; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035136 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035136
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr3:15454186..15454737
RNA-Seq ExpressionLag0035136
SyntenyLag0035136
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]7.4e-5266.67Show/hide
Query:  AWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSWGGIKPIIE
        AW C ND + SWILN VSKEIAASI Y+G  +E+WDEL  RFKQSNG  IYQLRK+ +T+ QG  ++ETY+TKLKTIWQ+L +YR T DC+ GG+KP I+
Subjt:  AWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSWGGIKPIIE

Query:  HMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQRIA
        H+ESE +M FLMGLNDSY++VRAQILLM P+P I  VFSL+IQEE+QR A
Subjt:  HMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQRIA

KHN29861.1 hypothetical protein glysoja_034105, partial [Glycine soja]1.7e-4048.6Show/hide
Query:  KPEGKS---TAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDC
        KPE K+   TAW   N+   SW+ N VSKEI  +I +    +E+WD+L TRF + NG  I+QL+  L+++ QGT  + TY+TKLK+IW++L DY+PT+ C
Subjt:  KPEGKS---TAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDC

Query:  SWGGIKPIIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQR--IASNSISSTSPD--HSVNT
        + GG++ +  H ESE VM FLMGLNDS+S ++ QILL NP+P I  VFSLI+QE+ QR  + ++S ++TS +   SVN+
Subjt:  SWGGIKPIIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQR--IASNSISSTSPD--HSVNT

XP_016199461.1 uncharacterized protein LOC107640454 [Arachis ipaensis]3.5e-4153.38Show/hide
Query:  AWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSWGGIKPIIE
        +W+C N+ +T+W+LN +SK+IAAS+ Y G A  +W +L TRF QSNG  I++L+K L+T+ QG  SV  Y TKLKTIW++   +RP   C+ GG K    
Subjt:  AWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSWGGIKPIIE

Query:  HMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQR
        H++ E V+FFLMGLNDS+S++R QILL +P+P I K+FSL++QEERQR
Subjt:  HMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQR

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]1.7e-5164.33Show/hide
Query:  KKPEGK-STAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCS
        KKP G    AWKC ND ITSWI+N VSKEIAASI Y G A+++WDEL  RF+QS+   I+QLRK+L+T  QGT S+E Y+TKLKT+WQ+L DYRPT DC+
Subjt:  KKPEGK-STAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCS

Query:  WGGIKPIIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQR
          G+K + E  +SE VM FLMGLN+SY+ +RAQILLM+PIP + KVFSL+IQEERQR
Subjt:  WGGIKPIIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQR

XP_022155284.1 uncharacterized protein LOC111022420 [Momordica charantia]2.0e-4153.05Show/hide
Query:  KKPEGKSTAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSW
        K  +   +AWKC ND I  WI+N VS++IAAS+ Y   A ++W+EL  RF+QSNG  IYQLRK+ +TI       E Y+TKLKT+WQ+L +Y  +  C+ 
Subjt:  KKPEGKSTAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSW

Query:  GGIKPIIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQRIASNSISS
        GG+K ++ H  SE VM FLMGLN+SY+ VRAQIL M+P+P I KVFSL+IQEE  R   N I S
Subjt:  GGIKPIIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQRIASNSISS

TrEMBL top hitse value%identityAlignment
A0A0B2R675 Retrotrans_gag domain-containing protein (Fragment)8.3e-4148.6Show/hide
Query:  KPEGKS---TAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDC
        KPE K+   TAW   N+   SW+ N VSKEI  +I +    +E+WD+L TRF + NG  I+QL+  L+++ QGT  + TY+TKLK+IW++L DY+PT+ C
Subjt:  KPEGKS---TAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDC

Query:  SWGGIKPIIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQR--IASNSISSTSPD--HSVNT
        + GG++ +  H ESE VM FLMGLNDS+S ++ QILL NP+P I  VFSLI+QE+ QR  + ++S ++TS +   SVN+
Subjt:  SWGGIKPIIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQR--IASNSISSTSPD--HSVNT

A0A151U9A5 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-4049.32Show/hide
Query:  AWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSWGGIKPIIE
        +W+  N+ + SW+LNF+SK++ AS+ Y   A  +W++L  RF+Q NG  ++QLR+DL+T+ QG+ ++  Y TK+K +W++L +Y+P++ C+ GGIKP I+
Subjt:  AWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSWGGIKPIIE

Query:  HMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQR
        H +SE  M FLMGLN+ YS +R QILLM+PIP I K FSL++QEE+Q+
Subjt:  HMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQR

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 83.6e-5266.67Show/hide
Query:  AWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSWGGIKPIIE
        AW C ND + SWILN VSKEIAASI Y+G  +E+WDEL  RFKQSNG  IYQLRK+ +T+ QG  ++ETY+TKLKTIWQ+L +YR T DC+ GG+KP I+
Subjt:  AWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSWGGIKPIIE

Query:  HMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQRIA
        H+ESE +M FLMGLNDSY++VRAQILLM P+P I  VFSL+IQEE+QR A
Subjt:  HMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQRIA

A0A6J1CXR2 uncharacterized protein LOC1110152398.0e-5264.33Show/hide
Query:  KKPEGK-STAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCS
        KKP G    AWKC ND ITSWI+N VSKEIAASI Y G A+++WDEL  RF+QS+   I+QLRK+L+T  QGT S+E Y+TKLKT+WQ+L DYRPT DC+
Subjt:  KKPEGK-STAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCS

Query:  WGGIKPIIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQR
          G+K + E  +SE VM FLMGLN+SY+ +RAQILLM+PIP + KVFSL+IQEERQR
Subjt:  WGGIKPIIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQR

A0A6J1DPT8 uncharacterized protein LOC1110224209.8e-4253.05Show/hide
Query:  KKPEGKSTAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSW
        K  +   +AWKC ND I  WI+N VS++IAAS+ Y   A ++W+EL  RF+QSNG  IYQLRK+ +TI       E Y+TKLKT+WQ+L +Y  +  C+ 
Subjt:  KKPEGKSTAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSW

Query:  GGIKPIIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQRIASNSISS
        GG+K ++ H  SE VM FLMGLN+SY+ VRAQIL M+P+P I KVFSL+IQEE  R   N I S
Subjt:  GGIKPIIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQRIASNSISS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).7.5e-1831.79Show/hide
Query:  WKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSWGG-----IK
        W+  N  +  W++N ++ ++  S+ Y   A ++W++L   F       IYQLR+ L T+ QG  SVE Y  KL  +W +L +Y P  +C  GG      K
Subjt:  WKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSWGG-----IK

Query:  PIIEHMESELVMFFLMG--LNDSYSSVRAQILLMNPIPDITKVFSLIIQEE
           E  E E    FLMG  LN  + +V  +I+   P P + + F+++   E
Subjt:  PIIEHMESELVMFFLMG--LNDSYSSVRAQILLMNPIPDITKVFSLIIQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCATCAAGAAAGAAGCCTGAAGGAAAATCCACTGCATGGAAGTGTAAGAATGACACTATCACTTCCTGGATCCTGAATTTCGTTTCAAAGGAGATCGCTGCAAG
CATCAATTATGTCGGATTTGCTCAAGAAGTCTGGGACGAACTCCACACTCGCTTCAAGCAAAGCAACGGGTCGCACATTTATCAGCTTCGCAAAGATCTTATTACTATCG
CTCAAGGTACTTCTTCGGTCGAAACCTACCACACCAAATTGAAGACAATTTGGCAAGATTTAGTTGATTATCGACCAACATACGACTGCTCTTGGGGAGGTATCAAACCG
ATCATCGAGCACATGGAGTCTGAGCTTGTGATGTTCTTTCTGATGGGACTCAATGACTCTTACTCCTCTGTTCGCGCACAAATCCTCCTTATGAACCCTATCCCTGACAT
TACTAAAGTTTTTTCGCTGATCATACAAGAAGAGCGTCAAAGAATCGCAAGTAATTCCATTTCCTCTACTTCGCCTGATCACAGTGTAAATACTATTCTGTTATTCTGTT
AA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCATCAAGAAAGAAGCCTGAAGGAAAATCCACTGCATGGAAGTGTAAGAATGACACTATCACTTCCTGGATCCTGAATTTCGTTTCAAAGGAGATCGCTGCAAG
CATCAATTATGTCGGATTTGCTCAAGAAGTCTGGGACGAACTCCACACTCGCTTCAAGCAAAGCAACGGGTCGCACATTTATCAGCTTCGCAAAGATCTTATTACTATCG
CTCAAGGTACTTCTTCGGTCGAAACCTACCACACCAAATTGAAGACAATTTGGCAAGATTTAGTTGATTATCGACCAACATACGACTGCTCTTGGGGAGGTATCAAACCG
ATCATCGAGCACATGGAGTCTGAGCTTGTGATGTTCTTTCTGATGGGACTCAATGACTCTTACTCCTCTGTTCGCGCACAAATCCTCCTTATGAACCCTATCCCTGACAT
TACTAAAGTTTTTTCGCTGATCATACAAGAAGAGCGTCAAAGAATCGCAAGTAATTCCATTTCCTCTACTTCGCCTGATCACAGTGTAAATACTATTCTGTTATTCTGTT
AA
Protein sequenceShow/hide protein sequence
MAASRKKPEGKSTAWKCKNDTITSWILNFVSKEIAASINYVGFAQEVWDELHTRFKQSNGSHIYQLRKDLITIAQGTSSVETYHTKLKTIWQDLVDYRPTYDCSWGGIKP
IIEHMESELVMFFLMGLNDSYSSVRAQILLMNPIPDITKVFSLIIQEERQRIASNSISSTSPDHSVNTILLFC