; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038207 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038207
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:13804069..13804623
RNA-Seq ExpressionLag0038207
SyntenyLag0038207
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3469681.1 reverse transcriptase [Gossypium australe]5.5e-4754.75Show/hide
Query:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW
        D I   I   ITEE NA+L   FT EEI+  + +M PTKA G DG+ A+FYQK W I+G DV  YCLQ LNN   +  IN T IVLI KV  P ++  F 
Subjt:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW

Query:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
        PISLC V+YK+IAK++ANR+  V+ K I P+QS FV GRLI+DN ++ +E +H++KN++ GK G++A+KLDMSKAYDRV
Subjt:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

KAA3480199.1 reverse transcriptase [Gossypium australe]7.2e-4753.63Show/hide
Query:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW
        D I   I   ++EE NA+L   FT+EEI+  + +M PTKA G DG+ A+FYQK W I+G DV  YCLQ LNN   +  IN T IVLI KV  P ++  F 
Subjt:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW

Query:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
        PISLC V+YK+IAK++ANR++ V+ K I P+QS FV GRLI+DN ++ +E +H++KN++ GK G++A+KLDMSKAYDRV
Subjt:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

OMO59710.1 reverse transcriptase [Corchorus capsularis]1.9e-4755.31Show/hide
Query:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW
        D ILE +  +IT E N  L   FT EEI+  +K++HPTKA GPDG+   F++K+W IVG+DV S+CL F +    L   N T+IVLI KV +P ++  F 
Subjt:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW

Query:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
        PISLC VLYK+I+K+L NR+KS+L   IS SQS FV GRLI+DN ++ FE +HS+K+R+ GK G  ALKLDMSKAYDRV
Subjt:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

XP_012461392.1 PREDICTED: uncharacterized protein LOC105781396 [Gossypium raimondii]1.1e-4756.42Show/hide
Query:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW
        DHIL  I R I  + N +LT  +++EEI   + +M PTKA G DG  A+FYQK W IVG+DV S+CLQ LN    + PIN T IVLI KV  P +M  F 
Subjt:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW

Query:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
        PISLC VLYK+IAK+LANR +SV+ K I  +QS FV GRLISDN ++ +E +H +K +R GK G +A+KLDMSKAYDRV
Subjt:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]2.0e-4954.24Show/hide
Query:  ILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFWPI
        +L+ IP T+TEE N  L + FT+EEI   + +MHPTKA GPDG+ A+F+QKYW+IVGND+    L  LN+   +  IN T I L+ K+K P  M DF PI
Subjt:  ILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFWPI

Query:  SLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
        SLC V+YK+I+K+LANR+K++L +IIS +QS F+SGRLI+DN ++ FE +H +++++ GK G  A+KLDMSKAYDRV
Subjt:  SLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

TrEMBL top hitse value%identityAlignment
A0A1R3GNW3 Reverse transcriptase9.2e-4855.31Show/hide
Query:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW
        D ILE +  +IT E N  L   FT EEI+  +K++HPTKA GPDG+   F++K+W IVG+DV S+CL F +    L   N T+IVLI KV +P ++  F 
Subjt:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW

Query:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
        PISLC VLYK+I+K+L NR+KS+L   IS SQS FV GRLI+DN ++ FE +HS+K+R+ GK G  ALKLDMSKAYDRV
Subjt:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

A0A2N9E9A1 Reverse transcriptase domain-containing protein7.1e-4854.75Show/hide
Query:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW
        + ++E IPR +T E N  LT+ F   E+   +K+M P K+ GPDG   +FYQKYW I+G DV    L  LN+ K L  IN+T+I LI KVK P S+ DF 
Subjt:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW

Query:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
        PISLC V+YK+I+K+L NR+KS+L +I+S SQS FV GRLI+DN ++ FE +H +  +RRGK G VALKLDMSKAYDRV
Subjt:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

A0A5B6V0I7 Reverse transcriptase7.8e-4753.59Show/hide
Query:  DNDHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKD
        + D IL  I   +TEE N  L  SFTKEEI+  + +M PTKA G DG++A+FYQK W I+G +V +YCL  LNN   L  IN T IVL+ KV  P ++  
Subjt:  DNDHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKD

Query:  FWPISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
        F PISLC V+YKVIAK LANR+++++ K I  SQS FV GRLISDN ++ +E +H+++N++ GK G++A+KLDMSKA+DRV
Subjt:  FWPISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

A0A5B6VKR7 Reverse transcriptase2.7e-4754.75Show/hide
Query:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW
        D I   I   ITEE NA+L   FT EEI+  + +M PTKA G DG+ A+FYQK W I+G DV  YCLQ LNN   +  IN T IVLI KV  P ++  F 
Subjt:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW

Query:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
        PISLC V+YK+IAK++ANR+  V+ K I P+QS FV GRLI+DN ++ +E +H++KN++ GK G++A+KLDMSKAYDRV
Subjt:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

A0A5B6WEP5 Reverse transcriptase3.5e-4753.63Show/hide
Query:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW
        D I   I   ++EE NA+L   FT+EEI+  + +M PTKA G DG+ A+FYQK W I+G DV  YCLQ LNN   +  IN T IVLI KV  P ++  F 
Subjt:  DHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFW

Query:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
        PISLC V+YK+IAK++ANR++ V+ K I P+QS FV GRLI+DN ++ +E +H++KN++ GK G++A+KLDMSKAYDRV
Subjt:  PISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.8e-1229.47Show/hide
Query:  MEDNDHILE--TIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTY----IVLILKV
        +E+ D  L+  T+PR + +E+   L R  T  EI  +I  +   K+ GPDG  A FYQ+Y +    ++  + L+   + ++   + N++    I+LI K 
Subjt:  MEDNDHILE--TIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTY----IVLILKV

Query:  -KEPISMKDFWPISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
         ++    ++F PISL  +  K++ KILANR++  + K+I   Q  F+ G     N       I  + NR + K  V+ + +D  KA+D++
Subjt:  -KEPISMKDFWPISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

P08548 LINE-1 reverse transcriptase homolog1.1e-0827.89Show/hide
Query:  MEDNDHILET--IPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTY----IVLILKV
        +++ D  LE   +PR +++++   L R  +  EI   I+ +   K+ GPDG  + FYQ + +    ++    L    N ++   + NT+    I LI K 
Subjt:  MEDNDHILET--IPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTY----IVLILKV

Query:  -KEPISMKDFWPISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
         K+P   +++ PISL  +  K++ KIL NR++  + KII   Q  F+ G     N       I  + N+ + K  ++ L +D  KA+D +
Subjt:  -KEPISMKDFWPISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

P11369 LINE-1 retrotransposable element ORF2 protein9.6e-1028.42Show/hide
Query:  MEDNDHILE--TIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTY----IVLILK-
        +++ D  L+   +P+ + ++Q   L    + +EI  VI  +   K+ GPDG  A FYQ + +    D+     +  +  +    + N++    I LI K 
Subjt:  MEDNDHILE--TIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTY----IVLILK-

Query:  VKEPISMKDFWPISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
         K+P  +++F PISL  +  K++ KILANR++  +  II P Q  F+ G     N       IH + N+ + K  ++ + LD  KA+D++
Subjt:  VKEPISMKDFWPISLCLVLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

P14381 Transposon TX1 uncharacterized 149 kDa protein2.9e-1430.18Show/hide
Query:  ITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFWPISLCLVLYK
        ++E +   L    T +E+   ++ M   K+ G DG+   F+Q +WD +G D      +     +         + L+ K  +   +K++ P+SL    YK
Subjt:  ITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFWPISLCLVLYK

Query:  VIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
        ++AK ++ R+KSVL ++I P QS  V GR I DN  +  + +H     RR    +  L LD  KA+DRV
Subjt:  VIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.3e-0633.33Show/hide
Query:  EEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFWPISLCLVLYKVI
        +EI   +  M   KA GPD   A F+ + W +V +   +   +F      L   N T I LI KV     +  F P+S C V+YK+I
Subjt:  EEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFWPISLCLVLYKVI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases8.3e-0939.06Show/hide
Query:  LANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV
        +  R+K ++  +I P+Q++F+ GR+ +DN +   E +HS++ R++G  G + LKLD+ KAYDR+
Subjt:  LANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATAATGATCATATCTTGGAAACAATCCCACGGACAATCACGGAGGAGCAAAATGCAGAACTTACGAGGAGTTTTACCAAAGAGGAAATTTATGGAGTTATAAA
GAAGATGCACCCAACTAAAGCTCACGGGCCGGATGGCATTCAGGCAGTCTTCTACCAAAAATATTGGGACATAGTGGGTAATGATGTGTGTTCTTATTGTCTTCAGTTTC
TTAACAACGAGAAGAGACTGGATCCAATTAACAATACATATATTGTCCTCATTTTGAAGGTTAAAGAACCCATATCAATGAAAGACTTTTGGCCTATTAGTCTCTGCTTA
GTCCTTTACAAAGTGATTGCTAAAATCCTTGCCAATAGAATGAAATCGGTGTTGGATAAGATTATATCCCCGAGCCAATCAACCTTTGTGTCGGGAAGACTAATCTCTGA
TAATACCATAATTGGTTTTGAATGCATCCATTCGGTTAAAAATAGAAGGCGTGGTAAGGGAGGGGTGGTTGCTCTAAAGCTGGATATGAGCAAAGCTTATGACCGAGTGG
GATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATAATGATCATATCTTGGAAACAATCCCACGGACAATCACGGAGGAGCAAAATGCAGAACTTACGAGGAGTTTTACCAAAGAGGAAATTTATGGAGTTATAAA
GAAGATGCACCCAACTAAAGCTCACGGGCCGGATGGCATTCAGGCAGTCTTCTACCAAAAATATTGGGACATAGTGGGTAATGATGTGTGTTCTTATTGTCTTCAGTTTC
TTAACAACGAGAAGAGACTGGATCCAATTAACAATACATATATTGTCCTCATTTTGAAGGTTAAAGAACCCATATCAATGAAAGACTTTTGGCCTATTAGTCTCTGCTTA
GTCCTTTACAAAGTGATTGCTAAAATCCTTGCCAATAGAATGAAATCGGTGTTGGATAAGATTATATCCCCGAGCCAATCAACCTTTGTGTCGGGAAGACTAATCTCTGA
TAATACCATAATTGGTTTTGAATGCATCCATTCGGTTAAAAATAGAAGGCGTGGTAAGGGAGGGGTGGTTGCTCTAAAGCTGGATATGAGCAAAGCTTATGACCGAGTGG
GATGA
Protein sequenceShow/hide protein sequence
MEDNDHILETIPRTITEEQNAELTRSFTKEEIYGVIKKMHPTKAHGPDGIQAVFYQKYWDIVGNDVCSYCLQFLNNEKRLDPINNTYIVLILKVKEPISMKDFWPISLCL
VLYKVIAKILANRMKSVLDKIISPSQSTFVSGRLISDNTIIGFECIHSVKNRRRGKGGVVALKLDMSKAYDRVG