; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036547 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036547
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:48303133..48303857
RNA-Seq ExpressionLag0036547
SyntenyLag0036547
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3459022.1 reverse transcriptase [Gossypium australe]8.2e-2138.65Show/hide
Query:  GGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLLADWNKDRPNQTQRRLNFPRR
        G LPR E  M+DF   ++ C L D GF GP +TW         I E +DR +          N+ + HLP   SDH PLLA        +T+  LN   R
Subjt:  GGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLLADWNKDRPNQTQRRLNFPRR

Query:  FEEGWVKLEMVQMECELGNLLEDDETYWRQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGL
         +E  V  E+++++  L   ++  E YW QRARE+WL  GD+NT +FH  AS RR++N ++GL
Subjt:  FEEGWVKLEMVQMECELGNLLEDDETYWRQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGL

XP_022158772.1 uncharacterized protein LOC111025237 [Momordica charantia]7.2e-2538.01Show/hide
Query:  FIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLLADWNKDRPNQTQRRLNFPRRFEEGWVK-----
        F + +D C L+D GF G ++TWCNN F  D +W+ LDR L N        +  + HLP   SDH  +        P+   +R + P RFEE WV+     
Subjt:  FIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLLADWNKDRPNQTQRRLNFPRRFEEGWVK-----

Query:  -------LEMV---QMECELGNLLEDDETYWRQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGG
               L+      +E +L  LLE +E +W+QR+REDWL WGD N KWFH +A+ R+  N + G+ D+ G
Subjt:  -------LEMV---QMECELGNLLEDDETYWRQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGG

XP_023918603.1 uncharacterized protein LOC112030149 [Quercus suber]2.6e-2234.69Show/hide
Query:  MGGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLLADWNKDRPNQTQRRLNFPR
        +GG  R    MQ F D +D CG  D GF+G ++TW + H+    +WE LDR +   E      + +V HL +  SDH PLL      +P+    +   P 
Subjt:  MGGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLLADWNKDRPNQTQRRLNFPR

Query:  RFEEGWV-----------KLEMVQ----------------MECELGNLLEDDETYWRQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGG
        RFE+ W+           + ++VQ                +E E+  LL+ +   WRQRA+  WL  GDRNTK+FH +AS RR+ N ++GL D  G
Subjt:  RFEEGWV-----------KLEMVQ----------------MECELGNLLEDDETYWRQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGG

XP_030924992.1 uncharacterized protein LOC115952038 [Quercus lobata]2.2e-2132.73Show/hide
Query:  GGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWP--LLADWNKDRPNQTQRRLNFP
        GG  R ++ MQ F D +D CG  D GF G  +TWCNN F   L+W  LDR + + E  +   + ++ HL   +SDH P  L +D    R  + QR    P
Subjt:  GGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWP--LLADWNKDRPNQTQRRLNFP

Query:  RRFEEGWVKLEMVQ--------------------------------------------------MECELGNLLEDDETYWRQRAREDWLNWGDRNTKWFH
         RFEE W K E  +                                                  +  E+ NLL+ +E  W QRA+ DWL +GD+N+K+FH
Subjt:  RRFEEGWVKLEMVQ--------------------------------------------------MECELGNLLEDDETYWRQRAREDWLNWGDRNTKWFH

Query:  LRASTRRKMNRVRGLMDEGG
         RA+ R K N + GL D  G
Subjt:  LRASTRRKMNRVRGLMDEGG

XP_030940268.1 uncharacterized protein LOC115965235 [Quercus lobata]2.2e-2131.12Show/hide
Query:  MGGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLLADW-NKDRPNQTQRRLNFP
        +GG  R +R MQ F D +D CG  D GF G  +TWCNN F   L+W  LDR L + E  +   + ++ HL   +SDH P+   W   D  ++   R   P
Subjt:  MGGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLLADW-NKDRPNQTQRRLNFP

Query:  RRFEEGWVKLE-----------------------------------------------MVQ-----------------------MECELGNLLEDDETYW
         RFEE W+K E                                               +VQ                       ++ E+  LL+ +E  W
Subjt:  RRFEEGWVKLE-----------------------------------------------MVQ-----------------------MECELGNLLEDDETYW

Query:  RQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGGI
         QRA+ DWL +GDRN+K+FH RAS R K N + GL D+ G+
Subjt:  RQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGGI

TrEMBL top hitse value%identityAlignment
A0A2N9H1U1 Reverse transcriptase domain-containing protein9.5e-2332.51Show/hide
Query:  GGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNF---KVFHLPLIASDHWPLLADWNKDRPNQTQRRLN-
        GG  R  R MQDF D ID+CG  D G+ GP +TWCNN   +  +WE LDR L  +    W NNF   ++FHL    SDH+P+       +P  T  R   
Subjt:  GGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNF---KVFHLPLIASDHWPLLADWNKDRPNQTQRRLN-

Query:  FPRRFEEGWV---------------------------KLEMVQME------CELGN-------------------------------------LLEDDET
         P RFEE W+                           K+   +ME      C+ GN                                     LL  +E 
Subjt:  FPRRFEEGWV---------------------------KLEMVQME------CELGN-------------------------------------LLEDDET

Query:  YWRQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGGI
         W QRAR  WL  GDRNT++FH  AS RR+ N +  L D  G+
Subjt:  YWRQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGGI

A0A2N9HKV4 Uncharacterized protein2.5e-2335.47Show/hide
Query:  GGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPL---------------------
        GG PR +  MQ F   +D CG  D GF GPE+TWCNN      IW  LDR +VN E  +   + +V H+P   SDH PL                     
Subjt:  GGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPL---------------------

Query:  -LAD----------W--NKDRPNQTQ---------RRLNFPRRFEEGWV------------KLEMVQM-----------ECELGNLLEDDETYWRQRARE
         L D          W  N D  +  Q         RRL F  R   G V            K E++ M             ELG LLE +E  W QR+R 
Subjt:  -LAD----------W--NKDRPNQTQ---------RRLNFPRRFEEGWV------------KLEMVQM-----------ECELGNLLEDDETYWRQRARE

Query:  DWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGG
         WL  GDRNT++FH RAS RR+ N + GL D+ G
Subjt:  DWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGG

A0A2N9I6L8 Reverse transcriptase domain-containing protein3.3e-2335.04Show/hide
Query:  GGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPL---------------------
        GG PR +  MQ F   +D CG  D GF GPE+TWCNN      +W  LDR +VN E  +   + +V H+P   SDH PL                     
Subjt:  GGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPL---------------------

Query:  -LAD----------W--NKDRPNQTQ---------RRLNFPRRFEEGWV------------KLEMVQM-----------ECELGNLLEDDETYWRQRARE
         L D          W  N D  +  Q         RRL F  R   G V            K E++ M             ELG LLE +E  W QR+R 
Subjt:  -LAD----------W--NKDRPNQTQ---------RRLNFPRRFEEGWV------------KLEMVQM-----------ECELGNLLEDDETYWRQRARE

Query:  DWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGG
         WL  GDRNT++FH RAS RR+ N + GL D+ G
Subjt:  DWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGG

A0A2N9IIR5 Uncharacterized protein7.7e-2533.76Show/hide
Query:  GGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLL--------------------
        GG PR +  MQ F D +D CG  D GF GPE+TWCNN      +WE LDR++VN E  +      V+H+    SDH PL                     
Subjt:  GGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLL--------------------

Query:  ------------ADW--NKDRPNQTQ--RRLNFPRRFEEGWVKL------------------------------EMVQMECELGNLLEDDETYWRQRARE
                    A W  N D P   Q   R+N  RR    W +                               ++V +  EL  LL  +ET W QR+R 
Subjt:  ------------ADW--NKDRPNQTQ--RRLNFPRRFEEGWVKL------------------------------EMVQMECELGNLLEDDETYWRQRARE

Query:  DWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGG
         WL  GDRNT++FH RAS RR+ N + GL DE G
Subjt:  DWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGG

A0A6J1DY29 uncharacterized protein LOC1110252373.5e-2538.01Show/hide
Query:  FIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLLADWNKDRPNQTQRRLNFPRRFEEGWVK-----
        F + +D C L+D GF G ++TWCNN F  D +W+ LDR L N        +  + HLP   SDH  +        P+   +R + P RFEE WV+     
Subjt:  FIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLLADWNKDRPNQTQRRLNFPRRFEEGWVK-----

Query:  -------LEMV---QMECELGNLLEDDETYWRQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGG
               L+      +E +L  LLE +E +W+QR+REDWL WGD N KWFH +A+ R+  N + G+ D+ G
Subjt:  -------LEMV---QMECELGNLLEDDETYWRQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGGTTGCCAAGGGTTGAACGGGATATGCAAGATTTTATAGACAACATAGATATTTGTGGTCTTAGTGATCCGGGTTTCATTGGACCTGAATATACGTGGTGTAA
CAATCATTTCCAAACTGATTTGATCTGGGAGCTACTAGATAGAATTCTTGTAAATGTGGAAATACAAGTCTGGTGTAATAACTTCAAGGTTTTCCACCTTCCTTTGATAG
CTTCAGATCATTGGCCGTTACTTGCAGATTGGAATAAAGATCGGCCAAACCAAACTCAACGAAGATTAAATTTTCCTAGAAGATTCGAAGAAGGTTGGGTGAAGTTGGAA
ATGGTTCAAATGGAGTGTGAATTGGGAAACTTGTTGGAGGATGACGAGACTTATTGGCGGCAAAGAGCGAGAGAGGATTGGCTTAACTGGGGTGATAGGAACACTAAATG
GTTCCACCTAAGAGCGTCCACTAGGAGAAAGATGAATAGAGTTCGAGGATTGATGGATGAGGGCGGAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGGTTGCCAAGGGTTGAACGGGATATGCAAGATTTTATAGACAACATAGATATTTGTGGTCTTAGTGATCCGGGTTTCATTGGACCTGAATATACGTGGTGTAA
CAATCATTTCCAAACTGATTTGATCTGGGAGCTACTAGATAGAATTCTTGTAAATGTGGAAATACAAGTCTGGTGTAATAACTTCAAGGTTTTCCACCTTCCTTTGATAG
CTTCAGATCATTGGCCGTTACTTGCAGATTGGAATAAAGATCGGCCAAACCAAACTCAACGAAGATTAAATTTTCCTAGAAGATTCGAAGAAGGTTGGGTGAAGTTGGAA
ATGGTTCAAATGGAGTGTGAATTGGGAAACTTGTTGGAGGATGACGAGACTTATTGGCGGCAAAGAGCGAGAGAGGATTGGCTTAACTGGGGTGATAGGAACACTAAATG
GTTCCACCTAAGAGCGTCCACTAGGAGAAAGATGAATAGAGTTCGAGGATTGATGGATGAGGGCGGAATTTGA
Protein sequenceShow/hide protein sequence
MGGLPRVERDMQDFIDNIDICGLSDPGFIGPEYTWCNNHFQTDLIWELLDRILVNVEIQVWCNNFKVFHLPLIASDHWPLLADWNKDRPNQTQRRLNFPRRFEEGWVKLE
MVQMECELGNLLEDDETYWRQRAREDWLNWGDRNTKWFHLRASTRRKMNRVRGLMDEGGI