; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031372 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031372
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:7719874..7720680
RNA-Seq ExpressionLag0031372
SyntenyLag0031372
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
POO04030.1 Endonuclease/exonuclease/phosphatase [Trema orientale]2.8e-3132.58Show/hide
Query:  SSTFKVYHLPLIAPDHRPLLA---EWSIEQRSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRDV-RQPDQRSIHDKAKECLRRLSTWSRIKYDGSIKGA
        S + ++ HL     DHR LL    +    Q SY++    R  RF+  W++ +E  EII++ W+D+  +     +      C   L+ WS+ K+ G +K  
Subjt:  SSTFKVYHLPLIAPDHRPLLA---EWSIEQRSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRDV-RQPDQRSIHDKAKECLRRLSTWSRIKYDGSIKGA

Query:  IAKVEKDIQNLSTHDD--NRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYF
        + K +K +Q L+  DD    + + L+Q+E  L+ LL  +EIYW+QR+R  W+Q GD NTK+FH KA+ RRK NK+  + D++G     D  +E++   YF
Subjt:  IAKVEKDIQNLSTHDD--NRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYF

Query:  QMLFQSSNPDLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFF
        Q +F S   + +++  +L TI  SIT+  N  L++ F   ++   +++M P +  GQ      F
Subjt:  QMLFQSSNPDLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFF

XP_015382878.1 uncharacterized protein LOC107175701 [Citrus sinensis]6.0e-3434.94Show/hide
Query:  HLPLIAPDHRPLLAEWSIEQ---RSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRD----VRQPDQRSIHDKAKECLRRLSTWSRIKYDGSIK--GAIA
        HL   + DH P+L E  +E+   + YMR    R   +E  W  YE+CREI++H W+D     +         K+KE L  L  WS+ +++G  K    + 
Subjt:  HLPLIAPDHRPLLAEWSIEQ---RSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRD----VRQPDQRSIHDKAKECLRRLSTWSRIKYDGSIK--GAIA

Query:  KVEKDIQNLSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYFQMLF
           K+++N  +H DNR   E+ + E +++N+L D+EI+W+QR+R +WL+ GDRN K+FH KAS ++K N+++GL DE G W ++D  +E +   YF  +F
Subjt:  KVEKDIQNLSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYFQMLF

Query:  QSSNPDLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSK
         ++NP    +   L+T+P+ +TE+    + + F+ EE+   +  M P+K
Subjt:  QSSNPDLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSK

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]2.1e-3435.91Show/hide
Query:  KVYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPR-RFEIGWIKYEECREIIEHTWRDVRQPDQ-RSIHDKAKECLRRLSTWSRIKYDGSIKGAIAKVEK
        KV+HL     DH  LL    I   + ++   +R R +FE  W + E+C++II+  W    + +  R I  + + C   LS W+++ + G+I   I + ++
Subjt:  KVYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPR-RFEIGWIKYEECREIIEHTWRDVRQPDQ-RSIHDKAKECLRRLSTWSRIKYDGSIKGAIAKVEK

Query:  DIQNLSTHDDNRNL-NELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYFQMLFQSS
         +  L   D N +L  E+    KE+  LL+ +EI W+QR+R +WL  GDRNTK+FH KAS+RR+ N ++G+MDE GNW     G+  +A  YFQ ++ SS
Subjt:  DIQNLSTHDDNRNL-NELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYFQMLFQSS

Query:  NPDLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFFIK
         P    I  +LD IP+++TEE N +L++ F+ EEI   +  M P+K  G       F +
Subjt:  NPDLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFFIK

XP_023906330.1 uncharacterized protein LOC112018052 [Quercus suber]5.6e-3233.62Show/hide
Query:  KDRPRRFEIGWIKYEECREIIEHTWR---DVRQPDQRSIHDKAKECLRRLSTWSRIKYDGSIKGAIAKVEKDIQNLSTHDDNRNLNELIQKEKELENLLE
        K+ PRRFE  W+  +EC  IIE  WR   +V  P    +  K K     L  W ++ + G+ K ++    ++++ L++ +D   L  + + + E+ N+L 
Subjt:  KDRPRRFEIGWIKYEECREIIEHTWR---DVRQPDQRSIHDKAKECLRRLSTWSRIKYDGSIKGAIAKVEKDIQNLSTHDDNRNLNELIQKEKELENLLE

Query:  DDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYFQMLFQSSNPDLDSIGWILDTIPSSITEEQNIALLKAF
         +E+ WRQR+R  WL  GD+NTK+FH +A+ R++ N++ G+ +  G W  D+  +  IA +YFQ LF SS P+ D +  +L+ +   +T+E N  LL+ +
Subjt:  DDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYFQMLFQSSNPDLDSIGWILDTIPSSITEEQNIALLKAF

Query:  SCEEIHMVVKNMQPSKLQGQTEFRQFFIK
        + EEI   +  M PSK  G      FF +
Subjt:  SCEEIHMVVKNMQPSKLQGQTEFRQFFIK

XP_030479239.1 uncharacterized protein LOC115696480 [Cannabis sativa]2.8e-3133.82Show/hide
Query:  WSSTF---KVYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPR---RFEIGWIKYEECREIIEHTWRDVRQPDQ-RSIHDKAKECLRRLSTWSRIKYDGS
        W   F   +++HL   + DHR +    S+  +        RPR   RFE  W+K EECR +I + W+     D   S+      C   L TW   KY G 
Subjt:  WSSTF---KVYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPR---RFEIGWIKYEECREIIEHTWRDVRQPDQ-RSIHDKAKECLRRLSTWSRIKYDGS

Query:  IKGAIAKVEK---DIQNLSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESI
        +K  I K +K   ++ NLS    +++  +++  EK L++LLE +E+YW+QRAR +WL+ GD NTK+FH +A +R   NK+  L    G     D      
Subjt:  IKGAIAKVEK---DIQNLSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESI

Query:  ATQYFQMLFQSSNPDLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFFIKSIGI
        A+ +F  LF +S  DL+++  I+  I +SITE+ N +LLK F+  ++   +K+M P K     E   +F  + GI
Subjt:  ATQYFQMLFQSSNPDLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFFIKSIGI

TrEMBL top hitse value%identityAlignment
A0A2N9GPZ7 Reverse transcriptase domain-containing protein7.1e-3333.72Show/hide
Query:  VYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRD--VRQPDQRSIHDKAKECLRRLSTWSRIKYDGSIKGAIAKVEKD
        V HL +   DH P+L +  I     ++  K +  RFE  WIK E+CRE+I+H W D          + +K K C   L  WSR ++ GS+  +I +  + 
Subjt:  VYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRD--VRQPDQRSIHDKAKECLRRLSTWSRIKYDGSIKGAIAKVEKD

Query:  IQNLSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYFQMLFQSSNP
        +Q+L     +     +++ + +L  LLE +EI+WRQR+R  W+  GD+NTK+FH + + RR+ N + GL D  G W  +   +  IA  YFQ +F SSNP
Subjt:  IQNLSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYFQMLFQSSNP

Query:  DLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFFIKS
          +SI  +L  + S +T   N  L   F+ +E+ + +K M P+K  G       F ++
Subjt:  DLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFFIKS

A0A2N9I611 Uncharacterized protein1.9e-3334.21Show/hide
Query:  IWSSTFK---VYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRDVRQPDQRSIH--DKAKECLRRLSTWSRIKYDGSI
        +W   FK   + ++P+   DH  +     +     +R    R  RFE  W K+EEC ++I   WRD   P+  +    +K K C + L  WS+  + G+I
Subjt:  IWSSTFK---VYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRDVRQPDQRSIH--DKAKECLRRLSTWSRIKYDGSI

Query:  KGAIAKVEKDIQNLSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQY
        K  + ++E+ ++ L +++  ++  ++   + E+  LLE +E+YW+QRAR  WL+ GDRNTK+FH KA+ R+K N V GL+D+ G W +D   ME IA +Y
Subjt:  KGAIAKVEKDIQNLSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQY

Query:  FQMLFQSSNP-DLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFF
        F+ +F S+N  DLDS     + I   +T+  N +L   F  EE+   +  M  SK  G   F   F
Subjt:  FQMLFQSSNP-DLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFF

A0A2N9IDC0 Uncharacterized protein4.9e-3435.55Show/hide
Query:  WSSTFK---VYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRD--VRQPDQRSIHDKAKECLRRLSTWSRIKYDGSIK
        W S F+   V+HLP+   DH P+L   SI     ++  K +  +FE  W K EECR +IE TW            + +K K C   L  WSR+K+ GS  
Subjt:  WSSTFK---VYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRD--VRQPDQRSIHDKAKECLRRLSTWSRIKYDGSIK

Query:  GAIAKVEKDIQNLSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYF
         +I    + +Q+L       +   +++ + EL  LLE +EI+WRQR+R  W+  GD+NTK+FH + ++RR++N + GL D  G W  D   +  IA  YF
Subjt:  GAIAKVEKDIQNLSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYF

Query:  QMLFQSSNPDLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQG
        + +F S NP LDSI   L+ +   +T E N  L++ F+ +E+   +++M P+K  G
Subjt:  QMLFQSSNPDLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQG

A0A2N9IPS8 Reverse transcriptase domain-containing protein7.1e-3333.72Show/hide
Query:  VYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRD--VRQPDQRSIHDKAKECLRRLSTWSRIKYDGSIKGAIAKVEKD
        V HL +   DH P+L +  I     ++  K +  RFE  WIK E+CRE+I+H W D          + +K K C   L  WSR ++ GS+  +I +  + 
Subjt:  VYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRD--VRQPDQRSIHDKAKECLRRLSTWSRIKYDGSIKGAIAKVEKD

Query:  IQNLSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYFQMLFQSSNP
        +Q+L     +     +++ + +L  LLE +EI+WRQR+R  W+  GD+NTK+FH + + RR+ N + GL D  G W  +   +  IA  YFQ +F SSNP
Subjt:  IQNLSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYFQMLFQSSNP

Query:  DLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFFIKS
          +SI  +L  + S +T   N  L   F+ +E+ + +K M P+K  G       F ++
Subjt:  DLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFFIKS

A0A803PY51 Uncharacterized protein1.2e-3233.33Show/hide
Query:  IWSSTFK---VYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPR----RFEIGWIKYEECREIIEHTWRDVRQPDQRSIH-DKAKECLRRLSTWSRIKYD
        +W  TF+    +HL     DHR +  +  +   S  +  + RPR    RFE  W++ EE   +I+  W  V   +   I  +  + C   L  W + K+ 
Subjt:  IWSSTFK---VYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPR----RFEIGWIKYEECREIIEHTWRDVRQPDQRSIH-DKAKECLRRLSTWSRIKYD

Query:  GSIKGAIAKVEKDIQNLSTHDDNRN--LNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMES
        GS K  I++ +K +++L+  +D     + EL   E  L++LL  +E YW+QR+R +WLQ GD+NTK+FH  AS+RRK N +  L D+ G       GM  
Subjt:  GSIKGAIAKVEKDIQNLSTHDDNRN--LNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMES

Query:  IATQYFQMLFQSSNPDLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFF
        I T YF  LF++++ DLD++   +  IP+++T   N +L++ F+CEEI+  +K + P K  G       F
Subjt:  IATQYFQMLFQSSNPDLDSIGWILDTIPSSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAATATGGAGTAGCACATTTAAGGTCTATCATCTCCCCCTTATAGCACCGGATCACAGGCCACTGCTAGCAGAATGGTCAATAGAGCAGAGGAGTTACATGAGAAA
TGATAAGGATCGTCCAAGAAGATTTGAGATCGGTTGGATCAAATATGAGGAATGTCGGGAGATAATAGAACATACTTGGAGAGATGTGAGACAGCCCGATCAGAGATCGA
TTCACGACAAGGCTAAGGAGTGCCTGCGCCGGTTATCAACATGGAGTCGAATCAAGTATGATGGGTCTATTAAAGGAGCCATAGCTAAAGTGGAAAAGGATATTCAGAAT
TTATCAACCCATGATGACAACAGGAATCTGAATGAGTTGATTCAAAAAGAAAAAGAGTTAGAAAATTTACTTGAAGATGATGAGATTTACTGGAGGCAAAGGGCTAGGGA
GGAGTGGCTTCAATGGGGTGATAGGAATACCAAGTGGTTTCATATGAAGGCGTCCAATAGACGGAAAGTGAATAAAGTTCATGGTTTGATGGATGAGTGGGGAAATTGGA
CAAAGGATGATATTGGTATGGAAAGCATTGCTACCCAATATTTTCAAATGCTTTTCCAATCATCTAATCCAGACTTGGATTCTATTGGGTGGATTTTGGACACAATCCCA
TCAAGTATCACTGAGGAACAGAACATTGCTCTTTTGAAAGCTTTCTCTTGTGAAGAAATTCATATGGTTGTCAAGAACATGCAGCCATCAAAGCTCCAGGGCCAGACGGA
ATTCAGGCAATTTTTTATCAAAAGTATTGGGATATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAATATGGAGTAGCACATTTAAGGTCTATCATCTCCCCCTTATAGCACCGGATCACAGGCCACTGCTAGCAGAATGGTCAATAGAGCAGAGGAGTTACATGAGAAA
TGATAAGGATCGTCCAAGAAGATTTGAGATCGGTTGGATCAAATATGAGGAATGTCGGGAGATAATAGAACATACTTGGAGAGATGTGAGACAGCCCGATCAGAGATCGA
TTCACGACAAGGCTAAGGAGTGCCTGCGCCGGTTATCAACATGGAGTCGAATCAAGTATGATGGGTCTATTAAAGGAGCCATAGCTAAAGTGGAAAAGGATATTCAGAAT
TTATCAACCCATGATGACAACAGGAATCTGAATGAGTTGATTCAAAAAGAAAAAGAGTTAGAAAATTTACTTGAAGATGATGAGATTTACTGGAGGCAAAGGGCTAGGGA
GGAGTGGCTTCAATGGGGTGATAGGAATACCAAGTGGTTTCATATGAAGGCGTCCAATAGACGGAAAGTGAATAAAGTTCATGGTTTGATGGATGAGTGGGGAAATTGGA
CAAAGGATGATATTGGTATGGAAAGCATTGCTACCCAATATTTTCAAATGCTTTTCCAATCATCTAATCCAGACTTGGATTCTATTGGGTGGATTTTGGACACAATCCCA
TCAAGTATCACTGAGGAACAGAACATTGCTCTTTTGAAAGCTTTCTCTTGTGAAGAAATTCATATGGTTGTCAAGAACATGCAGCCATCAAAGCTCCAGGGCCAGACGGA
ATTCAGGCAATTTTTTATCAAAAGTATTGGGATATAG
Protein sequenceShow/hide protein sequence
MRIWSSTFKVYHLPLIAPDHRPLLAEWSIEQRSYMRNDKDRPRRFEIGWIKYEECREIIEHTWRDVRQPDQRSIHDKAKECLRRLSTWSRIKYDGSIKGAIAKVEKDIQN
LSTHDDNRNLNELIQKEKELENLLEDDEIYWRQRAREEWLQWGDRNTKWFHMKASNRRKVNKVHGLMDEWGNWTKDDIGMESIATQYFQMLFQSSNPDLDSIGWILDTIP
SSITEEQNIALLKAFSCEEIHMVVKNMQPSKLQGQTEFRQFFIKSIGI