; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022734 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022734
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:36702085..36702825
RNA-Seq ExpressionLag0022734
SyntenyLag0022734
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2725981.1 hypothetical protein I3760_01G090600 [Carya illinoinensis]4.3e-3937.02Show/hide
Query:  WRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERLDRF
        W+FT +YG+P+ EK +ETW+L+  L+    +PW+V GDFNE+ +  EKLGG  R E+ +Q FR  ID C L D GF G  YTWCN  ++  ++ ERLDRF
Subjt:  WRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERLDRF

Query:  LINPEMICWCKEFKVHHLPFMASDHCPLLAEWSKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGARHSGR--RSIVEKSKECISRLSSWSRRKY
        +  P        F+V H    +SDH P++   S  +   R   F    RFE +WV+ ++C +++ + W+G  H+ R   SI+++   C   L  W+R+K+
Subjt:  LINPEMICWCKEFKVHHLPFMASDHCPLLAEWSKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGARHSGR--RSIVEKSKECISRLSSWSRRKY

Query:  EGSIKCAISRKEKELQEISNNNDKSNMMDRIQKGK
         G +K  I R  + LQ I      S   + ++K +
Subjt:  EGSIKCAISRKEKELQEISNNNDKSNMMDRIQKGK

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]5.1e-4842.92Show/hide
Query:  MVSDTEGIWRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEI
        MV +    WRFT IYG+  ++   ETW L+ RL     +PW++GGDFNEI  NSEKL G+ R +  +Q+F++ +D+CGL DPGF+G  +TWC+ H   + 
Subjt:  MVSDTEGIWRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEI

Query:  IWERLDRFLINPEMICWCKEFKVHHLPFMASDHCPLLAEW--SKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGARHSGRRSIVEKSKECISRL
        IWERLDRFLIN  +    +  ++ HL F+ASDH P+LAEW    E T  R  G + P RFE+ W  +++CKEIV ++W             K   C+  L
Subjt:  IWERLDRFLINPEMICWCKEFKVHHLPFMASDHCPLLAEW--SKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGARHSGRRSIVEKSKECISRL

Query:  SSWSRRKYEGSIKCAISRKEKELQEI
          W+  +  GS++ AI RKE E+Q +
Subjt:  SSWSRRKYEGSIKCAISRKEKELQEI

XP_023895448.1 uncharacterized protein LOC112007343 [Quercus suber]6.3e-3837.07Show/hide
Query:  WRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERLDRF
        WR T  YGNP     + +WAL+  L     +PW+  GDFNEIT   EK GG +R+E+ ++ FR+ +D CG  D GF+GS +TWCNN F  E+ W  LDR 
Subjt:  WRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERLDRF

Query:  LINPEMICWCKEFKVHHLPFMASDHCPLLAEW-SKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRG-ARHSGRRSIVEKSKECISRLSSWSRRKY
        +  P    +    +VHHLP   SDHCPL   W   +    R      P RFE VW+K E C+ ++ + W G         +++K   C S L +WSR  +
Subjt:  LINPEMICWCKEFKVHHLPFMASDHCPLLAEW-SKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRG-ARHSGRRSIVEKSKECISRLSSWSRRKY

Query:  EGSIKCAISRKEKELQEISNNNDKSNMMDRIQ
         G+I+  +++K+K+L +    +      D+I+
Subjt:  EGSIKCAISRKEKELQEISNNNDKSNMMDRIQ

XP_030923017.1 uncharacterized protein LOC115949892 [Quercus lobata]1.3e-3839.07Show/hide
Query:  WRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERLDRF
        WRFT  YG P+    +++W+++  L     +PWV  GDFNEIT   EK GG IR EK +QDFR+C+D+CGL D GF G  +TWCN  +   ++W RLDR 
Subjt:  WRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERLDRF

Query:  LINPEMICWCKEFKVHHLPFMASDHCPLLAEWSKERTTQ-RMNGFKFPRRFEDVWVKYEDCKEIVDQIW-RGARHSGRRSIVEKSKECISRLSSWSRRKY
        +   + I      ++HHLP  +SDH PL   W      Q R    K P RFE +W+  E C+ +V  +W R         I+ K +EC ++L  W +  +
Subjt:  LINPEMICWCKEFKVHHLPFMASDHCPLLAEWSKERTTQ-RMNGFKFPRRFEDVWVKYEDCKEIVDQIW-RGARHSGRRSIVEKSKECISRLSSWSRRKY

Query:  EGSIKCAISRKEKEL
         G+++ A++R  K L
Subjt:  EGSIKCAISRKEKEL

XP_030970102.1 uncharacterized protein LOC115990406 [Quercus lobata]4.8e-3838.6Show/hide
Query:  WRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERLDRF
        WRFT  YG P+    +++W+++  L     +PWV  GDFNEIT   EK GG IR EK +QDFR+C+D+CGL D GF G  +TWCN  +   ++W RLDR 
Subjt:  WRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERLDRF

Query:  LINPEMICWCKEFKVHHLPFMASDHCPLLAEWSKERTTQ-RMNGFKFPRRFEDVWVKYEDCKEIVDQIW-RGARHSGRRSIVEKSKECISRLSSWSRRKY
        +   + I      ++HHLP  +SDH PL   W      Q R +  K P RFE +W+  E C+ +V  +W R         ++ K +EC  +L  W +  +
Subjt:  LINPEMICWCKEFKVHHLPFMASDHCPLLAEWSKERTTQ-RMNGFKFPRRFEDVWVKYEDCKEIVDQIW-RGARHSGRRSIVEKSKECISRLSSWSRRKY

Query:  EGSIKCAISRKEKEL
         G++  A++R  K L
Subjt:  EGSIKCAISRKEKEL

TrEMBL top hitse value%identityAlignment
A0A2N9ERX7 CCHC-type domain-containing protein1.4e-3837.08Show/hide
Query:  MVSDTEGIWRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEI
        +V DT  ++R TR YGNP+  K +ETWAL+  L   +  PWV  GDFNE+ + +E++G     +  I+DFRE +D C L D GF+G+ +TW      +  
Subjt:  MVSDTEGIWRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEI

Query:  IWERLDRFLINPEMICWCKEF---KVHHLPFMASDHCPLLAEWSKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGARHSGRR--SIVEKSKECI
        I ERLDR L +   + W  +F   KV HL  ++SDHCPLL E  +    ++    K    F+ +W+K + CK +++Q W      G     + EK K C 
Subjt:  IWERLDRFLINPEMICWCKEF---KVHHLPFMASDHCPLLAEWSKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGARHSGRR--SIVEKSKECI

Query:  SRLSSWSRRKYEGSIKCAISRKEKELQEISNNNDKSNMMD
          L SWS+ ++ GS+   I  K K+LQ ++N     N +D
Subjt:  SRLSSWSRRKYEGSIKCAISRKEKELQEISNNNDKSNMMD

A0A2N9FVV5 Reverse transcriptase domain-containing protein2.1e-3937.27Show/hide
Query:  TEGIWRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWER
        T+ IWRF   YG P+ +    +W ++  L   S +PW   GDFNE+ +  EK GG  R E  +Q FR+ +D CG  D GF G  +TWCNN      +WE+
Subjt:  TEGIWRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWER

Query:  LDRFLINPEMICWCKEFKVHHLPFMASDHCPLLAEWSKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGAR-HSGRRSIVEKSKECISRLSSWSR
        LDR ++N E +   +E++VHH+    SDH PL    +  R  +R   F    RFE +W+  E CK+ ++  WRG R  S    + ++ + C +RL SWS 
Subjt:  LDRFLINPEMICWCKEFKVHHLPFMASDHCPLLAEWSKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGAR-HSGRRSIVEKSKECISRLSSWSR

Query:  RKYEGSIKCAISRKEKELQE
          + GS+   ++ K K+L E
Subjt:  RKYEGSIKCAISRKEKELQE

A0A2N9IMU2 Reverse transcriptase domain-containing protein7.2e-4037.17Show/hide
Query:  EGIWRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERL
        E  WRFT  YG  +  +H E+W L++ L   S +PW   GD+NE+T+  EK+GG+I +E+ +QDFR+ ID CG  D GF+G  +TWCNN   +  IWERL
Subjt:  EGIWRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERL

Query:  DRFLINPEMICWCKEFKVHHLPFMASDHCPL---LAEWSKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWR-GARHSGRRSIVEKSKECISRLSSW
        DR L   E +       +HH+    SDHCPL   L   +   ++QR      P RFE++W+    C+++V+Q W+   R      + +K + C   L SW
Subjt:  DRFLINPEMICWCKEFKVHHLPFMASDHCPL---LAEWSKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWR-GARHSGRRSIVEKSKECISRLSSW

Query:  SRRKYEGSIKCAISRKE--KELQEIS
        S+  +    +  I +K   KE++ +S
Subjt:  SRRKYEGSIKCAISRKE--KELQEIS

A0A2N9IWN7 Uncharacterized protein1.2e-3735.87Show/hide
Query:  WRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERLDRF
        WRFT  YG  +  +  E+W+L+  L   S +PW   GDFNE+ +  EK GG IR+ + +QDFR+ ID CG  D G+ GS +TWCNN   T  +WERLDR 
Subjt:  WRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERLDRF

Query:  LINPEMICWCKEFKVHHLPFMASDHCPLLAEWSKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGARH-SGRRSIVEKSKECISRLSSWSRRKYE
        L     I      +VHHL  ++SDHCP+  ++S    ++      F  RF+++W+ +  CKE +   W+  +H +    + +K + C + L  WSR  + 
Subjt:  LINPEMICWCKEFKVHHLPFMASDHCPLLAEWSKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGARH-SGRRSIVEKSKECISRLSSWSRRKYE

Query:  GSIKCAISRKEKELQEISNNNDK
        G++   + +K   L+E  + + K
Subjt:  GSIKCAISRKEKELQEISNNNDK

A0A6J1DRA0 uncharacterized protein LOC1110224232.5e-4842.92Show/hide
Query:  MVSDTEGIWRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEI
        MV +    WRFT IYG+  ++   ETW L+ RL     +PW++GGDFNEI  NSEKL G+ R +  +Q+F++ +D+CGL DPGF+G  +TWC+ H   + 
Subjt:  MVSDTEGIWRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEI

Query:  IWERLDRFLINPEMICWCKEFKVHHLPFMASDHCPLLAEW--SKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGARHSGRRSIVEKSKECISRL
        IWERLDRFLIN  +    +  ++ HL F+ASDH P+LAEW    E T  R  G + P RFE+ W  +++CKEIV ++W             K   C+  L
Subjt:  IWERLDRFLINPEMICWCKEFKVHHLPFMASDHCPLLAEW--SKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGARHSGRRSIVEKSKECISRL

Query:  SSWSRRKYEGSIKCAISRKEKELQEI
          W+  +  GS++ AI RKE E+Q +
Subjt:  SSWSRRKYEGSIKCAISRKEKELQEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCAGACACAGAGGGAATTTGGAGGTTTACTAGAATCTACGGAAATCCTCAGCGCGAAAAGCATCAAGAGACTTGGGCACTTATGAACAGATTGAAGGACACATC
TGGAATGCCATGGGTAGTGGGTGGTGATTTTAATGAGATCACTAATAATTCTGAAAAATTGGGCGGTTTAATCCGGGCAGAAAAAGATATTCAAGACTTCAGAGAATGTA
TAGACATTTGTGGCCTTAGTGATCCAGGGTTCATGGGCTCGGCGTATACCTGGTGCAACAACCATTTCCAAACTGAAATCATATGGGAACGGTTGGACAGATTCCTTATA
AATCCAGAAATGATATGTTGGTGCAAGGAGTTTAAGGTTCATCATCTTCCTTTCATGGCCTCAGATCATTGCCCGCTATTGGCTGAATGGTCAAAGGAAAGGACAACTCA
AAGGATGAATGGATTCAAGTTTCCTAGAAGATTTGAGGATGTTTGGGTGAAATATGAAGACTGCAAGGAAATTGTGGATCAGATTTGGAGGGGTGCTCGTCATTCAGGCA
GGAGATCAATTGTGGAAAAGAGTAAGGAATGCATATCTAGACTTTCATCATGGAGTCGAAGGAAATATGAGGGATCGATCAAATGTGCTATTTCGAGAAAGGAGAAGGAG
CTTCAGGAGATTAGCAACAATAATGACAAAAGTAATATGATGGATAGGATTCAGAAAGGAAAGAGCTTGAAAACCTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCAGACACAGAGGGAATTTGGAGGTTTACTAGAATCTACGGAAATCCTCAGCGCGAAAAGCATCAAGAGACTTGGGCACTTATGAACAGATTGAAGGACACATC
TGGAATGCCATGGGTAGTGGGTGGTGATTTTAATGAGATCACTAATAATTCTGAAAAATTGGGCGGTTTAATCCGGGCAGAAAAAGATATTCAAGACTTCAGAGAATGTA
TAGACATTTGTGGCCTTAGTGATCCAGGGTTCATGGGCTCGGCGTATACCTGGTGCAACAACCATTTCCAAACTGAAATCATATGGGAACGGTTGGACAGATTCCTTATA
AATCCAGAAATGATATGTTGGTGCAAGGAGTTTAAGGTTCATCATCTTCCTTTCATGGCCTCAGATCATTGCCCGCTATTGGCTGAATGGTCAAAGGAAAGGACAACTCA
AAGGATGAATGGATTCAAGTTTCCTAGAAGATTTGAGGATGTTTGGGTGAAATATGAAGACTGCAAGGAAATTGTGGATCAGATTTGGAGGGGTGCTCGTCATTCAGGCA
GGAGATCAATTGTGGAAAAGAGTAAGGAATGCATATCTAGACTTTCATCATGGAGTCGAAGGAAATATGAGGGATCGATCAAATGTGCTATTTCGAGAAAGGAGAAGGAG
CTTCAGGAGATTAGCAACAATAATGACAAAAGTAATATGATGGATAGGATTCAGAAAGGAAAGAGCTTGAAAACCTGTTAG
Protein sequenceShow/hide protein sequence
MVSDTEGIWRFTRIYGNPQREKHQETWALMNRLKDTSGMPWVVGGDFNEITNNSEKLGGLIRAEKDIQDFRECIDICGLSDPGFMGSAYTWCNNHFQTEIIWERLDRFLI
NPEMICWCKEFKVHHLPFMASDHCPLLAEWSKERTTQRMNGFKFPRRFEDVWVKYEDCKEIVDQIWRGARHSGRRSIVEKSKECISRLSSWSRRKYEGSIKCAISRKEKE
LQEISNNNDKSNMMDRIQKGKSLKTC