; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006084 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006084
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:37112757..37113531
RNA-Seq ExpressionLag0006084
SyntenyLag0006084
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PNX95452.1 ribonuclease H, partial [Trifolium pratense]3.4e-4840.08Show/hide
Query:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLM--------------------------------------------SSKEE
        MGF       IM+CV  VTFS+LIN    + F  +RG+RQGDPLSPY+F++                                            ++KEE
Subjt:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLM--------------------------------------------SSKEE

Query:  CLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKE
          VIKN+   Y+ ASGQ +N+DKS  M SK   Q     + + L +K+  +   YLGMP+   R K ++F  I+++I N L+GW E+  S  G+  LIK 
Subjt:  CLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKE

Query:  VIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG
        V QAIPTY MSCF LPK LC  I +M  +FWWG   DK+K+HW +W  LCK+K  GG
Subjt:  VIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG

XP_023909336.1 uncharacterized protein LOC112020997 [Quercus suber]3.4e-4839.3Show/hide
Query:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLM--------------------------------------------SSKEE
        +GF+  WI  I  C+ TV+FS+LIN         +RG+RQGDPLSPY+FL+                                            ++  E
Subjt:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLM--------------------------------------------SSKEE

Query:  CLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKE
        C  +  + + YE ASGQ IN DK++   S N NQ   + I   L +  ++ L  YLG+PS   R K Q FS I+E+I   +QGW E+L S  GKE+LIK 
Subjt:  CLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKE

Query:  VIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG
        ++QA+PTYSM+CFKLP+ LCKDI  +  +FWWG  G++RK HW  W ++C  K  GG
Subjt:  VIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG

XP_024177965.2 uncharacterized protein LOC112183883 [Rosa chinensis]4.0e-4939.69Show/hide
Query:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSSK--------------------------------------------EE
        +GF++ W+  IMKCV TV ++ LIN   +      RG+RQGDPLSPY+FL+ ++                                             E
Subjt:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSSK--------------------------------------------EE

Query:  CLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKE
        CL I+NV   YELASGQ IN  KS  + SK V + +   +  FL +       TYLG+P+   R + + F+ IKEK+   L GW  +L S+ GK++LI+ 
Subjt:  CLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKE

Query:  VIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG
        V QA+P+Y+MSCF LPKG C D+++MCARFWWG+  ++RK+HW  W+ LC+SK  GG
Subjt:  VIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG

XP_030497433.1 uncharacterized protein LOC115713087 [Cannabis sativa]3.4e-4843.22Show/hide
Query:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSS-----------------------KEECLVIKNVFKSYELASGQAINL
        +G+ E WI+ IM+CV +V+FSVLIN      F   RG+RQGD LSPY+FL+ S                       ++EC  +  +F+ Y   SGQ INL
Subjt:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSS-----------------------KEECLVIKNVFKSYELASGQAINL

Query:  DKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKEVIQAIPTYSMSCFKLPKGLCK
        +KS   +S +V++     + + L +    N   YLG+PS   RRK ++F  IK+K  N L+ W   +FS  GKEILIK VIQAIP+Y MSCF+LPK L K
Subjt:  DKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKEVIQAIPTYSMSCFKLPKGLCK

Query:  DINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG
        +++ + A FWWG   + +KLHWS W +LCK K  GG
Subjt:  DINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]2.0e-4842.02Show/hide
Query:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLM--------------------------------------------SSKEE
        +GF E W+  IM+C+ TVT+S+L+N   +      RG+RQGDPLSPY+FL                                             SS EE
Subjt:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLM--------------------------------------------SSKEE

Query:  CLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKE
        C  IK +   YE ASGQ +N DK+    SKN +      I + L +   ++   YLG+PS   R K   F++IKE+I   +QGW E+L S  GKEI+IK 
Subjt:  CLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKE

Query:  VIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG
        V+Q+IPTYSMS FKLP GLCKDI  M  +FWWG  G+ RKLHW  W  LC SK  GG
Subjt:  VIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG

TrEMBL top hitse value%identityAlignment
A0A2N9EK17 Reverse transcriptase domain-containing protein1.1e-4942.67Show/hide
Query:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSS-------------------KEECLVIKNVFKSYELASGQAINLDKSR
        MGF   W+  IM+C+ TV +SVLIN       +  RG+RQGDPLSPY+FL+ +                   + + + +K +  +YE+ASGQ +N +KS 
Subjt:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSS-------------------KEECLVIKNVFKSYELASGQAINLDKSR

Query:  FMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKEVIQAIPTYSMSCFKLPKGLCKDINK
        F  SKN  Q     I   L     +N G YLG+P    R+K Q F + K +I   LQGW  +L S  G+E+LIK V  AIPTY+MSCFK+P+ LC +I  
Subjt:  FMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKEVIQAIPTYSMSCFKLPKGLCKDINK

Query:  MCARFWWGAIGDKRKLHWSKWKELCKSKVAGG
        + +RFWWG  GD+RK+HW KW++L + K  GG
Subjt:  MCARFWWGAIGDKRKLHWSKWKELCKSKVAGG

A0A2N9EMD0 Uncharacterized protein1.1e-4942.86Show/hide
Query:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSSKEE-----------CLVIKNVFKSYELASGQAINLDKSRFMVSKNVN
        +GF   WI  I +C+ TV++S+L+N       +  RG+RQGDPLSPY+FL+ ++++           C  I+ +   YE ASGQ +N DK+    SK+  
Subjt:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSSKEE-----------CLVIKNVFKSYELASGQAINLDKSRFMVSKNVN

Query:  QGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKEVIQAIPTYSMSCFKLPKGLCKDINKMCARFWWG
        +   HVI + L +        YLG+PS   R + + F+KIKE++   L+GW E+L S  G+EILIK V QAIPTYSMSCF+LP  LC ++  M  RFWW 
Subjt:  QGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKEVIQAIPTYSMSCFKLPKGLCKDINKMCARFWWG

Query:  AIGDKRKLHWSKWKELCKSKVAGG
           ++RK+HW  W++LCK K  GG
Subjt:  AIGDKRKLHWSKWKELCKSKVAGG

A0A2N9HYS7 Uncharacterized protein1.0e-5045.41Show/hide
Query:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSSKE-ECLVIKNVFKS---------------YELASGQAINLDKSRFMV
        MGF + WI  IM+C+ TVT+S+LIN          RG+RQGDP+SPY+FL+ ++    L+ K  F+                YE ASGQ +N  K+    
Subjt:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSSKE-ECLVIKNVFKS---------------YELASGQAINLDKSRFMV

Query:  SKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKEVIQAIPTYSMSCFKLPKGLCKDINKMCA
        SKN  Q +   I + L +        YLG+PS   + K   FS+IKE++ + ++GW E+L S  G+EILIK V+QAIPTY+M+CFKLP  LCK+I  +  
Subjt:  SKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKEVIQAIPTYSMSCFKLPKGLCKDINKMCA

Query:  RFWWGAIGDKRKLHWSKWKELCKSKVAGG
        RFWWG  GDKRK+HW KW++LC+SK AGG
Subjt:  RFWWGAIGDKRKLHWSKWKELCKSKVAGG

A0A2N9J7Z5 Reverse transcriptase domain-containing protein1.1e-4940.47Show/hide
Query:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLM--------------------------------------------SSKEE
        MGF+  WI  +M+CV TV++SVL+N       K  RG+RQGDPLSPY+FL+                                            ++ EE
Subjt:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLM--------------------------------------------SSKEE

Query:  CLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKE
        C  I+N+  SYE ASGQ +N  K+    S N ++     + + L +        YLG+PS   R K   F+ IKE++   LQGW E+L S  GKEILIK 
Subjt:  CLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKE

Query:  VIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG
        V+QA+PTYSM CFKLP+ LCKDI  M  +F+WG   DKR++HW KW+ LC+ K+ GG
Subjt:  VIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG

A0A803Q9W0 Uncharacterized protein3.9e-5038.91Show/hide
Query:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSSK--------------------------------------------EE
        +G+ + W+  IM C+++++FS+L+N     +    RG+RQGDPLSPY+FL+ S+                                             +
Subjt:  MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSSK--------------------------------------------EE

Query:  CLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKE
        C  +K++   Y L SGQ IN DKS   V K +N G+  ++   L +K  +    YLG+P++  ++K ++F  I+ KIR  LQGW   LFS  G+EIL+K 
Subjt:  CLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGTYLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKE

Query:  VIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG
        +IQAIPTY MSCF+LPK L KDI+ M ARFWWG+   K+K HW  WK+LCK K  GG
Subjt:  VIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003102.2e-1048.15Show/hide
Query:  AIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG
        A+P Y+MSCF+L K LCK +      FWW +  +KRK+ W  W++LCKSK   G
Subjt:  AIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein2.5e-0944.44Show/hide
Query:  AIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG
        A+PTY+M+CF LPK +CK I  + A FWW    + + +HW  W  L   K  GG
Subjt:  AIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-1148.15Show/hide
Query:  AIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG
        A+P Y+MSCF+L K LCK +      FWW +  +KRK+ W  W++LCKSK   G
Subjt:  AIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTTCAGCGAGGGATGGATAAAGAATATCATGAAATGCGTGGAAACGGTGACTTTCTCGGTTCTTATCAACTGTGTGTCGCAGGAAGAGTTCAAGACAGAGCGTGG
CATTCGCCAAGGGGACCCTCTATCCCCTTACATTTTCCTAATGTCTTCAAAGGAGGAATGCCTTGTTATCAAGAACGTGTTCAAGTCTTACGAGTTGGCTTCGGGTCAGG
CCATTAACCTCGATAAATCTAGGTTTATGGTGAGCAAAAATGTGAACCAAGGAGAAGCCCACGTTATTGGTGACTTTCTTGATATAAAGAAAGCCAATAACTTGGGAACC
TATCTTGGAATGCCTTCTAATACAAGCAGAAGAAAGGCCCAAATGTTTAGTAAGATCAAGGAAAAGATAAGAAATATCCTGCAAGGGTGGAGTGAGAGACTTTTTTCAAC
TGTAGGCAAAGAAATCCTTATCAAAGAAGTGATCCAGGCTATTCCCACTTATTCGATGAGCTGTTTTAAACTCCCAAAAGGGCTATGCAAGGATATTAACAAAATGTGTG
CTAGATTCTGGTGGGGTGCGATAGGGGACAAGAGGAAGTTACATTGGTCCAAGTGGAAGGAGTTGTGTAAAAGCAAGGTTGCGGGGGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGTTCAGCGAGGGATGGATAAAGAATATCATGAAATGCGTGGAAACGGTGACTTTCTCGGTTCTTATCAACTGTGTGTCGCAGGAAGAGTTCAAGACAGAGCGTGG
CATTCGCCAAGGGGACCCTCTATCCCCTTACATTTTCCTAATGTCTTCAAAGGAGGAATGCCTTGTTATCAAGAACGTGTTCAAGTCTTACGAGTTGGCTTCGGGTCAGG
CCATTAACCTCGATAAATCTAGGTTTATGGTGAGCAAAAATGTGAACCAAGGAGAAGCCCACGTTATTGGTGACTTTCTTGATATAAAGAAAGCCAATAACTTGGGAACC
TATCTTGGAATGCCTTCTAATACAAGCAGAAGAAAGGCCCAAATGTTTAGTAAGATCAAGGAAAAGATAAGAAATATCCTGCAAGGGTGGAGTGAGAGACTTTTTTCAAC
TGTAGGCAAAGAAATCCTTATCAAAGAAGTGATCCAGGCTATTCCCACTTATTCGATGAGCTGTTTTAAACTCCCAAAAGGGCTATGCAAGGATATTAACAAAATGTGTG
CTAGATTCTGGTGGGGTGCGATAGGGGACAAGAGGAAGTTACATTGGTCCAAGTGGAAGGAGTTGTGTAAAAGCAAGGTTGCGGGGGGATAG
Protein sequenceShow/hide protein sequence
MGFSEGWIKNIMKCVETVTFSVLINCVSQEEFKTERGIRQGDPLSPYIFLMSSKEECLVIKNVFKSYELASGQAINLDKSRFMVSKNVNQGEAHVIGDFLDIKKANNLGT
YLGMPSNTSRRKAQMFSKIKEKIRNILQGWSERLFSTVGKEILIKEVIQAIPTYSMSCFKLPKGLCKDINKMCARFWWGAIGDKRKLHWSKWKELCKSKVAGG