; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035757 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035757
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:29726391..29727529
RNA-Seq ExpressionLag0035757
SyntenyLag0035757
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131662.1 uncharacterized protein LOC111004787 [Momordica charantia]6.5e-2936.18Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGR----------------
        M  FRLP   CE+++Q+  +FWWGS  D KK+HWMSW+ +C  K++ GL FR+L  FNQA++AK  W +L+N N L+++ LK +                
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGR----------------

Query:  ---WK----------------VDNGRYIELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW
           WK                V NG  I +  DPWI R  S +P+        +KV  LI+ N +W V  I   F +ED D IL++P+    S D  +W
Subjt:  ---WK----------------VDNGRYIELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]7.2e-3638.69Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKG-----------------
        + CF+LP ++C +++QI  +FWWGS  + +K+HW SWK LC  KD+ G+GFR++  FNQAMLAK SW ILR+ ++LLAKTL+G                 
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKG-----------------

Query:  ------------------RWKVDNGRYIELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW
                          RWKV NG  I L+ DPW+ R+G+  P+    +++   V  L++   RW   K++ESF+  + D IL  PL      DEI+W
Subjt:  ------------------RWKVDNGRYIELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW

XP_023880941.1 uncharacterized protein LOC111993328 [Quercus suber]6.5e-2929.56Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLK------------------
        M CF LP+++C+D+N + + FWWG    ++K+ W++W+ LC  K+  GLGFR+LR FN A+LAK  W I +N+N+L  K LK                  
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLK------------------

Query:  -----------------GRWKVDNGRYIELAKDPWINREGSSKPLVVSENLKGLKVKCLID-ENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW
                          RW + NG ++++ KD WI    S K +    +L   KV CL+D     W V K++ SFL  + + IL I +      D ++W
Subjt:  -----------------GRWKVDNGRYIELAKDPWINREGSSKPLVVSENLKGLKVKCLID-ENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW

Query:  ---------------------KSRWSSMD-----YWEWLTNNPDAEELEKAILTMWNILQFKNQILNNNKVKPA
                             K R + M       WE     PD  E E   +T W++   +N + + +K K A
Subjt:  ---------------------KSRWSSMD-----YWEWLTNNPDAEELEKAILTMWNILQFKNQILNNNKVKPA

XP_024033483.1 uncharacterized protein LOC112095606 [Citrus clementina]9.4e-2836.99Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGR---------WKVDNGR
        M  F++P  VC DI +    FWWGS  DK+ +HW  W+ L ++K    +GFR+  +FNQA++AK  W IL+  ++L+AK L+ R         W++ +G+
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGR---------WKVDNGR

Query:  YIELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW
         I++ KD WI +  + KP+V     +   V  LI+E N+W  S+I   F + D D+I+ IPL  +  +D I+W
Subjt:  YIELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW

XP_030486817.1 uncharacterized protein LOC115703723 [Cannabis sativa]1.5e-2834.62Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGR----------------
        M CFRLP  +C++I  +  +FWWGS GD +K+HW +W++LCRSK + GLGFR L  FNQAMLAK +W I    ++LL+ TLK R                
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGR----------------

Query:  -------------------WKVDNGRYIELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVWK
                           WKV NG  I+  KD WI      + L   ++ +  K+   ID    W + +++  F    +++ILN+P      KDE +W 
Subjt:  -------------------WKVDNGRYIELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVWK

Query:  SRWSSMDY
           S + Y
Subjt:  SRWSSMDY

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein5.4e-2938.51Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLL-----AKTLKG---RWKVDNGRY
        M CF+LP  +C++I+ + T+FWWG  G+++K+HW+S K LC++K   G+GFR+L+ FNQA+LA+  W +L+N N+L+     AK + G   RW+V NG  
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLL-----AKTLKG---RWKVDNGRY

Query:  IELAKDPWINREGSSKPLVVSENL-KGLKVKCLIDENN-RWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW
        I + KD WI+   + + +   + L +   V  LI+++   W V  + E FL  D++ I+ IPL +    D +VW
Subjt:  IELAKDPWINREGSSKPLVVSENL-KGLKVKCLIDENN-RWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW

A0A2N9F0P5 Uncharacterized protein2.7e-2836.26Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLK--GRWKVDNGRYIELAKD
        M CF+LP+++C ++N + + FWWG       +HWM W+ LC SK+  GLGFR+L+TFN A+LAK  W IL+   +L+A+ L+   RW + +G+ +E+ +D
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLK--GRWKVDNGRYIELAKD

Query:  PWINREGSSKPLVVSENLKGLKV-----KCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW
        PW+  +GS    +V  + +G  +       +++++ RW V  IKE F + + + I++IPL      D + W
Subjt:  PWINREGSSKPLVVSENLKGLKV-----KCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW

A0A6J1BRN0 uncharacterized protein LOC1110047873.2e-2936.18Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGR----------------
        M  FRLP   CE+++Q+  +FWWGS  D KK+HWMSW+ +C  K++ GL FR+L  FNQA++AK  W +L+N N L+++ LK +                
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGR----------------

Query:  ---WK----------------VDNGRYIELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW
           WK                V NG  I +  DPWI R  S +P+        +KV  LI+ N +W V  I   F +ED D IL++P+    S D  +W
Subjt:  ---WK----------------VDNGRYIELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW

A0A6J1DRA0 uncharacterized protein LOC1110224233.5e-3638.69Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKG-----------------
        + CF+LP ++C +++QI  +FWWGS  + +K+HW SWK LC  KD+ G+GFR++  FNQAMLAK SW ILR+ ++LLAKTL+G                 
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKG-----------------

Query:  ------------------RWKVDNGRYIELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW
                          RWKV NG  I L+ DPW+ R+G+  P+    +++   V  L++   RW   K++ESF+  + D IL  PL      DEI+W
Subjt:  ------------------RWKVDNGRYIELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVW

A0A803Q8H5 Uncharacterized protein3.5e-2838.55Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGR--------WKVDNGRY
        M CFRLP +VC+ I ++  +FWWGSMG   K+HW +W NLC SK   GLGFR L   NQAM+AK +W +L N N+LLA  LK +        WKV NG  
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGR--------WKVDNGRY

Query:  IELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVWKSRWSSM
        I   ++ W+      K + +       +V   IDEN     +K+ + F    V  IL +P+G    +D ++W    + M
Subjt:  IELAKDPWINREGSSKPLVVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVWKSRWSSM

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657506.4e-1130.19Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGRWKVDNGRYIELAKDPW
        M    LP ++   ++Q+   F WGS  +KKK H + W  +C  K   GLG R  ++ N+A+++K  W +L+  N+L    L+ ++ V      E+    W
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGRWKVDNGRYIELAKDPW

Query:  INREGS
        +  +GS
Subjt:  INREGS

P93295 Uncharacterized mitochondrial protein AtMg003102.3e-1645.35Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSK-DIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGRW
        M CFRL   +C+ +    T+FWW S  +K+K+ W++W+ LC+SK D  GLGFR+L  FNQA+LAK S+ I+   + LL++ L+ R+
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSK-DIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGRW

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein9.8e-1538.82Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGRW
        M CF LP  VC+ I  +   FWW +  + K MHW +W +L   K   G+GF+++  FN A+L K  W +L    +L+AK  K R+
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGRW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-1745.35Show/hide
Query:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSK-DIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGRW
        M CFRL   +C+ +    T+FWW S  +K+K+ W++W+ LC+SK D  GLGFR+L  FNQA+LAK S+ I+   + LL++ L+ R+
Subjt:  MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSK-DIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGRW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTGCTTCAGGCTTCCTAACAATGTGTGTGAGGACATTAACCAGATTTGCACTAAGTTTTGGTGGGGCTCGATGGGAGATAAAAAGAAGATGCATTGGATGAGTTG
GAAGAACCTCTGCAGAAGCAAGGATATCAAGGGGTTGGGATTCAGGGAGCTTAGGACATTCAACCAAGCTATGCTAGCTAAGTTTAGTTGGTGCATCCTTAGAAATTCGA
ACAACCTCCTAGCTAAGACATTAAAAGGGAGATGGAAAGTAGACAATGGTAGATACATCGAGCTTGCCAAGGATCCGTGGATCAACAGAGAAGGGAGTAGCAAGCCACTC
GTAGTTTCAGAGAATCTCAAAGGCCTAAAAGTCAAATGCTTGATTGATGAAAATAACAGATGGATAGTCTCCAAGATAAAAGAAAGTTTTCTACAGGAAGATGTGGATGA
GATTTTGAACATTCCTTTAGGAGTGGCCACGTCTAAAGATGAGATTGTTTGGAAGTCAAGATGGAGTAGCATGGATTATTGGGAATGGCTTACCAACAACCCAGATGCAG
AGGAATTGGAGAAGGCTATCCTAACGATGTGGAATATTTTGCAGTTCAAGAATCAAATTCTCAACAACAACAAAGTTAAGCCAGCTGATTCCAAACTTCCAATTCTTGTA
GCAAAGAGTATCAACGAGTCTATTGGAAGAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCGTTGCTTCAGGCTTCCTAACAATGTGTGTGAGGACATTAACCAGATTTGCACTAAGTTTTGGTGGGGCTCGATGGGAGATAAAAAGAAGATGCATTGGATGAGTTG
GAAGAACCTCTGCAGAAGCAAGGATATCAAGGGGTTGGGATTCAGGGAGCTTAGGACATTCAACCAAGCTATGCTAGCTAAGTTTAGTTGGTGCATCCTTAGAAATTCGA
ACAACCTCCTAGCTAAGACATTAAAAGGGAGATGGAAAGTAGACAATGGTAGATACATCGAGCTTGCCAAGGATCCGTGGATCAACAGAGAAGGGAGTAGCAAGCCACTC
GTAGTTTCAGAGAATCTCAAAGGCCTAAAAGTCAAATGCTTGATTGATGAAAATAACAGATGGATAGTCTCCAAGATAAAAGAAAGTTTTCTACAGGAAGATGTGGATGA
GATTTTGAACATTCCTTTAGGAGTGGCCACGTCTAAAGATGAGATTGTTTGGAAGTCAAGATGGAGTAGCATGGATTATTGGGAATGGCTTACCAACAACCCAGATGCAG
AGGAATTGGAGAAGGCTATCCTAACGATGTGGAATATTTTGCAGTTCAAGAATCAAATTCTCAACAACAACAAAGTTAAGCCAGCTGATTCCAAACTTCCAATTCTTGTA
GCAAAGAGTATCAACGAGTCTATTGGAAGAAAATAA
Protein sequenceShow/hide protein sequence
MRCFRLPNNVCEDINQICTKFWWGSMGDKKKMHWMSWKNLCRSKDIKGLGFRELRTFNQAMLAKFSWCILRNSNNLLAKTLKGRWKVDNGRYIELAKDPWINREGSSKPL
VVSENLKGLKVKCLIDENNRWIVSKIKESFLQEDVDEILNIPLGVATSKDEIVWKSRWSSMDYWEWLTNNPDAEELEKAILTMWNILQFKNQILNNNKVKPADSKLPILV
AKSINESIGRK