; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018837 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018837
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNA-directed DNA polymerase (reverse transcriptase)-related family protein
Genome locationchr5:35222358..35223615
RNA-Seq ExpressionLag0018837
SyntenyLag0018837
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131662.1 uncharacterized protein LOC111004787 [Momordica charantia]1.2e-4453.16Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS
        MS FRLPKG   ++S + A FWWGSS    ++HW  W  +C PKELGGLNFRDLE FNQA++AKQ WRVL NP   V+RVL+ KYF  S + +AT   N 
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS

Query:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQCEDLKVAD
        S+FWK FIWG +LL  G R ++ NG ++ +  DPWIPRP++F+         D+KVAD
Subjt:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQCEDLKVAD

XP_024195790.1 uncharacterized protein LOC112198938 [Rosa chinensis]5.3e-4047.17Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS
        MSCF LPK +  ++  L A FWWG  G + +IHW  W  LC PK+ GGL FRD+  FN A+LAKQGWR++  P+S +A+V + +YFP++D   A  H  +
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS

Query:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQ-CEDLKVAD
        SF W+S + G +LLK G R Q+ NG  + +  DPW+P P+ FK F    Q  EDL+V D
Subjt:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQ-CEDLKVAD

XP_030502610.1 uncharacterized protein LOC115717775 [Cannabis sativa]3.5e-3938.14Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS
        MSCF++P  I  K+  L A FWWGS G   + HW+ WS LC  K  GGL FR L   NQA+LAKQ WR+   PES  +R+L+ +YF +S   EA+   + 
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS

Query:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKI---------FGCKDQCEDL-------KVADFKTGYKMGMKMAAGASASSSVCISGW
        SF W S +WG +LLK G   ++ NG  +R L+DPW+P                 G  ++ + L        +   K+ Y +   ++   S+S+    + W
Subjt:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKI---------FGCKDQCEDL-------KVADFKTGYKMGMKMAAGASASSSVCISGW

Query:  WKKLWKLNIPSKIKS
        WKK W L IP KIK+
Subjt:  WKKLWKLNIPSKIKS

XP_030505522.1 uncharacterized protein LOC115720515 [Cannabis sativa]1.6e-3935.77Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS
        MSCF +P+G   +I  L A +WWGS  +K +IHWR W  LC  K  GGL FR    +NQA+LAKQ WRVL+NP S +A+VL+ +YFP     EAT  S+ 
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS

Query:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTF-----------------------------KIFGCKDQCEDLKVA--------------
        S  W+  +WG ELL  G R++I NG + RV +DPWIPRP +F                             ++F   D    L +               
Subjt:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTF-----------------------------KIFGCKDQCEDLKVA--------------

Query:  ------DFKTGYKMGMKMAAGASASSSVCISGWWKKLWKLNIPSKI
                K+GY + +       +SS+  ++ WW   W + IP KI
Subjt:  ------DFKTGYKMGMKMAAGASASSSVCISGWWKKLWKLNIPSKI

XP_042946916.1 uncharacterized mitochondrial protein AtMg00310-like [Carya illinoinensis]1.2e-3938.26Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS
        MSCF  PK +  ++  + A FWWG    +N+IHW +W  LC  K  GG+ FRDL  FN A+LAKQGWR+L N +S + +V + KYFP+S LF++   ++S
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS

Query:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQCEDLKV---ADFKTGY---KMGMKMAAGASASSSVCISGWWKKLWKLNIP
        S+ WK     L+ L+ GCR ++ NG +VR+ +DPW+P      +    +  E+LKV    D  TG+   +M   ++ G S+S S   + +WK LW L +P
Subjt:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQCEDLKV---ADFKTGY---KMGMKMAAGASASSSVCISGWWKKLWKLNIP

Query:  SKIKSLFGGLSITVSPVWLSGEKHTSVDDS
         K+K           P +L+ +K   +DD+
Subjt:  SKIKSLFGGLSITVSPVWLSGEKHTSVDDS

TrEMBL top hitse value%identityAlignment
A0A803NHG3 Uncharacterized protein2.0e-4537.1Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS
        MSC+RL K  ++ I  + A FWWGS+  K +IHW KW  LC+PKE GGL FRDLE FNQA+LAKQ WR L  P S  ++VL+  YFPH  +  A   +++
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS

Query:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQCEDLKVADF-----------------------------------------
        SF W+S +WG E++  G R ++ NG  VRVLEDPW+PRP +FK++      + L V D                                          
Subjt:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQCEDLKVADF-----------------------------------------

Query:  ---------KTGYKMGMKMAAGASASSSVCISGWWKKLWKLNIPSKIK
                 ++GY+M  ++    +      +  WW+KLWKL +P K+K
Subjt:  ---------KTGYKMGMKMAAGASASSSVCISGWWKKLWKLNIPSKIK

A0A803PM68 Uncharacterized protein2.0e-4539.11Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS
        MSCFRLPK  ++ I S+ A FWWGSS    +IHW KWS LC+ KE GGL FRDL  FNQA+LAKQ WR +  P S  ++VL+  YFP+  + EA S +++
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS

Query:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQCEDLKVADF-----------------------------------------
        SF W+S +WG ++++ G R +I NG SVRVLEDPW+PRP TFK++      E + V D                                          
Subjt:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQCEDLKVADF-----------------------------------------

Query:  ---------KTGYKMGMKMAAGASASSSVCISGWWKKLWKLNIPSKIK
                 ++GY+M   +      S++     WW+ LWKL IP K+K
Subjt:  ---------KTGYKMGMKMAAGASASSSVCISGWWKKLWKLNIPSKIK

A0A803PWX1 Uncharacterized protein3.4e-4846.41Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS
        MSCFRLPK  ++ I S+ A FWWGSS   ++IHW KW  LC+ KE GGL FRDL  FNQA+LAKQ WR +  P S  +RVL+  Y+P+  + EA S +++
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS

Query:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQCEDLKVADFKTGYKMGMKMAAGAS-----------ASSSVCISGWWKKLW
        SF W+S +WG ++++ G R +I NG SVRV++DPW+PRP TFKI+      + L V D K   +MG+   +G              S +     WW KLW
Subjt:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQCEDLKVADFKTGYKMGMKMAAGAS-----------ASSSVCISGWWKKLW

Query:  KLNIPSKIK
        KL IP K+K
Subjt:  KLNIPSKIK

A0A803Q1K6 Uncharacterized protein2.0e-4539.52Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS
        MSCF+LPK  +S +  + + FWWGSS  + +IHW KW  LC+PK+ GGL FRDL  FNQA+LAKQ WR L +P+   +RVL+  YFP   + EA   +N+
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS

Query:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFG-----CKDQCEDLKVAD-------------------------------------
        SF  +S +WG +L+  G R ++ NG SVRVLEDPW+PRP TFK++            DLK+AD                                     
Subjt:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFG-----CKDQCEDLKVAD-------------------------------------

Query:  --------FKTGYKMGMKMAAGASASSSVCISGWWKKLWKLNIPSKIK
                 K+GY+M   +      S+   I  WWKKLW+LN P K+K
Subjt:  --------FKTGYKMGMKMAAGASASSSVCISGWWKKLWKLNIPSKIK

A0A803QQT2 Uncharacterized protein7.0e-4639.52Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS
        MSCF+LPK  +S +  + + FWWGS   + +IHW KW  LC+PK+ GGL FRDL  FNQA+LAKQ WR L +P+   +RVL+  YFP   + EA   +N+
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS

Query:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFG-----CKDQCEDLKVAD-------------------------------------
        SF W+S +WG +L+  G R ++ NG SVRVLEDPW+PRP TFK++            DLK+AD                                     
Subjt:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFG-----CKDQCEDLKVAD-------------------------------------

Query:  --------FKTGYKMGMKMAAGASASSSVCISGWWKKLWKLNIPSKIK
                 K+GY+M          S+   I  WWKKLW+L IP K+K
Subjt:  --------FKTGYKMGMKMAAGASASSSVCISGWWKKLWKLNIPSKIK

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657508.1e-1533.09Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHS--DLFEATSHS
        MS   LP+ IL+++  L  +F WGS+  K + H  KWS +C PK+ GGL  R  ++ N+A+++K GWR+L    S    VL+ KY      D        
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHS--DLFEATSHS

Query:  NSSFFWKSFIWGL-ELLKGGCRKQIRNGLSVRVLEDPWI
        + S  W+S   GL +++  G      +G  +R   D W+
Subjt:  NSSFFWKSFIWGL-ELLKGGCRKQIRNGLSVRVLEDPWI

P93295 Uncharacterized mitochondrial protein AtMg003102.3e-3045.26Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKE-LGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSN
        MSCFRL K +  K++S    FWW S   K +I W  W  LC+ KE  GGL FRDL  FNQA+LAKQ +R++  P + ++R+LR +YFPHS + E +  + 
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKE-LGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSN

Query:  SSFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWI
         S+ W+S I G ELL  G  + I +G+  +V  D WI
Subjt:  SSFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWI

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.4e-0830.09Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDL------FEA
        MS FRLP   + +I S+C+SF W       +     WSD+C PK+ GGL  R L+  N+       W +  N  +T+   +  K   H  L       + 
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDL------FEA

Query:  TSHSNSSFFWKSF
         + SN+SF++ ++
Subjt:  TSHSNSSFFWKSF

AT4G29090.1 Ribonuclease H-like superfamily protein2.5e-2739.71Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS
        M+CF LPK +  +I S+ A FWW +      +HW+ W  L   K  GG+ F+D+E FN A+L KQ WR+L  PES +A+V + +YF  SD   A   S  
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNS

Query:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWI
        SF WKS     E+L+ G R  + NG  + +    W+
Subjt:  SFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWI

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-3145.26Show/hide
Query:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKE-LGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSN
        MSCFRL K +  K++S    FWW S   K +I W  W  LC+ KE  GGL FRDL  FNQA+LAKQ +R++  P + ++R+LR +YFPHS + E +  + 
Subjt:  MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKE-LGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSN

Query:  SSFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWI
         S+ W+S I G ELL  G  + I +G+  +V  D WI
Subjt:  SSFFWKSFIWGLELLKGGCRKQIRNGLSVRVLEDPWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTGCTTCCGTCTGCCTAAAGGTATTCTTTCAAAGATATCTTCCCTCTGTGCTAGCTTTTGGTGGGGGTCATCAGGTACAAAGAATCGAATCCACTGGCGAAAATG
GAGTGATCTTTGTCAGCCTAAGGAGTTGGGAGGTCTCAACTTCAGAGACTTGGAAACTTTTAACCAAGCAATGTTGGCTAAGCAAGGTTGGAGAGTTTTAGTCAATCCAG
AATCAACGGTGGCTAGAGTCTTAAGAGGAAAATATTTTCCACATTCTGATCTATTTGAGGCAACATCGCATTCCAATTCTTCCTTCTTTTGGAAGAGTTTTATATGGGGT
CTCGAGCTATTGAAAGGTGGTTGTAGGAAACAAATCAGAAATGGGTTATCGGTTAGGGTGCTGGAAGATCCTTGGATTCCTCGACCATGGACGTTCAAAATTTTTGGTTG
TAAGGACCAGTGTGAAGATCTCAAAGTGGCTGATTTTAAGACTGGGTACAAAATGGGGATGAAGATGGCTGCAGGGGCATCCGCTTCAAGCTCGGTGTGTATTAGTGGCT
GGTGGAAGAAATTGTGGAAGTTGAATATCCCAAGCAAAATAAAATCTTTATTTGGTGGACTTTCTATCACTGTATCCCCTGTATGGTTAAGCGGTGAAAAACATACCTCC
GTTGACGACTCTGAATACTTCAACGCCTTTCTTGATTTTGGACCGAGCGTCAACAATTGGGACACCACCAATTTGATCTTCCTCACTATAACTCTGAGATTAGGAACGTA
G
mRNA sequenceShow/hide mRNA sequence
ATGAGTTGCTTCCGTCTGCCTAAAGGTATTCTTTCAAAGATATCTTCCCTCTGTGCTAGCTTTTGGTGGGGGTCATCAGGTACAAAGAATCGAATCCACTGGCGAAAATG
GAGTGATCTTTGTCAGCCTAAGGAGTTGGGAGGTCTCAACTTCAGAGACTTGGAAACTTTTAACCAAGCAATGTTGGCTAAGCAAGGTTGGAGAGTTTTAGTCAATCCAG
AATCAACGGTGGCTAGAGTCTTAAGAGGAAAATATTTTCCACATTCTGATCTATTTGAGGCAACATCGCATTCCAATTCTTCCTTCTTTTGGAAGAGTTTTATATGGGGT
CTCGAGCTATTGAAAGGTGGTTGTAGGAAACAAATCAGAAATGGGTTATCGGTTAGGGTGCTGGAAGATCCTTGGATTCCTCGACCATGGACGTTCAAAATTTTTGGTTG
TAAGGACCAGTGTGAAGATCTCAAAGTGGCTGATTTTAAGACTGGGTACAAAATGGGGATGAAGATGGCTGCAGGGGCATCCGCTTCAAGCTCGGTGTGTATTAGTGGCT
GGTGGAAGAAATTGTGGAAGTTGAATATCCCAAGCAAAATAAAATCTTTATTTGGTGGACTTTCTATCACTGTATCCCCTGTATGGTTAAGCGGTGAAAAACATACCTCC
GTTGACGACTCTGAATACTTCAACGCCTTTCTTGATTTTGGACCGAGCGTCAACAATTGGGACACCACCAATTTGATCTTCCTCACTATAACTCTGAGATTAGGAACGTA
G
Protein sequenceShow/hide protein sequence
MSCFRLPKGILSKISSLCASFWWGSSGTKNRIHWRKWSDLCQPKELGGLNFRDLETFNQAMLAKQGWRVLVNPESTVARVLRGKYFPHSDLFEATSHSNSSFFWKSFIWG
LELLKGGCRKQIRNGLSVRVLEDPWIPRPWTFKIFGCKDQCEDLKVADFKTGYKMGMKMAAGASASSSVCISGWWKKLWKLNIPSKIKSLFGGLSITVSPVWLSGEKHTS
VDDSEYFNAFLDFGPSVNNWDTTNLIFLTITLRLGT