; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G012800 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G012800
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr09:11305967..11307033
RNA-Seq ExpressionCmoCh09G012800
SyntenyCmoCh09G012800
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0006397 - mRNA processing (biological process)
GO:0006811 - ion transport (biological process)
GO:0006950 - response to stress (biological process)
GO:0007165 - signal transduction (biological process)
GO:0019684 - photosynthesis, light reaction (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016020 - membrane (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016772 - transferase activity, transferring phosphorus-containing groups (molecular function)
GO:0036094 - small molecule binding (molecular function)
GO:0043168 - anion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3636042.1 hypothetical protein FXO37_25670 [Capsicum annuum]7.0e-5954.3Show/hide
Query:  MESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNKVYL
        ME SK+GIEKFDGSDFSFWKMQIEDYLYQKDLH+ L GV  ++M  E+WKLKDR+ALG IRLTLSRNVAFNI KEKTTSDL+KALSNMYEK   MNKVYL
Subjt:  MESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNKVYL

Query:  RRRLFNLQMSE---------------------------------------------------------------------------EDIGDALILNMDSS
          RLFNLQ+S+                                                                           EDIGDALIL+M+S 
Subjt:  RRRLFNLQMSE---------------------------------------------------------------------------EDIGDALILNMDSS

Query:  IESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTPAGN
        +ESWIL+S ASFHSSP+KE+F+NFK   F KVYLA+NK L I+GKGDVCIKT A N
Subjt:  IESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTPAGN

KAF3680274.1 putative 50S ribosomal protein L18-like [Capsicum annuum]4.8e-5258.22Show/hide
Query:  KMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNKVYLRRRLFNLQMSE--------
        +M+IEDYLYQKDLHEPL G+  +++  E WKLKDR+AL +I LTLSRNVAFNI+KEKTT DL+KALSNMYE  S +NKVYL RRLFNLQM E        
Subjt:  KMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNKVYLRRRLFNLQMSE--------

Query:  -----------------ED--------------------------IGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIK
                         ED                          IGD+LIL++DS +ESWILDS ASFHSS +KELF+NFKSGNF KVYLADNK L IK
Subjt:  -----------------ED--------------------------IGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIK

Query:  GKGDVCIKTPAGN
        GK DVCIKTPAGN
Subjt:  GKGDVCIKTPAGN

KAG7011443.1 hypothetical protein SDJN02_26349, partial [Cucurbita argyrosperma subsp. argyrosperma]3.8e-5787.97Show/hide
Query:  MRCCAILTTGIRTRQKSPRFVKMESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTT
        +RCCAILTTGIR R KSPR +KMESSKIGIEKFDGSDF FWKMQIEDYLY+KDLHEPL GV LDTMTTEQWKLKDR+AL +IRLTLSRN AFNIIKEKTT
Subjt:  MRCCAILTTGIRTRQKSPRFVKMESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTT

Query:  SDLMKALSNMYEKLSTMNKVYLRRRLFNLQMSE
        SDL+KALSNMYEKLS MNKVYL RRLFNLQMSE
Subjt:  SDLMKALSNMYEKLSTMNKVYLRRRLFNLQMSE

VFQ62075.1 unnamed protein product [Cuscuta campestris]1.5e-5347.77Show/hide
Query:  MESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNK---
        ME SK+GIEKFDGSDF FWKMQIEDYLYQKDLHEPL GV  D+MT EQWKLKDR+ALGMI LTL++NVAFNI+KE TT+ L+KALSNMYEK S MNK   
Subjt:  MESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNK---

Query:  --------------------------------VYLRRRLFNLQM--------------------------------------------------------
                                        V L   +   +M                                                        
Subjt:  --------------------------------VYLRRRLFNLQM--------------------------------------------------------

Query:  -------------------SEEDIGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTPAGN
                           S EDIGDALIL++DS +ESWILDS ASFHSSP+KE F+NFKSGNF KVYLADNK L I+GKGDV IKTPAGN
Subjt:  -------------------SEEDIGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTPAGN

VFQ92713.1 unnamed protein product [Cuscuta campestris]1.6e-5057.21Show/hide
Query:  MQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNK-----------------------
        MQIEDYLYQKDLHEPL GV  D+MT EQWKLKDR+ALGMIRLTL++NVAFNI+KE TT+ LMKALSNMYEK   MNK                       
Subjt:  MQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNK-----------------------

Query:  ------------VYLRRRLFNLQM-------------------SEEDIGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLE
                    V L    F +                     S EDIGDALIL++DS +ESWILDS ASFHSSP+KELF+NFKSGNF KVYLADNK L 
Subjt:  ------------VYLRRRLFNLQM-------------------SEEDIGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLE

Query:  IKGKGDVCIKTPAGN
        I+GKGDV IKTP GN
Subjt:  IKGKGDVCIKTPAGN

TrEMBL top hitse value%identityAlignment
A0A484KC47 CCHC-type domain-containing protein7.3e-5447.77Show/hide
Query:  MESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNK---
        ME SK+GIEKFDGSDF FWKMQIEDYLYQKDLHEPL GV  D+MT EQWKLKDR+ALGMI LTL++NVAFNI+KE TT+ L+KALSNMYEK S MNK   
Subjt:  MESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNK---

Query:  --------------------------------VYLRRRLFNLQM--------------------------------------------------------
                                        V L   +   +M                                                        
Subjt:  --------------------------------VYLRRRLFNLQM--------------------------------------------------------

Query:  -------------------SEEDIGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTPAGN
                           S EDIGDALIL++DS +ESWILDS ASFHSSP+KE F+NFKSGNF KVYLADNK L I+GKGDV IKTPAGN
Subjt:  -------------------SEEDIGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTPAGN

A0A484MUU4 gag_pre-integrs domain-containing protein7.5e-5157.21Show/hide
Query:  MQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNK-----------------------
        MQIEDYLYQKDLHEPL GV  D+MT EQWKLKDR+ALGMIRLTL++NVAFNI+KE TT+ LMKALSNMYEK   MNK                       
Subjt:  MQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNK-----------------------

Query:  ------------VYLRRRLFNLQM-------------------SEEDIGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLE
                    V L    F +                     S EDIGDALIL++DS +ESWILDS ASFHSSP+KELF+NFKSGNF KVYLADNK L 
Subjt:  ------------VYLRRRLFNLQM-------------------SEEDIGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLE

Query:  IKGKGDVCIKTPAGN
        I+GKGDV IKTP GN
Subjt:  IKGKGDVCIKTPAGN

A0A484NK44 CCHC-type domain-containing protein1.6e-4580.67Show/hide
Query:  QKSPRFVKMESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKL
        ++S R +KMESSK+GIEKFDGSDF FWKMQIEDYLYQKDLHEPL GV  D+MT EQWKLKDR+ALGMIRLTL++NVAFNI+KE TT+ LMKALSNMYEK 
Subjt:  QKSPRFVKMESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKL

Query:  STMNKVYLRRRLFNLQMSE
        S MNKVYL RRLFNLQM E
Subjt:  STMNKVYLRRRLFNLQMSE

A0A6A2XNT2 NB-ARC domain-containing protein8.9e-4455.91Show/hide
Query:  ESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTM------
        +  K+ IEKFDG+DF FWKMQIED+LYQK+L++PLLG   + M  E W L DR+ALG+IRLTLSRNVAFNI KEKTT+ LM ALSN +    T       
Subjt:  ESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTM------

Query:  -NKVYLR--RRLFNLQMSEEDIGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTP
         NK+     R L +  ++EE  GDA+IL+++S IESWILDS ASFHS+P +E+  N+ SG+F KV+LAD++ L+I GKGD+ +K P
Subjt:  -NKVYLR--RRLFNLQMSEEDIGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTP

A0A6A2Y6G9 Integrase catalytic domain-containing protein1.2e-4345.92Show/hide
Query:  ESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNKVYLR
        +  K+ IEKFDG+DF FWKMQIED+LYQK+L++PL G   + M  E W L DRKALG+IRLTLSRN+AFNI KEKTT+ LM ALS+MYEK S  NKV+L 
Subjt:  ESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNKVYLR

Query:  RRLFNLQMSE--------------------------------------------------------EDIGDALILNMDSSIESWILDSSASFHSSPNKEL
        RRLFNL+M+E                                                        E+  DA+IL++++ IESWILDS ASFHS+P +E+
Subjt:  RRLFNLQMSE--------------------------------------------------------EDIGDALILNMDSSIESWILDSSASFHSSPNKEL

Query:  FRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTP
          N+ S +F  V+LA ++ L+I GK D+ +K P
Subjt:  FRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTP

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-1037.72Show/hide
Query:  MESSKIGIEKFDGSD-FSFWKMQIEDYLYQKDLHEPL--LGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNK
        M   K  + KF+G + FS W+ ++ D L Q+ LH+ L       DTM  E W   D +A   IRL LS +V  NII E T   +   L ++Y   +  NK
Subjt:  MESSKIGIEKFDGSD-FSFWKMQIEDYLYQKDLHEPL--LGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNK

Query:  VYLRRRLFNLQMSE
        +YL+++L+ L MSE
Subjt:  VYLRRRLFNLQMSE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.3e-0634.43Show/hide
Query:  LNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTPAG
        +++      W++D++AS H++P ++LF  + +G+F  V + +  Y +I G GD+CIKT  G
Subjt:  LNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTPAG

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein1.7e-1845Show/hide
Query:  EKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNKVYLRRRLFNLQ
        +K DG+ +SF +M+IEDYLY K LH+P LG  ++TM+ + W +  R+ L +IRLT+S+N+A N+ KEK+   LMK LS++Y+K ST N V       +++
Subjt:  EKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKLSTMNKVYLRRRLFNLQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTGTTGTGCGATCCTAACAACTGGTATCAGAACTAGGCAGAAGTCACCACGCTTTGTGAAGATGGAAAGTTCAAAGATTGGAATTGAGAAGTTCGATGGATCCGA
TTTCAGTTTCTGGAAGATGCAGATTGAAGATTATCTGTACCAGAAAGATCTTCACGAACCCTTGTTGGGAGTGGTGCTGGATACCATGACCACGGAGCAGTGGAAGCTCA
AGGATAGAAAAGCCTTAGGGATGATCCGGTTAACACTATCCAGAAACGTGGCGTTTAACATCATCAAGGAGAAGACAACGTCAGATCTGATGAAGGCGCTGTCGAATATG
TACGAAAAACTGTCAACTATGAACAAGGTGTATTTGAGGCGGAGATTGTTCAATCTACAGATGTCTGAAGAAGACATTGGGGATGCTCTAATTCTCAACATGGACAGTTC
GATTGAATCCTGGATTTTGGATTCAAGTGCATCTTTTCATTCGTCTCCAAATAAAGAGTTGTTCCGGAATTTCAAGTCTGGAAATTTCGAGAAGGTGTATCTTGCCGACA
ACAAATATTTGGAGATTAAAGGAAAAGGAGATGTCTGCATAAAAACTCCAGCAGGAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGTTGTTGTGCGATCCTAACAACTGGTATCAGAACTAGGCAGAAGTCACCACGCTTTGTGAAGATGGAAAGTTCAAAGATTGGAATTGAGAAGTTCGATGGATCCGA
TTTCAGTTTCTGGAAGATGCAGATTGAAGATTATCTGTACCAGAAAGATCTTCACGAACCCTTGTTGGGAGTGGTGCTGGATACCATGACCACGGAGCAGTGGAAGCTCA
AGGATAGAAAAGCCTTAGGGATGATCCGGTTAACACTATCCAGAAACGTGGCGTTTAACATCATCAAGGAGAAGACAACGTCAGATCTGATGAAGGCGCTGTCGAATATG
TACGAAAAACTGTCAACTATGAACAAGGTGTATTTGAGGCGGAGATTGTTCAATCTACAGATGTCTGAAGAAGACATTGGGGATGCTCTAATTCTCAACATGGACAGTTC
GATTGAATCCTGGATTTTGGATTCAAGTGCATCTTTTCATTCGTCTCCAAATAAAGAGTTGTTCCGGAATTTCAAGTCTGGAAATTTCGAGAAGGTGTATCTTGCCGACA
ACAAATATTTGGAGATTAAAGGAAAAGGAGATGTCTGCATAAAAACTCCAGCAGGAAATTAG
Protein sequenceShow/hide protein sequence
MRCCAILTTGIRTRQKSPRFVKMESSKIGIEKFDGSDFSFWKMQIEDYLYQKDLHEPLLGVVLDTMTTEQWKLKDRKALGMIRLTLSRNVAFNIIKEKTTSDLMKALSNM
YEKLSTMNKVYLRRRLFNLQMSEEDIGDALILNMDSSIESWILDSSASFHSSPNKELFRNFKSGNFEKVYLADNKYLEIKGKGDVCIKTPAGN