; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G008970 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G008970
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr01:4990470..4992421
RNA-Seq ExpressionCmoCh01G008970
SyntenyCmoCh01G008970
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025314 - Domain of unknown function DUF4219


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022949097.1 GRB10-interacting GYF protein 1-like [Cucurbita moschata]2.7e-6394.48Show/hide
Query:  MLQKKTSKEIGDSMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIGKFNYIVCSIEEANNVEEMQIDELQSSLLVHEQKLNHTSAIEEIKALKI
        MLQKKTSKEIGDSMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIGK    V S+ +ANNVEEMQIDELQSSLLVHEQKLNHTSAIEEIKALKI
Subjt:  MLQKKTSKEIGDSMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIGKFNYIVCSIEEANNVEEMQIDELQSSLLVHEQKLNHTSAIEEIKALKI

Query:  STPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK
        STPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK
Subjt:  STPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK

XP_022975469.1 uncharacterized protein LOC111474837, partial [Cucurbita maxima]8.4e-5762.45Show/hide
Query:  MIAERQVDNYVQPAISRLD----------------------------EPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGD
        M AE+QV+NYVQPAI R D                            EPNEGEVL   +QQ LAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEI D
Subjt:  MIAERQVDNYVQPAISRLD----------------------------EPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGD

Query:  SMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLV
        SMRRKYQ STRVK AQLQAVRRDFE  Q QKGE+ NDYIG                               KF+YIVCSIEEANNVEEMQIDELQSSLLV
Subjt:  SMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLV

Query:  HEQKLNHTSAIEEIKALKISTPGEASSSR
         E++LN TSAIEE+ ALKISTP E SSSR
Subjt:  HEQKLNHTSAIEEIKALKISTPGEASSSR

XP_022979572.1 uncharacterized protein LOC111479253 [Cucurbita maxima]5.4e-7264.39Show/hide
Query:  MIAERQVDNYVQPAISRLD----------------------------EPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGD
        M AE+QV+NYVQPAI R D                            EPNEGEVL   +QQ L ASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEI D
Subjt:  MIAERQVDNYVQPAISRLD----------------------------EPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGD

Query:  SMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLV
        SMRRKYQ STRVK AQLQAVRRDFE  QMQKGE  NDYIG                               KF+YIVCSIEEANNVEEMQIDELQSSLLV
Subjt:  SMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLV

Query:  HEQKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK
        HEQ+LN TSAIEE+  LKISTP E SSSR RGQRRGRG  RG RE++G VG+SADLVR+DYDNK
Subjt:  HEQKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK

XP_022989543.1 uncharacterized protein LOC111486606 [Cucurbita maxima]1.3e-5757.2Show/hide
Query:  MIAERQVDNYVQPAISRLD----------------------------EPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGD
        M AE+QV+NYVQPAI R D                            EPNEGEVL   +QQ LAASKLKDLK                   KKT KEI D
Subjt:  MIAERQVDNYVQPAISRLD----------------------------EPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGD

Query:  SMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLV
        SMRRKYQ STRVK AQLQAVRRDFE  QMQKGE  NDYIG                               KF+YIVCSIEEANNVEEMQIDELQSSLLV
Subjt:  SMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLV

Query:  HEQKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK
        HEQ+LN TSAIEE+  LKISTP E SSSR RGQRRGRG  RG RE++G VG+SADLVR+DYDNK
Subjt:  HEQKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK

XP_023526391.1 uncharacterized protein LOC111789905 [Cucurbita pepo subsp. pepo]5.3e-5158.82Show/hide
Query:  MIAERQVDNYVQPAISRLDEPNEGEVLLV-----------VQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGDSMRRKYQVSTRVKHAQL
        M  ERQVDN +QPAISR +   +   +L+           + +Q+L ASKLKDLKVKNYLFQ IDRTILETMLQKKTSKEI DSMRRKYQ STRVK AQL
Subjt:  MIAERQVDNYVQPAISRLDEPNEGEVLLV-----------VQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGDSMRRKYQVSTRVKHAQL

Query:  QAVRRDFENFQMQKGEMANDYIGKFNYIVCSIE-EANNVEEMQIDELQSSLLVHE---QKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGR
        QAVRRDFE  QMQKGE  NDYI K   +   +    +++ ++ + E     L  +    KLN TSA+EE+ ALKISTP E SSSR RGQRR R  GRGG+
Subjt:  QAVRRDFENFQMQKGEMANDYIGKFNYIVCSIE-EANNVEEMQIDELQSSLLVHE---QKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGR

Query:  EKNGYVGKSADLVRDDYDNKE
        E++G VG+ ADLVRD+YDNK+
Subjt:  EKNGYVGKSADLVRDDYDNKE

TrEMBL top hitse value%identityAlignment
A0A6J1GB35 GRB10-interacting GYF protein 1-like1.3e-6394.48Show/hide
Query:  MLQKKTSKEIGDSMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIGKFNYIVCSIEEANNVEEMQIDELQSSLLVHEQKLNHTSAIEEIKALKI
        MLQKKTSKEIGDSMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIGK    V S+ +ANNVEEMQIDELQSSLLVHEQKLNHTSAIEEIKALKI
Subjt:  MLQKKTSKEIGDSMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIGKFNYIVCSIEEANNVEEMQIDELQSSLLVHEQKLNHTSAIEEIKALKI

Query:  STPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK
        STPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK
Subjt:  STPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK

A0A6J1GT20 uncharacterized protein LOC1114567247.2e-4668.71Show/hide
Query:  MRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLVH
        MRRKYQVSTRVK AQLQAVRRDFE  QMQKGE  N YIG                               KF+YIVCSIEEANNV+E QIDELQSSLLVH
Subjt:  MRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLVH

Query:  EQKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK
        EQKLN TSAIEE+  LKISTP E SSSR RGQRRGRGHGRGGRE++G VG+SADLVRDDYDNK
Subjt:  EQKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK

A0A6J1IKL3 uncharacterized protein LOC1114748374.1e-5762.45Show/hide
Query:  MIAERQVDNYVQPAISRLD----------------------------EPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGD
        M AE+QV+NYVQPAI R D                            EPNEGEVL   +QQ LAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEI D
Subjt:  MIAERQVDNYVQPAISRLD----------------------------EPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGD

Query:  SMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLV
        SMRRKYQ STRVK AQLQAVRRDFE  Q QKGE+ NDYIG                               KF+YIVCSIEEANNVEEMQIDELQSSLLV
Subjt:  SMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLV

Query:  HEQKLNHTSAIEEIKALKISTPGEASSSR
         E++LN TSAIEE+ ALKISTP E SSSR
Subjt:  HEQKLNHTSAIEEIKALKISTPGEASSSR

A0A6J1IP26 uncharacterized protein LOC1114792532.6e-7264.39Show/hide
Query:  MIAERQVDNYVQPAISRLD----------------------------EPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGD
        M AE+QV+NYVQPAI R D                            EPNEGEVL   +QQ L ASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEI D
Subjt:  MIAERQVDNYVQPAISRLD----------------------------EPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGD

Query:  SMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLV
        SMRRKYQ STRVK AQLQAVRRDFE  QMQKGE  NDYIG                               KF+YIVCSIEEANNVEEMQIDELQSSLLV
Subjt:  SMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLV

Query:  HEQKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK
        HEQ+LN TSAIEE+  LKISTP E SSSR RGQRRGRG  RG RE++G VG+SADLVR+DYDNK
Subjt:  HEQKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK

A0A6J1JQL3 uncharacterized protein LOC1114866066.3e-5857.2Show/hide
Query:  MIAERQVDNYVQPAISRLD----------------------------EPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGD
        M AE+QV+NYVQPAI R D                            EPNEGEVL   +QQ LAASKLKDLK                   KKT KEI D
Subjt:  MIAERQVDNYVQPAISRLD----------------------------EPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGD

Query:  SMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLV
        SMRRKYQ STRVK AQLQAVRRDFE  QMQKGE  NDYIG                               KF+YIVCSIEEANNVEEMQIDELQSSLLV
Subjt:  SMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDYIG-------------------------------KFNYIVCSIEEANNVEEMQIDELQSSLLV

Query:  HEQKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK
        HEQ+LN TSAIEE+  LKISTP E SSSR RGQRRGRG  RG RE++G VG+SADLVR+DYDNK
Subjt:  HEQKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein3.8e-0751.11Show/hide
Query:  MANMMGQMPLPRLTKTNYENWSIQMKALLDLQDAWEVVEDAFKEP
        MA+      +P LTK+NY+NWS++MKA+L   D WE+VE  F EP
Subjt:  MANMMGQMPLPRLTKTNYENWSIQMKALLDLQDAWEVVEDAFKEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGCAGAAAGACAAGTTGATAACTATGTTCAGCCCGCCATTTCGAGATTAGATGAGCCAAATGAAGGTGAGGTACTCTTAGTTGTTCAACAGCAACGGTTGGCAGC
ATCGAAGTTGAAGGACCTGAAGGTCAAGAATTACTTATTCCAGTCCATTGATCGAACTATATTGGAAACGATGCTTCAGAAGAAGACATCCAAGGAAATTGGGGATTCAA
TGAGGAGGAAGTATCAAGTCTCAACGAGAGTAAAGCATGCTCAACTTCAAGCAGTTAGAAGAGACTTTGAAAATTTTCAAATGCAAAAGGGAGAAATGGCCAATGATTAT
ATTGGCAAGTTCAATTATATTGTATGTTCTATTGAAGAAGCAAACAATGTCGAAGAAATGCAAATAGATGAACTTCAAAGCTCCTTACTAGTGCATGAGCAGAAGCTTAA
TCACACAAGTGCAATAGAGGAGATAAAAGCCTTAAAGATATCCACACCAGGTGAAGCTTCAAGCTCCAGAAGTAGAGGGCAAAGAAGAGGCAGAGGTCATGGAAGAGGCG
GTCGAGAAAAAAATGGATACGTTGGCAAGTCAGCAGATCTAGTCAGAGACGATTATGACAACAAAGAGCTGATGGCGAACATGATGGGACAAATGCCGCTACCACGATTG
ACGAAGACGAACTACGAGAATTGGAGCATCCAAATGAAAGCTCTTCTTGATTTGCAAGACGCATGGGAGGTGGTCGAAGACGCTTTCAAAGAACCGATTGATACCACGGG
TTATACGGTGGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATAGCAGAAAGACAAGTTGATAACTATGTTCAGCCCGCCATTTCGAGATTAGATGAGCCAAATGAAGGTGAGGTACTCTTAGTTGTTCAACAGCAACGGTTGGCAGC
ATCGAAGTTGAAGGACCTGAAGGTCAAGAATTACTTATTCCAGTCCATTGATCGAACTATATTGGAAACGATGCTTCAGAAGAAGACATCCAAGGAAATTGGGGATTCAA
TGAGGAGGAAGTATCAAGTCTCAACGAGAGTAAAGCATGCTCAACTTCAAGCAGTTAGAAGAGACTTTGAAAATTTTCAAATGCAAAAGGGAGAAATGGCCAATGATTAT
ATTGGCAAGTTCAATTATATTGTATGTTCTATTGAAGAAGCAAACAATGTCGAAGAAATGCAAATAGATGAACTTCAAAGCTCCTTACTAGTGCATGAGCAGAAGCTTAA
TCACACAAGTGCAATAGAGGAGATAAAAGCCTTAAAGATATCCACACCAGGTGAAGCTTCAAGCTCCAGAAGTAGAGGGCAAAGAAGAGGCAGAGGTCATGGAAGAGGCG
GTCGAGAAAAAAATGGATACGTTGGCAAGTCAGCAGATCTAGTCAGAGACGATTATGACAACAAAGAGCTGATGGCGAACATGATGGGACAAATGCCGCTACCACGATTG
ACGAAGACGAACTACGAGAATTGGAGCATCCAAATGAAAGCTCTTCTTGATTTGCAAGACGCATGGGAGGTGGTCGAAGACGCTTTCAAAGAACCGATTGATACCACGGG
TTATACGGTGGCGTAA
Protein sequenceShow/hide protein sequence
MIAERQVDNYVQPAISRLDEPNEGEVLLVVQQQRLAASKLKDLKVKNYLFQSIDRTILETMLQKKTSKEIGDSMRRKYQVSTRVKHAQLQAVRRDFENFQMQKGEMANDY
IGKFNYIVCSIEEANNVEEMQIDELQSSLLVHEQKLNHTSAIEEIKALKISTPGEASSSRSRGQRRGRGHGRGGREKNGYVGKSADLVRDDYDNKELMANMMGQMPLPRL
TKTNYENWSIQMKALLDLQDAWEVVEDAFKEPIDTTGYTVA