; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh19G004690 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh19G004690
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr19:5621499..5623507
RNA-Seq ExpressionCmoCh19G004690
SyntenyCmoCh19G004690
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016679117.1 uncharacterized protein LOC107898077 [Gossypium hirsutum]6.5e-3970.14Show/hide
Query:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSMR------------------PD
        MKES+SVKEY  RLLSIANKVRLLGS LNDSRIVEKLLVT+LE FEAT TTL+NTKDLS ISL+ELLNALQAQEQRRSMR                  PD
Subjt:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSMR------------------PD

Query:  AKCSKCNQLGHEVVICKVKGQVKDIDAQVKRDKLDKKAEAGFFV
        AKCSKCNQLGHE VICKVKGQV+++DAQV    +D++ E   FV
Subjt:  AKCSKCNQLGHEVVICKVKGQVKDIDAQVKRDKLDKKAEAGFFV

XP_016679117.1 uncharacterized protein LOC107898077 [Gossypium hirsutum]1.7e-2666.34Show/hide
Query:  QVKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVR
        QVKRDKLDKKAEA  FVGY+T+SKAYRVFQPHT RVIVSR+V F  NEQWNW++ TK N+  +APN    GS L        EE EDE QD L DDAPVR
Subjt:  QVKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVR

Query:  G
        G
Subjt:  G

XP_016679117.1 uncharacterized protein LOC107898077 [Gossypium hirsutum]2.1e-3772.87Show/hide
Query:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSM------------------RPD
        MKES+SVKEYS+RLLSIANKVRLLGS LNDSRIVE LLVT+ E FEATITTL+NTKDLS I L++LLNALQ QEQRRSM                  RPD
Subjt:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSM------------------RPD

Query:  AKCSKCNQLGHEVVICKVKGQVKDIDAQV
        AKCSKCNQLGHE VICKVKGQV+++DAQV
Subjt:  AKCSKCNQLGHEVVICKVKGQVKDIDAQV

XP_022959005.1 uncharacterized protein LOC111460124 [Cucurbita moschata]6.1e-4530.18Show/hide
Query:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSM---------------------
        MK+S+SVKEYSNRLL+IANKVRLLGS+LNDSRIVEKLLVT+ E FEATITTL+NTKDLS ISL ELLNALQAQEQ+RSM                     
Subjt:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSM---------------------

Query:  ------------------------------------------------RPDAKCSKCNQLGHEVVICKVKGQVKDIDA----------------------
                                                        RPDA CSKCNQLGHE VICKVK  VK++DA                      
Subjt:  ------------------------------------------------RPDAKCSKCNQLGHEVVICKVKGQVKDIDA----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------------QVK
                                                                                                         QVK
Subjt:  -------------------------------------------------------------------------------------------------QVK

Query:  RDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVRGGA
        RDKLDKK+E G FVGY+TISKAYRVFQPHT+RVIVSR+V FA N+QWNW+ELTKVN+ISNAPN L FG MLEESEDERQ++SEDERQDALVDDAPVRGGA
Subjt:  RDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVRGGA

Query:  FGD
        FGD
Subjt:  FGD

XP_022959074.1 uncharacterized protein LOC111460172 [Cucurbita moschata]2.0e-4065Show/hide
Query:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSMRPDAKCSKCNQLGHEVVICKV
        MKES+SVKEYS+RLLSIANKVRLLGS+LNDSRIVEKLLVT+LE FEATITTL+NTKDLS I LIELLNALQ QE                          
Subjt:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSMRPDAKCSKCNQLGHEVVICKV

Query:  KGQVKDIDAQVKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWN
                 +VK DKLDKKA+AG FVGYN ISKAYRVFQPHT+ VI+ R+V FA NEQWN
Subjt:  KGQVKDIDAQVKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWN

XP_022971788.1 uncharacterized protein LOC111470466 [Cucurbita maxima]2.3e-6055.19Show/hide
Query:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFE------ATITTLKNTKDLSNISLIELLNALQAQEQRRS----------------
        MK+S+SVKEYSNRLLSIANKVRLLGSVLNDS IVEKLLVT+ E  +       T  T +  + +    +IE   AL  + Q                   
Subjt:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFE------ATITTLKNTKDLSNISLIELLNALQAQEQRRS----------------

Query:  ---------------------------------MRPDAKCSKCNQLGHEVVICKVKGQVKDIDAQVKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRV
                                          RPDAKCSKCNQLGHE VICKVKGQ+K++DAQ + D  DKK EA  FVGYN ISKAYRVFQPHT+RV
Subjt:  ---------------------------------MRPDAKCSKCNQLGHEVVICKVKGQVKDIDAQVKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRV

Query:  IVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVRGGAFGD
        I+SR+V FAGNEQWNW+ELTKV +ISNA NNLIFGSML        EESEDERQD LVDDAP+RGG FGD
Subjt:  IVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVRGGAFGD

TrEMBL top hitse value%identityAlignment
A0A1U8ILX5 uncharacterized protein LOC1078980773.1e-3970.14Show/hide
Query:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSMR------------------PD
        MKES+SVKEY  RLLSIANKVRLLGS LNDSRIVEKLLVT+LE FEAT TTL+NTKDLS ISL+ELLNALQAQEQRRSMR                  PD
Subjt:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSMR------------------PD

Query:  AKCSKCNQLGHEVVICKVKGQVKDIDAQVKRDKLDKKAEAGFFV
        AKCSKCNQLGHE VICKVKGQV+++DAQV    +D++ E   FV
Subjt:  AKCSKCNQLGHEVVICKVKGQVKDIDAQVKRDKLDKKAEAGFFV

A0A1U8ILX5 uncharacterized protein LOC1078980778.0e-2766.34Show/hide
Query:  QVKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVR
        QVKRDKLDKKAEA  FVGY+T+SKAYRVFQPHT RVIVSR+V F  NEQWNW++ TK N+  +APN    GS L        EE EDE QD L DDAPVR
Subjt:  QVKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVR

Query:  G
        G
Subjt:  G

A0A1U8KX28 uncharacterized protein LOC1079216722.4e-3959.41Show/hide
Query:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSM------------------RPD
        MKES+ VKEY +RL SIANKVRLLGS LNDSRIVEKLLVT+ E FEATITTL+NTKDLS ISL+ELLNALQ QEQRRSM                  RPD
Subjt:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSM------------------RPD

Query:  AKCSKCNQLGHEVVICKVKGQVKDIDAQ-VKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNV
        AKCSKCNQLGHE VICKVKGQV+++DAQ V +++ D+     +F G   +S+++ +   HTN +   + +
Subjt:  AKCSKCNQLGHEVVICKVKGQVKDIDAQ-VKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNV

A0A6J1H529 uncharacterized protein LOC1114601242.9e-4530.18Show/hide
Query:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSM---------------------
        MK+S+SVKEYSNRLL+IANKVRLLGS+LNDSRIVEKLLVT+ E FEATITTL+NTKDLS ISL ELLNALQAQEQ+RSM                     
Subjt:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSM---------------------

Query:  ------------------------------------------------RPDAKCSKCNQLGHEVVICKVKGQVKDIDA----------------------
                                                        RPDA CSKCNQLGHE VICKVK  VK++DA                      
Subjt:  ------------------------------------------------RPDAKCSKCNQLGHEVVICKVKGQVKDIDA----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------------QVK
                                                                                                         QVK
Subjt:  -------------------------------------------------------------------------------------------------QVK

Query:  RDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVRGGA
        RDKLDKK+E G FVGY+TISKAYRVFQPHT+RVIVSR+V FA N+QWNW+ELTKVN+ISNAPN L FG MLEESEDERQ++SEDERQDALVDDAPVRGGA
Subjt:  RDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVRGGA

Query:  FGD
        FGD
Subjt:  FGD

A0A6J1H6Y2 uncharacterized protein LOC1114601729.8e-4165Show/hide
Query:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSMRPDAKCSKCNQLGHEVVICKV
        MKES+SVKEYS+RLLSIANKVRLLGS+LNDSRIVEKLLVT+LE FEATITTL+NTKDLS I LIELLNALQ QE                          
Subjt:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSMRPDAKCSKCNQLGHEVVICKV

Query:  KGQVKDIDAQVKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWN
                 +VK DKLDKKA+AG FVGYN ISKAYRVFQPHT+ VI+ R+V FA NEQWN
Subjt:  KGQVKDIDAQVKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWN

A0A6J1I7Z7 uncharacterized protein LOC1114704661.1e-6055.19Show/hide
Query:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFE------ATITTLKNTKDLSNISLIELLNALQAQEQRRS----------------
        MK+S+SVKEYSNRLLSIANKVRLLGSVLNDS IVEKLLVT+ E  +       T  T +  + +    +IE   AL  + Q                   
Subjt:  MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFE------ATITTLKNTKDLSNISLIELLNALQAQEQRRS----------------

Query:  ---------------------------------MRPDAKCSKCNQLGHEVVICKVKGQVKDIDAQVKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRV
                                          RPDAKCSKCNQLGHE VICKVKGQ+K++DAQ + D  DKK EA  FVGYN ISKAYRVFQPHT+RV
Subjt:  ---------------------------------MRPDAKCSKCNQLGHEVVICKVKGQVKDIDAQVKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRV

Query:  IVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVRGGAFGD
        I+SR+V FAGNEQWNW+ELTKV +ISNA NNLIFGSML        EESEDERQD LVDDAP+RGG FGD
Subjt:  IVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVRGGAFGD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAGTCGAAGTCGGTGAAAGAGTACTCTAACAGACTTCTCAGCATTGCCAACAAGGTGAGATTGCTTGGTTCTGTGTTAAATGATTCCAGGATCGTTGAAAAGCT
GCTAGTCACTCTTCTAGAGATGTTTGAAGCCACCATTACTACTCTGAAGAACACCAAAGACCTGTCAAATATTTCTCTTATAGAACTCTTGAATGCTTTACAAGCACAAG
AGCAAAGAAGGTCTATGAGACCTGATGCCAAATGCTCCAAATGCAATCAACTTGGACATGAAGTTGTGATCTGCAAAGTCAAAGGACAGGTGAAAGACATAGATGCACAG
GTCAAGCGTGATAAGCTTGACAAAAAGGCAGAAGCCGGCTTCTTTGTTGGGTATAACACTATATCCAAAGCTTATAGAGTTTTTCAACCACACACTAATCGTGTTATTGT
GAGCCGAAATGTTCTTTTTGCTGGAAATGAGCAATGGAATTGGAAAGAATTGACGAAGGTGAATAAGATTTCTAATGCACCAAACAATTTAATCTTTGGTAGCATGTTAG
AAGAGTCTGAAGATGAACGACAAGAAGAGTCTGAAGATGAACGACAAGATGCTTTGGTTGATGATGCACCTGTCAGAGGAGGAGCTTTTGGTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAGTCGAAGTCGGTGAAAGAGTACTCTAACAGACTTCTCAGCATTGCCAACAAGGTGAGATTGCTTGGTTCTGTGTTAAATGATTCCAGGATCGTTGAAAAGCT
GCTAGTCACTCTTCTAGAGATGTTTGAAGCCACCATTACTACTCTGAAGAACACCAAAGACCTGTCAAATATTTCTCTTATAGAACTCTTGAATGCTTTACAAGCACAAG
AGCAAAGAAGGTCTATGAGACCTGATGCCAAATGCTCCAAATGCAATCAACTTGGACATGAAGTTGTGATCTGCAAAGTCAAAGGACAGGTGAAAGACATAGATGCACAG
GTCAAGCGTGATAAGCTTGACAAAAAGGCAGAAGCCGGCTTCTTTGTTGGGTATAACACTATATCCAAAGCTTATAGAGTTTTTCAACCACACACTAATCGTGTTATTGT
GAGCCGAAATGTTCTTTTTGCTGGAAATGAGCAATGGAATTGGAAAGAATTGACGAAGGTGAATAAGATTTCTAATGCACCAAACAATTTAATCTTTGGTAGCATGTTAG
AAGAGTCTGAAGATGAACGACAAGAAGAGTCTGAAGATGAACGACAAGATGCTTTGGTTGATGATGCACCTGTCAGAGGAGGAGCTTTTGGTGATTGA
Protein sequenceShow/hide protein sequence
MKESKSVKEYSNRLLSIANKVRLLGSVLNDSRIVEKLLVTLLEMFEATITTLKNTKDLSNISLIELLNALQAQEQRRSMRPDAKCSKCNQLGHEVVICKVKGQVKDIDAQ
VKRDKLDKKAEAGFFVGYNTISKAYRVFQPHTNRVIVSRNVLFAGNEQWNWKELTKVNKISNAPNNLIFGSMLEESEDERQEESEDERQDALVDDAPVRGGAFGD