; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G011240 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G011240
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr20:10740480..10743382
RNA-Seq ExpressionCmoCh20G011240
SyntenyCmoCh20G011240
Gene Ontology termsGO:0000226 - microtubule cytoskeleton organization (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3680275.1 putative disease resistance RPP13-like protein 1-like [Capsicum annuum]1.5e-3662.59Show/hide
Query:  DAFIKLFWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADELEVEQVTPEHVLKRSSIAIRVSDRYVPSL
        D F   FWD +NK+ILRH D+ FDENVLYKD+E     TTKQ+G E+EL K++  DV ADTQ TLEI+ +E EVEQVTPE V +RSS  IR  DRY PSL
Subjt:  DAFIKLFWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADELEVEQVTPEHVLKRSSIAIRVSDRYVPSL

Query:  HCLLLTDEGEPELFDEVLQFEDTTKWEKIMDDGMFRLQK
        H LLLTDEGEPE   E LQ ED  KWE+ MDD M  L+K
Subjt:  HCLLLTDEGEPELFDEVLQFEDTTKWEKIMDDGMFRLQK

PHU02774.1 IAA-amino acid hydrolase ILR1-like 1 [Capsicum chinense]1.2e-2562.16Show/hide
Query:  DAFIKLFWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADELEVEQVTPEHVLKRSSIAIRVSDRYVPSL
        D F  +FWD +NK+ILRH DV FDENVLYKDKE     TTKQ+G E+EL K++   V  DTQ T E V +EL+ EQVTPE V +RSS   R  DRY PSL
Subjt:  DAFIKLFWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADELEVEQVTPEHVLKRSSIAIRVSDRYVPSL

Query:  HCLLLTDEGEP
        H LLLTDEGEP
Subjt:  HCLLLTDEGEP

RVW40116.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.2e-2532.55Show/hide
Query:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDL--------------------------------------------
        IGVKWV+ TKLN++G VDKYKARLV KGY Q+ G+DY KVF P+A+  TIR+++ +                                            
Subjt:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDL--------------------------------------------

Query:  -------------------------VRLTKDKEEAKINATMHKQLIGSLMYLTAT--------------------------------SDGAVSWSSKKQP
                                 ++L+K     ++++T++KQ++GSLMYLT+T                                + GA++WSSKKQ 
Subjt:  -------------------------VRLTKDKEEAKINATMHKQLIGSLMYLTAT--------------------------------SDGAVSWSSKKQP

Query:  VVTLSTTEAEFTAAVSCACQ-VYQDGVIELKHCVTQEQVTDIITEPLKLDAFIKL
        +VTLSTTEAEF AA S +CQ ++   ++E+ +   Q+Q+ DI+T+PLK   F+KL
Subjt:  VVTLSTTEAEFTAAVSCACQ-VYQDGVIELKHCVTQEQVTDIITEPLKLDAFIKL

VFQ67296.1 unnamed protein product [Cuscuta campestris]1.8e-2661.74Show/hide
Query:  KLFWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADELEVEQVTPEHVLKRSSIAIRVSDRYVPSLHCLL
        K FWD KN+ ILRH DV FDE+++YKD     S TTKQ+G EVEL K++  DV  +TQ T +IV  E EVEQVTPE VL++SS  IR  DRY PSLH LL
Subjt:  KLFWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADELEVEQVTPEHVLKRSSIAIRVSDRYVPSLHCLL

Query:  LTDEGEPELFDEVLQ
        LTDEGEPE  DE +Q
Subjt:  LTDEGEPELFDEVLQ

VFQ82754.1 unnamed protein product [Cuscuta campestris]4.0e-3463.16Show/hide
Query:  FWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADELEVEQVTPEHVLKRSSIAIRVSDRYVPSLHCLLLT
        FWD KN++ILRH DV FDE+VLYKD+E     TTKQ+G EVEL K++  +V A+TQ T E  A+E EVEQVTPE VL+RSS   RV DRY PSLH LLLT
Subjt:  FWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADELEVEQVTPEHVLKRSSIAIRVSDRYVPSLHCLLLT

Query:  DEGEPELFDEVLQFEDTTKWEKIMDDGMFRLQK
        DEGEPE FDE +Q ED+ KWE+ MDD M  L++
Subjt:  DEGEPELFDEVLQFEDTTKWEKIMDDGMFRLQK

TrEMBL top hitse value%identityAlignment
A0A2N9FH80 Integrase catalytic domain-containing protein1.9e-2932.56Show/hide
Query:  KIGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRM-----------------------------------------------
        KIGVKWVF TKLNENGEVDK KARLVAKGYAQQ+GIDY +VF P+ARW TIRM                                               
Subjt:  KIGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRM-----------------------------------------------

Query:  ---------------------------------------------------------------------------------------------IIDLVRL
                                                                                                     I+  VRL
Subjt:  ---------------------------------------------------------------------------------------------IIDLVRL

Query:  TKDKEEAKINATMHKQLIGSLMYLTATSDGAVSWSSKKQPVVTLSTTEAEFTAAVSCACQ----------------------------------------
         KD+E AK+NATM+KQL+GSLMYLTAT  G         PVV LSTTEAEF  A SCACQ                                        
Subjt:  TKDKEEAKINATMHKQLIGSLMYLTATSDGAVSWSSKKQPVVTLSTTEAEFTAAVSCACQ----------------------------------------

Query:  -------------VYQDGVIELKHCVTQEQVTDIITEPLKLDAFIKL
                     + +DGV+ELKHCVTQEQV DI+T+PLKLD F+KL
Subjt:  -------------VYQDGVIELKHCVTQEQVTDIITEPLKLDAFIKL

A0A2N9HRC6 Uncharacterized protein1.3e-2730.81Show/hide
Query:  KIGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDL-------------------------------------------
        KIGVKWVF TKLNENGEVDK KARLVAKGYAQQ+GIDY +VF P+ARW TIRM+I L                                           
Subjt:  KIGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDL-------------------------------------------

Query:  ---------------------------------------------------------------------------------------VRLTKDKEEAKIN
                                                                                               VRL KD+E AK+N
Subjt:  ---------------------------------------------------------------------------------------VRLTKDKEEAKIN

Query:  ATMHKQLIGSLMYLTATS--------------------------------DGAVSWS--------------SKKQPVVTLSTTEAEFTAAVSCACQ----
        ATM+KQL+GSLMYLTAT                                  G V                  KKQPVV LSTTE EF AA SCACQ    
Subjt:  ATMHKQLIGSLMYLTATS--------------------------------DGAVSWS--------------SKKQPVVTLSTTEAEFTAAVSCACQ----

Query:  -------------------------------------------------VYQDGVIELKHCVTQEQVTDIITEPLKLDAFIKL
                                                         + +DGV+ELKHCVTQEQV DI+T+PLKLD F+KL
Subjt:  -------------------------------------------------VYQDGVIELKHCVTQEQVTDIITEPLKLDAFIKL

A0A438DX60 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-2632.55Show/hide
Query:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDL--------------------------------------------
        IGVKWV+ TKLN++G VDKYKARLV KGY Q+ G+DY KVF P+A+  TIR+++ +                                            
Subjt:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDL--------------------------------------------

Query:  -------------------------VRLTKDKEEAKINATMHKQLIGSLMYLTAT--------------------------------SDGAVSWSSKKQP
                                 ++L+K     ++++T++KQ++GSLMYLT+T                                + GA++WSSKKQ 
Subjt:  -------------------------VRLTKDKEEAKINATMHKQLIGSLMYLTAT--------------------------------SDGAVSWSSKKQP

Query:  VVTLSTTEAEFTAAVSCACQ-VYQDGVIELKHCVTQEQVTDIITEPLKLDAFIKL
        +VTLSTTEAEF AA S +CQ ++   ++E+ +   Q+Q+ DI+T+PLK   F+KL
Subjt:  VVTLSTTEAEFTAAVSCACQ-VYQDGVIELKHCVTQEQVTDIITEPLKLDAFIKL

A0A484KXZ0 Uncharacterized protein8.7e-2761.74Show/hide
Query:  KLFWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADELEVEQVTPEHVLKRSSIAIRVSDRYVPSLHCLL
        K FWD KN+ ILRH DV FDE+++YKD     S TTKQ+G EVEL K++  DV  +TQ T +IV  E EVEQVTPE VL++SS  IR  DRY PSLH LL
Subjt:  KLFWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADELEVEQVTPEHVLKRSSIAIRVSDRYVPSLHCLL

Query:  LTDEGEPELFDEVLQ
        LTDEGEPE  DE +Q
Subjt:  LTDEGEPELFDEVLQ

A0A484M2S3 Uncharacterized protein1.9e-3463.16Show/hide
Query:  FWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADELEVEQVTPEHVLKRSSIAIRVSDRYVPSLHCLLLT
        FWD KN++ILRH DV FDE+VLYKD+E     TTKQ+G EVEL K++  +V A+TQ T E  A+E EVEQVTPE VL+RSS   RV DRY PSLH LLLT
Subjt:  FWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADELEVEQVTPEHVLKRSSIAIRVSDRYVPSLHCLLLT

Query:  DEGEPELFDEVLQFEDTTKWEKIMDDGMFRLQK
        DEGEPE FDE +Q ED+ KWE+ MDD M  L++
Subjt:  DEGEPELFDEVLQFEDTTKWEKIMDDGMFRLQK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-0750Show/hide
Query:  KWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDLV
        +WVF+ K NE G   +YKARLVA+G+ Q++ IDY + F P+AR  + R I+ LV
Subjt:  KWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDLV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-0643.4Show/hide
Query:  KWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDL
        KWVF  K + + ++ +YKARLV KG+ Q+ GID+ ++F+P+ +  +IR I+ L
Subjt:  KWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDL

P92520 Uncharacterized mitochondrial protein AtMg008201.2e-0946.55Show/hide
Query:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDLVR
        +G KWVF TKL+ +G +D+ KARLVAKG+ Q+ GI +++ ++P+ R  TIR I+++ +
Subjt:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDLVR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.6e-0944.44Show/hide
Query:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMII
        +G +W+F  K N +G +++YKARLVAKGY Q+ G+DY + F+P+ +  +IR+++
Subjt:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMII

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.5e-0944.44Show/hide
Query:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMII
        +G +W+F  K N +G +++YKARLVAKGY Q+ G+DY + F+P+ +  +IR+++
Subjt:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMII

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.6e-1241.89Show/hide
Query:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDLVRLTKDKEEAKINATMHK
        IG KWV+  K N +G +++YKARLVAKGY QQ GID+I+ F+P+ +  ++++I+ +         A  N T+H+
Subjt:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDLVRLTKDKEEAKINATMHK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.6e-1146.55Show/hide
Query:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDLVR
        +G KWVF TKL+ +G +D+ KARLVAKG+ Q+ GI +++ ++P+ R  TIR I+++ +
Subjt:  IGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDLVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATTGGAGTAAAGTGGGTTTTCAACACCAAACTCAACGAAAATGGTGAAGTTGACAAGTATAAGGCTAGGTTGGTAGCAAAAGGTTATGCACAACAACATGGTAT
AGACTATATCAAGGTGTTTACACCGATGGCTAGGTGGGTTACTATTCGAATGATAATTGATTTGGTTAGACTGACAAAGGATAAAGAAGAAGCTAAGATCAATGCTACCA
TGCATAAACAATTGATTGGAAGCCTTATGTATCTGACTGCAACAAGTGATGGAGCTGTGTCTTGGTCCTCCAAAAAACAACCTGTTGTTACTTTGTCCACTACTGAAGCA
GAATTTACGGCAGCCGTGTCTTGTGCTTGTCAAGTGTATCAAGATGGAGTTATTGAGCTAAAGCATTGTGTCACACAAGAACAAGTTACAGATATTATAACAGAACCACT
GAAGCTGGATGCATTCATAAAACTATTTTGGGATGTCAAGAATAAGAGAATCCTGAGACATGATGACGTGATTTTTGATGAAAATGTCTTGTACAAGGACAAAGAGACGA
ATGGTTCTGGGACAACGAAGCAAATGGGAGATGAGGTTGAGTTGCGAAAAAATTCACTTAGTGATGTTGTAGCAGATACTCAAGGAACTCTTGAGATTGTTGCTGATGAA
CTAGAGGTGGAGCAAGTGACACCTGAGCATGTGTTGAAAAGATCATCCATAGCTATTAGAGTATCAGATAGGTATGTACCTTCATTACACTGTCTGTTGTTGACTGATGA
AGGGGAACCAGAACTCTTTGATGAGGTCCTACAATTCGAAGATACAACCAAGTGGGAGAAAATCATGGATGATGGGATGTTTAGGCTTCAGAAATGTGTTGCTCTTTCAT
CTACTAAAACCGAGTACGTGGCTATAGCTAAAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGATTGGAGTAAAGTGGGTTTTCAACACCAAACTCAACGAAAATGGTGAAGTTGACAAGTATAAGGCTAGGTTGGTAGCAAAAGGTTATGCACAACAACATGGTAT
AGACTATATCAAGGTGTTTACACCGATGGCTAGGTGGGTTACTATTCGAATGATAATTGATTTGGTTAGACTGACAAAGGATAAAGAAGAAGCTAAGATCAATGCTACCA
TGCATAAACAATTGATTGGAAGCCTTATGTATCTGACTGCAACAAGTGATGGAGCTGTGTCTTGGTCCTCCAAAAAACAACCTGTTGTTACTTTGTCCACTACTGAAGCA
GAATTTACGGCAGCCGTGTCTTGTGCTTGTCAAGTGTATCAAGATGGAGTTATTGAGCTAAAGCATTGTGTCACACAAGAACAAGTTACAGATATTATAACAGAACCACT
GAAGCTGGATGCATTCATAAAACTATTTTGGGATGTCAAGAATAAGAGAATCCTGAGACATGATGACGTGATTTTTGATGAAAATGTCTTGTACAAGGACAAAGAGACGA
ATGGTTCTGGGACAACGAAGCAAATGGGAGATGAGGTTGAGTTGCGAAAAAATTCACTTAGTGATGTTGTAGCAGATACTCAAGGAACTCTTGAGATTGTTGCTGATGAA
CTAGAGGTGGAGCAAGTGACACCTGAGCATGTGTTGAAAAGATCATCCATAGCTATTAGAGTATCAGATAGGTATGTACCTTCATTACACTGTCTGTTGTTGACTGATGA
AGGGGAACCAGAACTCTTTGATGAGGTCCTACAATTCGAAGATACAACCAAGTGGGAGAAAATCATGGATGATGGGATGTTTAGGCTTCAGAAATGTGTTGCTCTTTCAT
CTACTAAAACCGAGTACGTGGCTATAGCTAAAGCTTGA
Protein sequenceShow/hide protein sequence
MKIGVKWVFNTKLNENGEVDKYKARLVAKGYAQQHGIDYIKVFTPMARWVTIRMIIDLVRLTKDKEEAKINATMHKQLIGSLMYLTATSDGAVSWSSKKQPVVTLSTTEA
EFTAAVSCACQVYQDGVIELKHCVTQEQVTDIITEPLKLDAFIKLFWDVKNKRILRHDDVIFDENVLYKDKETNGSGTTKQMGDEVELRKNSLSDVVADTQGTLEIVADE
LEVEQVTPEHVLKRSSIAIRVSDRYVPSLHCLLLTDEGEPELFDEVLQFEDTTKWEKIMDDGMFRLQKCVALSSTKTEYVAIAKA