; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh02G004960 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh02G004960
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCma_Chr02:2646248..2646865
RNA-Seq ExpressionCmaCh02G004960
SyntenyCmaCh02G004960
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0007015 - actin filament organization (biological process)
GO:0034645 - cellular macromolecule biosynthetic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3643966.1 Pleiotropic drug resistance protein 1 [Capsicum annuum]2.2e-7377.18Show/hide
Query:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVA
        +IRLTLSRNVAFNI+KEKTTSDL+KALSNMYEKPSA NKVYLMRRLFNLQM E GSV DHINEFNMIVSQL SV+INFEDEIK LILMSSLPE   T+V 
Subjt:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVA

Query:  AISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPN-KGRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDD
        AISSS GS+KLKFDKIRDVV S+S RKREIG SS SALSVD+RGR + +  N   RSKSKNR KSP + NVTCWNCGEK HF T C +PK+ +N KSGDD
Subjt:  AISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPN-KGRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDD

Query:  DDSINS
        +DS+NS
Subjt:  DDSINS

KAG7011443.1 hypothetical protein SDJN02_26349, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-5665.52Show/hide
Query:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVA
        +IRLTLSRN AFNIIKEKTTSDL+KALSNMYEK SAMNKVYLMRRLFNLQMSEGGS+ D+INEFNMIVS+LS VEINF+DEIK LILMSSLPESWDTVVA
Subjt:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVA

Query:  AISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNKGRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDDD
        AI+SSRGSDKLKFD+IRD+VL ES R R+ G+SS  ALS D                                           CT+ K+KQNHKS DDD
Subjt:  AISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNKGRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDDD

Query:  DSI
        DSI
Subjt:  DSI

VFQ59121.1 unnamed protein product [Cuscuta campestris]9.2e-5977.91Show/hide
Query:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVA
        MIRLTL++NVAFNI+KE TT+ LMKALSN+YEKPSAMNKVYLMRRLFNLQM E GSV +HIN+FNMIVSQL  VEINFEDEIK LIL+SS+PESWD VVA
Subjt:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVA

Query:  AISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNK-GRSKSKNREKSPNRPNVT
        AISSSRGS+KL+FD+IRDVVLSES RKRE+ +SS SALSVD+RGR K K  ++ GRSKSKNR KSPNR  +T
Subjt:  AISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNK-GRSKSKNREKSPNRPNVT

VFQ69914.1 unnamed protein product [Cuscuta campestris]2.3e-6277.65Show/hide
Query:  MNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAAISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSS
        MNKVYLMRRLFNLQM E GSV +HIN+FNMIVSQL SVEINFEDEIK LIL+SS+PESWDTVVAAISSSRGS+KL+FD+IRDVVLSES RKRE+G+SS S
Subjt:  MNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAAISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSS

Query:  ALSVDQRGRSKPKSPNK-GRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDDDDSINS
        ALSVD++GRSK K  ++ GRSKSKNR KSPNR N+TCWNCG+K HF+  C +PK+KQN KSGDD DS+NS
Subjt:  ALSVDQRGRSKPKSPNK-GRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDDDDSINS

VFR00719.1 unnamed protein product [Cuscuta campestris]8.3e-7678.79Show/hide
Query:  NVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAAISSSRGS
        NVAFNI+KE TT+ LMKALSNMYEKPSAMNKVYLMRRLFNLQM E GSV +HIN+FNMIVSQL SVEINFEDEIK LIL+SS+ ESWDTVVAAISSSRGS
Subjt:  NVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAAISSSRGS

Query:  DKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNK-GRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDDDDSINS
        +KL+FD+IRDVVLSES RKRE+G+SS SALSVDQ+GRSK K  ++ GRSKSKNR KSPNR N+TCWNCG+K HF+  C +PK+KQN KSGDD DS+NS
Subjt:  DKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNK-GRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDDDDSINS

TrEMBL top hitse value%identityAlignment
A0A484K039 Uncharacterized protein4.4e-5977.91Show/hide
Query:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVA
        MIRLTL++NVAFNI+KE TT+ LMKALSN+YEKPSAMNKVYLMRRLFNLQM E GSV +HIN+FNMIVSQL  VEINFEDEIK LIL+SS+PESWD VVA
Subjt:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVA

Query:  AISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNK-GRSKSKNREKSPNRPNVT
        AISSSRGS+KL+FD+IRDVVLSES RKRE+ +SS SALSVD+RGR K K  ++ GRSKSKNR KSPNR  +T
Subjt:  AISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNK-GRSKSKNREKSPNRPNVT

A0A484KZ82 CCHC-type domain-containing protein1.1e-6277.65Show/hide
Query:  MNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAAISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSS
        MNKVYLMRRLFNLQM E GSV +HIN+FNMIVSQL SVEINFEDEIK LIL+SS+PESWDTVVAAISSSRGS+KL+FD+IRDVVLSES RKRE+G+SS S
Subjt:  MNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAAISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSS

Query:  ALSVDQRGRSKPKSPNK-GRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDDDDSINS
        ALSVD++GRSK K  ++ GRSKSKNR KSPNR N+TCWNCG+K HF+  C +PK+KQN KSGDD DS+NS
Subjt:  ALSVDQRGRSKPKSPNK-GRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDDDDSINS

A0A484NK44 CCHC-type domain-containing protein4.0e-7678.79Show/hide
Query:  NVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAAISSSRGS
        NVAFNI+KE TT+ LMKALSNMYEKPSAMNKVYLMRRLFNLQM E GSV +HIN+FNMIVSQL SVEINFEDEIK LIL+SS+ ESWDTVVAAISSSRGS
Subjt:  NVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAAISSSRGS

Query:  DKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNK-GRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDDDDSINS
        +KL+FD+IRDVVLSES RKRE+G+SS SALSVDQ+GRSK K  ++ GRSKSKNR KSPNR N+TCWNCG+K HF+  C +PK+KQN KSGDD DS+NS
Subjt:  DKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNK-GRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDDDDSINS

A0A5B7BAK4 Uncharacterized protein2.5e-5458.46Show/hide
Query:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVA
        ++RLTL+RNVAFNI KEKTT+ LM ALSNMYEKPSA NKVYLMRRLFNL+MSEG SV +H+NEFN++ +QLSSVEI F+DEI+ LIL+SSLPESW+  V 
Subjt:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVA

Query:  AISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNKGRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHK
        A+SSS G+ KLK+D +RD++LSE  R+RE G SS SAL+V+ RGR+  ++ +  RS+S+  +    +    CWNCG+  H +  C  PK+++  K
Subjt:  AISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNKGRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHK

A0A6A3BK95 Uncharacterized protein9.6e-5460.73Show/hide
Query:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVA
        +IRLTLSRNVAFNI KEKTT+ LM ALS+MYEKPSA NKV+LMRRLFNL+M+EG SV  H+NE N I +QLSSVEI F+DE++ LIL+SSLP+SW+  V 
Subjt:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVA

Query:  AISSSRGSDKLKFDKIRDVVLSESTRKREIGN-SSSSALSVDQRGRSKPKSPNKGRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKR
        A+SSS G++KLKFD +RD+VLSE  R+RE G  S+SSAL  + RGR+  ++ N+GRSKS+  +      + TC+NCG+K HF+  C  PK+
Subjt:  AISSSRGSDKLKFDKIRDVVLSESTRKREIGN-SSSSALSVDQRGRSKPKSPNKGRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKR

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-2235.61Show/hide
Query:  IRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAA
        IRL LS +V  NII E T   +   L ++Y   +  NK+YL ++L+ L MSEG + + H+N FN +++QL+++ +  E+E K ++L++SLP S+D +   
Subjt:  IRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAA

Query:  ISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNK-----GRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKS
        I   + + +LK D    ++L+E  RK+    +   AL  + RGRS  +S N       R KSKNR KS  R    C+NC +  HF+  C  P++ +   S
Subjt:  ISSSRGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNK-----GRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKS

Query:  GDDDD
        G  +D
Subjt:  GDDDD

Arabidopsis top hitse value%identityAlignment
AT3G29785.1 unknown protein1.6e-0555Show/hide
Query:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKV
        +IRLT+S+N+A N+ KEK+   LMK LS++Y+KPS  N V
Subjt:  MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKV

AT4G35820.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein9.0e-0426.6Show/hide
Query:  MYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAAISSSRGSDKLKFDKIRDVVLSE
        M +  S    +YL +RL  L++ E   ++ HIN F+ +V +  SV++  E++ K +IL+ SL     T+  ++     S +   ++  +V+  E
Subjt:  MYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAAISSSRGSDKLKFDKIRDVVLSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCGGTTGACGCTATCCAGAAACGTGGCTTTTAATATCATCAAGGAGAAGACAACGTCAGATCTGATGAAGGCGCTGTCGAATATGTACGAAAAACCGTCG
GCTATGAACAAGGTGTATTTGATGCGAAGATTGTTCAACCTACAGATGTCTGAAGGTGGATCTGTTGTTGATCATATAAATGAATTCAATATGATCGTAAGTCAA
CTGAGTTCGGTGGAAATTAATTTCGAGGATGAAATTAAAACATTGATTTTGATGTCATCTTTACCCGAGTCGTGGGATACTGTTGTTGCCGCAATCAGCAGTTCG
CGAGGATCTGATAAACTGAAGTTTGATAAAATTCGAGATGTAGTTCTTAGCGAAAGTACTCGCAAAAGAGAAATTGGAAATTCATCTAGTAGTGCTCTCAGTGTT
GACCAACGGGGAAGAAGTAAACCGAAGAGCCCAAACAAAGGGCGATCAAAATCAAAGAACCGAGAAAAATCTCCAAATAGACCAAACGTAACGTGTTGGAATTGT
GGAGAAAAAGTTCACTTTCGGACAGGTTGTACAAGACCAAAGAGGAAGCAGAATCACAAATCTGGAGATGACGATGATTCTATAAATTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGATCCGGTTGACGCTATCCAGAAACGTGGCTTTTAATATCATCAAGGAGAAGACAACGTCAGATCTGATGAAGGCGCTGTCGAATATGTACGAAAAACCGTCG
GCTATGAACAAGGTGTATTTGATGCGAAGATTGTTCAACCTACAGATGTCTGAAGGTGGATCTGTTGTTGATCATATAAATGAATTCAATATGATCGTAAGTCAA
CTGAGTTCGGTGGAAATTAATTTCGAGGATGAAATTAAAACATTGATTTTGATGTCATCTTTACCCGAGTCGTGGGATACTGTTGTTGCCGCAATCAGCAGTTCG
CGAGGATCTGATAAACTGAAGTTTGATAAAATTCGAGATGTAGTTCTTAGCGAAAGTACTCGCAAAAGAGAAATTGGAAATTCATCTAGTAGTGCTCTCAGTGTT
GACCAACGGGGAAGAAGTAAACCGAAGAGCCCAAACAAAGGGCGATCAAAATCAAAGAACCGAGAAAAATCTCCAAATAGACCAAACGTAACGTGTTGGAATTGT
GGAGAAAAAGTTCACTTTCGGACAGGTTGTACAAGACCAAAGAGGAAGCAGAATCACAAATCTGGAGATGACGATGATTCTATAAATTCATAA
Protein sequenceShow/hide protein sequence
MIRLTLSRNVAFNIIKEKTTSDLMKALSNMYEKPSAMNKVYLMRRLFNLQMSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKTLILMSSLPESWDTVVAAISSS
RGSDKLKFDKIRDVVLSESTRKREIGNSSSSALSVDQRGRSKPKSPNKGRSKSKNREKSPNRPNVTCWNCGEKVHFRTGCTRPKRKQNHKSGDDDDSINS