; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC06G118080 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC06G118080
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationCiama_Chr06:22749763..22751733
RNA-Seq ExpressionCaUC06G118080
SyntenyCaUC06G118080
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032636.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]6.1e-1761.84Show/hide
Query:  SALEWNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR
        S+  WNK   +   E+ + +GKFV A++ + +SSN KK EALKEVREK+SSI+CWKCK FGHMSKDC+NKRVMV+R
Subjt:  SALEWNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR

TYK06180.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]1.8e-1678.33Show/hide
Query:  SEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR
        SE KGKFVVAKR +A+SSN+KKNEA KE +EKS+S++CWKCK FGHMSKDCVNK+VMVIR
Subjt:  SEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR

XP_022153198.1 uncharacterized protein LOC111020753 [Momordica charantia]9.5e-1869.44Show/hide
Query:  WNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR
        WNK   +   E+ +PK KFV AKR + +SSN K+NEA KEVREK+SSI+CWKCK FGHMSKDCVNKRVMVIR
Subjt:  WNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR

XP_022158838.1 uncharacterized protein LOC111025303 [Momordica charantia]7.3e-1869.44Show/hide
Query:  WNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR
        W K   +   E+ +PKGKFV AKR + +SSN KKNEA KEVRE +SSI+CWKCK FGHMSKDCVNKRVMVIR
Subjt:  WNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR

XP_022932136.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111438459, partial [Cucurbita moschata]3.6e-1768.06Show/hide
Query:  WNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR
        WNK   +   E+  PK KFV AKR +A+SSN KKNEA K V+EKSSSI+CWKCK FGHMSK+CVNK+VMVIR
Subjt:  WNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR

TrEMBL top hitse value%identityAlignment
A0A5A7SRG3 Transposon Ty3-I Gag-Pol polyprotein3.0e-1761.84Show/hide
Query:  SALEWNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR
        S+  WNK   +   E+ + +GKFV A++ + +SSN KK EALKEVREK+SSI+CWKCK FGHMSKDC+NKRVMV+R
Subjt:  SALEWNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR

A0A5D3C4C3 Transposon Ty3-I Gag-Pol polyprotein8.7e-1778.33Show/hide
Query:  SEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR
        SE KGKFVVAKR +A+SSN+KKNEA KE +EKS+S++CWKCK FGHMSKDCVNK+VMVIR
Subjt:  SEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR

A0A6J1DGU9 uncharacterized protein LOC1110207534.6e-1869.44Show/hide
Query:  WNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR
        WNK   +   E+ +PK KFV AKR + +SSN K+NEA KEVREK+SSI+CWKCK FGHMSKDCVNKRVMVIR
Subjt:  WNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR

A0A6J1DX75 uncharacterized protein LOC1110253033.5e-1869.44Show/hide
Query:  WNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR
        W K   +   E+ +PKGKFV AKR + +SSN KKNEA KEVRE +SSI+CWKCK FGHMSKDCVNKRVMVIR
Subjt:  WNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR

A0A6J1EVI6 LOW QUALITY PROTEIN: uncharacterized protein LOC1114384591.7e-1768.06Show/hide
Query:  WNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR
        WNK   +   E+  PK KFV AKR +A+SSN KKNEA K V+EKSSSI+CWKCK FGHMSK+CVNK+VMVIR
Subjt:  WNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMVIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGATCCTAGAAAGACCTCTAGCTTTGCATCTAACGAACATTGCGGGCACTTATCATTTGGTATTAGAGCATTTGGTCATCATCCTTGGTCAGCTCTAGAG
TGGAACAAGAAAGACAAGTTGGAGGGAGAGGAAACGAGTGAACCAAAAGGGAAGTTTGTGGTTGCCAAAAGAGCGAAGGCCAAGAGCTCCAATGCTAAAAAGAAT
GAAGCTCTAAAGGAGGTTAGAGAGAAGTCTAGTTCTATTGAATGTTGGAAGTGCAAGGTGTTTGGGCACATGAGCAAAGATTGTGTCAATAAAAGGGTCATGGTG
ATAAGAAAATGGGATACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGATCCTAGAAAGACCTCTAGCTTTGCATCTAACGAACATTGCGGGCACTTATCATTTGGTATTAGAGCATTTGGTCATCATCCTTGGTCAGCTCTAGAG
TGGAACAAGAAAGACAAGTTGGAGGGAGAGGAAACGAGTGAACCAAAAGGGAAGTTTGTGGTTGCCAAAAGAGCGAAGGCCAAGAGCTCCAATGCTAAAAAGAAT
GAAGCTCTAAAGGAGGTTAGAGAGAAGTCTAGTTCTATTGAATGTTGGAAGTGCAAGGTGTTTGGGCACATGAGCAAAGATTGTGTCAATAAAAGGGTCATGGTG
ATAAGAAAATGGGATACTTGA
Protein sequenceShow/hide protein sequence
MLDPRKTSSFASNEHCGHLSFGIRAFGHHPWSALEWNKKDKLEGEETSEPKGKFVVAKRAKAKSSNAKKNEALKEVREKSSSIECWKCKVFGHMSKDCVNKRVMV
IRKWDT