; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G08520 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G08520
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionCCHC-type domain-containing protein
Genome locationChr7:6335188..6336238
RNA-Seq ExpressionCSPI07G08520
SyntenyCSPI07G08520
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025314 - Domain of unknown function DUF4219
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022931810.1 uncharacterized protein LOC111438099 [Cucurbita moschata]2.7e-11671.67Show/hide
Query:  MANL-LANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFN
        MANL + NG  EGQSTSRPPYFDG+NY  WKARMKIYLQS+DY LWL V+ GPY+P+K ++N++ PKLE E+DE++MKKCS N  AINCLYC LS DEFN
Subjt:  MANL-LANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFN

Query:  RMSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK-----------
        R+ MCSSA EIW TLE+THEGTNQVKE+KISM VHNYELFKM+ NE I DMFTRFTNI+NALK LGKVY+TSENVRKILRSLPK+WEAK           
Subjt:  RMSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK-----------

Query:  ----RSLIGSLMTHEIVMKEHLEDESKKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG
              L+GSLMTHEI M  H+E+ESKK+KSIALK  S++VD +DED LDEDD+AYF+RKYKNFIKRKK FKKH + QKESKGEKSK DEVICYECK+ G
Subjt:  ----RSLIGSLMTHEIVMKEHLEDESKKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG

XP_022971787.1 uncharacterized protein LOC111470465, partial [Cucurbita maxima]2.8e-9769.73Show/hide
Query:  MANL-LANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFN
        MANL + NG  EGQSTSRPPYFDG+NY  WKARMKIYLQS+D+ LWL V+ GPY+P+K ++N++ PKLE E+DE++MKKCS N  AINCLYC LS DEFN
Subjt:  MANL-LANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFN

Query:  RMSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK-----------
        R+ MCSSA EIW TLE+THEGTNQVKE+KISM VHNYELFKM+ NE I DMFTRFTNI+NALK LGKVY+TSENVRKILRSLPK+WEAK           
Subjt:  RMSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK-----------

Query:  ----RSLIGSLMTHEIVMKEHLEDESKKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKY
              L+GSLMTHEI M  H+E+ESKK+KSIALK  S++VD +DED LDEDD+AYF+RKY
Subjt:  ----RSLIGSLMTHEIVMKEHLEDESKKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKY

XP_031739764.1 uncharacterized protein LOC116403291 [Cucumis sativus]8.1e-11388.33Show/hide
Query:  MKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNRMSMCSSAQEIWNTLEITHEGTNQVKESKISMF
        MKIYLQSIDYNLWLIVAKGPYVPMKN+DNVD PKLEEEYDENEMKKCSFN KAINCLYC LSKDEFNR+SMCSSAQEIWNTLEITHEGTNQVKESKISMF
Subjt:  MKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNRMSMCSSAQEIWNTLEITHEGTNQVKESKISMF

Query:  VHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK---------------RSLIGSLMTHEIVMKEHLEDESKKRKSIA
        VHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK                 LIGSLMTHEI+MKEHLEDESKK+KSIA
Subjt:  VHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK---------------RSLIGSLMTHEIVMKEHLEDESKKRKSIA

Query:  LKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKK
        LKTISLEVDP+DEDGLDEDDIAYFSRKYKNFIKRKK F++
Subjt:  LKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKK

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]1.1e-17892.55Show/hide
Query:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNR
        MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKN+DNVD PKLEEEYDENEMKKCSFN KAINCLYC LSKDEFNR
Subjt:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNR

Query:  MSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK------------
        +SMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK            
Subjt:  MSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK------------

Query:  ---RSLIGSLMTHEIVMKEHLEDESKKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGH
             LIGSLMTHEI+MKEHLEDESKK+KSIALKTISLEVDP+DEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGH
Subjt:  ---RSLIGSLMTHEIVMKEHLEDESKKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGH

Query:  IRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKED
        IRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDK+D
Subjt:  IRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHSDKED

XP_038895919.1 uncharacterized protein LOC120084093 [Benincasa hispida]4.2e-10155.71Show/hide
Query:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNR
        MA    NG  EGQST+RPP FDG+NYA+WK RM+IYL SIDYNLW IV  GP +P K +DN D PK E++ ++ + KK S N KA+NCL+C L  +EFN+
Subjt:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNR

Query:  MSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK------------
        +S C+SA+EIW+ L++THEGTNQVKESKISM VHNY+LFKMDANETI++MFTRFTNI+N LKGLGK YTTSENVRKILRSLPK+WEAK            
Subjt:  MSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK------------

Query:  ---RSLIGSLMTHEIVMKEHLEDESKKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGH
             L+GSLM HEI+MK ++E++ KK+K++ LK+  ++ D + E  L++++ AY ++K+K   +++ + KK     +E KGEKS +D +ICYECK+ GH
Subjt:  ---RSLIGSLMTHEIVMKEHLEDESKKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGH

Query:  IRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEE-MANLGLMAHSDKED
        +  D P  K+  K  +KAMKAT D+S ESE E EE +ANL +MA  D +D
Subjt:  IRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEE-MANLGLMAHSDKED

TrEMBL top hitse value%identityAlignment
A0A2N9ECJ4 CCHC-type domain-containing protein1.5e-8348.29Show/hide
Query:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNR
        MAN     ++EGQST RPP F GS+Y YWK RM +Y++  DY++W I+A GP++P K ++     KLE E++E ++K    N KA++ LYC L   E+NR
Subjt:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNR

Query:  MSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK------------
        +S C SA+EIW+ LE+T+EGTNQVKESK++M VH YELF M  +E I++M TRFTNI+N LK LGK+YT  ENVRKILRSLPK WEAK            
Subjt:  MSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK------------

Query:  ---RSLIGSLMTHEIVMKEHLEDES-KKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG
             L GSLMT+E+ M   +E+E  K +K+ ALK+   + D  +E+  +E++IA  +R +K F+K+KK F +    + E+KGE SK +   CY+CK+ G
Subjt:  ---RSLIGSLMTHEIVMKEHLEDES-KKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG

Query:  HIRTDCPLL-KSSKKSKKKAMKATWDDSSESESE----VEEMANLGLMAH
        H + +CP + K + K KKKA+KATWDDS ES+S+      E+ANL L+ +
Subjt:  HIRTDCPLL-KSSKKSKKKAMKATWDDSSESESE----VEEMANLGLMAH

A0A2N9EN48 CCHC-type domain-containing protein1.6e-8547.89Show/hide
Query:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNR
        MAN     ++EGQST RPP F GS+Y YWK RM +Y+++ DY++W ++A GP++P K ++     KLE E++E ++K    N KA++ LYC L  +E+NR
Subjt:  MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNR

Query:  MSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKRS----------
        +  C SA+EIW+ LE+T+EGTNQVKESK+SM VH YELF M  +E I++M TRFTNI+N LK LGK+YT  ENVRKILRSLPK WEAK++          
Subjt:  MSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKRS----------

Query:  -----LIGSLMTHEIVMKEHLEDES-KKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG
             L GSLMT+E+ M   +E+E  K +K+ ALK+   + D  +E+  +E++IA  +RK+K F+K+KK F +    + E++GE SK +  ICY+CK+ G
Subjt:  -----LIGSLMTHEIVMKEHLEDES-KKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG

Query:  HIRTDCPLL-KSSKKSKKKAMKATWDDSSESESE----VEEMANLGLMAHSDKED
        H + +CP + K + K KKKA+KATWDDS ES+S+      E+ANL L+ + ++ D
Subjt:  HIRTDCPLL-KSSKKSKKKAMKATWDDSSESESE----VEEMANLGLMAHSDKED

A0A2N9I7S8 CCHC-type domain-containing protein1.0e-8448.13Show/hide
Query:  IVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNRMSMCSSAQ
        ++EGQST RPP F GS+Y YWK RM +Y+++ DY++W ++A GP++P K ++     KLE E++E ++K    N KA++ LYC L  +E+NR+  C SA+
Subjt:  IVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNRMSMCSSAQ

Query:  EIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKRS---------------LIG
        EIW+ LE+T+EGTNQVKESK+SM VH YELF M  +E I++M TRFTNI+N LK LGK+YT  ENVRKILRSLPK WEAK++               L G
Subjt:  EIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKRS---------------LIG

Query:  SLMTHEIVMKEHLEDES-KKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGHIRTDCPL
        SLMT+E+ M   +E+E  K +K+ ALK+   + D  +E+  +E++IA  +RK+K F+K+KK F +    + E++GE SK +  ICY+CK+ GH + +CP 
Subjt:  SLMTHEIVMKEHLEDES-KKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGHIRTDCPL

Query:  L-KSSKKSKKKAMKATWDDSSESESE----VEEMANLGLMAHSDKED
        + K + K KKKA+KATWDDS ES+S+      E+ANL L+ + ++ D
Subjt:  L-KSSKKSKKKAMKATWDDSSESESE----VEEMANLGLMAHSDKED

A0A6J1F0H1 uncharacterized protein LOC1114380991.3e-11671.67Show/hide
Query:  MANL-LANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFN
        MANL + NG  EGQSTSRPPYFDG+NY  WKARMKIYLQS+DY LWL V+ GPY+P+K ++N++ PKLE E+DE++MKKCS N  AINCLYC LS DEFN
Subjt:  MANL-LANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFN

Query:  RMSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK-----------
        R+ MCSSA EIW TLE+THEGTNQVKE+KISM VHNYELFKM+ NE I DMFTRFTNI+NALK LGKVY+TSENVRKILRSLPK+WEAK           
Subjt:  RMSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK-----------

Query:  ----RSLIGSLMTHEIVMKEHLEDESKKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG
              L+GSLMTHEI M  H+E+ESKK+KSIALK  S++VD +DED LDEDD+AYF+RKYKNFIKRKK FKKH + QKESKGEKSK DEVICYECK+ G
Subjt:  ----RSLIGSLMTHEIVMKEHLEDESKKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSG

A0A6J1I2X4 uncharacterized protein LOC1114704651.4e-9769.73Show/hide
Query:  MANL-LANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFN
        MANL + NG  EGQSTSRPPYFDG+NY  WKARMKIYLQS+D+ LWL V+ GPY+P+K ++N++ PKLE E+DE++MKKCS N  AINCLYC LS DEFN
Subjt:  MANL-LANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFN

Query:  RMSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK-----------
        R+ MCSSA EIW TLE+THEGTNQVKE+KISM VHNYELFKM+ NE I DMFTRFTNI+NALK LGKVY+TSENVRKILRSLPK+WEAK           
Subjt:  RMSMCSSAQEIWNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAK-----------

Query:  ----RSLIGSLMTHEIVMKEHLEDESKKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKY
              L+GSLMTHEI M  H+E+ESKK+KSIALK  S++VD +DED LDEDD+AYF+RKY
Subjt:  ----RSLIGSLMTHEIVMKEHLEDESKKRKSIALKTISLEVDPKDEDGLDEDDIAYFSRKY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAACCTATTGGCAAATGGTATTGTTGAAGGTCAATCTACTTCTAGACCTCCTTATTTTGATGGTTCAAATTATGCATATTGGAAAGCTAGAATGAAAATTTATTT
GCAATCTATTGACTATAATTTGTGGTTAATTGTTGCTAAAGGTCCTTATGTACCCATGAAAAATATTGATAATGTTGATAAGCCTAAATTAGAAGAAGAGTATGATGAAA
ATGAAATGAAAAAGTGTTCTTTTAATACTAAGGCTATTAATTGTTTGTATTGTGACTTGAGTAAAGATGAATTTAATAGAATGTCCATGTGTTCTTCCGCTCAAGAAATT
TGGAATACTCTTGAAATTACTCATGAAGGAACAAATCAAGTTAAAGAGTCTAAAATTAGCATGTTTGTTCATAATTATGAATTGTTTAAGATGGATGCTAATGAGACTAT
CACCGATATGTTTACTAGATTTACTAACATCATAAATGCTTTGAAGGGTCTTGGTAAAGTTTATACAACTTCCGAAAATGTTAGAAAAATTCTAAGGTCTCTACCTAAGA
CTTGGGAAGCTAAGAGGAGCCTTATTGGCTCACTCATGACTCATGAGATCGTCATGAAGGAGCATTTGGAGGATGAGTCCAAAAAGAGAAAGAGCATTGCATTAAAGACT
ATCTCGTTGGAAGTTGATCCCAAAGATGAGGATGGCCTGGATGAAGATGACATTGCTTATTTCTCACGTAAGTACAAAAATTTCATAAAAAGAAAGAAATATTTCAAGAA
ACACCTATCAACCCAAAAAGAGTCAAAAGGTGAGAAAAGCAAAAAGGATGAGGTGATTTGTTATGAATGTAAAAGATCGGGTCATATAAGAACGGATTGTCCTCTCCTTA
AATCATCTAAGAAATCCAAGAAGAAGGCAATGAAGGCTACATGGGATGATAGTAGTGAAAGTGAAAGTGAAGTTGAAGAAATGGCAAATCTTGGTCTCATGGCTCATAGT
GACAAAGAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAACCTATTGGCAAATGGTATTGTTGAAGGTCAATCTACTTCTAGACCTCCTTATTTTGATGGTTCAAATTATGCATATTGGAAAGCTAGAATGAAAATTTATTT
GCAATCTATTGACTATAATTTGTGGTTAATTGTTGCTAAAGGTCCTTATGTACCCATGAAAAATATTGATAATGTTGATAAGCCTAAATTAGAAGAAGAGTATGATGAAA
ATGAAATGAAAAAGTGTTCTTTTAATACTAAGGCTATTAATTGTTTGTATTGTGACTTGAGTAAAGATGAATTTAATAGAATGTCCATGTGTTCTTCCGCTCAAGAAATT
TGGAATACTCTTGAAATTACTCATGAAGGAACAAATCAAGTTAAAGAGTCTAAAATTAGCATGTTTGTTCATAATTATGAATTGTTTAAGATGGATGCTAATGAGACTAT
CACCGATATGTTTACTAGATTTACTAACATCATAAATGCTTTGAAGGGTCTTGGTAAAGTTTATACAACTTCCGAAAATGTTAGAAAAATTCTAAGGTCTCTACCTAAGA
CTTGGGAAGCTAAGAGGAGCCTTATTGGCTCACTCATGACTCATGAGATCGTCATGAAGGAGCATTTGGAGGATGAGTCCAAAAAGAGAAAGAGCATTGCATTAAAGACT
ATCTCGTTGGAAGTTGATCCCAAAGATGAGGATGGCCTGGATGAAGATGACATTGCTTATTTCTCACGTAAGTACAAAAATTTCATAAAAAGAAAGAAATATTTCAAGAA
ACACCTATCAACCCAAAAAGAGTCAAAAGGTGAGAAAAGCAAAAAGGATGAGGTGATTTGTTATGAATGTAAAAGATCGGGTCATATAAGAACGGATTGTCCTCTCCTTA
AATCATCTAAGAAATCCAAGAAGAAGGCAATGAAGGCTACATGGGATGATAGTAGTGAAAGTGAAAGTGAAGTTGAAGAAATGGCAAATCTTGGTCTCATGGCTCATAGT
GACAAAGAAGATTAA
Protein sequenceShow/hide protein sequence
MANLLANGIVEGQSTSRPPYFDGSNYAYWKARMKIYLQSIDYNLWLIVAKGPYVPMKNIDNVDKPKLEEEYDENEMKKCSFNTKAINCLYCDLSKDEFNRMSMCSSAQEI
WNTLEITHEGTNQVKESKISMFVHNYELFKMDANETITDMFTRFTNIINALKGLGKVYTTSENVRKILRSLPKTWEAKRSLIGSLMTHEIVMKEHLEDESKKRKSIALKT
ISLEVDPKDEDGLDEDDIAYFSRKYKNFIKRKKYFKKHLSTQKESKGEKSKKDEVICYECKRSGHIRTDCPLLKSSKKSKKKAMKATWDDSSESESEVEEMANLGLMAHS
DKED