; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC11G210890 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC11G210890
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionNucleic acid-binding, OB-fold containing protein
Genome locationCicolChr11:2775521..2781051
RNA-Seq ExpressionCcUC11G210890
SyntenyCcUC11G210890
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005736 - RNA polymerase I complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR036898 - RNA polymerase Rpb7-like, N-terminal domain superfamily
IPR045113 - RNA polymerase Rpb7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466499.1 PREDICTED: uncharacterized protein LOC103503892 isoform X1 [Cucumis melo]1.4e-10187.5Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
        MEGLKVS+ANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIID+NAKILSGVHPYFGVTIKAKLLLFSPKPNML+EGKVVKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH

Query:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT E IRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSV+ S+K+KMRENEG V
Subjt:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS
            +V  D N +I  ND + KT+K KTTRIS
Subjt:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS

XP_011652444.1 uncharacterized protein LOC101216589 [Cucumis sativus]1.0e-10187.5Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
        MEGLKVS+ANMVVYVHPSKSKK+SQAVLRELGAMLLKFDEKFEGVLLAYEAKIID+NAKILSGVHPYFGVTIKAKLLLFSPKPNML+EGKVVKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH

Query:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT E IRDEFKHRTKHGEEMFVSRAHKHH+IKVGTMIR LVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSVT S KRKMRENEG V
Subjt:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS
            SV  D N +I NND + KT+K K TRIS
Subjt:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS

XP_038889707.1 uncharacterized protein LOC120079556 isoform X1 [Benincasa hispida]1.6e-10588.6Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
        MEGLKVS+AN+VVYVHPSKSKKVSQAVLRELG MLLKFDEKFEGVLLAYEA IID+NAKILSGVHPYFGVTIKAKLLLFSPKPNML+EGKVVKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH

Query:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGF+S VITDE IRDEFKHRTKHGEEMFVSR HKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSVTNS+K+K RENEG  
Subjt:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  SVAPDANTLISNNDGRSKTRKPKTTRIS
        SV+ DAN LI NND +SKT++ KTTRIS
Subjt:  SVAPDANTLISNNDGRSKTRKPKTTRIS

XP_038889709.1 uncharacterized protein LOC120079556 isoform X2 [Benincasa hispida]5.0e-10488.6Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
        MEGLKVS+AN+VVYVHPSKSKKVSQAVLRELG MLLKFDEKFEGVLLAYEA IID+NAKILSGVHPYFGVTIKAKLLLFSPKPNML+EGKVVKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH

Query:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGF+S VITDE IRDEFKHRTKHGEEMFVSR HKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSVTNS K+K RENEG  
Subjt:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  SVAPDANTLISNNDGRSKTRKPKTTRIS
        SV+ DAN LI NND +SKT++ KTTRIS
Subjt:  SVAPDANTLISNNDGRSKTRKPKTTRIS

XP_038898978.1 uncharacterized protein LOC120086413 isoform X3 [Benincasa hispida]5.0e-10487.93Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
        M+GLKVS+ANMVVYVHPSKSKKVSQAVLR LGAMLLKFDEKFEGVLLAYEAKIID+ AKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH

Query:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT+E IRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHI+GSLVPSHTGSIHWLEKN VEG+VT+S+K+KMRENEG+V
Subjt:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS
            SVA + N LI NND +SKT+K KTTRIS
Subjt:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS

TrEMBL top hitse value%identityAlignment
A0A0A0LGG8 Uncharacterized protein5.1e-10287.5Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
        MEGLKVS+ANMVVYVHPSKSKK+SQAVLRELGAMLLKFDEKFEGVLLAYEAKIID+NAKILSGVHPYFGVTIKAKLLLFSPKPNML+EGKVVKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH

Query:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT E IRDEFKHRTKHGEEMFVSRAHKHH+IKVGTMIR LVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSVT S KRKMRENEG V
Subjt:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS
            SV  D N +I NND + KT+K K TRIS
Subjt:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS

A0A1S3CRF8 uncharacterized protein LOC103503892 isoform X21.6e-10087.5Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
        MEGLKVS+ANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIID+NAKILSGVHPYFGVTIKAKLLLFSPKPNML+EGKVVKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH

Query:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT E IRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSV+ S K+KMRENEG V
Subjt:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS
            +V  D N +I  ND + KT+K KTTRIS
Subjt:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS

A0A1S4E5I5 uncharacterized protein LOC103503892 isoform X16.6e-10287.5Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
        MEGLKVS+ANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIID+NAKILSGVHPYFGVTIKAKLLLFSPKPNML+EGKVVKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH

Query:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT E IRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSV+ S+K+KMRENEG V
Subjt:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS
            +V  D N +I  ND + KT+K KTTRIS
Subjt:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS

A0A5D3E6A0 Putative DNA-directed RNA polymerase I subunit RPA436.6e-10287.5Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
        MEGLKVS+ANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIID+NAKILSGVHPYFGVTIKAKLLLFSPKPNML+EGKVVKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH

Query:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT E IRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSV+ S+K+KMRENEG V
Subjt:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS
            +V  D N +I  ND + KT+K KTTRIS
Subjt:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS

A0A6J1D5Y9 uncharacterized protein LOC1110167571.3e-10083.62Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
        MEGLKVS+AN+VVYVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAY+AKI D++AKILSGVHPYFGVT+KAKLLLFSPKPNML+EGKVVKLRQES+H
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH

Query:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASA ITDE IRDEFKHRTKH EEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN +EGSVT+  ++K R+NEG+ 
Subjt:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS
            SVA D N LI NND +SKT+K KT+RIS
Subjt:  ----SVAPDANTLISNNDGRSKTRKPKTTRIS

SwissProt top hitse value%identityAlignment
O43036 DNA-directed RNA polymerase I subunit rpa432.1e-0422.09Show/hide
Query:  NMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIHIIVLGFAS
        ++ + + P  S+    A+   + +M+L    +  G++LAY+  + ++++AK++    P+  + ++  +L+FSPK    +EGK+  +    I +++LG  +
Subjt:  NMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIHIIVLGFAS

Query:  AVITDEVI-RDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEE--ILHISGSLVPS
        A I  + I +D         EE    + +  ++++ G  + F+V     E  +  + G+L  S
Subjt:  AVITDEVI-RDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEE--ILHISGSLVPS

Arabidopsis top hitse value%identityAlignment
AT1G75670.1 DNA-directed RNA polymerases4.8e-6054.12Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
        MEGLK+SEA +++++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  + AKIL+G+HPYFGV +  +LLLF PKP   VEGK+VK+  ESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH

Query:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMR
        +IVLGF++AVITD  IR+EFK+R + GE  FVSR+HK H +K+GTM+R  V+SFDEE++HI+GSL+P +TG + WLEK   E   T+ + ++ +
Subjt:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMR

AT1G75670.2 DNA-directed RNA polymerases4.8e-6054.12Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH
        MEGLK+SEA +++++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  + AKIL+G+HPYFGV +  +LLLF PKP   VEGK+VK+  ESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIH

Query:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMR
        +IVLGF++AVITD  IR+EFK+R + GE  FVSR+HK H +K+GTM+R  V+SFDEE++HI+GSL+P +TG + WLEK   E   T+ + ++ +
Subjt:  IIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTTAAGATTGATGAACTAAGATATGCTTGTAGCAGCCACATCCGTCCGCCTACAGCCGGCCGCTGCACCACCACAGTTCGAGTCGACGTCTGCCGCTCGCCGTG
CTGGGTGCCTACGCCGACTGCTCACGCCGTTGCTGATTCTCCTCCTCCACCACCCTCGTCGGAGTTCATCGGGTCCGGTGAGTGTGCAGTGGAATCAAGCTGTGATCGTT
GGAACTTTGGAGGTCTGGATCAAGTGACTTTTGGGGTAGGAGTTTTGGACGTTCCAAGTGAATTGAAGTGTGGGTTATGCAATATAAAGAGAGTTAAGCCCGCTAGTCTT
GGTTTCTATTTCAGTGTTGAGAATCCTGATGATCTGGATGATGCGGGTGTCTCAAAACCTAAGCTTATGTTTTGCTTCTTGTGTGAGTGCCATACACATTTGGACGTCGG
TGTGACACAAATGTGTATACCAACATCTGCAACACACGTGGTAATAAAAGTTTGCTGTTTTTACCTTCTTCGAGCTCTCTTTCTTCTCTGGTTGTTGTGGATTCTTCTCC
TCCTTGTTTCGGTTGCCATTTCAATGGAGGGGCTCAAGGTTTCAGAGGCTAATATGGTTGTTTACGTTCACCCATCTAAAAGTAAGAAGGTTTCGCAAGCGGTGCTTCGA
GAGCTTGGTGCTATGCTTCTAAAATTTGATGAAAAATTTGAAGGCGTACTTCTGGCTTATGAGGCCAAAATTATCGATAGAAATGCGAAGATTCTATCTGGAGTGCATCC
CTATTTCGGTGTGACAATAAAGGCAAAGCTATTACTTTTCTCTCCAAAGCCAAACATGCTTGTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCAATCCATATTATTG
TCTTAGGTTTTGCTTCTGCTGTAATAACCGATGAAGTCATTCGCGACGAATTCAAGCATAGAACAAAACATGGAGAAGAAATGTTTGTCAGCAGAGCTCACAAGCACCAT
GTAATAAAGGTTGGGACAATGATACGATTTTTGGTGAAGAGTTTTGATGAGGAAATATTGCATATCTCTGGATCTCTAGTTCCATCTCACACAGGGAGCATCCATTGGTT
GGAGAAGAATTTGGTTGAAGGTTCAGTGACTAATAGCAACAAAAGGAAGATGAGAGAAAATGAGGGAGACGTGAGCGTTGCGCCAGATGCAAATACACTTATCTCGAACA
ACGACGGTAGATCAAAAACCAGAAAGCCAAAAACAACCAGAATATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTTAAGATTGATGAACTAAGATATGCTTGTAGCAGCCACATCCGTCCGCCTACAGCCGGCCGCTGCACCACCACAGTTCGAGTCGACGTCTGCCGCTCGCCGTG
CTGGGTGCCTACGCCGACTGCTCACGCCGTTGCTGATTCTCCTCCTCCACCACCCTCGTCGGAGTTCATCGGGTCCGGTGAGTGTGCAGTGGAATCAAGCTGTGATCGTT
GGAACTTTGGAGGTCTGGATCAAGTGACTTTTGGGGTAGGAGTTTTGGACGTTCCAAGTGAATTGAAGTGTGGGTTATGCAATATAAAGAGAGTTAAGCCCGCTAGTCTT
GGTTTCTATTTCAGTGTTGAGAATCCTGATGATCTGGATGATGCGGGTGTCTCAAAACCTAAGCTTATGTTTTGCTTCTTGTGTGAGTGCCATACACATTTGGACGTCGG
TGTGACACAAATGTGTATACCAACATCTGCAACACACGTGGTAATAAAAGTTTGCTGTTTTTACCTTCTTCGAGCTCTCTTTCTTCTCTGGTTGTTGTGGATTCTTCTCC
TCCTTGTTTCGGTTGCCATTTCAATGGAGGGGCTCAAGGTTTCAGAGGCTAATATGGTTGTTTACGTTCACCCATCTAAAAGTAAGAAGGTTTCGCAAGCGGTGCTTCGA
GAGCTTGGTGCTATGCTTCTAAAATTTGATGAAAAATTTGAAGGCGTACTTCTGGCTTATGAGGCCAAAATTATCGATAGAAATGCGAAGATTCTATCTGGAGTGCATCC
CTATTTCGGTGTGACAATAAAGGCAAAGCTATTACTTTTCTCTCCAAAGCCAAACATGCTTGTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCAATCCATATTATTG
TCTTAGGTTTTGCTTCTGCTGTAATAACCGATGAAGTCATTCGCGACGAATTCAAGCATAGAACAAAACATGGAGAAGAAATGTTTGTCAGCAGAGCTCACAAGCACCAT
GTAATAAAGGTTGGGACAATGATACGATTTTTGGTGAAGAGTTTTGATGAGGAAATATTGCATATCTCTGGATCTCTAGTTCCATCTCACACAGGGAGCATCCATTGGTT
GGAGAAGAATTTGGTTGAAGGTTCAGTGACTAATAGCAACAAAAGGAAGATGAGAGAAAATGAGGGAGACGTGAGCGTTGCGCCAGATGCAAATACACTTATCTCGAACA
ACGACGGTAGATCAAAAACCAGAAAGCCAAAAACAACCAGAATATCTTGAAGACTGCTAGTTGTGATATAACATAGATTGTTTTTGTATCAGGGTGATGATGATTCAAAC
TAATTCAAAGCAAGAGATCCTCATTGTTTTCCTATAGTAAGAAAGGCTTGGGAGATGCACCTTGCAGTTGGGAAGAAGATCAATGATATAACTTTATATATCAAATGCAG
CTCCCCATTTTTGTAACTGTTATGAAGAGAGCCCAAATGCTTGTATTTTGGCTTCTTTGGCTTCCTTTAATAAGCTTGTCTAATAACAATTTATCAATTTTGAGTCTACC
TTTAAATTAAAAATTTTTGTTGGAAGTTAGAAGTTTACCATTTGGGC
Protein sequenceShow/hide protein sequence
MEFKIDELRYACSSHIRPPTAGRCTTTVRVDVCRSPCWVPTPTAHAVADSPPPPPSSEFIGSGECAVESSCDRWNFGGLDQVTFGVGVLDVPSELKCGLCNIKRVKPASL
GFYFSVENPDDLDDAGVSKPKLMFCFLCECHTHLDVGVTQMCIPTSATHVVIKVCCFYLLRALFLLWLLWILLLLVSVAISMEGLKVSEANMVVYVHPSKSKKVSQAVLR
ELGAMLLKFDEKFEGVLLAYEAKIIDRNAKILSGVHPYFGVTIKAKLLLFSPKPNMLVEGKVVKLRQESIHIIVLGFASAVITDEVIRDEFKHRTKHGEEMFVSRAHKHH
VIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDVSVAPDANTLISNNDGRSKTRKPKTTRIS