; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C11G209010 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C11G209010
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionNucleic acid-binding, OB-fold containing protein
Genome locationCla97Chr11:2587696..2594191
RNA-Seq ExpressionCla97C11G209010
SyntenyCla97C11G209010
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005736 - RNA polymerase I complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR036898 - RNA polymerase Rpb7-like, N-terminal domain superfamily
IPR045113 - RNA polymerase Rpb7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466499.1 PREDICTED: uncharacterized protein LOC103503892 isoform X1 [Cucumis melo]2.5e-10388.36Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH
        MEGLKVS+ANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGK+VKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH

Query:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT EDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSV+ S+K+KMRENEG V
Subjt:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS
            ++  D N +I  ND + KT+KQKTTRIS
Subjt:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS

XP_011652444.1 uncharacterized protein LOC101216589 [Cucumis sativus]1.9e-10388.36Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH
        MEGLKVS+ANMVVYVHPSKSKK+SQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGK+VKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH

Query:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT EDIRDEFKHRTKHGEEMFVSRAHKHH+IKVGTMIR LVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSVT S KRKMRENEG V
Subjt:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS
            S+  D N +I NND + KT+KQK TRIS
Subjt:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS

XP_038889707.1 uncharacterized protein LOC120079556 isoform X1 [Benincasa hispida]3.7e-10789.47Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH
        MEGLKVS+AN+VVYVHPSKSKKVSQAVLRELG MLLKFDEKFEGVLLAYEA IIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGK+VKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH

Query:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGF+S VITDEDIRDEFKHRTKHGEEMFVSR HKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSVTNS+K+K RENEG  
Subjt:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  SIVPDANTLISNNDGRSKTRKQKTTRIS
        S+  DAN LI NND +SKT++QKTTRIS
Subjt:  SIVPDANTLISNNDGRSKTRKQKTTRIS

XP_038889709.1 uncharacterized protein LOC120079556 isoform X2 [Benincasa hispida]9.2e-10689.47Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH
        MEGLKVS+AN+VVYVHPSKSKKVSQAVLRELG MLLKFDEKFEGVLLAYEA IIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGK+VKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH

Query:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGF+S VITDEDIRDEFKHRTKHGEEMFVSR HKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSVTNS K+K RENEG  
Subjt:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  SIVPDANTLISNNDGRSKTRKQKTTRIS
        S+  DAN LI NND +SKT++QKTTRIS
Subjt:  SIVPDANTLISNNDGRSKTRKQKTTRIS

XP_038898978.1 uncharacterized protein LOC120086413 isoform X3 [Benincasa hispida]7.8e-10587.5Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH
        M+GLKVS+ANMVVYVHPSKSKKVSQAVLR LGAMLLKFDEKFEGVLLAYEAKIIDK AKILSGVHPYFGVTIKAKLLLFSPKPNML+EGK+VKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH

Query:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT+EDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHI+GSLVPSHTGSIHWLEKN VEG+VT+S+K+KMRENEG+V
Subjt:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS
            S+  + N LI NND +SKT+KQKTTRIS
Subjt:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS

TrEMBL top hitse value%identityAlignment
A0A0A0LGG8 Uncharacterized protein9.3e-10488.36Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH
        MEGLKVS+ANMVVYVHPSKSKK+SQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGK+VKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH

Query:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT EDIRDEFKHRTKHGEEMFVSRAHKHH+IKVGTMIR LVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSVT S KRKMRENEG V
Subjt:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS
            S+  D N +I NND + KT+KQK TRIS
Subjt:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS

A0A1S3CRF8 uncharacterized protein LOC103503892 isoform X23.9e-10288.36Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH
        MEGLKVS+ANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGK+VKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH

Query:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT EDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSV+ S K+KMRENEG V
Subjt:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS
            ++  D N +I  ND + KT+KQKTTRIS
Subjt:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS

A0A1S4E5I5 uncharacterized protein LOC103503892 isoform X11.2e-10388.36Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH
        MEGLKVS+ANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGK+VKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH

Query:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT EDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSV+ S+K+KMRENEG V
Subjt:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS
            ++  D N +I  ND + KT+KQKTTRIS
Subjt:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS

A0A5D3E6A0 Putative DNA-directed RNA polymerase I subunit RPA431.2e-10388.36Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH
        MEGLKVS+ANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGK+VKLRQESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH

Query:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASAVIT EDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN VEGSV+ S+K+KMRENEG V
Subjt:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS
            ++  D N +I  ND + KT+KQKTTRIS
Subjt:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS

A0A6J1D5Y9 uncharacterized protein LOC1110167573.9e-10284.05Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH
        MEGLKVS+AN+VVYVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAY+AKI DK+AKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGK+VKLRQES+H
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH

Query:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV
        +IVLGFASA ITDEDIRDEFKHRTKH EEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKN +EGSVT+  ++K R+NEG+ 
Subjt:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDV

Query:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS
            S+  D N LI NND +SKT+KQKT+RIS
Subjt:  ----SIVPDANTLISNNDGRSKTRKQKTTRIS

SwissProt top hitse value%identityAlignment
O43036 DNA-directed RNA polymerase I subunit rpa434.3e-0523.31Show/hide
Query:  NMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIHIIVLGFAS
        ++ + + P  S+    A+   + +M+L    +  G++LAY+  + ++K+AK++    P+  + ++  +L+FSPK    LEGK+  +    I +++LG  +
Subjt:  NMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIHIIVLGFAS

Query:  AVITDEDI-RDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEE--ILHISGSLVPS
        A I  + I +D         EE    + +  ++++ G  + F+V     E  +  + G+L  S
Subjt:  AVITDEDI-RDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEE--ILHISGSLVPS

Arabidopsis top hitse value%identityAlignment
AT1G75670.1 DNA-directed RNA polymerases7.3e-6154.64Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH
        MEGLK+SEA +++++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH

Query:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMR
        +IVLGF++AVITD DIR+EFK+R + GE  FVSR+HK H +K+GTM+R  V+SFDEE++HI+GSL+P +TG + WLEK   E   T+ + ++ +
Subjt:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMR

AT1G75670.2 DNA-directed RNA polymerases7.3e-6154.64Show/hide
Query:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH
        MEGLK+SEA +++++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSEANMVVYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIH

Query:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMR
        +IVLGF++AVITD DIR+EFK+R + GE  FVSR+HK H +K+GTM+R  V+SFDEE++HI+GSL+P +TG + WLEK   E   T+ + ++ +
Subjt:  IIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTTAAGATTGATGAACTAAGATATGCTTGTAGCAGCCACATCCGTCCGCCTACAGCCGGCCGCTGCACCACCACCGTTCGAGTCGACGTCTGCCGCTCGCCGTA
CTGGGTGCCTACGCCGACTGCTCACGCCGTTGCTGATTCTCCTCCTCCACCACCCTCGTCGGAGTTCATCGGGTCCGGTGAGTGTGCTGTGGATTCAAGCTGTGATCGTT
GGAACTTTGGAGGTCTGGATCAAGTGACTTTTGGGGTAGGAGTTTTGGACGTTCCAAGGGAATTGAAGTGTGGGTTATGCAATATAAAGAGAGTTAAGCCCGCTAGTCTT
GGTTTCTATTTCAGTGTTGAGAATCCTGATGGTCTGGATAATGCGGGTGTCTCAAAACCTAAGCTTATGTTTTACTTCTTGTGTGAGTGCCATACACATTTGGACGTCGG
TGTGACACAAATGTGTACGCCAACATTTGCAACACACGTGGTAATAAAAGTTCGCTGTTTTTACCTTCTTCGAGCTCTCTTTCTTCTCTGGTTGTTGTGGATTCTTCTCC
TCCTTGTTTCGGTTGCCATTTCAATGGAGGGGCTCAAGGTTTCAGAGGCTAATATGGTTGTTTACGTTCACCCATCTAAAAGTAAGAAGGTTTCGCAAGCGGTGCTTCGA
GAGCTTGGTGCTATGCTTCTAAAATTTGATGAAAAATTTGAAGGCGTTCTTCTGGCTTATGAGGCCAAAATTATCGATAAAAATGCGAAGATTCTGTCTGGAGTGCATCC
CTATTTTGGTGTGACAATAAAGGCAAAGCTATTACTTTTCTCTCCAAAGCCAAACATGCTTTTAGAGGGAAAGTTGGTGAAGCTTAGGCAAGAATCAATCCATATTATTG
TCTTAGGTTTTGCTTCTGCTGTAATAACCGATGAAGACATTCGCGACGAATTCAAGCATAGAACAAAACATGGAGAAGAAATGTTTGTCAGCAGAGCTCACAAGCACCAT
GTAATAAAGGTTGGGACAATGATACGATTTTTGGTGAAGAGTTTTGATGAGGAAATATTGCATATCTCTGGATCTCTAGTTCCATCTCACACAGGGAGCATCCATTGGTT
GGAGAAGAATTTGGTTGAAGGTTCAGTGACTAATAGCAACAAAAGGAAGATGAGAGAAAATGAGGGAGACGTTAGCATTGTGCCAGATGCAAATACACTTATCTCGAACA
ACGACGGTAGATCAAAAACCAGAAAGCAAAAAACTACCAGAATATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTTAAGATTGATGAACTAAGATATGCTTGTAGCAGCCACATCCGTCCGCCTACAGCCGGCCGCTGCACCACCACCGTTCGAGTCGACGTCTGCCGCTCGCCGTA
CTGGGTGCCTACGCCGACTGCTCACGCCGTTGCTGATTCTCCTCCTCCACCACCCTCGTCGGAGTTCATCGGGTCCGGTGAGTGTGCTGTGGATTCAAGCTGTGATCGTT
GGAACTTTGGAGGTCTGGATCAAGTGACTTTTGGGGTAGGAGTTTTGGACGTTCCAAGGGAATTGAAGTGTGGGTTATGCAATATAAAGAGAGTTAAGCCCGCTAGTCTT
GGTTTCTATTTCAGTGTTGAGAATCCTGATGGTCTGGATAATGCGGGTGTCTCAAAACCTAAGCTTATGTTTTACTTCTTGTGTGAGTGCCATACACATTTGGACGTCGG
TGTGACACAAATGTGTACGCCAACATTTGCAACACACGTGGTAATAAAAGTTCGCTGTTTTTACCTTCTTCGAGCTCTCTTTCTTCTCTGGTTGTTGTGGATTCTTCTCC
TCCTTGTTTCGGTTGCCATTTCAATGGAGGGGCTCAAGGTTTCAGAGGCTAATATGGTTGTTTACGTTCACCCATCTAAAAGTAAGAAGGTTTCGCAAGCGGTGCTTCGA
GAGCTTGGTGCTATGCTTCTAAAATTTGATGAAAAATTTGAAGGCGTTCTTCTGGCTTATGAGGCCAAAATTATCGATAAAAATGCGAAGATTCTGTCTGGAGTGCATCC
CTATTTTGGTGTGACAATAAAGGCAAAGCTATTACTTTTCTCTCCAAAGCCAAACATGCTTTTAGAGGGAAAGTTGGTGAAGCTTAGGCAAGAATCAATCCATATTATTG
TCTTAGGTTTTGCTTCTGCTGTAATAACCGATGAAGACATTCGCGACGAATTCAAGCATAGAACAAAACATGGAGAAGAAATGTTTGTCAGCAGAGCTCACAAGCACCAT
GTAATAAAGGTTGGGACAATGATACGATTTTTGGTGAAGAGTTTTGATGAGGAAATATTGCATATCTCTGGATCTCTAGTTCCATCTCACACAGGGAGCATCCATTGGTT
GGAGAAGAATTTGGTTGAAGGTTCAGTGACTAATAGCAACAAAAGGAAGATGAGAGAAAATGAGGGAGACGTTAGCATTGTGCCAGATGCAAATACACTTATCTCGAACA
ACGACGGTAGATCAAAAACCAGAAAGCAAAAAACTACCAGAATATCTTGAAGACTGCTAGTTGTGATATAACATAGATTGTTTTTGTATCAGGGTGATGATGATTCAAAC
TAATTCAAAGCAAGAGATCCTCATTGTTTTCCTATAGTAAGAAAGGCTTGGGATATGCACCTTGCAGTTGGGAAGAAGATCAATGATATAACTTTATATATCAAATGCAG
CTCCCCATTTTTGCAACTGTTATGAAGAGAGCCCAAATGCTTGTATTGTGGCTTCTTTGGCTTCCTTTAATAAGTTTGTCTAATAACAATTTATCAATTTTGAGTCTACC
TTTAAATTAAAAAATTTTGTTGGAAGTTAGAAGTTTACCATTTGGAC
Protein sequenceShow/hide protein sequence
MEFKIDELRYACSSHIRPPTAGRCTTTVRVDVCRSPYWVPTPTAHAVADSPPPPPSSEFIGSGECAVDSSCDRWNFGGLDQVTFGVGVLDVPRELKCGLCNIKRVKPASL
GFYFSVENPDGLDNAGVSKPKLMFYFLCECHTHLDVGVTQMCTPTFATHVVIKVRCFYLLRALFLLWLLWILLLLVSVAISMEGLKVSEANMVVYVHPSKSKKVSQAVLR
ELGAMLLKFDEKFEGVLLAYEAKIIDKNAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKLVKLRQESIHIIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRAHKHH
VIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNLVEGSVTNSNKRKMRENEGDVSIVPDANTLISNNDGRSKTRKQKTTRIS