; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0120 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0120
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionNucleic acid-binding, OB-fold containing protein
Genome locationMC11:857540..860472
RNA-Seq ExpressionMC11g0120
SyntenyMC11g0120
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005736 - RNA polymerase I complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR036898 - RNA polymerase Rpb7-like, N-terminal domain superfamily
IPR045113 - RNA polymerase Rpb7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037901.1 rpa43, partial [Cucurbita argyrosperma subsp. argyrosperma]1.30e-14392.24Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES
        VIVLGFASA ITDEDIRDEFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGS T R RKKTRDN+ ES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

XP_022148351.1 uncharacterized protein LOC111016757 [Momordica charantia]1.31e-158100Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES
        VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

XP_022940043.1 uncharacterized protein LOC111445794 isoform X1 [Cucurbita moschata]4.15e-14692.24Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT R RKKTRDN+ ES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

XP_022940045.1 uncharacterized protein LOC111445794 isoform X2 [Cucurbita moschata]2.70e-14492.24Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT R RKKTRDN+ ES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

XP_023523351.1 uncharacterized protein LOC111787571 [Cucurbita pepo subsp. pepo]3.41e-14591.81Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES
        VIVLGFASA ITDEDIRDEFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT+R RKK RD + ES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

TrEMBL top hitse value%identityAlignment
A0A6J1D5Y9 uncharacterized protein LOC1110167576.37e-159100Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES
        VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

A0A6J1FN75 uncharacterized protein LOC111445794 isoform X12.01e-14692.24Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT R RKKTRDN+ ES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

A0A6J1FPG3 uncharacterized protein LOC111445794 isoform X21.31e-14492.24Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT R RKKTRDN+ ES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

A0A6J1IWP0 uncharacterized protein LOC111481277 isoform X17.92e-14391.63Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLR+LGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRAHKHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT+R RKK RDN+ ES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQK
        LLQDSVATDVNAL+LNNDHQSKTKKQK
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQK

A0A6J1J4W1 uncharacterized protein LOC111481277 isoform X27.32e-14191.63Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLR+LGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRAHKHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT R RKK RDN+ ES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQK
        LLQDSVATDVNAL+LNNDHQSKTKKQK
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQK

SwissProt top hitse value%identityAlignment
O43036 DNA-directed RNA polymerase I subunit rpa431.3e-0625.15Show/hide
Query:  NLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYD-AKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVHVIVLGFAS
        +L + + P  S+    A+   + +M+L    R  G++LAYD  +  +KSAK++    P+  + ++  +L+FSPK    LEGK+  +    + +++LG  +
Subjt:  NLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYD-AKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVHVIVLGFAS

Query:  AAITDEDI-RDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEE--ILHISGSLVPS
        A+I  + I +D         EE    + +  ++++ G  + F+V     E  +  + G+L  S
Subjt:  AAITDEDI-RDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEE--ILHISGSLVPS

Arabidopsis top hitse value%identityAlignment
AT1G75670.1 DNA-directed RNA polymerases2.8e-6256.19Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLK+S+A L++++HPS+S+ V Q + REL ++L +++E F+GVLLAYDA +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTR
        VIVLGF++A ITD DIR+EFK+R +  E  FVSR+HK H +K+GTM+R  V+SFDEE++HI+GSL+P +TG + WLEK S E   TDR  K+ +
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTR

AT1G75670.2 DNA-directed RNA polymerases2.8e-6256.19Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLK+S+A L++++HPS+S+ V Q + REL ++L +++E F+GVLLAYDA +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTR
        VIVLGF++A ITD DIR+EFK+R +  E  FVSR+HK H +K+GTM+R  V+SFDEE++HI+GSL+P +TG + WLEK S E   TDR  K+ +
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGACTGAAGGTTTCCGATGCCAATTTGGTCGTTTACGTTCACCCTTCCAAAAGTAAGAAGGTTTCGCAGGCGGTGCTGCGGGAGCTCGGCGCTATGCTTCTCAA
GTTTGATGAAAGATTTGAAGGCGTGCTACTGGCTTATGACGCCAAAATTACTGATAAAAGTGCAAAGATTCTATCTGGAGTGCATCCCTATTTTGGCGTGACACTAAAGG
CAAAACTTTTGCTTTTCTCTCCAAAACCAAACATGCTTTTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCGGTCCATGTTATTGTCCTAGGTTTTGCTTCTGCTGCA
ATAACCGATGAAGACATTCGAGATGAATTCAAGCATAGAACAAAACACGAGGAAGAAATGTTTGTCAGCAGAGCTCATAAGCACCATGTGATAAAGGTTGGGACAATGAT
ACGATTTTTGGTGAAGAGTTTTGATGAGGAAATATTGCATATATCTGGATCGTTAGTTCCATCTCACACAGGGAGCATCCATTGGTTGGAGAAGAATTCGATCGAGGGTT
CAGTAACTGATAGGAGAAGAAAGAAGACAAGAGACAACGAGGGAGAATCATTGTTGCAGGATAGTGTTGCTACCGATGTCAATGCACTTATCTTGAACAATGACCATCAG
TCAAAAACCAAAAAGCAGAAAACTAGCAGAATATCTTGA
mRNA sequenceShow/hide mRNA sequence
CCGCCCCAATGGCGTGTTGGGCCTGTAATCCATTATGTCTTTCAGGCCCAAATACTGACTAGGGCTCCACGGAGGGGAGGGGTAGGCGCTTGCTTGATTCCCTCGATTTT
GTTTGAACCGAACACCGGAAACGCACGTGTAGATACCGCTCCCTGTTTATAGTGTTTTTCTCCGGCCGCGGCGGGTTCTTCTCCTCCTTGTTTCGATGGAGGGACTGAAG
GTTTCCGATGCCAATTTGGTCGTTTACGTTCACCCTTCCAAAAGTAAGAAGGTTTCGCAGGCGGTGCTGCGGGAGCTCGGCGCTATGCTTCTCAAGTTTGATGAAAGATT
TGAAGGCGTGCTACTGGCTTATGACGCCAAAATTACTGATAAAAGTGCAAAGATTCTATCTGGAGTGCATCCCTATTTTGGCGTGACACTAAAGGCAAAACTTTTGCTTT
TCTCTCCAAAACCAAACATGCTTTTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCGGTCCATGTTATTGTCCTAGGTTTTGCTTCTGCTGCAATAACCGATGAAGAC
ATTCGAGATGAATTCAAGCATAGAACAAAACACGAGGAAGAAATGTTTGTCAGCAGAGCTCATAAGCACCATGTGATAAAGGTTGGGACAATGATACGATTTTTGGTGAA
GAGTTTTGATGAGGAAATATTGCATATATCTGGATCGTTAGTTCCATCTCACACAGGGAGCATCCATTGGTTGGAGAAGAATTCGATCGAGGGTTCAGTAACTGATAGGA
GAAGAAAGAAGACAAGAGACAACGAGGGAGAATCATTGTTGCAGGATAGTGTTGCTACCGATGTCAATGCACTTATCTTGAACAATGACCATCAGTCAAAAACCAAAAAG
CAGAAAACTAGCAGAATATCTTGAAGGCTGCTAATTGACATACAGCATAGATCGTTTCTGTATCAGGTATGGGCTGGGGGTGCTCTGTCGAGATCAACCCGCTGCAGGAT
GGAATGAATTAGAAGAGAGAATGGGTTTTCACAAAATTTGTGAACCGGGTTCACTCCATCCAACTCCCATGTGACCCATCACTGTTGTTTAATCAGATGAATACAAACAC
AGAAATATGACAAAGTTCTCGTTTTTTCCTTTTTCCTTTTTCCTTTTTCTGGCGAGAAAGCCGGATTTGTAACATGGGATGAACTATGAAGTAAGAAGTACAAAACTCTT
CAATCCAATCAAACTACAGTTTAAAAAAATGGTGTCTTTTCTCCT
Protein sequenceShow/hide protein sequence
MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVHVIVLGFASAA
ITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRRKKTRDNEGESLLQDSVATDVNALILNNDHQ
SKTKKQKTSRIS