; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021864 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021864
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDNA-directed RNA polymerases
Genome locationscaffold1:862306..864145
RNA-Seq ExpressionMS021864
SyntenyMS021864
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005736 - RNA polymerase I complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR036898 - RNA polymerase Rpb7-like, N-terminal domain superfamily
IPR045113 - RNA polymerase Rpb7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037901.1 rpa43, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-11192.21Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTRDNEGESL
        VIVLGFASA ITDEDIRDEFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGS T  RKKTRDN+ ESL
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTRDNEGESL

Query:  LQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LQDSVATDVNALILNNDHQSKTKKQKTSRIS

XP_022148351.1 uncharacterized protein LOC111016757 [Momordica charantia]1.0e-11999.57Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTD-RRKKTRDNEGES
        VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTD RRKKTRDNEGES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTD-RRKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

XP_022940043.1 uncharacterized protein LOC111445794 isoform X1 [Cucurbita moschata]6.8e-11192.24Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDR-RKKTRDNEGES
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT R RKKTRDN+ ES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDR-RKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

XP_022940045.1 uncharacterized protein LOC111445794 isoform X2 [Cucurbita moschata]2.3e-11192.21Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTRDNEGESL
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT  RKKTRDN+ ESL
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTRDNEGESL

Query:  LQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LQDSVATDVNALILNNDHQSKTKKQKTSRIS

XP_023523351.1 uncharacterized protein LOC111787571 [Cucurbita pepo subsp. pepo]3.4e-11091.81Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDR-RKKTRDNEGES
        VIVLGFASA ITDEDIRDEFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT+R RKK RD + ES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDR-RKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

TrEMBL top hitse value%identityAlignment
A0A6J1D5Y9 uncharacterized protein LOC1110167575.1e-12099.57Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTD-RRKKTRDNEGES
        VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTD RRKKTRDNEGES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTD-RRKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

A0A6J1FN75 uncharacterized protein LOC111445794 isoform X13.3e-11192.24Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDR-RKKTRDNEGES
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT R RKKTRDN+ ES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDR-RKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LLQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQKTSRIS

A0A6J1FPG3 uncharacterized protein LOC111445794 isoform X21.1e-11192.21Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTRDNEGESL
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT  RKKTRDN+ ESL
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTRDNEGESL

Query:  LQDSVATDVNALILNNDHQSKTKKQKTSRIS
        LQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LQDSVATDVNALILNNDHQSKTKKQKTSRIS

A0A6J1IWP0 uncharacterized protein LOC111481277 isoform X19.0e-10991.63Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLR+LGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDR-RKKTRDNEGES
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRAHKHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT+R RKK RDN+ ES
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDR-RKKTRDNEGES

Query:  LLQDSVATDVNALILNNDHQSKTKKQK
        LLQDSVATDVNAL+LNNDHQSKTKKQK
Subjt:  LLQDSVATDVNALILNNDHQSKTKKQK

A0A6J1J4W1 uncharacterized protein LOC111481277 isoform X24.0e-10991.59Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLR+LGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTRDNEGESL
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRAHKHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT  RKK RDN+ ESL
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTRDNEGESL

Query:  LQDSVATDVNALILNNDHQSKTKKQK
        LQDSVATDVNAL+LNNDHQSKTKKQK
Subjt:  LQDSVATDVNALILNNDHQSKTKKQK

SwissProt top hitse value%identityAlignment
O43036 DNA-directed RNA polymerase I subunit rpa431.2e-0625.15Show/hide
Query:  NLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYD-AKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVHVIVLGFAS
        +L + + P  S+    A+   + +M+L    R  G++LAYD  +  +KSAK++    P+  + ++  +L+FSPK    LEGK+  +    + +++LG  +
Subjt:  NLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYD-AKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVHVIVLGFAS

Query:  AAITDEDI-RDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEE--ILHISGSLVPS
        A+I  + I +D         EE    + +  ++++ G  + F+V     E  +  + G+L  S
Subjt:  AAITDEDI-RDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEE--ILHISGSLVPS

Arabidopsis top hitse value%identityAlignment
AT1G75670.1 DNA-directed RNA polymerases2.8e-6256.99Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLK+S+A L++++HPS+S+ V Q + REL ++L +++E F+GVLLAYDA +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTR
        VIVLGF++A ITD DIR+EFK+R +  E  FVSR+HK H +K+GTM+R  V+SFDEE++HI+GSL+P +TG + WLEK S E   TDR  K R
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTR

AT1G75670.2 DNA-directed RNA polymerases2.8e-6256.99Show/hide
Query:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH
        MEGLK+S+A L++++HPS+S+ V Q + REL ++L +++E F+GVLLAYDA +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ES+H
Subjt:  MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVH

Query:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTR
        VIVLGF++A ITD DIR+EFK+R +  E  FVSR+HK H +K+GTM+R  V+SFDEE++HI+GSL+P +TG + WLEK S E   TDR  K R
Subjt:  VIVLGFASAAITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGACTGAAGGTTTCCGATGCCAATTTGGTCGTTTACGTTCACCCTTCCAAAAGTAAGAAGGTTTCGCAGGCGGTGCTGCGGGAGCTCGGCGCTATGCTTCTCAA
GTTTGATGAAAGATTTGAAGGCGTGCTACTGGCTTATGACGCCAAAATTACTGATAAAAGTGCAAAGATTCTATCTGGAGTGCATCCCTATTTTGGCGTGACACTAAAGG
CAAAACTTTTGCTTTTCTCTCCAAAACCAAACATGCTTTTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCGGTCCATGTTATTGTCCTAGGTTTTGCTTCTGCTGCA
ATAACTGATGAAGACATTCGAGATGAATTCAAGCATAGAACAAAACACGAGGAAGAAATGTTTGTCAGCAGAGCTCATAAGCACCATGTGATAAAGGTTGGGACGATGAT
ACGATTTTTGGTGAAGAGTTTTGATGAGGAAATATTGCATATATCTGGATCGTTAGTTCCATCTCACACAGGGAGCATCCATTGGTTGGAGAAGAATTCGATCGAGGGTT
CAGTAACTGATAGAAGAAAGAAGACAAGAGACAACGAGGGAGAATCATTGTTGCAGGATAGTGTTGCTACCGATGTCAATGCACTTATCTTGAACAATGACCATCAGTCA
AAAACCAAAAAGCAGAAAACTAGCAGAATATCT
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGACTGAAGGTTTCCGATGCCAATTTGGTCGTTTACGTTCACCCTTCCAAAAGTAAGAAGGTTTCGCAGGCGGTGCTGCGGGAGCTCGGCGCTATGCTTCTCAA
GTTTGATGAAAGATTTGAAGGCGTGCTACTGGCTTATGACGCCAAAATTACTGATAAAAGTGCAAAGATTCTATCTGGAGTGCATCCCTATTTTGGCGTGACACTAAAGG
CAAAACTTTTGCTTTTCTCTCCAAAACCAAACATGCTTTTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCGGTCCATGTTATTGTCCTAGGTTTTGCTTCTGCTGCA
ATAACTGATGAAGACATTCGAGATGAATTCAAGCATAGAACAAAACACGAGGAAGAAATGTTTGTCAGCAGAGCTCATAAGCACCATGTGATAAAGGTTGGGACGATGAT
ACGATTTTTGGTGAAGAGTTTTGATGAGGAAATATTGCATATATCTGGATCGTTAGTTCCATCTCACACAGGGAGCATCCATTGGTTGGAGAAGAATTCGATCGAGGGTT
CAGTAACTGATAGAAGAAAGAAGACAAGAGACAACGAGGGAGAATCATTGTTGCAGGATAGTGTTGCTACCGATGTCAATGCACTTATCTTGAACAATGACCATCAGTCA
AAAACCAAAAAGCAGAAAACTAGCAGAATATCT
Protein sequenceShow/hide protein sequence
MEGLKVSDANLVVYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYDAKITDKSAKILSGVHPYFGVTLKAKLLLFSPKPNMLLEGKVVKLRQESVHVIVLGFASAA
ITDEDIRDEFKHRTKHEEEMFVSRAHKHHVIKVGTMIRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSIEGSVTDRRKKTRDNEGESLLQDSVATDVNALILNNDHQS
KTKKQKTSRIS