; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh01G019600 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh01G019600
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionNucleic acid-binding, OB-fold containing protein
Genome locationCma_Chr01:12660087..12662018
RNA-Seq ExpressionCmaCh01G019600
SyntenyCmaCh01G019600
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005736 - RNA polymerase I complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR036898 - RNA polymerase Rpb7-like, N-terminal domain superfamily
IPR045113 - RNA polymerase Rpb7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022940043.1 uncharacterized protein LOC111445794 isoform X1 [Cucurbita moschata]1.4e-11698.24Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLR+LGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES
        VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRA+KHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVT+RSRKK RDNDRES
Subjt:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQK
        LLQDSVATDVNALLLNNDHQSKTKKQK
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQK

XP_022940045.1 uncharacterized protein LOC111445794 isoform X2 [Cucurbita moschata]4.5e-11598.24Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLR+LGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES
        VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRA+KHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVT RSRKK RDNDRES
Subjt:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQK
        LLQDSVATDVNALLLNNDHQSKTKKQK
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQK

XP_022982461.1 uncharacterized protein LOC111481277 isoform X1 [Cucurbita maxima]1.4e-119100Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES
        VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES
Subjt:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQKN
        LLQDSVATDVNALLLNNDHQSKTKKQKN
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQKN

XP_022982463.1 uncharacterized protein LOC111481277 isoform X2 [Cucurbita maxima]1.3e-11799.56Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES
        VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVT RSRKKMRDNDRES
Subjt:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQKN
        LLQDSVATDVNALLLNNDHQSKTKKQKN
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQKN

XP_023523351.1 uncharacterized protein LOC111787571 [Cucurbita pepo subsp. pepo]1.4e-11698.24Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLR+LGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES
        VIVLGFASAVITDEDIR+EFKHRTKHGEEMFVSRA+KHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRD DRES
Subjt:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQK
        LLQDSVATDVNALLLNNDHQSKTKKQK
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQK

TrEMBL top hitse value%identityAlignment
A0A6J1D5Y9 uncharacterized protein LOC1110167574.7e-11091.63Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLR+LGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES
        VIVLGFASA ITDEDIR+EFKHRTKH EEMFVSRAHKHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGSVT+R RKK RDN+ ES
Subjt:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQK
        LLQDSVATDVNAL+LNNDHQSKTKKQK
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQK

A0A6J1FN75 uncharacterized protein LOC111445794 isoform X16.8e-11798.24Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLR+LGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES
        VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRA+KHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVT+RSRKK RDNDRES
Subjt:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQK
        LLQDSVATDVNALLLNNDHQSKTKKQK
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQK

A0A6J1FPG3 uncharacterized protein LOC111445794 isoform X22.2e-11598.24Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLR+LGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES
        VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRA+KHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVT RSRKK RDNDRES
Subjt:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQK
        LLQDSVATDVNALLLNNDHQSKTKKQK
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQK

A0A6J1IWP0 uncharacterized protein LOC111481277 isoform X16.6e-120100Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES
        VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES
Subjt:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQKN
        LLQDSVATDVNALLLNNDHQSKTKKQKN
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQKN

A0A6J1J4W1 uncharacterized protein LOC111481277 isoform X26.1e-11899.56Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES
        VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVT RSRKKMRDNDRES
Subjt:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQKN
        LLQDSVATDVNALLLNNDHQSKTKKQKN
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQKN

SwissProt top hitse value%identityAlignment
O43036 DNA-directed RNA polymerase I subunit rpa433.6e-0624.54Show/hide
Query:  NLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYE-ANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFAS
        +L + + P  S+    A+   + +M+L    R  G++LAY+    ++KSAK++    P+  + ++  +L+FSPK    LEGK+  +    I +++LG  +
Subjt:  NLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYE-ANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFAS

Query:  AVITDEDIRNEFKH-RTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEE--ILHISGSLVPS
        A I  + I  ++        EE    + +  ++++ G  + F+V     E  +  + G+L  S
Subjt:  AVITDEDIRNEFKH-RTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEE--ILHISGSLVPS

Arabidopsis top hitse value%identityAlignment
AT1G75670.1 DNA-directed RNA polymerases2.3e-6156.19Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLK+S+A L+I++HPS+S+ V Q + R+L ++L +++E F+GVLLAY+A +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMR
        VIVLGF++AVITD DIR EFK+R + GE  FVSR+HK H +K+GTM+R  V+SFDEE++HI+GSL+P +TG +  LEK S E   T+R  K+ +
Subjt:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMR

AT1G75670.2 DNA-directed RNA polymerases2.3e-6156.19Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLK+S+A L+I++HPS+S+ V Q + R+L ++L +++E F+GVLLAY+A +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMR
        VIVLGF++AVITD DIR EFK+R + GE  FVSR+HK H +K+GTM+R  V+SFDEE++HI+GSL+P +TG +  LEK S E   T+R  K+ +
Subjt:  VIVLGFASAVITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGGCTTAAGGTTTCCGACGCCAATTTGGTTATTTACGTTCACCCATCCAAAAGTAAGAAGGTTTCGCAAGCGGTGCTTCGAGACCTCGGCGCTATGCTTCTGAA
ATTTGACGAAAGGTTTGAAGGTGTCCTACTGGCTTATGAGGCCAATATTATTGATAAAAGTGCGAAGATTCTATCTGGAGTGCATCCGTATTTTGGTGTGACAATCAAGG
CAAAGCTATTACTTTTCTCTCCGAAGCCGAACATGCTTTTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCAATCCATGTTATTGTCTTAGGTTTTGCTTCTGCTGTA
ATAACCGATGAAGACATTCGCAATGAATTCAAGCATAGAACAAAACATGGAGAAGAAATGTTTGTCAGCAGAGCTCACAAGCACCATGTGATAAAGGTTGGGACGATGGT
ACGATTTTTGGTGAAGAGTTTTGATGAGGAAATATTGCACATATCTGGATCTCTAGTTCCATCTCACACAGGGAGCATCCATTGCTTGGAGAAAAATTCAGTTGAAGGTT
CAGTAACTAATAGGAGTAGAAAGAAGATGAGAGATAACGACAGAGAATCATTGTTGCAAGATAGTGTTGCCACTGATGTAAATGCACTTCTCTTGAACAATGACCATCAA
TCTAAAACCAAAAAACAAAAAAACTAG
mRNA sequenceShow/hide mRNA sequence
CGGCCCTTTTCTTCGTCGTCCATTATGTTATGGACTAGGGCTTCATGGGGGGTGACATACGCACACTCCTTGCTTTTACCTTGCTTTTACCTTGCTTCCAGCTAATTCCT
CTCCTCCTCCTTTGTTTCGCTCGCCGTTTCAATGGAGGGGCTTAAGGTTTCCGACGCCAATTTGGTTATTTACGTTCACCCATCCAAAAGTAAGAAGGTTTCGCAAGCGG
TGCTTCGAGACCTCGGCGCTATGCTTCTGAAATTTGACGAAAGGTTTGAAGGTGTCCTACTGGCTTATGAGGCCAATATTATTGATAAAAGTGCGAAGATTCTATCTGGA
GTGCATCCGTATTTTGGTGTGACAATCAAGGCAAAGCTATTACTTTTCTCTCCGAAGCCGAACATGCTTTTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCAATCCA
TGTTATTGTCTTAGGTTTTGCTTCTGCTGTAATAACCGATGAAGACATTCGCAATGAATTCAAGCATAGAACAAAACATGGAGAAGAAATGTTTGTCAGCAGAGCTCACA
AGCACCATGTGATAAAGGTTGGGACGATGGTACGATTTTTGGTGAAGAGTTTTGATGAGGAAATATTGCACATATCTGGATCTCTAGTTCCATCTCACACAGGGAGCATC
CATTGCTTGGAGAAAAATTCAGTTGAAGGTTCAGTAACTAATAGGAGTAGAAAGAAGATGAGAGATAACGACAGAGAATCATTGTTGCAAGATAGTGTTGCCACTGATGT
AAATGCACTTCTCTTGAACAATGACCATCAATCTAAAACCAAAAAACAAAAAAACTAG
Protein sequenceShow/hide protein sequence
MEGLKVSDANLVIYVHPSKSKKVSQAVLRDLGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFASAV
ITDEDIRNEFKHRTKHGEEMFVSRAHKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSVTNRSRKKMRDNDRESLLQDSVATDVNALLLNNDHQ
SKTKKQKN