; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg17187 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg17187
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionNucleic acid-binding, OB-fold containing protein
Genome locationCarg_Chr01:12887370..12889701
RNA-Seq ExpressionCarg17187
SyntenyCarg17187
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005736 - RNA polymerase I complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR036898 - RNA polymerase Rpb7-like, N-terminal domain superfamily
IPR045113 - RNA polymerase Rpb7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608583.1 hypothetical protein SDJN03_01925, partial [Cucurbita argyrosperma subsp. sororia]1.6e-12499.18Show/hide
Query:  MGSHTLDCVSSRGFYLASSLFFFFSSSFVSLAVSMEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHP
        MGSHTLDCVSSRGFYLASSL FFFSSSFVSLAVSMEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHP
Subjt:  MGSHTLDCVSSRGFYLASSLFFFFSSSFVSLAVSMEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHP

Query:  YFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLV
        YFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLV
Subjt:  YFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLV

Query:  PSHTGSIHCLEKNSVEGSETRSRKKTRDNDKESLLQDSVATDV
        PSHTGSIHCLEKNSVEGSETRSRKKTRDNDKESLLQDSVATD+
Subjt:  PSHTGSIHCLEKNSVEGSETRSRKKTRDNDKESLLQDSVATDV

KAG7037901.1 rpa43, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-138100Show/hide
Query:  MGSHTLDCVSSRGFYLASSLFFFFSSSFVSLAVSMEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHP
        MGSHTLDCVSSRGFYLASSLFFFFSSSFVSLAVSMEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHP
Subjt:  MGSHTLDCVSSRGFYLASSLFFFFSSSFVSLAVSMEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHP

Query:  YFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLV
        YFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLV
Subjt:  YFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLV

Query:  PSHTGSIHCLEKNSVEGSETRSRKKTRDNDKESLLQDSVATDVNALLLNNDHQSKTKKQKTSRIS
        PSHTGSIHCLEKNSVEGSETRSRKKTRDNDKESLLQDSVATDVNALLLNNDHQSKTKKQKTSRIS
Subjt:  PSHTGSIHCLEKNSVEGSETRSRKKTRDNDKESLLQDSVATDVNALLLNNDHQSKTKKQKTSRIS

XP_022940043.1 uncharacterized protein LOC111445794 isoform X1 [Cucurbita moschata]5.6e-11798.28Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSET-RSRKKTRDNDKES
        VIVLGFASAVITDEDIR+EFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGS T RSRKKTRDND+ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSET-RSRKKTRDNDKES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQKTSRIS
        LLQDSVATDVNALLLNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQKTSRIS

XP_022940045.1 uncharacterized protein LOC111445794 isoform X2 [Cucurbita moschata]2.3e-11898.7Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSETRSRKKTRDNDKESL
        VIVLGFASAVITDEDIR+EFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGS TRSRKKTRDND+ESL
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSETRSRKKTRDNDKESL

Query:  LQDSVATDVNALLLNNDHQSKTKKQKTSRIS
        LQDSVATDVNALLLNNDHQSKTKKQKTSRIS
Subjt:  LQDSVATDVNALLLNNDHQSKTKKQKTSRIS

XP_023523351.1 uncharacterized protein LOC111787571 [Cucurbita pepo subsp. pepo]3.6e-11697.84Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSET-RSRKKTRDNDKES
        VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGS T RSRKK RD D+ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSET-RSRKKTRDNDKES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQKTSRIS
        LLQDSVATDVNALLLNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQKTSRIS

TrEMBL top hitse value%identityAlignment
A0A6J1D5Y9 uncharacterized protein LOC1110167571.4e-11092.24Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLV+YVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAY+A I DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSET-RSRKKTRDNDKES
        VIVLGFASA ITDEDIRDEFKHRTKH EEMFVSRA+KHHVIKVGTM+RFLVKSFDEEILHISGSLVPSHTGSIH LEKNS+EGS T R RKKTRDN+ ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSET-RSRKKTRDNDKES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQKTSRIS
        LLQDSVATDVNAL+LNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQKTSRIS

A0A6J1FN75 uncharacterized protein LOC111445794 isoform X12.7e-11798.28Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSET-RSRKKTRDNDKES
        VIVLGFASAVITDEDIR+EFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGS T RSRKKTRDND+ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSET-RSRKKTRDNDKES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQKTSRIS
        LLQDSVATDVNALLLNNDHQSKTKKQKTSRIS
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQKTSRIS

A0A6J1FPG3 uncharacterized protein LOC111445794 isoform X21.1e-11898.7Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSETRSRKKTRDNDKESL
        VIVLGFASAVITDEDIR+EFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGS TRSRKKTRDND+ESL
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSETRSRKKTRDNDKESL

Query:  LQDSVATDVNALLLNNDHQSKTKKQKTSRIS
        LQDSVATDVNALLLNNDHQSKTKKQKTSRIS
Subjt:  LQDSVATDVNALLLNNDHQSKTKKQKTSRIS

A0A6J1IWP0 uncharacterized protein LOC111481277 isoform X13.1e-11396.92Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLR+LGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSET-RSRKKTRDNDKES
        VIVLGFASAVITDEDIR+EFKHRTKHGEEMFVSRA+KHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGS T RSRKK RDND+ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSET-RSRKKTRDNDKES

Query:  LLQDSVATDVNALLLNNDHQSKTKKQK
        LLQDSVATDVNALLLNNDHQSKTKKQK
Subjt:  LLQDSVATDVNALLLNNDHQSKTKKQK

A0A6J1J4W1 uncharacterized protein LOC111481277 isoform X21.3e-11497.35Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANLVIYVHPSKSKKVSQAVLR+LGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSETRSRKKTRDNDKESL
        VIVLGFASAVITDEDIR+EFKHRTKHGEEMFVSRA+KHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGS TRSRKK RDND+ESL
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSETRSRKKTRDNDKESL

Query:  LQDSVATDVNALLLNNDHQSKTKKQK
        LQDSVATDVNALLLNNDHQSKTKKQK
Subjt:  LQDSVATDVNALLLNNDHQSKTKKQK

SwissProt top hitse value%identityAlignment
O43036 DNA-directed RNA polymerase I subunit rpa436.4e-0725.77Show/hide
Query:  NLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYE-ANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFAS
        +L + + P  S+    A+   + +M+L    R  G++LAY+    ++KSAK++    P+  + ++  +L+FSPK    LEGK+  +    I +++LG  +
Subjt:  NLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYE-ANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFAS

Query:  AVITDEDI-RDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEE--ILHISGSLVPS
        A I  + I +D         EE    + N  ++++ G  + F+V     E  +  + G+L  S
Subjt:  AVITDEDI-RDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEE--ILHISGSLVPS

Arabidopsis top hitse value%identityAlignment
AT1G75670.1 DNA-directed RNA polymerases1.5e-5956.48Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLK+S+A L+I++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSETRSRKKTR
        VIVLGF++AVITD DIR+EFK+R + GE  FVSR++K H +K+GTM+R  V+SFDEE++HI+GSL+P +TG +  LEK S E   T    K R
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSETRSRKKTR

AT1G75670.2 DNA-directed RNA polymerases1.5e-5956.48Show/hide
Query:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLK+S+A L+I++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSETRSRKKTR
        VIVLGF++AVITD DIR+EFK+R + GE  FVSR++K H +K+GTM+R  V+SFDEE++HI+GSL+P +TG +  LEK S E   T    K R
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSETRSRKKTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCACATACGCTCGATTGTGTTTCTTCTCGCGGCTTTTACCTTGCTTCGAGCTTATTCTTCTTCTTCTCCTCCTCCTTTGTTTCGCTCGCCGTTTCAATGGAGGG
GCTTAAGGTTTCCGACGCCAATTTGGTTATCTACGTTCACCCATCCAAAAGTAAGAAGGTTTCGCAAGCGGTGCTCCGAGAGCTCGGCGCTATGCTTCTGAAATTTGACG
AAAGGTTTGAAGGTGTCCTACTGGCTTATGAGGCCAATATTATTGATAAAAGTGCGAAGATTCTATCTGGAGTGCATCCATATTTTGGTGTGACAATAAAGGCAAAGCTA
TTACTTTTCTCTCCGAAGCCGAACATGCTTTTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCAATCCATGTTATTGTCTTGGGTTTTGCTTCTGCTGTAATAACCGA
TGAAGACATTCGCGATGAATTCAAGCATAGAACAAAACATGGAGAAGAAATGTTTGTCAGCAGAGCTAACAAGCACCATGTGATAAAGGTTGGGACAATGGTACGATTTT
TGGTGAAGAGTTTTGATGAGGAAATATTGCACATCTCTGGATCTCTAGTTCCATCTCACACAGGGAGCATCCATTGCTTGGAGAAAAATTCAGTTGAAGGTTCAGAAACT
AGGAGTAGAAAGAAGACGAGAGATAACGACAAAGAATCATTGTTGCAAGATAGTGTTGCCACTGATGTAAATGCACTTCTCTTGAACAATGACCATCAATCTAAAACCAA
AAAACAAAAAACTAGCAGAATATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTCACATACGCTCGATTGTGTTTCTTCTCGCGGCTTTTACCTTGCTTCGAGCTTATTCTTCTTCTTCTCCTCCTCCTTTGTTTCGCTCGCCGTTTCAATGGAGGG
GCTTAAGGTTTCCGACGCCAATTTGGTTATCTACGTTCACCCATCCAAAAGTAAGAAGGTTTCGCAAGCGGTGCTCCGAGAGCTCGGCGCTATGCTTCTGAAATTTGACG
AAAGGTTTGAAGGTGTCCTACTGGCTTATGAGGCCAATATTATTGATAAAAGTGCGAAGATTCTATCTGGAGTGCATCCATATTTTGGTGTGACAATAAAGGCAAAGCTA
TTACTTTTCTCTCCGAAGCCGAACATGCTTTTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCAATCCATGTTATTGTCTTGGGTTTTGCTTCTGCTGTAATAACCGA
TGAAGACATTCGCGATGAATTCAAGCATAGAACAAAACATGGAGAAGAAATGTTTGTCAGCAGAGCTAACAAGCACCATGTGATAAAGGTTGGGACAATGGTACGATTTT
TGGTGAAGAGTTTTGATGAGGAAATATTGCACATCTCTGGATCTCTAGTTCCATCTCACACAGGGAGCATCCATTGCTTGGAGAAAAATTCAGTTGAAGGTTCAGAAACT
AGGAGTAGAAAGAAGACGAGAGATAACGACAAAGAATCATTGTTGCAAGATAGTGTTGCCACTGATGTAAATGCACTTCTCTTGAACAATGACCATCAATCTAAAACCAA
AAAACAAAAAACTAGCAGAATATCTTGAAGACTGCTAATTGTCATACAACATAGATCATTTTTGTATCAGGGTGATGATGATTCAAATCAGAGATCCTCGTTGTTTTCCC
ATTGCAAGATATGCTTGGGAAAGAGATGCGCCTTTCACTTGGGAATATCTGACGTGACATCACATTATATCAGGGTGTAGATAAATGCAGTTTCCCATCTTTGTAATTGT
CTTCGAAGTAGCCCAAATCCTTGTTTTATGGGCTTAGCTCTAGTAGACATTAGTGAGCAAATGATTTTTGTTTAGTCCAATGTGCATTAGGTAAGGATAGGGGGAGGACA
GATGTTATCAACCTAGACTTCCCTAAAAAACTGTGAAATTTTGATATTATATTTCTTAGTTTCATTCAGAATCACACACACCACAAAAAACAACACCACAACCAGACCGT
TTGTTTGTCAATCATTATAAGGAG
Protein sequenceShow/hide protein sequence
MGSHTLDCVSSRGFYLASSLFFFFSSSFVSLAVSMEGLKVSDANLVIYVHPSKSKKVSQAVLRELGAMLLKFDERFEGVLLAYEANIIDKSAKILSGVHPYFGVTIKAKL
LLFSPKPNMLLEGKVVKLRQESIHVIVLGFASAVITDEDIRDEFKHRTKHGEEMFVSRANKHHVIKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHCLEKNSVEGSET
RSRKKTRDNDKESLLQDSVATDVNALLLNNDHQSKTKKQKTSRIS