; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006405 (gene) of Snake gourd v1 genome

Gene IDTan0006405
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNucleic acid-binding, OB-fold containing protein
Genome locationLG01:115705991..115708600
RNA-Seq ExpressionTan0006405
SyntenyTan0006405
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005736 - RNA polymerase I complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR036898 - RNA polymerase Rpb7-like, N-terminal domain superfamily
IPR045113 - RNA polymerase Rpb7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148351.1 uncharacterized protein LOC111016757 [Momordica charantia]9.6e-11390.52Show/hide
Query:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN+++YVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAY+AKI DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES
        VIVLGFASA ITDEDIRDEFKHRTKH +EMFVSRAHKHHV+KVGTM+RFLVKSFDEEILHISGSLVPSHTGSIHWLEKNS+EGSVT+R RKKTRDN+GES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES

Query:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS
        LLQDSVAT+VNAL+LN+DHQ+KTKKQKTSR S
Subjt:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS

XP_022940043.1 uncharacterized protein LOC111445794 isoform X1 [Cucurbita moschata]1.2e-11293.53Show/hide
Query:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN++IYVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAYEA IIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES
        VIVLGFASAVITDEDIR+EFKHRTKHG+EMFVSRA+KHHV+KVGTMVRFLVKSFDEEILHISGSLVPSHTGSIH LEKNSVEGSVT+RSRKKTRDND ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES

Query:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS
        LLQDSVAT+VNALLLN+DHQ+KTKKQKTSR S
Subjt:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS

XP_022940045.1 uncharacterized protein LOC111445794 isoform X2 [Cucurbita moschata]3.1e-11193.53Show/hide
Query:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN++IYVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAYEA IIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES
        VIVLGFASAVITDEDIR+EFKHRTKHG+EMFVSRA+KHHV+KVGTMVRFLVKSFDEEILHISGSLVPSHTGSIH LEKNSVEGSVT RSRKKTRDND ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES

Query:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS
        LLQDSVAT+VNALLLN+DHQ+KTKKQKTSR S
Subjt:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS

XP_022982461.1 uncharacterized protein LOC111481277 isoform X1 [Cucurbita maxima]1.4e-11193.83Show/hide
Query:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN++IYVHPSKSKKVSQAVLR+LGAMLLKFDE+FEGVLLAYEA IIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES
        VIVLGFASAVITDEDIR+EFKHRTKHG+EMFVSRAHKHHV+KVGTMVRFLVKSFDEEILHISGSLVPSHTGSIH LEKNSVEGSVTNRSRKK RDND ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES

Query:  LLQDSVATEVNALLLNDDHQTKTKKQK
        LLQDSVAT+VNALLLN+DHQ+KTKKQK
Subjt:  LLQDSVATEVNALLLNDDHQTKTKKQK

XP_023523351.1 uncharacterized protein LOC111787571 [Cucurbita pepo subsp. pepo]2.1e-11293.53Show/hide
Query:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN++IYVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAYEA IIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES
        VIVLGFASAVITDEDIRDEFKHRTKHG+EMFVSRA+KHHV+KVGTMVRFLVKSFDEEILHISGSLVPSHTGSIH LEKNSVEGSVTNRSRKK RD D ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES

Query:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS
        LLQDSVAT+VNALLLN+DHQ+KTKKQKTSR S
Subjt:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS

TrEMBL top hitse value%identityAlignment
A0A6J1D5Y9 uncharacterized protein LOC1110167574.6e-11390.52Show/hide
Query:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN+++YVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAY+AKI DKSAKILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES
        VIVLGFASA ITDEDIRDEFKHRTKH +EMFVSRAHKHHV+KVGTM+RFLVKSFDEEILHISGSLVPSHTGSIHWLEKNS+EGSVT+R RKKTRDN+GES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES

Query:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS
        LLQDSVAT+VNAL+LN+DHQ+KTKKQKTSR S
Subjt:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS

A0A6J1FN75 uncharacterized protein LOC111445794 isoform X16.0e-11393.53Show/hide
Query:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN++IYVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAYEA IIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES
        VIVLGFASAVITDEDIR+EFKHRTKHG+EMFVSRA+KHHV+KVGTMVRFLVKSFDEEILHISGSLVPSHTGSIH LEKNSVEGSVT+RSRKKTRDND ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES

Query:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS
        LLQDSVAT+VNALLLN+DHQ+KTKKQKTSR S
Subjt:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS

A0A6J1FPG3 uncharacterized protein LOC111445794 isoform X21.5e-11193.53Show/hide
Query:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN++IYVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAYEA IIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES
        VIVLGFASAVITDEDIR+EFKHRTKHG+EMFVSRA+KHHV+KVGTMVRFLVKSFDEEILHISGSLVPSHTGSIH LEKNSVEGSVT RSRKKTRDND ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES

Query:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS
        LLQDSVAT+VNALLLN+DHQ+KTKKQKTSR S
Subjt:  LLQDSVATEVNALLLNDDHQTKTKKQKTSRKS

A0A6J1IWP0 uncharacterized protein LOC111481277 isoform X16.7e-11293.83Show/hide
Query:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN++IYVHPSKSKKVSQAVLR+LGAMLLKFDE+FEGVLLAYEA IIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES
        VIVLGFASAVITDEDIR+EFKHRTKHG+EMFVSRAHKHHV+KVGTMVRFLVKSFDEEILHISGSLVPSHTGSIH LEKNSVEGSVTNRSRKK RDND ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES

Query:  LLQDSVATEVNALLLNDDHQTKTKKQK
        LLQDSVAT+VNALLLN+DHQ+KTKKQK
Subjt:  LLQDSVATEVNALLLNDDHQTKTKKQK

A0A6J1J4W1 uncharacterized protein LOC111481277 isoform X26.3e-11093.39Show/hide
Query:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN++IYVHPSKSKKVSQAVLR+LGAMLLKFDE+FEGVLLAYEA IIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES
        VIVLGFASAVITDEDIR+EFKHRTKHG+EMFVSRAHKHHV+KVGTMVRFLVKSFDEEILHISGSLVPSHTGSIH LEKNSVEGSVT RSRKK RDND ES
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGES

Query:  LLQDSVATEVNALLLNDDHQTKTKKQK
        LLQDSVAT+VNALLLN+DHQ+KTKKQK
Subjt:  LLQDSVATEVNALLLNDDHQTKTKKQK

SwissProt top hitse value%identityAlignment
O43036 DNA-directed RNA polymerase I subunit rpa438.1e-0623.93Show/hide
Query:  NMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFAS
        ++ + + P  S+    A+   + +M+L    +  G++LAY+  + ++KSAK++    P+  + ++  +L+FSPK    LEGK+  +    I +++LG  +
Subjt:  NMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFAS

Query:  AVITDEDI-RDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEE--ILHISGSLVPS
        A I  + I +D         +E    + +  ++L+ G  + F+V     E  +  + G+L  S
Subjt:  AVITDEDI-RDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEE--ILHISGSLVPS

Arabidopsis top hitse value%identityAlignment
AT1G75670.1 DNA-directed RNA polymerases9.7e-6356.7Show/hide
Query:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLK+S+A ++I++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTR
        VIVLGF++AVITD DIR+EFK+R + G+  FVSR+HK H LK+GTM+R  V+SFDEE++HI+GSL+P +TG + WLEK S E   T+R  K+ +
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTR

AT1G75670.2 DNA-directed RNA polymerases9.7e-6356.7Show/hide
Query:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLK+S+A ++I++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K AKIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTR
        VIVLGF++AVITD DIR+EFK+R + G+  FVSR+HK H LK+GTM+R  V+SFDEE++HI+GSL+P +TG + WLEK S E   T+R  K+ +
Subjt:  VIVLGFASAVITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGGCTAAAGGTTTCGGACGCCAATATGATTATTTACGTTCACCCATCGAAAAGTAAGAAGGTTTCGCAAGCGGTGCTTCGGGAGCTTGGTGCTATGCTTCTGAA
ATTTGATGAAAAATTTGAAGGTGTGCTACTGGCTTATGAGGCCAAAATTATTGATAAAAGTGCGAAGATTCTATCTGGAGTACATCCCTATTTTGGCGTGACAATAAAGG
CAAAGCTATTACTTTTCTCTCCAAAACCAAACATGCTTTTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCGATCCATGTTATTGTCTTAGGTTTTGCTTCTGCTGTA
ATAACCGATGAAGACATTCGCGATGAATTCAAGCATAGAACAAAACATGGAGATGAAATGTTTGTCAGCAGAGCTCACAAGCACCATGTGTTAAAGGTTGGGACGATGGT
ACGATTTTTGGTGAAGAGCTTTGATGAGGAAATATTGCATATCTCTGGATCTCTAGTTCCATCTCACACGGGGAGCATCCATTGGTTGGAGAAGAATTCAGTTGAAGGTT
CGGTAACTAATAGGAGTAGAAAGAAGACGAGAGATAACGACGGAGAATCATTGTTGCAGGATAGTGTTGCCACTGAAGTAAATGCACTTCTCTTGAACGATGACCATCAA
ACTAAAACGAAAAAACAAAAAACTAGCAGAAAATCTTGA
mRNA sequenceShow/hide mRNA sequence
TGTTAATAGACGTTCGTTGTTTTTAACTTCTTCGAGCTCTTTCTTCTCCGGCCGCTGCGGATTCTCCTCCTCCTTGTTTCGGTCGTCGTTTCAATGGAGGGGCTAAAGGT
TTCGGACGCCAATATGATTATTTACGTTCACCCATCGAAAAGTAAGAAGGTTTCGCAAGCGGTGCTTCGGGAGCTTGGTGCTATGCTTCTGAAATTTGATGAAAAATTTG
AAGGTGTGCTACTGGCTTATGAGGCCAAAATTATTGATAAAAGTGCGAAGATTCTATCTGGAGTACATCCCTATTTTGGCGTGACAATAAAGGCAAAGCTATTACTTTTC
TCTCCAAAACCAAACATGCTTTTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCGATCCATGTTATTGTCTTAGGTTTTGCTTCTGCTGTAATAACCGATGAAGACAT
TCGCGATGAATTCAAGCATAGAACAAAACATGGAGATGAAATGTTTGTCAGCAGAGCTCACAAGCACCATGTGTTAAAGGTTGGGACGATGGTACGATTTTTGGTGAAGA
GCTTTGATGAGGAAATATTGCATATCTCTGGATCTCTAGTTCCATCTCACACGGGGAGCATCCATTGGTTGGAGAAGAATTCAGTTGAAGGTTCGGTAACTAATAGGAGT
AGAAAGAAGACGAGAGATAACGACGGAGAATCATTGTTGCAGGATAGTGTTGCCACTGAAGTAAATGCACTTCTCTTGAACGATGACCATCAAACTAAAACGAAAAAACA
AAAAACTAGCAGAAAATCTTGAAGACTGCTAATTATCACACAACACAGATCGTTTTTGCATCAGGGTGTTGATGATTCAAAGCAGAGATCCTCATTGTTTTCCTATTGTA
AGAAATGCTTGGGAGAGAGACGCACCTTTCACTTGGGAATATCACCTCAACTTTGATAATGACATAACATTATATCAGGGTATGGATAAATGCAGCTTCCCATTTTTGTA
ACTGTCAAGAAGAAATAGCCCAAATCCATGTATTATGGGCTTAGGCCCAATAGACATTGGTGAGTGAATGATTTTTTCCAGCCCAAATCTTGAGTTCAATATGCAGTAGG
TAGAGAAGAAAGAGATGACAGATGTCATCAACCTAGACTTTCTGAAAAATGATATGAGATTTGGACATTATATTTCTTAATTTCATTCAAAATCACACACAACACAAAAC
AACACCACACCAGACCGTTGGTTTGTCAATCATTATAATGAGGATATGATTTGGGTTTCGCAAGTACACTCGATTTGCACCGTTGAC
Protein sequenceShow/hide protein sequence
MEGLKVSDANMIIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIIDKSAKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFASAV
ITDEDIRDEFKHRTKHGDEMFVSRAHKHHVLKVGTMVRFLVKSFDEEILHISGSLVPSHTGSIHWLEKNSVEGSVTNRSRKKTRDNDGESLLQDSVATEVNALLLNDDHQ
TKTKKQKTSRKS