; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G018480 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G018480
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionNucleic acid-binding, OB-fold containing protein
Genome locationCG_Chr05:30753303..30755679
RNA-Seq ExpressionClCG05G018480
SyntenyClCG05G018480
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:0005736 - RNA polymerase I complex (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR036898 - RNA polymerase Rpb7-like, N-terminal domain superfamily
IPR045113 - RNA polymerase Rpb7-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466499.1 PREDICTED: uncharacterized protein LOC103503892 isoform X1 [Cucumis melo]7.4e-10588.89Show/hide
Query:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMV+YVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKN KILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG
        VIVLGFASAVIT +DIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMI+F+VKSFDEEILHISGSL PS+TGSIHWLEKN VEG    S +SKKK RE EG
Subjt:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG

Query:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS
         VMLQD+  TD NA IL NDHQ KTKKQKTTRIS
Subjt:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS

XP_008466500.1 PREDICTED: uncharacterized protein LOC103503892 isoform X2 [Cucumis melo]2.8e-10488.46Show/hide
Query:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMV+YVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKN KILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG
        VIVLGFASAVIT +DIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMI+F+VKSFDEEILHISGSL PS+TGSIHWLEKN VEG   +   SKKK RE EG
Subjt:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG

Query:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS
         VMLQD+  TD NA IL NDHQ KTKKQKTTRIS
Subjt:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS

XP_011652444.1 uncharacterized protein LOC101216589 [Cucumis sativus]6.3e-10486.75Show/hide
Query:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMV+YVHPSKSKK+SQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKN KILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG
        VIVLGFASAVIT +DIRDEFKHRTKHGEEMFVSRAHKHH+IKVGTMI+ +VKSFDEEILHISGSL PS+TGSIHWLEKN VEG   +   SK+K RE EG
Subjt:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG

Query:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS
         V LQDS  TD NA ILNNDHQ KTKKQK TRIS
Subjt:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS

XP_022148351.1 uncharacterized protein LOC111016757 [Momordica charantia]2.8e-10485.47Show/hide
Query:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN+V+YVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAY+AKI DK+ KILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG
        VIVLGFASA ITD+DIRDEFKHRTKH EEMFVSRAHKHHVIKVGTMI+F+VKSFDEEILHISGSL PS+TGSIHWLEKN +EG   + R  +KKTR+ EG
Subjt:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG

Query:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS
        E +LQDS ATD NA ILNNDHQSKTKKQKT+RIS
Subjt:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS

XP_038898978.1 uncharacterized protein LOC120086413 isoform X3 [Benincasa hispida]3.9e-10688.46Show/hide
Query:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        M+GLKVSDANMV+YVHPSKSKKVSQAVLR LGAMLLKFDEKFEGVLLAYEAKI+DK  KILSGVHPYFGVTIKAKLLLFSPKPNML+EGKVVKLRQESIH
Subjt:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG
        VIVLGFASAVIT++DIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMI+F+VKSFDEEILHI+GSL PS+TGSIHWLEKN VEG   +S  SKKK RE EG
Subjt:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG

Query:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS
        EVMLQDS AT+ NA ILNNDHQSKTKKQKTTRIS
Subjt:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS

TrEMBL top hitse value%identityAlignment
A0A0A0LGG8 Uncharacterized protein3.0e-10486.75Show/hide
Query:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMV+YVHPSKSKK+SQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKN KILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG
        VIVLGFASAVIT +DIRDEFKHRTKHGEEMFVSRAHKHH+IKVGTMI+ +VKSFDEEILHISGSL PS+TGSIHWLEKN VEG   +   SK+K RE EG
Subjt:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG

Query:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS
         V LQDS  TD NA ILNNDHQ KTKKQK TRIS
Subjt:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS

A0A1S3CRF8 uncharacterized protein LOC103503892 isoform X21.4e-10488.46Show/hide
Query:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMV+YVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKN KILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG
        VIVLGFASAVIT +DIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMI+F+VKSFDEEILHISGSL PS+TGSIHWLEKN VEG   +   SKKK RE EG
Subjt:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG

Query:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS
         VMLQD+  TD NA IL NDHQ KTKKQKTTRIS
Subjt:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS

A0A1S4E5I5 uncharacterized protein LOC103503892 isoform X13.6e-10588.89Show/hide
Query:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMV+YVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKN KILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG
        VIVLGFASAVIT +DIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMI+F+VKSFDEEILHISGSL PS+TGSIHWLEKN VEG    S +SKKK RE EG
Subjt:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG

Query:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS
         VMLQD+  TD NA IL NDHQ KTKKQKTTRIS
Subjt:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS

A0A5D3E6A0 Putative DNA-directed RNA polymerase I subunit RPA433.6e-10588.89Show/hide
Query:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDANMV+YVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKI+DKN KILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
Subjt:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG
        VIVLGFASAVIT +DIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMI+F+VKSFDEEILHISGSL PS+TGSIHWLEKN VEG    S +SKKK RE EG
Subjt:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG

Query:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS
         VMLQD+  TD NA IL NDHQ KTKKQKTTRIS
Subjt:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS

A0A6J1D5Y9 uncharacterized protein LOC1110167571.4e-10485.47Show/hide
Query:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLKVSDAN+V+YVHPSKSKKVSQAVLRELGAMLLKFDE+FEGVLLAY+AKI DK+ KILSGVHPYFGVT+KAKLLLFSPKPNMLLEGKVVKLRQES+H
Subjt:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG
        VIVLGFASA ITD+DIRDEFKHRTKH EEMFVSRAHKHHVIKVGTMI+F+VKSFDEEILHISGSL PS+TGSIHWLEKN +EG   + R  +KKTR+ EG
Subjt:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEG

Query:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS
        E +LQDS ATD NA ILNNDHQSKTKKQKT+RIS
Subjt:  EVMLQDSFATDPNARILNNDHQSKTKKQKTTRIS

SwissProt top hitse value%identityAlignment
O43036 DNA-directed RNA polymerase I subunit rpa434.8e-0623.31Show/hide
Query:  NMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFAS
        ++ + + P  S+    A+   + +M+L    +  G++LAY+  + ++K+ K++    P+  + ++  +L+FSPK    LEGK+  +    I +++LG  +
Subjt:  NMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYE-AKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFAS

Query:  AVITDKDI-RDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEE--ILHISGSLGPS
        A I  K I +D         EE    + +  ++++ G  ++F+V     E  +  + G+L  S
Subjt:  AVITDKDI-RDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEE--ILHISGSLGPS

Arabidopsis top hitse value%identityAlignment
AT1G75670.1 DNA-directed RNA polymerases5.0e-5954.12Show/hide
Query:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLK+S+A ++I++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K  KIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKK
        VIVLGF++AVITD DIR+EFK+R + GE  FVSR+HK H +K+GTM++  V+SFDEE++HI+GSL P  TG + WLEK   E    +    ++K
Subjt:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKK

AT1G75670.2 DNA-directed RNA polymerases5.0e-5954.12Show/hide
Query:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH
        MEGLK+S+A ++I++HPS+S+ V Q + REL ++L +++E F+GVLLAY+A +  K  KIL+G+HPYFGV +  +LLLF PKP   +EGK+VK+  ESIH
Subjt:  MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIH

Query:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKK
        VIVLGF++AVITD DIR+EFK+R + GE  FVSR+HK H +K+GTM++  V+SFDEE++HI+GSL P  TG + WLEK   E    +    ++K
Subjt:  VIVLGFASAVITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGGCTCAAGGTTTCGGATGCTAATATGGTTATTTACGTTCACCCATCCAAAAGTAAGAAGGTTTCGCAAGCGGTTCTTCGAGAGCTCGGCGCTATGCTTTTGAA
ATTTGATGAAAAATTTGAAGGCGTGCTACTGGCTTATGAAGCCAAAATTGTTGATAAAAATGGGAAGATTCTATCTGGAGTGCATCCCTATTTTGGTGTGACAATAAAGG
CAAAGCTATTACTTTTCTCTCCAAAACCAAATATGCTATTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCAATCCATGTTATTGTCTTAGGTTTTGCTTCTGCTGTA
ATAACCGATAAAGACATTCGCGACGAATTCAAGCATAGAACAAAACATGGAGAAGAAATGTTTGTCAGCAGAGCTCACAAGCACCATGTAATAAAGGTTGGGACAATGAT
ACAATTTATGGTGAAGAGTTTTGATGAGGAAATATTGCATATCTCTGGATCTCTAGGTCCATCTTATACAGGGAGCATCCATTGGTTGGAGAAGAATTTGGTTGAAGGCA
AGAAACAAAATTCTAGGACCAGTAAAAAGAAGACGAGAGAAAAAGAGGGAGAGGTGATGTTGCAGGATAGCTTTGCCACGGATCCAAATGCACGTATCTTGAACAATGAC
CATCAGTCAAAAACCAAAAAGCAAAAAACTACCAGAATATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGGCTCAAGGTTTCGGATGCTAATATGGTTATTTACGTTCACCCATCCAAAAGTAAGAAGGTTTCGCAAGCGGTTCTTCGAGAGCTCGGCGCTATGCTTTTGAA
ATTTGATGAAAAATTTGAAGGCGTGCTACTGGCTTATGAAGCCAAAATTGTTGATAAAAATGGGAAGATTCTATCTGGAGTGCATCCCTATTTTGGTGTGACAATAAAGG
CAAAGCTATTACTTTTCTCTCCAAAACCAAATATGCTATTAGAGGGAAAGGTGGTGAAGCTTAGGCAAGAATCAATCCATGTTATTGTCTTAGGTTTTGCTTCTGCTGTA
ATAACCGATAAAGACATTCGCGACGAATTCAAGCATAGAACAAAACATGGAGAAGAAATGTTTGTCAGCAGAGCTCACAAGCACCATGTAATAAAGGTTGGGACAATGAT
ACAATTTATGGTGAAGAGTTTTGATGAGGAAATATTGCATATCTCTGGATCTCTAGGTCCATCTTATACAGGGAGCATCCATTGGTTGGAGAAGAATTTGGTTGAAGGCA
AGAAACAAAATTCTAGGACCAGTAAAAAGAAGACGAGAGAAAAAGAGGGAGAGGTGATGTTGCAGGATAGCTTTGCCACGGATCCAAATGCACGTATCTTGAACAATGAC
CATCAGTCAAAAACCAAAAAGCAAAAAACTACCAGAATATCTTGA
Protein sequenceShow/hide protein sequence
MEGLKVSDANMVIYVHPSKSKKVSQAVLRELGAMLLKFDEKFEGVLLAYEAKIVDKNGKILSGVHPYFGVTIKAKLLLFSPKPNMLLEGKVVKLRQESIHVIVLGFASAV
ITDKDIRDEFKHRTKHGEEMFVSRAHKHHVIKVGTMIQFMVKSFDEEILHISGSLGPSYTGSIHWLEKNLVEGKKQNSRTSKKKTREKEGEVMLQDSFATDPNARILNND
HQSKTKKQKTTRIS