; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G093640 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G093640
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionU11/U12 small nuclear ribonucleoprotein 59 kDa protein isoform X1
Genome locationCiama_Chr05:20683859..20687888
RNA-Seq ExpressionCaUC05G093640
SyntenyCaUC05G093640
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437978.1 PREDICTED: U11/U12 small nuclear ribonucleoprotein 59 kDa protein isoform X1 [Cucumis melo]1.2e-9688.36Show/hide
Query:  FAPQPPWLPVSPPNPPL--SCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEV
        F PQ PW PV PPNPPL  S SSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLM+IKE K SEQSAD+LPNGSSI EF  YLKDRRI+LELQE RSVE 
Subjt:  FAPQPPWLPVSPPNPPL--SCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEV

Query:  AAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEE
        AA LM++LRAQL PF L TDESSLWEAVRLS KL KAKRNKQWRKRKRKRIAESLAKERESFDQVD EADEWRAREIAKDIAKRKVEKMKEIARIKAKEE
Subjt:  AAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEE

Query:  KKKLDSDLELALIVEKLQELRSIRIQKLKKQG
        KKKLDS+LELALIVEKLQELRSIRIQKLKKQG
Subjt:  KKKLDSDLELALIVEKLQELRSIRIQKLKKQG

XP_016899131.1 PREDICTED: U11/U12 small nuclear ribonucleoprotein 59 kDa protein isoform X2 [Cucumis melo]1.2e-9688.36Show/hide
Query:  FAPQPPWLPVSPPNPPL--SCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEV
        F PQ PW PV PPNPPL  S SSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLM+IKE K SEQSAD+LPNGSSI EF  YLKDRRI+LELQE RSVE 
Subjt:  FAPQPPWLPVSPPNPPL--SCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEV

Query:  AAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEE
        AA LM++LRAQL PF L TDESSLWEAVRLS KL KAKRNKQWRKRKRKRIAESLAKERESFDQVD EADEWRAREIAKDIAKRKVEKMKEIARIKAKEE
Subjt:  AAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEE

Query:  KKKLDSDLELALIVEKLQELRSIRIQKLKKQG
        KKKLDS+LELALIVEKLQELRSIRIQKLKKQG
Subjt:  KKKLDSDLELALIVEKLQELRSIRIQKLKKQG

XP_031737698.1 U11/U12 small nuclear ribonucleoprotein 59 kDa protein isoform X2 [Cucumis sativus]1.2e-9384.62Show/hide
Query:  FAPQPPWLPVSPPNPPLSCSS----FWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSV
        F PQ PW PV PPNPP S SS    FWDNLNVRDRLR+LQ+TL+LAKSMQKELEMLM++KEAK SEQS D+LPNGSSI EF  YL+DRRI+LELQE RSV
Subjt:  FAPQPPWLPVSPPNPPLSCSS----FWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSV

Query:  EVAAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAK
        E AA LM++LRAQLHPF L TDESSLWEAVRLS KL KAKRNKQWRKRKRKR+AESLAKERESFDQVD EADEWRAREIAKDIAKRKVEKM EIARIKAK
Subjt:  EVAAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAK

Query:  EEKKKLDSDLELALIVEKLQELRSIRIQKLKKQG
        EEKKKLDS+LELALIVEKLQELRSIRIQKLKKQG
Subjt:  EEKKKLDSDLELALIVEKLQELRSIRIQKLKKQG

XP_038877454.1 U11/U12 small nuclear ribonucleoprotein 59 kDa protein isoform X1 [Benincasa hispida]5.5e-10291.77Show/hide
Query:  FAPQPPWLPVSPPN-PPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEVA
        FAPQPPW PV PPN PP SCSSFWDNLNVRDRLRDLQ+TLDLAKSMQKELEMLMI+KE  SSEQSADNLPNGSSI EFS YLKDRRIN ELQE RSVE+A
Subjt:  FAPQPPWLPVSPPN-PPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEVA

Query:  AALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEEK
        AALMKRLRAQLHPFSLATDESSL EAV+LS KLLKAKRNKQWRK+KRKRIAESLAKERESFDQVDQEAD+WRAREIAKDIAKRKVEKMKEIARIKAKEEK
Subjt:  AALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEEK

Query:  KKLDSDLELALIVEKLQELRSIRIQKLKKQG
        KKLDS+LELALIVEKLQELRSIRIQKLKKQG
Subjt:  KKLDSDLELALIVEKLQELRSIRIQKLKKQG

XP_038877459.1 U11/U12 small nuclear ribonucleoprotein 59 kDa protein isoform X2 [Benincasa hispida]5.5e-10291.77Show/hide
Query:  FAPQPPWLPVSPPN-PPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEVA
        FAPQPPW PV PPN PP SCSSFWDNLNVRDRLRDLQ+TLDLAKSMQKELEMLMI+KE  SSEQSADNLPNGSSI EFS YLKDRRIN ELQE RSVE+A
Subjt:  FAPQPPWLPVSPPN-PPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEVA

Query:  AALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEEK
        AALMKRLRAQLHPFSLATDESSL EAV+LS KLLKAKRNKQWRK+KRKRIAESLAKERESFDQVDQEAD+WRAREIAKDIAKRKVEKMKEIARIKAKEEK
Subjt:  AALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEEK

Query:  KKLDSDLELALIVEKLQELRSIRIQKLKKQG
        KKLDS+LELALIVEKLQELRSIRIQKLKKQG
Subjt:  KKLDSDLELALIVEKLQELRSIRIQKLKKQG

TrEMBL top hitse value%identityAlignment
A0A0A0L417 Uncharacterized protein5.9e-9484.62Show/hide
Query:  FAPQPPWLPVSPPNPPLSCSS----FWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSV
        F PQ PW PV PPNPP S SS    FWDNLNVRDRLR+LQ+TL+LAKSMQKELEMLM++KEAK SEQS D+LPNGSSI EF  YL+DRRI+LELQE RSV
Subjt:  FAPQPPWLPVSPPNPPLSCSS----FWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSV

Query:  EVAAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAK
        E AA LM++LRAQLHPF L TDESSLWEAVRLS KL KAKRNKQWRKRKRKR+AESLAKERESFDQVD EADEWRAREIAKDIAKRKVEKM EIARIKAK
Subjt:  EVAAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAK

Query:  EEKKKLDSDLELALIVEKLQELRSIRIQKLKKQG
        EEKKKLDS+LELALIVEKLQELRSIRIQKLKKQG
Subjt:  EEKKKLDSDLELALIVEKLQELRSIRIQKLKKQG

A0A1S3AVD0 U11/U12 small nuclear ribonucleoprotein 59 kDa protein isoform X15.7e-9788.36Show/hide
Query:  FAPQPPWLPVSPPNPPL--SCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEV
        F PQ PW PV PPNPPL  S SSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLM+IKE K SEQSAD+LPNGSSI EF  YLKDRRI+LELQE RSVE 
Subjt:  FAPQPPWLPVSPPNPPL--SCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEV

Query:  AAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEE
        AA LM++LRAQL PF L TDESSLWEAVRLS KL KAKRNKQWRKRKRKRIAESLAKERESFDQVD EADEWRAREIAKDIAKRKVEKMKEIARIKAKEE
Subjt:  AAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEE

Query:  KKKLDSDLELALIVEKLQELRSIRIQKLKKQG
        KKKLDS+LELALIVEKLQELRSIRIQKLKKQG
Subjt:  KKKLDSDLELALIVEKLQELRSIRIQKLKKQG

A0A1S4DT14 U11/U12 small nuclear ribonucleoprotein 59 kDa protein isoform X25.7e-9788.36Show/hide
Query:  FAPQPPWLPVSPPNPPL--SCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEV
        F PQ PW PV PPNPPL  S SSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLM+IKE K SEQSAD+LPNGSSI EF  YLKDRRI+LELQE RSVE 
Subjt:  FAPQPPWLPVSPPNPPL--SCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEV

Query:  AAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEE
        AA LM++LRAQL PF L TDESSLWEAVRLS KL KAKRNKQWRKRKRKRIAESLAKERESFDQVD EADEWRAREIAKDIAKRKVEKMKEIARIKAKEE
Subjt:  AAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEE

Query:  KKKLDSDLELALIVEKLQELRSIRIQKLKKQG
        KKKLDS+LELALIVEKLQELRSIRIQKLKKQG
Subjt:  KKKLDSDLELALIVEKLQELRSIRIQKLKKQG

A0A5D3D0C0 U11/U12 small nuclear ribonucleoprotein 59 kDa protein isoform X15.7e-9788.36Show/hide
Query:  FAPQPPWLPVSPPNPPL--SCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEV
        F PQ PW PV PPNPPL  S SSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLM+IKE K SEQSAD+LPNGSSI EF  YLKDRRI+LELQE RSVE 
Subjt:  FAPQPPWLPVSPPNPPL--SCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEV

Query:  AAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEE
        AA LM++LRAQL PF L TDESSLWEAVRLS KL KAKRNKQWRKRKRKRIAESLAKERESFDQVD EADEWRAREIAKDIAKRKVEKMKEIARIKAKEE
Subjt:  AAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEE

Query:  KKKLDSDLELALIVEKLQELRSIRIQKLKKQG
        KKKLDS+LELALIVEKLQELRSIRIQKLKKQG
Subjt:  KKKLDSDLELALIVEKLQELRSIRIQKLKKQG

A0A6J1IWF2 U11/U12 small nuclear ribonucleoprotein 59 kDa protein isoform X21.3e-9386.96Show/hide
Query:  FAPQPPWLPVSPPNPPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEVAA
        FA QPP   + PPN P SCSSFWDNLNVRDRLRDLQ++LDLAKSMQKELEML IIKE+KSSEQSADNLP  SSI EFS YL+D+RI+LELQE RSVEVAA
Subjt:  FAPQPPWLPVSPPNPPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEVAA

Query:  ALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEEKK
        ALMKRLR QLHPF + TDESSL EAV+LS+KL KAKRNKQWRKRKRKRIAESLAKERESFDQVD+EADEWRAREIAKDIAKRKVEKMKEIARIKAKEEKK
Subjt:  ALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAKEEKK

Query:  KLDSDLELALIVEKLQELRSIRIQKLKKQG
        KLDS+LELALIVEKLQELRSIRIQKLKKQG
Subjt:  KLDSDLELALIVEKLQELRSIRIQKLKKQG

SwissProt top hitse value%identityAlignment
Q8VYD3 U11/U12 small nuclear ribonucleoprotein 59 kDa protein1.3e-5354.27Show/hide
Query:  PQPP--WLPVSPPNPPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEVAA
        P PP  W P+ PP+PP  C  FW+  N+ D+L+ LQ+TL+LAKSM+KELE L +IK+AK    S +N    S +     YL+ R+++L  QE+ SV+ A 
Subjt:  PQPP--WLPVSPPNPPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEVAA

Query:  ALMKRLRAQLHPFSLATDESSLWE----AVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAK
        +LM  LRAQL PF    DE+S WE    AVRL+ K+ K+ RNK W+KRKR+  AE  AKE E F+Q D+EADEWR +E+AKD+A RKV++MK I +IKAK
Subjt:  ALMKRLRAQLHPFSLATDESSLWE----AVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAK

Query:  EEKKKLDSDLELALIVEKLQELRSIRIQKLKKQG
         E+K+L+ +LELALIVE++QELRS+RI+KLKKQG
Subjt:  EEKKKLDSDLELALIVEKLQELRSIRIQKLKKQG

Arabidopsis top hitse value%identityAlignment
AT2G46200.1 unknown protein9.2e-5554.27Show/hide
Query:  PQPP--WLPVSPPNPPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEVAA
        P PP  W P+ PP+PP  C  FW+  N+ D+L+ LQ+TL+LAKSM+KELE L +IK+AK    S +N    S +     YL+ R+++L  QE+ SV+ A 
Subjt:  PQPP--WLPVSPPNPPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEVAA

Query:  ALMKRLRAQLHPFSLATDESSLWE----AVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAK
        +LM  LRAQL PF    DE+S WE    AVRL+ K+ K+ RNK W+KRKR+  AE  AKE E F+Q D+EADEWR +E+AKD+A RKV++MK I +IKAK
Subjt:  ALMKRLRAQLHPFSLATDESSLWE----AVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAK

Query:  EEKKKLDSDLELALIVEKLQELRSIRIQKLKKQG
         E+K+L+ +LELALIVE++QELRS+RI+KLKKQG
Subjt:  EEKKKLDSDLELALIVEKLQELRSIRIQKLKKQG

AT2G46200.2 unknown protein9.2e-5554.27Show/hide
Query:  PQPP--WLPVSPPNPPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEVAA
        P PP  W P+ PP+PP  C  FW+  N+ D+L+ LQ+TL+LAKSM+KELE L +IK+AK    S +N    S +     YL+ R+++L  QE+ SV+ A 
Subjt:  PQPP--WLPVSPPNPPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLKDRRINLELQELRSVEVAA

Query:  ALMKRLRAQLHPFSLATDESSLWE----AVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAK
        +LM  LRAQL PF    DE+S WE    AVRL+ K+ K+ RNK W+KRKR+  AE  AKE E F+Q D+EADEWR +E+AKD+A RKV++MK I +IKAK
Subjt:  ALMKRLRAQLHPFSLATDESSLWE----AVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKMKEIARIKAK

Query:  EEKKKLDSDLELALIVEKLQELRSIRIQKLKKQG
         E+K+L+ +LELALIVE++QELRS+RI+KLKKQG
Subjt:  EEKKKLDSDLELALIVEKLQELRSIRIQKLKKQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCCGGACACCCACCTTCTGTTCATTCTTCTTGGCCGGCGAGCATCAGACGGCGACGACATCCACGAGGTCACGTTGACACGAGTGGACACCCTCACCTTGAAGGT
TTTACTCATGAACCCACCTTCTTTCCCTCACTTCTGGCGAGCGTTGTGGCGGTGAAACCCGCGAGTCAGCTCTGCAATGGCAACGCGATTCTCGATGGCAGCTCA
GGTATTCGAATATGTCAGTTTTTGAATTTGGGAATTGTATGGTATTTTGTTGTAGCCAAATTTGGAATTGACGTTTCTGGATTGGAGACGGAAACATTAGCCAAA
TACTCAACGGAACCGAAGCTTATACCCTTCAAATCGACGTTGGTGATGGAGGCCTCTGGCTTCAGCCTGTTTGCACCTCAACCTCCATGGTTACCGGTGTCGCCA
CCGAATCCACCGTTGTCTTGTTCATCTTTCTGGGACAACTTGAATGTGCGTGATCGTTTAAGGGATTTGCAAGAAACTCTTGATCTTGCCAAGTCAATGCAGAAA
GAGCTTGAAATGTTGATGATAATTAAAGAGGCCAAATCATCTGAGCAGAGTGCAGATAATTTGCCTAATGGTTCTTCTATTAGTGAATTTTCTGCTTACTTGAAA
GATAGAAGGATCAATTTGGAGTTGCAGGAATTGCGTTCAGTCGAAGTTGCAGCTGCTTTGATGAAAAGGTTAAGGGCTCAGCTTCATCCATTTAGCCTGGCCACT
GATGAATCGAGTCTCTGGGAAGCAGTTAGGTTATCTAGTAAATTATTGAAGGCCAAGAGAAATAAGCAATGGAGAAAGAGAAAGAGGAAGCGCATTGCTGAATCA
CTTGCAAAGGAGCGTGAAAGTTTTGACCAAGTTGATCAGGAGGCTGATGAATGGAGGGCCAGGGAGATTGCTAAGGATATTGCAAAACGCAAGGTGGAGAAGATG
AAAGAAATTGCAAGAATTAAAGCAAAAGAGGAGAAAAAGAAGTTAGATTCTGATCTTGAACTAGCACTAATAGTGGAAAAATTGCAAGAATTGCGCTCAATCAGG
ATCCAGAAACTGAAGAAACAAGGTTCAATTTCTTACTTCTTCTTTCATAACTCCATTGTTATGTATCTTTCTTAA
mRNA sequenceShow/hide mRNA sequence
GCCGGACACCCACCTTCTGTTCATTCTTCTTGGCCGGCGAGCATCAGACGGCGACGACATCCACGAGGTCACGTTGACACGAGTGGACACCCTCACCTTGAAGGT
TTTACTCATGAACCCACCTTCTTTCCCTCACTTCTGGCGAGCGTTGTGGCGGTGAAACCCGCGAGTCAGCTCTGCAATGGCAACGCGATTCTCGATGGCAGCTCA
GGTATTCGAATATGTCAGTTTTTGAATTTGGGAATTGTATGGTATTTTGTTGTAGCCAAATTTGGAATTGACGTTTCTGGATTGGAGACGGAAACATTAGCCAAA
TACTCAACGGAACCGAAGCTTATACCCTTCAAATCGACGTTGGTGATGGAGGCCTCTGGCTTCAGCCTGTTTGCACCTCAACCTCCATGGTTACCGGTGTCGCCA
CCGAATCCACCGTTGTCTTGTTCATCTTTCTGGGACAACTTGAATGTGCGTGATCGTTTAAGGGATTTGCAAGAAACTCTTGATCTTGCCAAGTCAATGCAGAAA
GAGCTTGAAATGTTGATGATAATTAAAGAGGCCAAATCATCTGAGCAGAGTGCAGATAATTTGCCTAATGGTTCTTCTATTAGTGAATTTTCTGCTTACTTGAAA
GATAGAAGGATCAATTTGGAGTTGCAGGAATTGCGTTCAGTCGAAGTTGCAGCTGCTTTGATGAAAAGGTTAAGGGCTCAGCTTCATCCATTTAGCCTGGCCACT
GATGAATCGAGTCTCTGGGAAGCAGTTAGGTTATCTAGTAAATTATTGAAGGCCAAGAGAAATAAGCAATGGAGAAAGAGAAAGAGGAAGCGCATTGCTGAATCA
CTTGCAAAGGAGCGTGAAAGTTTTGACCAAGTTGATCAGGAGGCTGATGAATGGAGGGCCAGGGAGATTGCTAAGGATATTGCAAAACGCAAGGTGGAGAAGATG
AAAGAAATTGCAAGAATTAAAGCAAAAGAGGAGAAAAAGAAGTTAGATTCTGATCTTGAACTAGCACTAATAGTGGAAAAATTGCAAGAATTGCGCTCAATCAGG
ATCCAGAAACTGAAGAAACAAGGTTCAATTTCTTACTTCTTCTTTCATAACTCCATTGTTATGTATCTTTCTTAA
Protein sequenceShow/hide protein sequence
AGHPPSVHSSWPASIRRRRHPRGHVDTSGHPHLEGFTHEPTFFPSLLASVVAVKPASQLCNGNAILDGSSGIRICQFLNLGIVWYFVVAKFGIDVSGLETETLAK
YSTEPKLIPFKSTLVMEASGFSLFAPQPPWLPVSPPNPPLSCSSFWDNLNVRDRLRDLQETLDLAKSMQKELEMLMIIKEAKSSEQSADNLPNGSSISEFSAYLK
DRRINLELQELRSVEVAAALMKRLRAQLHPFSLATDESSLWEAVRLSSKLLKAKRNKQWRKRKRKRIAESLAKERESFDQVDQEADEWRAREIAKDIAKRKVEKM
KEIARIKAKEEKKKLDSDLELALIVEKLQELRSIRIQKLKKQGSISYFFFHNSIVMYLS