; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g1081 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g1081
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionhomeobox protein 6
Genome locationMC02:9123633..9125071
RNA-Seq ExpressionMC02g1081
SyntenyMC02g1081
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605199.1 hypothetical protein SDJN03_02516, partial [Cucurbita argyrosperma subsp. sororia]1.47e-7756.86Show/hide
Query:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKH--RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVL
        +EEE LS CDLPVKE+Q P++       +D     FDF H   P   P MCAA+++FFQGH+LP  RLS SS     N +N+   + L  RSESMDHN+L
Subjt:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKH--RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVL

Query:  RF--SSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTR
        RF   SSS SS++S SHYSR CSSISNNSISIPT  +SK R + NNVFHSHPSPTPQIRSFST   RSR RSSSR  W FFR+GLL TPG MELHDLKTR
Subjt:  RF--SSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTR

Query:  TTNTMATAVPKPTGTGSFLGVVSCKRSVEAIPAAP-VRSSSRNFQK-----------------------EKEKEKKAKEGEERRVSHRRTFEWLKQLSHA
        TT T A        T SFLGVVSCK+SVE IPAA  +R+ S N  K                       EKEKEK        R+SHRRTFEWLKQLSHA
Subjt:  TTNTMATAVPKPTGTGSFLGVVSCKRSVEAIPAAP-VRSSSRNFQK-----------------------EKEKEKKAKEGEERRVSHRRTFEWLKQLSHA

Query:  TLTADE
        T  AD+
Subjt:  TLTADE

XP_008457772.1 PREDICTED: homeobox protein 6 [Cucumis melo]3.36e-8158.9Show/hide
Query:  EEEEELSFCDLPVKEEQNPI--ITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-HN
        EEEE LS CDLPVKE+Q P   ++A  +  +D     FDF + RP  +P M  A+++FFQGHMLP  RLSFSS EN  NNN N     LW RSESMD +N
Subjt:  EEEEELSFCDLPVKEEQNPI--ITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-HN

Query:  VLRFSSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTR
        +LRF + S SSS+S SHYSR  SS+SNNSISIPTT++ K R   NNVFHSHPSPTPQIRSFSTSSHRSR    S SRW FFRLGLL TPG MELHDLKTR
Subjt:  VLRFSSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTR

Query:  TTNTMATAVPKPTGTGSFLGVVSCKRSVEAIPAAPVRSSSR----------NFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADE
        TT T  T +     T S LGVVSCKRSV+ +P     S++R          N  K + +EK+ ++ +ERRVSHRRTFEWLKQLSHAT   ++
Subjt:  TTNTMATAVPKPTGTGSFLGVVSCKRSVEAIPAAPVRSSSR----------NFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADE

XP_011649316.1 homeobox protein 6 [Cucumis sativus]1.69e-8058.59Show/hide
Query:  EEEEELSFCDLPVKEEQNP---IITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-H
        EEEE LS CDLPVKE+Q P   + T V   DQD     FDF H RP  +P M  A+++FFQGHMLP  RLSFSS EN  NNN N     LW RSESMD +
Subjt:  EEEEELSFCDLPVKEEQNP---IITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-H

Query:  NVLRFSSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKT
        N+LRF + S SSS+S SHYSR  SS+SNNSISIPT  +SK R   NNVFHSHPSPTPQIRSFSTSSHRSR    S SRW FFRLGLL TPG MELHDLKT
Subjt:  NVLRFSSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKT

Query:  RTTNTMATAVPKPTG---TGSFLGVVSCKRSVEAIPAAP-----------VRSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADE
        RTT T  T     T    T S LGVVSCKRSVE +P              + ++ +N    K + ++ ++ +ERRVSHRRTFEWLKQLSHAT   ++
Subjt:  RTTNTMATAVPKPTG---TGSFLGVVSCKRSVEAIPAAP-----------VRSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADE

XP_022149472.1 uncharacterized protein LOC111017891 [Momordica charantia]8.14e-18999.64Show/hide
Query:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLRF
        EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLRF
Subjt:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLRF

Query:  SSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTRTTNT
        SSSSRSSSTSTSHYSR CSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTRTTNT
Subjt:  SSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTRTTNT

Query:  MATAVPKPTGTGSFLGVVSCKRSVEAIPAAPVRSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADEP
        MATAVPKPTGTGSFLGVVSCKRSVEAIPAAPVRSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADEP
Subjt:  MATAVPKPTGTGSFLGVVSCKRSVEAIPAAPVRSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADEP

XP_038902148.1 uncharacterized protein LOC120088781 [Benincasa hispida]4.01e-8660.42Show/hide
Query:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLRF
        EEEE LSFCDLPVKE+Q P+ +A    + +D    FDF H     P M AA+E+FFQG MLP  RLSFSS  +  NN ++  G NLW RSESMDHN+LRF
Subjt:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLRF

Query:  SSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTRTTNT
         + S SSS+S SHYSR  SS+SNNS+SIPT  +SKAR +KN VFHSHPSPTPQIRSFS SSHRSR    S SRW FFRLGLL TPG MELHDLKTRTT  
Subjt:  SSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTRTTNT

Query:  MATAVPKPTGTGSFLGVVSCKRSVEAIPAAPVRSSSRNFQ----------KEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADE
         A      T T S LGVVSCKRSV+ + A  V  + RN            +EKEKEK+ ++ +ERRVSHRRTFEWLKQLSHAT   ++
Subjt:  MATAVPKPTGTGSFLGVVSCKRSVEAIPAAPVRSSSRNFQ----------KEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADE

TrEMBL top hitse value%identityAlignment
A0A0A0LPT6 Uncharacterized protein8.17e-8158.59Show/hide
Query:  EEEEELSFCDLPVKEEQNP---IITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-H
        EEEE LS CDLPVKE+Q P   + T V   DQD     FDF H RP  +P M  A+++FFQGHMLP  RLSFSS EN  NNN N     LW RSESMD +
Subjt:  EEEEELSFCDLPVKEEQNP---IITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-H

Query:  NVLRFSSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKT
        N+LRF + S SSS+S SHYSR  SS+SNNSISIPT  +SK R   NNVFHSHPSPTPQIRSFSTSSHRSR    S SRW FFRLGLL TPG MELHDLKT
Subjt:  NVLRFSSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKT

Query:  RTTNTMATAVPKPTG---TGSFLGVVSCKRSVEAIPAAP-----------VRSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADE
        RTT T  T     T    T S LGVVSCKRSVE +P              + ++ +N    K + ++ ++ +ERRVSHRRTFEWLKQLSHAT   ++
Subjt:  RTTNTMATAVPKPTG---TGSFLGVVSCKRSVEAIPAAP-----------VRSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADE

A0A1S3C7L1 homeobox protein 61.63e-8158.9Show/hide
Query:  EEEEELSFCDLPVKEEQNPI--ITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-HN
        EEEE LS CDLPVKE+Q P   ++A  +  +D     FDF + RP  +P M  A+++FFQGHMLP  RLSFSS EN  NNN N     LW RSESMD +N
Subjt:  EEEEELSFCDLPVKEEQNPI--ITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-HN

Query:  VLRFSSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTR
        +LRF + S SSS+S SHYSR  SS+SNNSISIPTT++ K R   NNVFHSHPSPTPQIRSFSTSSHRSR    S SRW FFRLGLL TPG MELHDLKTR
Subjt:  VLRFSSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTR

Query:  TTNTMATAVPKPTGTGSFLGVVSCKRSVEAIPAAPVRSSSR----------NFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADE
        TT T  T +     T S LGVVSCKRSV+ +P     S++R          N  K + +EK+ ++ +ERRVSHRRTFEWLKQLSHAT   ++
Subjt:  TTNTMATAVPKPTGTGSFLGVVSCKRSVEAIPAAPVRSSSR----------NFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADE

A0A6J1D819 uncharacterized protein LOC1110178913.94e-18999.64Show/hide
Query:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLRF
        EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLRF
Subjt:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLRF

Query:  SSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTRTTNT
        SSSSRSSSTSTSHYSR CSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTRTTNT
Subjt:  SSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTRTTNT

Query:  MATAVPKPTGTGSFLGVVSCKRSVEAIPAAPVRSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADEP
        MATAVPKPTGTGSFLGVVSCKRSVEAIPAAPVRSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADEP
Subjt:  MATAVPKPTGTGSFLGVVSCKRSVEAIPAAPVRSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADEP

A0A6J1G7B7 uncharacterized protein LOC1114514084.65e-7556.48Show/hide
Query:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKH--RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVL
        +EEE LS CDLPVKE+Q P      + +++   E FDF H   P   P MCAA+++FFQGH+LP  RLS SS     N +N+   + L  RSESMDHN+L
Subjt:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKH--RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVL

Query:  RF--SSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTR
        RF   SSS SS++S SHYSR CSSISNNSISIPT  +SK R + NNVFHSHPSPTPQIRSFST   RSR RSSSR  W FFR+GLL TPG MELHDLKTR
Subjt:  RF--SSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTR

Query:  TTNTMATAVPKPTGTGSFLGVVSCKRSVEAIPAAP-VRSSSRNFQK-----------------------EKEKEKKAKEGEERRVSHRRTFEWLKQLSHA
        TT + A        T SFLGVVSCK+SVE IPAA  +++ S N  K                       EKEKEK        R+SHRRTFEWLKQLSHA
Subjt:  TTNTMATAVPKPTGTGSFLGVVSCKRSVEAIPAAP-VRSSSRNFQK-----------------------EKEKEKKAKEGEERRVSHRRTFEWLKQLSHA

Query:  T
        T
Subjt:  T

A0A6J1KZ54 uncharacterized protein LOC1114995746.03e-7355.48Show/hide
Query:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLRF
        EEEE LS CDLPVKE+Q P++      D++   E FDF H P   P MCAA+++FFQGH+LP  RLS SS     N +N+   + L  RSESMDHN+LRF
Subjt:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLRF

Query:  -SSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTRTTN
         + SS SS++S SHYSR CSSISNNSISIPT  +SK R + NNVFHSHPSPTPQIRSFST   R      S SRW FFR+GLL TPG MELHDLKTRTTN
Subjt:  -SSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTRTTN

Query:  TMATAVPKPTGT---GSFLGVVSCKRSVEAIPAAP-VRSSSRNFQK-----------------------EKEKEKKAKEGEERRVSHRRTFEWLKQLSHA
          A       G    G+FLGVVSCK+SV+ IPAA  +++ S N  K                       EKEKEK        R+SHRRTFEWLKQLSHA
Subjt:  TMATAVPKPTGT---GSFLGVVSCKRSVEAIPAAP-VRSSSRNFQK-----------------------EKEKEKKAKEGEERRVSHRRTFEWLKQLSHA

Query:  T
        T
Subjt:  T

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G67350.1 unknown protein4.8e-0829.97Show/hide
Query:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDF---------KHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSE
        EEEE LS CDLP ++ +   + ++   + ++ D GF+F               AP M  A+E+FF+G +LP       S + GLN         L  RSE
Subjt:  EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDF---------KHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSE

Query:  SMDHNVLRFSSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHR--SRCRSSSRSRWHFFRLGLLGTPGVME
        S++        S R               I NN I                  +S PSP PQIR  S+ + R  S     S S W F RLGL+ TP +  
Subjt:  SMDHNVLRFSSSSRSSSTSTSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHR--SRCRSSSRSRWHFFRLGLLGTPGVME

Query:  LHDLKTRTTN-------------TMATAVPKPTGTG---------SFLGVVSCKRSVEA-IPAAPVR---SSSRNFQKEKEKEKK-AKEGEERRVSHRRT
          +L+T   N             T  ++  K  G+G         SF+    CK SV      APV+   SS    +K++  EKK AK+ E+  ++ +RT
Subjt:  LHDLKTRTTN-------------TMATAVPKPTGTG---------SFLGVVSCKRSVEA-IPAAPVR---SSSRNFQKEKEKEKK-AKEGEERRVSHRRT

Query:  FEWLKQL
        FEWL Q+
Subjt:  FEWLKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAAGAAGAAGAAGAATTATCTTTCTGCGACCTTCCTGTAAAAGAGGAGCAAAATCCCATCATTACAGCCGTCCAAATAGCTGATCAAGATGATGATGATGAAGGGTTCGA
TTTCAAGCACCGGCCGGCGGTGGCGCCGGCGATGTGCGCGGCGGAGGAGGTGTTCTTCCAAGGCCATATGCTCCCATTTGGACGACTCTCCTTCAGTAGCTCTGAAAATG
GTTTGAATAATAATAATAATAATTTAGGGAGAAATTTGTGGTTCAGATCGGAGTCTATGGATCATAATGTGTTGAGGTTCAGCAGCAGCAGCCGGAGTAGCAGCACTTCT
ACGAGCCATTATTCCAGGCAATGTTCCAGCATTAGCAACAACTCAATTTCCATTCCAACGACGAGCAGCTCAAAAGCAAGAGCTGAGAAGAACAACGTTTTCCACTCCCA
CCCAAGTCCCACGCCCCAAATCAGATCCTTCTCAACTTCCAGTCACCGGAGCCGGTGTCGAAGCTCCTCCCGCTCCCGGTGGCACTTTTTCCGGCTCGGTCTTCTCGGAA
CGCCAGGAGTAATGGAACTTCACGATCTCAAAACTCGCACCACCAACACCATGGCCACGGCGGTGCCGAAACCAACCGGTACCGGGTCGTTTCTGGGTGTGGTGAGCTGC
AAAAGATCCGTCGAGGCAATCCCGGCGGCGCCTGTAAGGAGCAGCAGTCGCAATTTTCAAAAGGAAAAGGAAAAGGAAAAAAAGGCAAAGGAAGGAGAGGAGAGGAGGGT
GTCACATCGTAGAACATTTGAATGGCTAAAGCAGCTCTCGCATGCAACCTTGACTGCTGACGAGCCTTAG
mRNA sequenceShow/hide mRNA sequence
GAAGAAGAAGAAGAATTATCTTTCTGCGACCTTCCTGTAAAAGAGGAGCAAAATCCCATCATTACAGCCGTCCAAATAGCTGATCAAGATGATGATGATGAAGGGTTCGA
TTTCAAGCACCGGCCGGCGGTGGCGCCGGCGATGTGCGCGGCGGAGGAGGTGTTCTTCCAAGGCCATATGCTCCCATTTGGACGACTCTCCTTCAGTAGCTCTGAAAATG
GTTTGAATAATAATAATAATAATTTAGGGAGAAATTTGTGGTTCAGATCGGAGTCTATGGATCATAATGTGTTGAGGTTCAGCAGCAGCAGCCGGAGTAGCAGCACTTCT
ACGAGCCATTATTCCAGGCAATGTTCCAGCATTAGCAACAACTCAATTTCCATTCCAACGACGAGCAGCTCAAAAGCAAGAGCTGAGAAGAACAACGTTTTCCACTCCCA
CCCAAGTCCCACGCCCCAAATCAGATCCTTCTCAACTTCCAGTCACCGGAGCCGGTGTCGAAGCTCCTCCCGCTCCCGGTGGCACTTTTTCCGGCTCGGTCTTCTCGGAA
CGCCAGGAGTAATGGAACTTCACGATCTCAAAACTCGCACCACCAACACCATGGCCACGGCGGTGCCGAAACCAACCGGTACCGGGTCGTTTCTGGGTGTGGTGAGCTGC
AAAAGATCCGTCGAGGCAATCCCGGCGGCGCCTGTAAGGAGCAGCAGTCGCAATTTTCAAAAGGAAAAGGAAAAGGAAAAAAAGGCAAAGGAAGGAGAGGAGAGGAGGGT
GTCACATCGTAGAACATTTGAATGGCTAAAGCAGCTCTCGCATGCAACCTTGACTGCTGACGAGCCTTAGGGACTCCTTTTTCTTTTACTCATCAACTATGTGCTCTCTC
TCGCATGCCTTGTAGTGATGTTTAATCATTTGACTGTTTCTTCTTTTCCAACTTTTTCTTTGTTTAGTACATTTGACTTAGGAGACTTTGTTATATATATATATATATAC
ACAAATTAAAATCGAAAGGGTATGTGTTTGATTTCTATTTAGTCCTTGAAATTTCATATACGATGCTTTAGTCTCT
Protein sequenceShow/hide protein sequence
EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLRFSSSSRSSSTS
TSHYSRQCSSISNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLGTPGVMELHDLKTRTTNTMATAVPKPTGTGSFLGVVSC
KRSVEAIPAAPVRSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLTADEP