; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009096 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009096
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionhomeobox protein 6
Genome locationscaffold687:622118..623357
RNA-Seq ExpressionMS009096
SyntenyMS009096
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605199.1 hypothetical protein SDJN03_02516, partial [Cucurbita argyrosperma subsp. sororia]1.3e-5855.26Show/hide
Query:  NEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKH--RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHN
        +E+EEE LS CDLPVKE+Q P+     + +++   E FDF H   P   P MCAA+++FFQGH+LP  RLS SS     N +N+   + L  RSESMDHN
Subjt:  NEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKH--RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHN

Query:  VLRF--SSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHD
        +LRF   SSS SS++S SHYSR     S  NNSISIPT S  +    +NNVFHSHPSPTPQIRSFST   RSR RSS  SRW FFR+GLLRTPG MELHD
Subjt:  VLRF--SSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHD

Query:  LKTRTTNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAA-PARSSSRNFQKEKEK----------------EKKAKEGEE-RRVSHRRTFEWLKQLSHATL
        LKTRTT T A        T SFLGVVSCK+SVE IPAA   R+ S N  K++ +                E + KE E+  R+SHRRTFEWLKQLSHAT 
Subjt:  LKTRTTNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAA-PARSSSRNFQKEKEK----------------EKKAKEGEE-RRVSHRRTFEWLKQLSHATL

Query:  AADE
         AD+
Subjt:  AADE

XP_008457772.1 PREDICTED: homeobox protein 6 [Cucumis melo]2.7e-6458.5Show/hide
Query:  NEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-HN
        ++EEEE LS CDLPVKE+Q P  +      + +D   FDF + RP  +P M  A+++FFQGHMLP  RLSF SSEN  NNN      NLW RSESMD +N
Subjt:  NEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-HN

Query:  VLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHDLK
        +LRF + S SSS+S SHYSRS    S  NNSISIPTT ++K R   NNVFHSHPSPTPQIRSFSTSSHRSR    S SRW FFRLGLLRTPG MELHDLK
Subjt:  VLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHDLK

Query:  TRTTNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAAPARSSSR----------NFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE
        TRTT TT T       T S LGVVSCKRSV+ +P     S++R          N  K + +EK+ ++ +ERRVSHRRTFEWLKQLSHAT   ++
Subjt:  TRTTNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAAPARSSSR----------NFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE

XP_011649316.1 homeobox protein 6 [Cucumis sativus]6.0e-6458.67Show/hide
Query:  EEEEEELSFCDLPVKEEQNP---IITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-
        EEEEE LS CDLPVKE+Q P   + T V   DQD     FDF H RP  +P M  A+++FFQGHMLP  RLSF SSEN  NNN      NLW RSESMD 
Subjt:  EEEEEELSFCDLPVKEEQNP---IITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-

Query:  HNVLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHD
        +N+LRF + S SSS+S SHYSRS    S  NNSISIPT  +SK R   NNVFHSHPSPTPQIRSFSTSSHRSR    S SRW FFRLGLLRTPG MELHD
Subjt:  HNVLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHD

Query:  LKTRTTNTTATAAPKPTG---TGSFLGVVSCKRSVEAIPAAPA-----------RSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE
        LKTRTT TT T     T    T S LGVVSCKRSVE +P                ++ +N    K + ++ ++ +ERRVSHRRTFEWLKQLSHAT   ++
Subjt:  LKTRTTNTTATAAPKPTG---TGSFLGVVSCKRSVEAIPAAPA-----------RSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE

XP_022149472.1 uncharacterized protein LOC111017891 [Momordica charantia]6.0e-14195.47Show/hide
Query:  MARSFNEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESM
        MARSFN EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESM
Subjt:  MARSFNEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESM

Query:  DHNVLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELH
        DHNVLRFSSSSRSSSTSTSHYSR     S  NNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLL TPGVMELH
Subjt:  DHNVLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELH

Query:  DLKTRTTNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAAPARSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE
        DLKTRTTNT ATA PKPTGTGSFLGVVSCKRSVEAIPAAP RSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATL ADE
Subjt:  DLKTRTTNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAAPARSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE

XP_038902148.1 uncharacterized protein LOC120088781 [Benincasa hispida]8.9e-6860.34Show/hide
Query:  EEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLR
        EEEEE LSFCDLPVKE+Q P+ +A    + +D    FDF H     P M AA+E+FFQG MLP  RLSFSS  +  NN ++  G NLW RSESMDHN+LR
Subjt:  EEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLR

Query:  FSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHDLKTRT
        F + S SSS+S SHYSRS    S  NNS+SIPT  +SKAR +K NVFHSHPSPTPQIRSFS SSHRSR    S SRW FFRLGLLRTPG MELHDLKTRT
Subjt:  FSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHDLKTRT

Query:  TNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAAPARS-SSRNFQK--------EKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE
        T T A A      T S LGVVSCKRSV+ +    A++  + N +K        EKEKEK+ ++ +ERRVSHRRTFEWLKQLSHAT   ++
Subjt:  TNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAAPARS-SSRNFQK--------EKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE

TrEMBL top hitse value%identityAlignment
A0A0A0LPT6 Uncharacterized protein2.9e-6458.67Show/hide
Query:  EEEEEELSFCDLPVKEEQNP---IITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-
        EEEEE LS CDLPVKE+Q P   + T V   DQD     FDF H RP  +P M  A+++FFQGHMLP  RLSF SSEN  NNN      NLW RSESMD 
Subjt:  EEEEEELSFCDLPVKEEQNP---IITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-

Query:  HNVLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHD
        +N+LRF + S SSS+S SHYSRS    S  NNSISIPT  +SK R   NNVFHSHPSPTPQIRSFSTSSHRSR    S SRW FFRLGLLRTPG MELHD
Subjt:  HNVLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHD

Query:  LKTRTTNTTATAAPKPTG---TGSFLGVVSCKRSVEAIPAAPA-----------RSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE
        LKTRTT TT T     T    T S LGVVSCKRSVE +P                ++ +N    K + ++ ++ +ERRVSHRRTFEWLKQLSHAT   ++
Subjt:  LKTRTTNTTATAAPKPTG---TGSFLGVVSCKRSVEAIPAAPA-----------RSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE

A0A1S3C7L1 homeobox protein 61.3e-6458.5Show/hide
Query:  NEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-HN
        ++EEEE LS CDLPVKE+Q P  +      + +D   FDF + RP  +P M  A+++FFQGHMLP  RLSF SSEN  NNN      NLW RSESMD +N
Subjt:  NEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKH-RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMD-HN

Query:  VLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHDLK
        +LRF + S SSS+S SHYSRS    S  NNSISIPTT ++K R   NNVFHSHPSPTPQIRSFSTSSHRSR    S SRW FFRLGLLRTPG MELHDLK
Subjt:  VLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHDLK

Query:  TRTTNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAAPARSSSR----------NFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE
        TRTT TT T       T S LGVVSCKRSV+ +P     S++R          N  K + +EK+ ++ +ERRVSHRRTFEWLKQLSHAT   ++
Subjt:  TRTTNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAAPARSSSR----------NFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE

A0A6J1D819 uncharacterized protein LOC1110178912.9e-14195.47Show/hide
Query:  MARSFNEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESM
        MARSFN EEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESM
Subjt:  MARSFNEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESM

Query:  DHNVLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELH
        DHNVLRFSSSSRSSSTSTSHYSR     S  NNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLL TPGVMELH
Subjt:  DHNVLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELH

Query:  DLKTRTTNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAAPARSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE
        DLKTRTTNT ATA PKPTGTGSFLGVVSCKRSVEAIPAAP RSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATL ADE
Subjt:  DLKTRTTNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAAPARSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE

A0A6J1G7B7 uncharacterized protein LOC1114514082.6e-5754.82Show/hide
Query:  NEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKH--RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHN
        +E+EEE LS CDLPVKE+Q P      + +++   E FDF H   P   P MCAA+++FFQGH+LP  RLS SS     N +N+   + L  RSESMDHN
Subjt:  NEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKH--RPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHN

Query:  VLRF--SSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHD
        +LRF   SSS SS++S SHYSR     S  NNSISIPT S  +    +NNVFHSHPSPTPQIRSFST   RSR RSS  SRW FFR+GLLRTPG MELHD
Subjt:  VLRF--SSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHD

Query:  LKTRTTNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAA-PARSSSRNFQKEKEKEKKA----------------KEGEE-RRVSHRRTFEWLKQLSHATL
        LKTRTT + A        T SFLGVVSCK+SVE IPAA   ++ S N  K++ ++ K                 KE E+  R+SHRRTFEWLKQLSHAT 
Subjt:  LKTRTTNTTATAAPKPTGTGSFLGVVSCKRSVEAIPAA-PARSSSRNFQKEKEKEKKA----------------KEGEE-RRVSHRRTFEWLKQLSHATL

Query:  A
        A
Subjt:  A

A0A6J1KZ54 uncharacterized protein LOC1114995746.5e-5654.33Show/hide
Query:  EEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLR
        EEEEE LS CDLPVKE+Q P+     + D++   E FDF H P   P MCAA+++FFQGH+LP  RLS SS     N +N+   + L  RSESMDHN+LR
Subjt:  EEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLR

Query:  F-SSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHDLKTR
        F + SS SS++S SHYSR     S  NNSISIPT S  +    +NNVFHSHPSPTPQIRSFST   R      S SRW FFR+GLLRTPG MELHDLKTR
Subjt:  F-SSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHDLKTR

Query:  TTNTTATAAPKPTG---TGSFLGVVSCKRSVEAIPAA-PARSSSRNFQKEKEKEKKA----------------KEGEE-RRVSHRRTFEWLKQLSHATLA
        TTN  A  A    G    G+FLGVVSCK+SV+ IPAA   ++ S N  K++ ++ K                 KE E+  R+SHRRTFEWLKQLSHAT A
Subjt:  TTNTTATAAPKPTG---TGSFLGVVSCKRSVEAIPAA-PARSSSRNFQKEKEKEKKA----------------KEGEE-RRVSHRRTFEWLKQLSHATLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G67350.1 unknown protein3.4e-0929.41Show/hide
Query:  EEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDF---------KHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRS
        EEEEE LS CDLP ++ +   + ++   + ++ D GF+F               AP M  A+E+FF+G +LP       S + GLN         L  RS
Subjt:  EEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDF---------KHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRS

Query:  ESMDHNVLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHR--SRCRSSSRSRWHFFRLGLLRTP-
        ES++        S R                                 + + N + +S PSP PQIR  S+ + R  S     S S W F RLGL+RTP 
Subjt:  ESMDHNVLRFSSSSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHR--SRCRSSSRSRWHFFRLGLLRTP-

Query:  -------GVMELHDLKTRTTNTTATAA-PKPTGTG---------SFLGVVSCKRSVEA-IPAAPAR---SSSRNFQKEKEKEKK-AKEGEERRVSHRRTF
               G  +L   +  + ++T+T++  K  G+G         SF+    CK SV      AP +   SS    +K++  EKK AK+ E+  ++ +RTF
Subjt:  -------GVMELHDLKTRTTNTTATAA-PKPTGTG---------SFLGVVSCKRSVEA-IPAAPAR---SSSRNFQKEKEKEKK-AKEGEERRVSHRRTF

Query:  EWLKQL
        EWL Q+
Subjt:  EWLKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAGATGGCCAGAAGCTTTAACGAAGAAGAAGAAGAAGAATTATCTTTCTGCGACCTTCCTGTAAAAGAGGAGCAAAATCCCATCATTACAGCCGTCCAAATAGCTGATCA
AGATGATGATGATGAAGGGTTCGATTTCAAGCACCGGCCGGCGGTGGCGCCGGCGATGTGCGCGGCGGAGGAGGTGTTCTTCCAAGGCCATATGCTCCCATTTGGACGAC
TCTCCTTCAGTAGCTCTGAAAATGGTTTGAATAATAATAATAATAATTTAGGGAGAAATTTGTGGTTCAGATCGGAGTCTATGGATCATAATGTGTTGAGGTTCAGCAGC
AGCAGCCGGAGTAGCAGCACTTCTACAAGCCATTATTCCAGGAGCCTTCCAATATTTTCTGCTCTCAACAACTCAATTTCCATTCCAACGACGAGCAGCTCAAAAGCAAG
AGCTGAGAAGAACAACGTTTTCCACTCCCACCCAAGTCCCACGCCCCAAATCAGATCCTTCTCAACTTCCAGTCACCGGAGCCGGTGTCGAAGCTCCTCCCGCTCCCGGT
GGCACTTTTTCCGGCTCGGTCTTCTCCGAACGCCAGGAGTAATGGAACTTCACGATCTCAAAACTCGCACCACCAACACCACGGCCACGGCGGCGCCGAAACCAACCGGT
ACCGGGTCGTTTCTGGGTGTGGTGAGCTGCAAAAGATCCGTCGAGGCAATCCCGGCGGCGCCTGCAAGGAGCAGCAGTCGCAATTTTCAAAAGGAAAAGGAAAAGGAAAA
AAAGGCAAAGGAAGGAGAGGAGAGGAGGGTGTCACATCGTAGAACATTTGAATGGCTAAAGCAGCTCTCGCATGCAACCTTGGCTGCTGACGAG
mRNA sequenceShow/hide mRNA sequence
GAGATGGCCAGAAGCTTTAACGAAGAAGAAGAAGAAGAATTATCTTTCTGCGACCTTCCTGTAAAAGAGGAGCAAAATCCCATCATTACAGCCGTCCAAATAGCTGATCA
AGATGATGATGATGAAGGGTTCGATTTCAAGCACCGGCCGGCGGTGGCGCCGGCGATGTGCGCGGCGGAGGAGGTGTTCTTCCAAGGCCATATGCTCCCATTTGGACGAC
TCTCCTTCAGTAGCTCTGAAAATGGTTTGAATAATAATAATAATAATTTAGGGAGAAATTTGTGGTTCAGATCGGAGTCTATGGATCATAATGTGTTGAGGTTCAGCAGC
AGCAGCCGGAGTAGCAGCACTTCTACAAGCCATTATTCCAGGAGCCTTCCAATATTTTCTGCTCTCAACAACTCAATTTCCATTCCAACGACGAGCAGCTCAAAAGCAAG
AGCTGAGAAGAACAACGTTTTCCACTCCCACCCAAGTCCCACGCCCCAAATCAGATCCTTCTCAACTTCCAGTCACCGGAGCCGGTGTCGAAGCTCCTCCCGCTCCCGGT
GGCACTTTTTCCGGCTCGGTCTTCTCCGAACGCCAGGAGTAATGGAACTTCACGATCTCAAAACTCGCACCACCAACACCACGGCCACGGCGGCGCCGAAACCAACCGGT
ACCGGGTCGTTTCTGGGTGTGGTGAGCTGCAAAAGATCCGTCGAGGCAATCCCGGCGGCGCCTGCAAGGAGCAGCAGTCGCAATTTTCAAAAGGAAAAGGAAAAGGAAAA
AAAGGCAAAGGAAGGAGAGGAGAGGAGGGTGTCACATCGTAGAACATTTGAATGGCTAAAGCAGCTCTCGCATGCAACCTTGGCTGCTGACGAG
Protein sequenceShow/hide protein sequence
EMARSFNEEEEEELSFCDLPVKEEQNPIITAVQIADQDDDDEGFDFKHRPAVAPAMCAAEEVFFQGHMLPFGRLSFSSSENGLNNNNNNLGRNLWFRSESMDHNVLRFSS
SSRSSSTSTSHYSRSLPIFSALNNSISIPTTSSSKARAEKNNVFHSHPSPTPQIRSFSTSSHRSRCRSSSRSRWHFFRLGLLRTPGVMELHDLKTRTTNTTATAAPKPTG
TGSFLGVVSCKRSVEAIPAAPARSSSRNFQKEKEKEKKAKEGEERRVSHRRTFEWLKQLSHATLAADE