; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G015880 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G015880
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionprotein HAPLESS 2 isoform X1
Genome locationCG_Chr11:29183401..29187212
RNA-Seq ExpressionClCG11G015880
SyntenyClCG11G015880
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142706.1 protein HAPLESS 2 isoform X1 [Cucumis sativus]1.5e-11086.97Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSLAS +FSAIGK+FGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMG++ ILSYFVLLLLYLLHKIGIF CIGRGLCRMIWTCL+SYFY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKH
        AWEYCC FMCIKLA+VKRTRRRH RRRD+EEE+E EEGKC+H STSDS+NV +HVES+SSRRVS+RWRRNH+ SQ RKSLRPKGHG+RVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKH

Query:  KRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR
        +RK VEVGNHLNE  SFGMYGSSK VHKERKYRRGR R
Subjt:  KRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR

XP_008444189.1 PREDICTED: protein HAPLESS 2 isoform X1 [Cucumis melo]9.4e-11389.08Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSLAS +FSAIGK+FGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANL+KMG++ ILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCL+SYFY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKH
        AWEYCC+FMCIKLA+VKRTRRRH RRRDLEEE+ESEEGKCQH STSDSSNV +HVESRSSR  SRRWRRNHK SQ RKSLRPKGHG+RVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKH

Query:  KRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR
        +RK VEVGNH NE  SFGMYGSSK VHKERKYRRGRQR
Subjt:  KRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR

XP_022154011.1 uncharacterized protein LOC111021352 [Momordica charantia]1.0e-9576.15Show/hide
Query:  VMGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYF
        +MGNVASS+ASG FSA+GK+F SPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLK+G++LILS FV+LLLYLLHKIGIFGCI RGLCRM WTC++SYF
Subjt:  VMGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYF

Query:  YAWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGK
        YAW+YCCTFMCIKL +VKRTRRR  RRRDLEEE+ESE GK ++GS+SDSS+VP+ +E RSS+R SRRWR NH+ SQMRK+LRPK  GIRVRSGR LVYGK
Subjt:  YAWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGK

Query:  HKRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR
        H+RK  EV N L E HS G +GSSK VH+E +Y+RGRQ+
Subjt:  HKRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR

XP_038897053.1 uncharacterized protein LOC120085227 isoform X1 [Benincasa hispida]4.5e-11589.92Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSL SGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCV+NLLKMG++LILSYFVLL L LLHKIGIFGCIGRGLC+MIWTCL+SYFY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKH
        AWEYCCTFMCIKLA+VKRTRRRH RRRDLEEE+ESEEGKCQHGSTSDSSNVP+HVESRSSRR  RRWRRNH+ S+MRKSLRP+GHGIRVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKH

Query:  KRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR
        +RK VEVGNHLNE HSFGMYGSSK VHKERKYRR  QR
Subjt:  KRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR

XP_038897054.1 uncharacterized protein LOC120085227 isoform X2 [Benincasa hispida]2.3e-9578.99Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSL SGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCV+NLLKMG++LILSYF                             +SYFY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKH
        AWEYCCTFMCIKLA+VKRTRRRH RRRDLEEE+ESEEGKCQHGSTSDSSNVP+HVESRSSRR  RRWRRNH+ S+MRKSLRP+GHGIRVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKH

Query:  KRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR
        +RK VEVGNHLNE HSFGMYGSSK VHKERKYRR  QR
Subjt:  KRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR

TrEMBL top hitse value%identityAlignment
A0A0A0L1V0 Uncharacterized protein7.2e-11186.97Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSLAS +FSAIGK+FGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMG++ ILSYFVLLLLYLLHKIGIF CIGRGLCRMIWTCL+SYFY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKH
        AWEYCC FMCIKLA+VKRTRRRH RRRD+EEE+E EEGKC+H STSDS+NV +HVES+SSRRVS+RWRRNH+ SQ RKSLRPKGHG+RVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKH

Query:  KRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR
        +RK VEVGNHLNE  SFGMYGSSK VHKERKYRRGR R
Subjt:  KRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR

A0A1S4DVV8 protein HAPLESS 2 isoform X14.5e-11389.08Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSLAS +FSAIGK+FGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANL+KMG++ ILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCL+SYFY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKH
        AWEYCC+FMCIKLA+VKRTRRRH RRRDLEEE+ESEEGKCQH STSDSSNV +HVESRSSR  SRRWRRNHK SQ RKSLRPKGHG+RVRSGRVLVYGKH
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKH

Query:  KRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR
        +RK VEVGNH NE  SFGMYGSSK VHKERKYRRGRQR
Subjt:  KRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR

A0A6J1DMF3 uncharacterized protein LOC1110213525.0e-9676.15Show/hide
Query:  VMGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYF
        +MGNVASS+ASG FSA+GK+F SPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLK+G++LILS FV+LLLYLLHKIGIFGCI RGLCRM WTC++SYF
Subjt:  VMGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYF

Query:  YAWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGK
        YAW+YCCTFMCIKL +VKRTRRR  RRRDLEEE+ESE GK ++GS+SDSS+VP+ +E RSS+R SRRWR NH+ SQMRK+LRPK  GIRVRSGR LVYGK
Subjt:  YAWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGK

Query:  HKRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR
        H+RK  EV N L E HS G +GSSK VH+E +Y+RGRQ+
Subjt:  HKRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR

A0A6J1FIX0 uncharacterized protein LOC1114458933.7e-8373.53Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSLASG+F A+ KVFGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLKMG+++IL+YFVLLLLYL HKIGIFGCIGRGLCRMIWTCL+SY +
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEE-GKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGK
        AWEYCCTFMCIKLA+VKRTRRRH RRRDLEEE ESEE  K ++GS+SDSSN    +ESR S+RVSR+ RR+H+ SQ  K+LRP  HGIRVRSGRVLVY K
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEE-GKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGK

Query:  HKRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQ
        H                    GSSK V KER YRRGRQ
Subjt:  HKRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQ

A0A6J1IVE5 uncharacterized protein LOC1114803121.6e-8171.97Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNVASSLASG+F A+ KVFGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLKMG+++IL+YFVLLLLYL HKIGIFGCIGRG CRMIWTCL+SY +
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYES-EEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGK
        AWEYCCTFMC+KLA+VKRTRR H RRRDLEEE ES EE K ++GS+ DSSN  + +ESR S RVSR+ RR+H+ SQ  K+LRP  HGIRVRSGRVLVY K
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYES-EEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGK

Query:  HKRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR
        H                    GSSK V K+RKYRRGRQR
Subjt:  HKRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR

SwissProt top hitse value%identityAlignment
F4JP36 Protein HAPLESS 25.7e-0436.51Show/hide
Query:  KVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIF
        K+    +DF++G +C + C S +DF C+I+  C++ ++  G++L L     LLL+LLH+ G+F
Subjt:  KVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIF

Arabidopsis top hitse value%identityAlignment
AT1G21722.1 unknown protein3.3e-3948.15Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGNV  S  +G   +IG  FGSPLDFLSG+SCSSVC S WDFICY+ENFCVANL K  +ILILSYF L  +Y+L+K+G + CI  G  +++W  +S +FY
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRV
           YCC+F C  L + KR RRR    R +EE+Y+            D+S+  D V+   S    R  R   K  ++RKSLRP+ H +RV
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSRRVSRRWRRNHKRSQMRKSLRPKGHGIRV

AT1G78922.1 unknown protein7.4e-1529.81Show/hide
Query:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY
        MGN     ++ I   IG +F +PL    GRSC  VC   WD  C+IE+FC+ ++ K+ ++  L + +L+ + LL K+GI  C+ + +C+M     ++Y++
Subjt:  MGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFY

Query:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEE---EYESEEGKCQHGSTSDSSNVPDHVESRS-SRRVSRRWRRNHKRSQM--RKSLRPKGHGIRVRSGRV
              + +C  L N+ R  RR  R  D+E    +Y S++      S+SDS +  D++  +   RR+  +   +H  S    R+ +R     + VR G  
Subjt:  AWEYCCTFMCIKLANVKRTRRRHFRRRDLEE---EYESEEGKCQHGSTSDSSNVPDHVESRS-SRRVSRRWRRNHKRSQM--RKSLRPKGHGIRVRSGRV

Query:  LVYGKHKR
           GK +R
Subjt:  LVYGKHKR

AT4G11720.1 hapless 24.0e-0536.51Show/hide
Query:  KVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIF
        K+    +DF++G +C + C S +DF C+I+  C++ ++  G++L L     LLL+LLH+ G+F
Subjt:  KVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCVANLLKMGIILILSYFVLLLLYLLHKIGIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCGACAGTCCAAATCCTCCAAATTTAACACTCAAAACTTCTTCTGCTCTGCCCGACGGTTACCCGCCATTCTTCTTCCTTCTGCTGTTCTTCATTTCCCGGTGCT
CTTCAACCATTATAATCCATGTCTTGGGTTTTCACTGTTTCGTCTTATTTCGCAGGAGTGTTTTGTAATGGGTAATGTAGCCAGTTCACTGGCCTCTGGTATTTTTTCGG
CCATTGGCAAAGTCTTTGGATCCCCACTTGATTTTCTTTCTGGAAGATCCTGCAGTTCAGTTTGTGGATCAACATGGGATTTCATATGCTACATAGAAAACTTCTGTGTT
GCCAATTTGCTAAAGATGGGAATAATCTTGATCCTTTCATACTTTGTTCTTTTGCTCCTGTATTTATTACATAAAATTGGCATCTTTGGATGTATTGGTCGGGGGCTCTG
CAGAATGATATGGACTTGTTTATCTTCTTATTTCTATGCATGGGAGTACTGCTGCACTTTCATGTGTATCAAGCTTGCCAATGTCAAAAGAACAAGAAGACGGCACTTTA
GGAGAAGAGACCTTGAAGAAGAGTATGAAAGCGAAGAAGGAAAATGTCAGCATGGGTCAACAAGTGACTCGAGTAATGTCCCCGACCATGTCGAGTCGAGAAGTAGTAGA
CGGGTGTCTCGTCGATGGAGGAGGAACCACAAACGTTCTCAAATGAGAAAATCGTTGAGGCCGAAAGGTCATGGAATTCGAGTAAGGAGTGGTAGAGTGTTGGTCTATGG
TAAGCATAAAAGAAAATGTGTTGAGGTTGGGAATCATTTGAATGAGAAACATAGCTTTGGGATGTATGGATCATCCAAATGTGTGCATAAAGAAAGGAAATATAGAAGAG
GAAGGCAAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCGACAGTCCAAATCCTCCAAATTTAACACTCAAAACTTCTTCTGCTCTGCCCGACGGTTACCCGCCATTCTTCTTCCTTCTGCTGTTCTTCATTTCCCGGTGCT
CTTCAACCATTATAATCCATGTCTTGGGTTTTCACTGTTTCGTCTTATTTCGCAGGAGTGTTTTGTAATGGGTAATGTAGCCAGTTCACTGGCCTCTGGTATTTTTTCGG
CCATTGGCAAAGTCTTTGGATCCCCACTTGATTTTCTTTCTGGAAGATCCTGCAGTTCAGTTTGTGGATCAACATGGGATTTCATATGCTACATAGAAAACTTCTGTGTT
GCCAATTTGCTAAAGATGGGAATAATCTTGATCCTTTCATACTTTGTTCTTTTGCTCCTGTATTTATTACATAAAATTGGCATCTTTGGATGTATTGGTCGGGGGCTCTG
CAGAATGATATGGACTTGTTTATCTTCTTATTTCTATGCATGGGAGTACTGCTGCACTTTCATGTGTATCAAGCTTGCCAATGTCAAAAGAACAAGAAGACGGCACTTTA
GGAGAAGAGACCTTGAAGAAGAGTATGAAAGCGAAGAAGGAAAATGTCAGCATGGGTCAACAAGTGACTCGAGTAATGTCCCCGACCATGTCGAGTCGAGAAGTAGTAGA
CGGGTGTCTCGTCGATGGAGGAGGAACCACAAACGTTCTCAAATGAGAAAATCGTTGAGGCCGAAAGGTCATGGAATTCGAGTAAGGAGTGGTAGAGTGTTGGTCTATGG
TAAGCATAAAAGAAAATGTGTTGAGGTTGGGAATCATTTGAATGAGAAACATAGCTTTGGGATGTATGGATCATCCAAATGTGTGCATAAAGAAAGGAAATATAGAAGAG
GAAGGCAAAGATGATATCAGCTGGCTGGGATATGAACTCTATATTTGTGCTAAAATTCATTCTAAAAATGTCTTGATCTGTTTGTCTTTTGAGAAGGTTTTTTGCGCTTT
TTTCCCCCAACTCTCATGATTATTTTATCACTGTCCATAGTTTCTT
Protein sequenceShow/hide protein sequence
MGRQSKSSKFNTQNFFCSARRLPAILLPSAVLHFPVLFNHYNPCLGFSLFRLISQECFVMGNVASSLASGIFSAIGKVFGSPLDFLSGRSCSSVCGSTWDFICYIENFCV
ANLLKMGIILILSYFVLLLLYLLHKIGIFGCIGRGLCRMIWTCLSSYFYAWEYCCTFMCIKLANVKRTRRRHFRRRDLEEEYESEEGKCQHGSTSDSSNVPDHVESRSSR
RVSRRWRRNHKRSQMRKSLRPKGHGIRVRSGRVLVYGKHKRKCVEVGNHLNEKHSFGMYGSSKCVHKERKYRRGRQR