; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh01G013790 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh01G013790
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
Descriptionprotein HAPLESS 2 isoform X1
Genome locationCma_Chr01:9775660..9778502
RNA-Seq ExpressionCmaCh01G013790
SyntenyCmaCh01G013790
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608041.1 Protein HAPLESS 2, partial [Cucurbita argyrosperma subsp. sororia]2.2e-10995.18Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRG CRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT

Query:  CLASYCHAWEYCCTFMCVKLASVKRT-RRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSG
        CLASYCHAWEYCCTFMC+KLASVKRT RR HRRRDLEEELESEEEAKHRYGSS DSSNDSEKIESRRS+RVSRKRRRSHRGSQTSKTLRP SHGIRVRSG
Subjt:  CLASYCHAWEYCCTFMCVKLASVKRT-RRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSG

Query:  RVLVYSKHGSSKFVQKKRKYRRGRQRQR
        RVLVYSKHGSSK VQK+RKYRRGRQ +R
Subjt:  RVLVYSKHGSSKFVQKKRKYRRGRQRQR

KAG7031669.1 Protein HAPLESS 2 [Cucurbita argyrosperma subsp. argyrosperma]4.4e-11095.61Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRG CRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT

Query:  CLASYCHAWEYCCTFMCVKLASVKRT-RRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSG
        CLASYCHAWEYCCTFMC+KLASVKRT RR HRRRDLEEELESEEEAKHRYGSS DSSNDSEKIESRRS+RVSRKRRRSHRGSQTSKTLRPTSHGIRVRSG
Subjt:  CLASYCHAWEYCCTFMCVKLASVKRT-RRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSG

Query:  RVLVYSKHGSSKFVQKKRKYRRGRQRQR
        RVLVYSKHGSSK VQK+RKYRRGRQ +R
Subjt:  RVLVYSKHGSSKFVQKKRKYRRGRQRQR

XP_022940192.1 uncharacterized protein LOC111445893 [Cucurbita moschata]1.1e-10893.83Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRG CRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT

Query:  CLASYCHAWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGR
        CLASYCHAWEYCCTFMC+KLASVKRTRR HRRRDLEEELESEE+AK+RYGSS DSSNDS+KIESRRS+RVSRKRRRSHRGSQTSKTLRP SHGIRVRSGR
Subjt:  CLASYCHAWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGR

Query:  VLVYSKHGSSKFVQKKRKYRRGRQRQR
        VLVYSKHGSSK VQK+R YRRGRQ +R
Subjt:  VLVYSKHGSSKFVQKKRKYRRGRQRQR

XP_022981050.1 uncharacterized protein LOC111480312 [Cucurbita maxima]1.2e-120100Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT

Query:  CLASYCHAWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGR
        CLASYCHAWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGR
Subjt:  CLASYCHAWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGR

Query:  VLVYSKHGSSKFVQKKRKYRRGRQRQRREYEI
        VLVYSKHGSSKFVQKKRKYRRGRQRQRREYEI
Subjt:  VLVYSKHGSSKFVQKKRKYRRGRQRQRREYEI

XP_023523489.1 protein HAPLESS 2 [Cucurbita pepo subsp. pepo]4.4e-11095.54Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRG CRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT

Query:  CLASYCHAWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGR
        CLASYCHAWEYCCTFMC+KLASVKRTRR HRRRDLEEELESEEEAKHRYGSS DSSN+SEKIESRR +RVSRK+RRSHRGSQTSKTLRPT HGIRVRSGR
Subjt:  CLASYCHAWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGR

Query:  VLVYSKHGSSKFVQKKRKYRRGRQ
        VLVYSKHGSSKFVQK+RKYRRGRQ
Subjt:  VLVYSKHGSSKFVQKKRKYRRGRQ

TrEMBL top hitse value%identityAlignment
A0A0A0L1V0 Uncharacterized protein5.3e-7769.87Show/hide
Query:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWTCLASYCH
        MGNVASSLAS +F A+ K+FGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLKMGMV IL+YFVLLLLYL HKIGIF CIGRG CRMIWTCLASY +
Subjt:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWTCLASYCH

Query:  AWEYCCTFMCVKLASVKRTRRCH-RRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGRVLVYSK
        AWEYCC FMC+KLASVKRTRR H RRRD+EEE E  EE K R+ S+ DS+N  E +ES+ S RVS++ RR+HR SQ  K+LRP  HG+RVRSGRVLVY K
Subjt:  AWEYCCTFMCVKLASVKRTRRCH-RRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGRVLVYSK

Query:  H--------------------GSSKFVQKKRKYRRGRQR
        H                    GSSK+V K+RKYRRGR R
Subjt:  H--------------------GSSKFVQKKRKYRRGRQR

A0A1S4DVV8 protein HAPLESS 2 isoform X11.3e-7871.13Show/hide
Query:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWTCLASYCH
        MGNVASSLAS +F A+ K+FGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANL+KMGMV IL+YFVLLLLYL HKIGIFGCIGRG CRMIWTCLASY +
Subjt:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWTCLASYCH

Query:  AWEYCCTFMCVKLASVKRTRRCH-RRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGRVLVYSK
        AWEYCC+FMC+KLASVKRTRR H RRRDLEEE ES EE K ++ S+ DSSN  E +ESR S   SR+ RR+H+ SQ  K+LRP  HG+RVRSGRVLVY K
Subjt:  AWEYCCTFMCVKLASVKRTRRCH-RRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGRVLVYSK

Query:  H--------------------GSSKFVQKKRKYRRGRQR
        H                    GSSKFV K+RKYRRGRQR
Subjt:  H--------------------GSSKFVQKKRKYRRGRQR

A0A6J1DMF3 uncharacterized protein LOC1110213524.1e-7768.29Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT
        M NL D MGNVASS+ASG F A+ K+F SPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLK+GMV+IL+ FV+LLLYL HKIGIFGCI RG CRM WT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT

Query:  CLASYCHAWEYCCTFMCVKLASVKRT-RRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSG
        C+ASY +AW+YCCTFMC+KL SVKRT RR HRRRDLEEE ES E  KHRYGSS DSS+  E+IE R S+R SR+ R +HRGSQ  K LRP S GIRVRSG
Subjt:  CLASYCHAWEYCCTFMCVKLASVKRT-RRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSG

Query:  RVLVYSK--------------------HGSSKFVQKKRKYRRGRQR
        R LVY K                    HGSSKFV ++ +Y+RGRQ+
Subjt:  RVLVYSK--------------------HGSSKFVQKKRKYRRGRQR

A0A6J1FIX0 uncharacterized protein LOC1114458935.3e-10993.83Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRG CRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT

Query:  CLASYCHAWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGR
        CLASYCHAWEYCCTFMC+KLASVKRTRR HRRRDLEEELESEE+AK+RYGSS DSSNDS+KIESRRS+RVSRKRRRSHRGSQTSKTLRP SHGIRVRSGR
Subjt:  CLASYCHAWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGR

Query:  VLVYSKHGSSKFVQKKRKYRRGRQRQR
        VLVYSKHGSSK VQK+R YRRGRQ +R
Subjt:  VLVYSKHGSSKFVQKKRKYRRGRQRQR

A0A6J1IVE5 uncharacterized protein LOC1114803126.0e-121100Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWT

Query:  CLASYCHAWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGR
        CLASYCHAWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGR
Subjt:  CLASYCHAWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGR

Query:  VLVYSKHGSSKFVQKKRKYRRGRQRQRREYEI
        VLVYSKHGSSKFVQKKRKYRRGRQRQRREYEI
Subjt:  VLVYSKHGSSKFVQKKRKYRRGRQRQRREYEI

SwissProt top hitse value%identityAlignment
F4JP36 Protein HAPLESS 29.9e-0433.33Show/hide
Query:  KVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIF
        K+    +DF++G +C + C S +DF C+I+  C++ ++  G+++ L     LLL+L H+ G+F
Subjt:  KVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIF

Arabidopsis top hitse value%identityAlignment
AT1G21722.1 unknown protein5.5e-3441.27Show/hide
Query:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWTCLASYCH
        MGNV  S  +G   ++   FGSPLDFLSGKSCSSVC S WDFICY+ENFCVANL K  +++IL+YF L  +Y+ +K+G + CI  GF +++W  ++ + +
Subjt:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWTCLASYCH

Query:  AWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRV
           YCC+F C  L   KR RR    R +EE+ +             D+S+D + ++   S    R +R   +  +  K+LRP +H +RV
Subjt:  AWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRV

AT1G78922.1 unknown protein2.1e-0925.91Show/hide
Query:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWTCLASYCH
        MGN     ++ +   +  +F +PL    G+SC  VC   WD  C+IE+FC+ ++ K+ ++  L + +L+ + L  K+GI  C+ +  C+M     A+Y  
Subjt:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWTCLASYCH

Query:  AWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIE-SRRSERVSRKRRRSHRGSQTS--KTLRPTSHGIRVRSGRVLVY
              + +C  L ++ R  R  +R D + E  + +       SS DS +  + I   +R  R+  K    H GS  +  + +R  S  + VR G     
Subjt:  AWEYCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIE-SRRSERVSRKRRRSHRGSQTS--KTLRPTSHGIRVRSGRVLVY

Query:  SKHGSSKFVQKKRKYRRGRQ
           G S+ V++  +  + R+
Subjt:  SKHGSSKFVQKKRKYRRGRQ

AT4G11720.1 hapless 27.0e-0533.33Show/hide
Query:  KVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIF
        K+    +DF++G +C + C S +DF C+I+  C++ ++  G+++ L     LLL+L H+ G+F
Subjt:  KVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAATCTATCCGATGCAATGGGTAATGTAGCCAGTTCATTGGCCTCTGGTTTGTTTTTGGCTCTTAACAAAGTATTTGGGTCTCCACTTGATTTTCTCTCTGGAAA
ATCCTGCAGTTCAGTGTGTGGATCAACATGGGATTTCATATGCTACATAGAAAATTTCTGCGTTGCCAATTTGCTAAAGATGGGCATGGTCGTGATCCTTGCATACTTTG
TTCTTTTACTCCTCTATTTATTCCATAAAATTGGCATATTTGGATGCATCGGTCGGGGTTTCTGCAGAATGATATGGACATGTTTAGCTTCCTATTGCCATGCATGGGAG
TACTGCTGCACTTTCATGTGTGTCAAGCTTGCCAGTGTCAAAAGAACAAGACGATGCCATAGAAGAAGAGACCTAGAAGAAGAACTCGAAAGTGAAGAAGAAGCAAAACA
TCGATATGGGTCATCATGTGATTCGAGCAACGACTCCGAAAAGATCGAGTCGAGAAGGAGCGAACGGGTGTCTCGCAAACGGAGGAGGAGCCACAGAGGTTCTCAAACGA
GTAAGACATTGAGGCCAACGAGCCATGGAATTCGAGTGAGGAGCGGTAGAGTGTTGGTCTATAGTAAGCATGGATCATCCAAGTTCGTGCAGAAAAAAAGGAAGTATAGA
AGAGGAAGACAAAGACAAAGACGAGAGTATGAGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGAATCTATCCGATGCAATGGGTAATGTAGCCAGTTCATTGGCCTCTGGTTTGTTTTTGGCTCTTAACAAAGTATTTGGGTCTCCACTTGATTTTCTCTCTGGAAA
ATCCTGCAGTTCAGTGTGTGGATCAACATGGGATTTCATATGCTACATAGAAAATTTCTGCGTTGCCAATTTGCTAAAGATGGGCATGGTCGTGATCCTTGCATACTTTG
TTCTTTTACTCCTCTATTTATTCCATAAAATTGGCATATTTGGATGCATCGGTCGGGGTTTCTGCAGAATGATATGGACATGTTTAGCTTCCTATTGCCATGCATGGGAG
TACTGCTGCACTTTCATGTGTGTCAAGCTTGCCAGTGTCAAAAGAACAAGACGATGCCATAGAAGAAGAGACCTAGAAGAAGAACTCGAAAGTGAAGAAGAAGCAAAACA
TCGATATGGGTCATCATGTGATTCGAGCAACGACTCCGAAAAGATCGAGTCGAGAAGGAGCGAACGGGTGTCTCGCAAACGGAGGAGGAGCCACAGAGGTTCTCAAACGA
GTAAGACATTGAGGCCAACGAGCCATGGAATTCGAGTGAGGAGCGGTAGAGTGTTGGTCTATAGTAAGCATGGATCATCCAAGTTCGTGCAGAAAAAAAGGAAGTATAGA
AGAGGAAGACAAAGACAAAGACGAGAGTATGAGATTTGAACTTCTAACTTTTTACCGTACATAAAAATTTAAATTAGTCATATTAAATAATAATGTTATTTCCTATCTAT
TATAATTATATTAATAAATCTTTGTCCTTGTCTGGAACTTAATTAGGTACATCCCATATAGACTTAGAAC
Protein sequenceShow/hide protein sequence
MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGFCRMIWTCLASYCHAWE
YCCTFMCVKLASVKRTRRCHRRRDLEEELESEEEAKHRYGSSCDSSNDSEKIESRRSERVSRKRRRSHRGSQTSKTLRPTSHGIRVRSGRVLVYSKHGSSKFVQKKRKYR
RGRQRQRREYEI