; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G014290 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G014290
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein HAPLESS 2 isoform X1
Genome locationCmo_Chr01:11182778..11185350
RNA-Seq ExpressionCmoCh01G014290
SyntenyCmoCh01G014290
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608041.1 Protein HAPLESS 2, partial [Cucurbita argyrosperma subsp. sororia]2.1e-11297.37Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT

Query:  CLASYCHAWEYCCTFMCIKLASVKRT-RRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSG
        CLASYCHAWEYCCTFMCIKLASVKRT RRRHRRRDLEEELESEE+AK+RYGSSSDSSNDS+KIESRRSKRVSRKRRRSHRGSQTSKTLRP SHGIRVRSG
Subjt:  CLASYCHAWEYCCTFMCIKLASVKRT-RRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSG

Query:  RVLVYSKHGSSKIVQKERNYRRGRQIRR
        RVLVYSKHGSSKIVQKER YRRGRQIRR
Subjt:  RVLVYSKHGSSKIVQKERNYRRGRQIRR

KAG7031669.1 Protein HAPLESS 2 [Cucurbita argyrosperma subsp. argyrosperma]1.6e-11297.37Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT

Query:  CLASYCHAWEYCCTFMCIKLASVKRT-RRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSG
        CLASYCHAWEYCCTFMCIKLASVKRT RRRHRRRDLEEELESEE+AK+RYGSSSDSSNDS+KIESRRSKRVSRKRRRSHRGSQTSKTLRP SHGIRVRSG
Subjt:  CLASYCHAWEYCCTFMCIKLASVKRT-RRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSG

Query:  RVLVYSKHGSSKIVQKERNYRRGRQIRR
        RVLVYSKHGSSKIVQKER YRRGRQIRR
Subjt:  RVLVYSKHGSSKIVQKERNYRRGRQIRR

XP_022940192.1 uncharacterized protein LOC111445893 [Cucurbita moschata]4.8e-117100Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT

Query:  CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGR
        CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGR
Subjt:  CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGR

Query:  VLVYSKHGSSKIVQKERNYRRGRQIRR
        VLVYSKHGSSKIVQKERNYRRGRQIRR
Subjt:  VLVYSKHGSSKIVQKERNYRRGRQIRR

XP_022981050.1 uncharacterized protein LOC111480312 [Cucurbita maxima]3.7e-10993.83Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRG CRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT

Query:  CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGR
        CLASYCHAWEYCCTFMC+KLASVKRTRR HRRRDLEEELESEE+AK+RYGSS DSSNDS+KIESRRS+RVSRKRRRSHRGSQTSKTLRP SHGIRVRSGR
Subjt:  CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGR

Query:  VLVYSKHGSSKIVQKERNYRRGRQIRR
        VLVYSKHGSSK VQK+R YRRGRQ +R
Subjt:  VLVYSKHGSSKIVQKERNYRRGRQIRR

XP_023523489.1 protein HAPLESS 2 [Cucurbita pepo subsp. pepo]8.8e-11195.58Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT

Query:  CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGR
        CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEE+AK+RYGSSSDSSN+S+KIESRR KRVSRK+RRSHRGSQTSKTLRP  HGIRVRSGR
Subjt:  CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGR

Query:  VLVYSKHGSSKIVQKERNYRRGRQIR
        VLVYSKHGSSK VQKER YRRGRQIR
Subjt:  VLVYSKHGSSKIVQKERNYRRGRQIR

TrEMBL top hitse value%identityAlignment
A0A0A0L1V0 Uncharacterized protein2.1e-7871.31Show/hide
Query:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCH
        MGNVASSLAS +F A+ K+FGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANLLKMGMV IL+YFVLLLLYL HKIGIF CIGRGLCRMIWTCLASY +
Subjt:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCH

Query:  AWEYCCTFMCIKLASVKRTRRRH-RRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGRVLVYSK
        AWEYCC FMCIKLASVKRTRRRH RRRD+EEE E EE  K R+ S+SDS+N  + +ES+ S+RVS++ RR+HR SQ  K+LRP  HG+RVRSGRVLVY K
Subjt:  AWEYCCTFMCIKLASVKRTRRRH-RRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGRVLVYSK

Query:  H--------------------GSSKIVQKERNYRRGR
        H                    GSSK V KER YRRGR
Subjt:  H--------------------GSSKIVQKERNYRRGR

A0A1S4DVV8 protein HAPLESS 2 isoform X11.5e-7971.85Show/hide
Query:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCH
        MGNVASSLAS +F A+ K+FGSPLDFLSG+SCSSVCGSTWDFICYIENFCVANL+KMGMV IL+YFVLLLLYL HKIGIFGCIGRGLCRMIWTCLASY +
Subjt:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCH

Query:  AWEYCCTFMCIKLASVKRTRRRH-RRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGRVLVYSK
        AWEYCC+FMCIKLASVKRTRRRH RRRDLEEE ESEE  K ++ S+SDSSN  + +ESR S+  SR+ RR+H+ SQ  K+LRP  HG+RVRSGRVLVY K
Subjt:  AWEYCCTFMCIKLASVKRTRRRH-RRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGRVLVYSK

Query:  H--------------------GSSKIVQKERNYRRGRQ
        H                    GSSK V KER YRRGRQ
Subjt:  H--------------------GSSKIVQKERNYRRGRQ

A0A6J1DMF3 uncharacterized protein LOC1110213528.1e-7869.39Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
        M NL D MGNVASS+ASG F A+ K+F SPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLK+GMV+IL+ FV+LLLYL HKIGIFGCI RGLCRM WT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT

Query:  CLASYCHAWEYCCTFMCIKLASVKRT-RRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSG
        C+ASY +AW+YCCTFMCIKL SVKRT RRRHRRRDLEEE ES E  K+RYGSSSDSS+  ++IE R S+R SR+ R +HRGSQ  K LRP S GIRVRSG
Subjt:  CLASYCHAWEYCCTFMCIKLASVKRT-RRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSG

Query:  RVLVYSK--------------------HGSSKIVQKERNYRRGRQ
        R LVY K                    HGSSK V +E  Y+RGRQ
Subjt:  RVLVYSK--------------------HGSSKIVQKERNYRRGRQ

A0A6J1FIX0 uncharacterized protein LOC1114458932.3e-117100Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT

Query:  CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGR
        CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGR
Subjt:  CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGR

Query:  VLVYSKHGSSKIVQKERNYRRGRQIRR
        VLVYSKHGSSKIVQKERNYRRGRQIRR
Subjt:  VLVYSKHGSSKIVQKERNYRRGRQIRR

A0A6J1IVE5 uncharacterized protein LOC1114803121.8e-10993.83Show/hide
Query:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT
        MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRG CRMIWT
Subjt:  MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWT

Query:  CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGR
        CLASYCHAWEYCCTFMC+KLASVKRTRR HRRRDLEEELESEE+AK+RYGSS DSSNDS+KIESRRS+RVSRKRRRSHRGSQTSKTLRP SHGIRVRSGR
Subjt:  CLASYCHAWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGR

Query:  VLVYSKHGSSKIVQKERNYRRGRQIRR
        VLVYSKHGSSK VQK+R YRRGRQ +R
Subjt:  VLVYSKHGSSKIVQKERNYRRGRQIRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21722.1 unknown protein6.0e-3340.43Show/hide
Query:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCH
        MGNV  S  +G   ++   FGSPLDFLSGKSCSSVC S WDFICY+ENFCVANL K  +++IL+YF L  +Y+ +K+G + CI  G  +++W  ++ + +
Subjt:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCH

Query:  AWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRV------RSGRV
           YCC+F C  L   KR RRR   R +EE+ +   D         D  +D       RSKR  RK  R        K+LRP +H +RV      RS   
Subjt:  AWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRV------RSGRV

Query:  LVYSKHGSSKI----VQKERNYRRGRQIRR
        L     G S I    V +E  + R    RR
Subjt:  LVYSKHGSSKI----VQKERNYRRGRQIRR

AT1G78922.1 unknown protein6.9e-1327.31Show/hide
Query:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCH
        MGN     ++ +   +  +F +PL    G+SC  VC   WD  C+IE+FC+ ++ K+ ++  L + +L+ + L  K+GI  C+ + +C+M     A+Y  
Subjt:  MGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCH

Query:  AWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGS--SSDSSNDSKKIES----RRSKRVSRKRRRSHRGSQTS--KTLRPPSHGIRVRSG
              + +C  L ++ R  RR +R      L+  E   Y Y S   S SS+   +I++    +R +R+  K    H GS  +  + +R PS  + VR G
Subjt:  AWEYCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGS--SSDSSNDSKKIES----RRSKRVSRKRRRSHRGSQTS--KTLRPPSHGIRVRSG

Query:  RVLVYSKHGSSKIVQKERNYRRGRQIR
                G S+ V++     + R+I+
Subjt:  RVLVYSKHGSSKIVQKERNYRRGRQIR

AT4G11720.1 hapless 29.0e-0533.33Show/hide
Query:  KVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIF
        K+    +DF++G +C + C S +DF C+I+  C++ ++  G+++ L     LLL+L H+ G+F
Subjt:  KVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAATCTATCCGACGCAATGGGTAATGTAGCCAGTTCATTGGCCTCTGGTTTGTTTTTGGCTCTTAACAAAGTATTTGGATCTCCACTTGATTTTCTCTCTGGAAA
ATCCTGCAGTTCAGTGTGTGGATCAACATGGGATTTCATATGCTACATAGAAAATTTCTGCGTTGCCAATTTGCTAAAGATGGGCATGGTCGTGATCCTTGCATACTTTG
TTCTTTTACTCCTCTATTTATTCCATAAAATTGGCATATTTGGATGCATCGGTCGGGGTCTCTGCAGAATGATATGGACATGTTTAGCTTCCTATTGCCATGCATGGGAG
TACTGCTGCACTTTCATGTGTATCAAGCTTGCCAGTGTCAAAAGAACAAGACGACGCCATAGAAGAAGAGACCTAGAAGAAGAACTCGAAAGTGAAGAAGACGCAAAATA
TCGATATGGGTCATCAAGTGATTCGAGCAACGACTCCAAAAAGATTGAGTCGAGAAGGAGCAAACGGGTGTCTCGCAAACGGAGGAGGAGCCACAGAGGTTCTCAAACAA
GTAAGACATTGAGGCCACCGAGCCATGGAATTCGAGTGAGGAGTGGTAGAGTGTTGGTCTATAGTAAGCATGGATCATCCAAGATCGTGCAGAAAGAAAGAAACTATAGA
AGAGGAAGACAAATACGAAGGTAA
mRNA sequenceShow/hide mRNA sequence
TCCAGATCTTTCAACGCGCTGTGGCCGACGGTTATCTTTCAAGAAACCGGCGGTGTGGAAATGAGTCACCCGCCACTCTTCTTCCATATGCGATTCTTCACTTCCCCTTG
CTGTTTAAACATTCTTATCGATGAGTGTTTTGTTTTGTTTTGTTAGTGAATGCTGAATCTATCCGACGCAATGGGTAATGTAGCCAGTTCATTGGCCTCTGGTTTGTTTT
TGGCTCTTAACAAAGTATTTGGATCTCCACTTGATTTTCTCTCTGGAAAATCCTGCAGTTCAGTGTGTGGATCAACATGGGATTTCATATGCTACATAGAAAATTTCTGC
GTTGCCAATTTGCTAAAGATGGGCATGGTCGTGATCCTTGCATACTTTGTTCTTTTACTCCTCTATTTATTCCATAAAATTGGCATATTTGGATGCATCGGTCGGGGTCT
CTGCAGAATGATATGGACATGTTTAGCTTCCTATTGCCATGCATGGGAGTACTGCTGCACTTTCATGTGTATCAAGCTTGCCAGTGTCAAAAGAACAAGACGACGCCATA
GAAGAAGAGACCTAGAAGAAGAACTCGAAAGTGAAGAAGACGCAAAATATCGATATGGGTCATCAAGTGATTCGAGCAACGACTCCAAAAAGATTGAGTCGAGAAGGAGC
AAACGGGTGTCTCGCAAACGGAGGAGGAGCCACAGAGGTTCTCAAACAAGTAAGACATTGAGGCCACCGAGCCATGGAATTCGAGTGAGGAGTGGTAGAGTGTTGGTCTA
TAGTAAGCATGGATCATCCAAGATCGTGCAGAAAGAAAGAAACTATAGAAGAGGAAGACAAATACGAAGGTAAGAGATTTGAACTTCTAACTTTTTACGATACATAAAAT
TTAAATCAGTCATATTAATTTTTGT
Protein sequenceShow/hide protein sequence
MLNLSDAMGNVASSLASGLFLALNKVFGSPLDFLSGKSCSSVCGSTWDFICYIENFCVANLLKMGMVVILAYFVLLLLYLFHKIGIFGCIGRGLCRMIWTCLASYCHAWE
YCCTFMCIKLASVKRTRRRHRRRDLEEELESEEDAKYRYGSSSDSSNDSKKIESRRSKRVSRKRRRSHRGSQTSKTLRPPSHGIRVRSGRVLVYSKHGSSKIVQKERNYR
RGRQIRR