; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh06G004840 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh06G004840
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionGATA zinc finger domain-containing protein isoform 1
Genome locationCma_Chr06:2277758..2283069
RNA-Seq ExpressionCmaCh06G004840
SyntenyCmaCh06G004840
Gene Ontology termsNA
InterPro domainsIPR023213 - Chloramphenicol acetyltransferase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596578.1 hypothetical protein SDJN03_09758, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0097.16Show/hide
Query:  MRTSLMTVFEDTVIDNSGRMQEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLMAAVLCTHTDAASEMSDPPA
        MRTSLMTVFEDTVIDNSGRMQEEIGVNDYMGCASIHGIGPSIAVFDT+RDGQLDCVCVYPAPLHSREQMEALVENMKRFLMAAVLCTHTDAASEMSDPPA
Subjt:  MRTSLMTVFEDTVIDNSGRMQEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLMAAVLCTHTDAASEMSDPPA

Query:  GESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHLQILDLTATARAIASHPDADD
        GESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDL NLQSTLHSLQNLHPIL SKI YDP RRDFSFL PPSP LHLQILDLTA ARAIASHPDADD
Subjt:  GESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHLQILDLTATARAIASHPDADD

Query:  PSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGEFEIGDHGEIGLGIEDLIPNG
        PSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTI+DGQW VFLRLHTAVCDRTAATALLRELLA ASGENEGGEFEIGDHGEIGLGIEDLIPNG
Subjt:  PSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGEFEIGDHGEIGLGIEDLIPNG

Query:  KANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKDLPPYHKEKYGVVTLNDCRSL
        KANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKDLPPYHKEKYGVVTLNDCRSL
Subjt:  KANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKDLPPYHKEKYGVVTLNDCRSL

Query:  LNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISVFEDPIFEAFSPAQEHLGL
        LNPPLTTHHLGFYHSAILNTHD+SAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISVFEDPIFEAFSPAQE+LGL
Subjt:  LNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISVFEDPIFEAFSPAQEHLGL

Query:  HDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG
        HDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIV +MKKILVGAI+VVEG
Subjt:  HDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG

KAG7017467.1 hypothetical protein SDJN02_19332, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0076.96Show/hide
Query:  MEDSRSRRRVATGTETAWCRAVPGGTGTAVLALSSSTAPPNLQLLQNALNKLQNAHPVLKSKLQFSPISSTVSFVTSPTPSVQVNTFKAPETSKIINGQN
        ME S SRRRVA  TETAWCRAVPGGTGTAV+ALSSS A PNLQLLQNAL +LQN+HP+LKSKL F+PISS  SF+TSPTP VQ+ T++ PETSKI+N QN
Subjt:  MEDSRSRRRVATGTETAWCRAVPGGTGTAVLALSSSTAPPNLQLLQNALNKLQNAHPVLKSKLQFSPISSTVSFVTSPTPSVQVNTFKAPETSKIINGQN

Query:  TLLNNNHHHAISISPLQILLEHELNENTAWCNLHHSD----AAADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLLKELLDLMNDGGGRDKKEEMEL
         L  +   H ISISPLQI+LEHELNEN+ W  LH+SD    +AADM FV+LYEVGS KW+ VFRLHVAACDRTTAVSLL+ELL LMN GGG DK  E+EL
Subjt:  TLLNNNHHHAISISPLQILLEHELNENTAWCNLHHSD----AAADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLLKELLDLMNDGGGRDKKEEMEL

Query:  GMENLVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKDAKSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYG
        GME+LVPRKLAKK +L+RGL++ISYS+NSLRLTNLKFKD KS RRSQVARLQMN  +T KIL ECK RGIKLSSAMVAAGLVA HSSG H + RHQRKYG
Subjt:  GMENLVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKDAKSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYG

Query:  VITLIDCRRFLEPPLSTHHFGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKNSNKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDN
        +ITLIDCRRFLEPPL +HHFGFYHAAILNSYT+RGGE+LWELA KIS+TLEASKNSNKHFTDMSDLNFLLCRA+ENPSLT SGAMRTSLMTVFEDTV+DN
Subjt:  VITLIDCRRFLEPPLSTHHFGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKNSNKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDN

Query:  SGRMQEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLMAAV--------------------------------
        SG MQ EIG+ DYMGCAS HGIGPS+AVFDTIRDG+LDC CVYPAPLHSREQMEALV+NMK  L   V                                
Subjt:  SGRMQEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLMAAV--------------------------------

Query:  LCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHLQILD
        LCT T  AS+MSD    E K RPVGGTE+SWCRA PGGTGTTVLGLLLSKPPD+ +LQ++LH+LQNLHPIL SKIH+DP RRDFS L PPSP +HLQILD
Subjt:  LCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHLQILD

Query:  LTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLREL--LAAASGENEGGEFE
        L A ARAIASHPDAD+PS+SDFHKILEHEIN   WL+P+HPSYSDTDVMFA+VY +SDGQW VFL LHTA CDR AA +LLREL  L AA G+ EGG FE
Subjt:  LTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLREL--LAAASGENEGGEFE

Query:  IGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKDLP
        IGD+GEIG GIEDLIP+GKA KPLWARGLDMLGYSLNSFR ANLEFKDA+S RFSQMIRLK+NSD T+KLLAGCK RGIK+CGAL AAGLIATRCSKDLP
Subjt:  IGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKDLP

Query:  PYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISV
        PY  EKY VVTL DCRSLL+PPLTTHHLGFYHSAILNTHD+SAEDTLW+VA+RCYF+FSN K+NNKHF+DMSDLNFLM KAIENP LTPSSSMRTALIS 
Subjt:  PYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISV

Query:  FEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILV-GAIEVVE
        FEDPI     PAQ+HLG+ DYIGCASAHGVGPSIA FDMIRDGQLDCACVYP PLFSRDQMN +  +MKKILV GA+EVVE
Subjt:  FEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILV-GAIEVVE

KAG7028117.1 hypothetical protein SDJN02_09297, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0079.09Show/hide
Query:  MEDSRSRRRVATGTETAWCRAVPGGTGTAVLAL----SSSTAPPNLQLLQNALNKLQNAHPVLKSKLQFSPISSTVSFVTSPTPSVQVNTFKAPETSKII
        MEDSRSRRRVATGTETAWCRAVPGGTGTAVLAL    SSS+APPNLQLLQNALNKLQNAHPVLKSKL +SPISSTVSFVTSPTPSVQV TFKAPETSKII
Subjt:  MEDSRSRRRVATGTETAWCRAVPGGTGTAVLAL----SSSTAPPNLQLLQNALNKLQNAHPVLKSKLQFSPISSTVSFVTSPTPSVQVNTFKAPETSKII

Query:  NGQNTLLNNNHHHAISISPLQILLEHELNENTAWCNLHHSDAAADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLLKELLDLMNDGGGRDKKEEMEL
        N QNTLLNNNHHHAISISPLQILLEHELNENT W NL+ SD AADM FVTLYEVGSS WVAVFRLHVAACDRTTAVSLL+ELL LMNDGGG DKK+EMEL
Subjt:  NGQNTLLNNNHHHAISISPLQILLEHELNENTAWCNLHHSDAAADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLLKELLDLMNDGGGRDKKEEMEL

Query:  GMENLVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKDAKSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYG
        GMENLVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKD KSPRRSQV RLQ+NHNQTQKIL+                                     
Subjt:  GMENLVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKDAKSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYG

Query:  VITLIDCRRFLEPPLSTHHFGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKNSNKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDN
                            GFYHAAILNSYTVRGGEDLWELAGKISSTLEASKN NKHFTD+                                     
Subjt:  VITLIDCRRFLEPPLSTHHFGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKNSNKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDN

Query:  SGRMQEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLMAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWC
                                                                      FLMAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWC
Subjt:  SGRMQEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLMAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWC

Query:  RAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHLQILDLTATARAIASHPDADDPSVSDFHKILEHEINI
        RAAPGGTGTTVLGLLLSKPPDL NLQSTLHSLQNLHPIL SKI YDP RRDFSFL PPSP LHLQILDLTA ARAIASHPDADDPSVSDFHKILEHEINI
Subjt:  RAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHLQILDLTATARAIASHPDADDPSVSDFHKILEHEINI

Query:  TTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGEFEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGY
        TTWLDPNHPSYSDTDVMFASVYTI+DGQW VFLRLHTAVCDRTAATALLRELLA ASGENEGGEFEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGY
Subjt:  TTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGEFEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGY

Query:  SLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKDLPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSA
        SLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIK+CGALAAAGLIATRCSKDLPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSA
Subjt:  SLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKDLPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSA

Query:  ILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSI
        ILNTHD+SAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISVFEDPIFEAFSPAQE+LGLHDYIGCASAHGVGPSI
Subjt:  ILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSI

Query:  AFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG
        AFFDMIRDGQLDCACVYPFPLFSRDQMNQIV +MKKILVGAI+VVEG
Subjt:  AFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG

XP_023005679.1 uncharacterized protein LOC111498610 [Cucurbita maxima]1.7e-285100Show/hide
Query:  MAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHL
        MAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHL
Subjt:  MAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHL

Query:  QILDLTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGE
        QILDLTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGE
Subjt:  QILDLTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGE

Query:  FEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKD
        FEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKD
Subjt:  FEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKD

Query:  LPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALI
        LPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALI
Subjt:  LPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALI

Query:  SVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG
        SVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG
Subjt:  SVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG

XP_023540823.1 uncharacterized protein LOC111801084 [Cucurbita pepo subsp. pepo]2.1e-27596.89Show/hide
Query:  MAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHL
        MAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPIL SKIHYDP RRDFSFL PPSP LHL
Subjt:  MAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHL

Query:  QILDLTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGE
        QILDLTA ARAIASHPDADDPSVSDFHKILEHEINITTWLDPN+PSYSDTDVMFASVYTI+DGQW VFLRLHTAVCDRTAATALLRELLAAASGENEGGE
Subjt:  QILDLTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGE

Query:  FEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKD
        FEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKD
Subjt:  FEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKD

Query:  LPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALI
        LP YHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHD+SAEDTLW+VAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALI
Subjt:  LPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALI

Query:  SVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG
        SVFEDPIFEAFSPAQE+LGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIV +MKKILVGA+EVVEG
Subjt:  SVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG

TrEMBL top hitse value%identityAlignment
A0A4D8YPF4 Uncharacterized protein1.6e-24949.84Show/hide
Query:  EDSRSRRRVATGTETAWCRAVPGGTGTAVLALSSSTAPPNLQL-LQNALNKLQNAHPVLKSKLQFSPISSTVSFV-TSPTPSVQVNTFKAPETSKIINGQ
        E   S  R    TE  WCRAV  GTG  VLAL  +  PP +   L   L KLQ  HP+L +KL ++  S   SF+ T+  P   V    AP T+KI+   
Subjt:  EDSRSRRRVATGTETAWCRAVPGGTGTAVLALSSSTAPPNLQL-LQNALNKLQNAHPVLKSKLQFSPISSTVSFV-TSPTPSVQVNTFKAPETSKIINGQ

Query:  NTLLNNNHHHAISISPLQILLEHELNENTAWCNLHHSDAAADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLLKELLDLMNDGG---GRDKKEEMEL
                  A + SPLQ +LEHELN N+         +  ++  V +Y       V   R H + CDR TAVSLL+EL++++   G   G   + E ++
Subjt:  NTLLNNNHHHAISISPLQILLEHELNENTAWCNLHHSDAAADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLLKELLDLMNDGG---GRDKKEEMEL

Query:  GMENLVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKDAKSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYG
        G+E+L+   +AKK L    LD + YS+NSLRLTNL F++ K PRRS+V RLQ+    T  IL  CK RGIK+   + AA ++AA+S+  H+     +KYG
Subjt:  GMENLVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKDAKSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYG

Query:  VITLIDCRRFLEPPLSTHHFGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKNSNKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDN
        V+TL DCR   +PPLS HH+GFYH+AILN + V+G E LW+LA          K SNKH +DM+DLNFL+ +AI+NP+LT+S ++RTSL+TVFED V+D+
Subjt:  VITLIDCRRFLEPPLSTHHFGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKNSNKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDN

Query:  SGRMQEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLMAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWC
        S  MQ  IG  D++GCAS+HG+GPSIA+FDT+R+G+LDC CVYPAPLHSREQM  LVE M +                     +G  KSR V  TE SWC
Subjt:  SGRMQEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLMAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWC

Query:  RAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHLQILDLTATARAIASHPDADDPSVSDFHKILEHEINI
        RA PGGTG TVL LL SKPPDLP LQS L   Q+ HPIL SK+ +D     FS++   SP L ++  D  +TA+ + SH  +   S+     ILEHE+N 
Subjt:  RAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHLQILDLTATARAIASHPDADDPSVSDFHKILEHEINI

Query:  TTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGEFEIGDHG---EIGLGIEDLIPNGKANKPLWARGLDM
         +W +P+ PS SD D+  AS+Y +    W + LR+HT+VCDR AA AL+ EL+A    E EG   E  +     E+ LGIED IP G  +K  WARG+DM
Subjt:  TTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGEFEIGDHG---EIGLGIEDLIPNGKANKPLWARGLDM

Query:  LGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKDLPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFY
        LGYSLNSFRLANL F D  SPR S ++R++MN++ T ++L+ C    IK+  ALAAA LIA   SK  P    EKY  VTL DCRS L+P L++HH+GFY
Subjt:  LGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKDLPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFY

Query:  HSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVG
        HSAIL+THD+   + LW++AKR + +F N+K+ NKHF+DM+DLNFLMCKAIENPGLTPSSS+RT+L+SVFEDPIF+  +  +E LG  D+IGCAS HGVG
Subjt:  HSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVG

Query:  PSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAI
        PS+A FD IR+G++DCA VYPFPLFSR+QMN+ V  +K IL+  I
Subjt:  PSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAI

A0A6J1FZI6 uncharacterized protein LOC1114493151.3e-24694.84Show/hide
Query:  MEDSRSRRRVATGTETAWCRAVPGGTGTAVLALSSS----TAPPNLQLLQNALNKLQNAHPVLKSKLQFSPISSTVSFVTSPTPSVQVNTFKAPETSKII
        MEDSRSRRRVATGTETAWCRAVPGGTGTAVLALSSS    +APPNLQLLQNALNKLQNAHPVLKSKL +SPISSTVSFVTSPTPSVQV TFKAPETSKII
Subjt:  MEDSRSRRRVATGTETAWCRAVPGGTGTAVLALSSS----TAPPNLQLLQNALNKLQNAHPVLKSKLQFSPISSTVSFVTSPTPSVQVNTFKAPETSKII

Query:  NGQNTLLNNNHHHAISISPLQILLEHELNENTAWCNLHHSDAAADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLLKELLDLMNDGGGRDKKEEMEL
        N QNTLLNNNHHHAISISPLQILLEHELNENT W NLH SD AADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLL+ELL LMNDGG  DKK+EMEL
Subjt:  NGQNTLLNNNHHHAISISPLQILLEHELNENTAWCNLHHSDAAADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLLKELLDLMNDGGGRDKKEEMEL

Query:  GMENLVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKDAKSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYG
        GMENLVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKD KSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYG
Subjt:  GMENLVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKDAKSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYG

Query:  VITLIDCRRFLEPPLSTHHFGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKNSNKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDN
        VITLIDCRR LEPPLSTHHFGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKN NKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDN
Subjt:  VITLIDCRRFLEPPLSTHHFGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKNSNKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDN

Query:  SGRMQEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLM
        SGRMQEEIGVNDYMGCASIHGIGPSIAVFDT+RDGQLDCVCVYPAPLHSREQMEALVENMKR L+
Subjt:  SGRMQEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLM

A0A6J1G619 uncharacterized protein LOC1114512044.7e-27396.48Show/hide
Query:  MAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHL
        MAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDL NLQSTLHSLQNLHPIL SKI YDP RRDFSFL PPSP LHL
Subjt:  MAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHL

Query:  QILDLTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGE
        QILDLTA ARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTI+DGQW VFLRLHTAVCDRTAATALLRELLAAASGENEGGE
Subjt:  QILDLTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGE

Query:  FEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKD
        FEI DHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANS RFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKD
Subjt:  FEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKD

Query:  LPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALI
        LPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHD+SAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALI
Subjt:  LPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALI

Query:  SVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG
        SVFEDPIFEAFSPAQE+LGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSR+QMNQIV +MKKILVGAI+VVEG
Subjt:  SVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG

A0A6J1KVW7 uncharacterized protein LOC1114986644.6e-26099.57Show/hide
Query:  MEDSRSRRRVATGTETAWCRAVPGGTGTAVLALSSSTAPPNLQLLQNALNKLQNAHPVLKSKLQFSPISSTVSFVTSPTPSVQVNTFKAPETSKIINGQN
        MEDSRSRRRVATGTETAWCRAVPGGTGTAVLALSSSTAPPNLQLLQNALNKLQNAHPVLKSKLQFSPISSTVSFVTSPTPSVQVNTFKAPETSKIINGQN
Subjt:  MEDSRSRRRVATGTETAWCRAVPGGTGTAVLALSSSTAPPNLQLLQNALNKLQNAHPVLKSKLQFSPISSTVSFVTSPTPSVQVNTFKAPETSKIINGQN

Query:  TLLNNNHHHAISISPLQILLEHELNENTAWCNLHHSDAAADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLLKELLDLMNDGGGRDKKEEMELGMEN
        TLLNNNHHHAISISPLQILLEHELNENTAWCNLHHSDAAADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLLKELLDLMNDGGGRDKKEEMELGMEN
Subjt:  TLLNNNHHHAISISPLQILLEHELNENTAWCNLHHSDAAADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLLKELLDLMNDGGGRDKKEEMELGMEN

Query:  LVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKDAKSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYGVITL
        LVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKDAKSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYGVITL
Subjt:  LVPRKLAKKPLLTRGLDMISYSMNSLRLTNLKFKDAKSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYGVITL

Query:  IDCRRFLEPPLSTHHFGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKNSNKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDNSGRM
        IDCRRFLEPPLSTHHFGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKNSNKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDNSGRM
Subjt:  IDCRRFLEPPLSTHHFGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKNSNKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDNSGRM

Query:  QEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLM
        QEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKR L+
Subjt:  QEEIGVNDYMGCASIHGIGPSIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLM

A0A6J1KZY5 uncharacterized protein LOC1114986108.3e-286100Show/hide
Query:  MAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHL
        MAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHL
Subjt:  MAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHL

Query:  QILDLTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGE
        QILDLTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGE
Subjt:  QILDLTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLRLHTAVCDRTAATALLRELLAAASGENEGGE

Query:  FEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKD
        FEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKD
Subjt:  FEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKD

Query:  LPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALI
        LPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALI
Subjt:  LPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALI

Query:  SVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG
        SVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG
Subjt:  SVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52610.1 unknown protein2.2e-13754.35Show/hide
Query:  MSDP-PAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHLQI--LDLTATARA
        MS+P    +S +RPVGGTEYSWCRA  GGTG  V+ LLLS+ P L NLQ+TL  LQ  HP L S I +D     FSF+   +   H++I   D  +TA+ 
Subjt:  MSDP-PAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTLHSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHLQI--LDLTATARA

Query:  IASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDG--QWGVFLRLHTAVCDRTAATALLRELLAAASGENEG-GEFEIGDHGE
        I    D+DDP       ILEHE+N  TW++P+    S++ V   S+Y ++D   Q  +  RL+TA  DRTAA  LLRE +   + +  G G         
Subjt:  IASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDG--QWGVFLRLHTAVCDRTAATALLRELLAAASGENEG-GEFEIGDHGE

Query:  IGLG--IEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDA-NSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKDLPPYH
        +GLG  IE+LIP+GK +KP WARG+D+LGYSLN+FR +NL F DA NS R SQ++RLK++ D T KL+AGCK RG+K+  ALA++ LIA   SK+LPPY 
Subjt:  IGLG--IEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDA-NSPRFSQMIRLKMNSDTTEKLLAGCKLRGIKVCGALAAAGLIATRCSKDLPPYH

Query:  KEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISVFED
         EKY VVTL+DCRS+L PPLT++  GFYH+ IL+THDL+ E+ LWD+AKRCY +F+++K++NK F+DMSDLNFLMCKAIENP LTPSSS+RTA IS+FED
Subjt:  KEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLMCKAIENPGLTPSSSMRTALISVFED

Query:  PIF-EAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILV
        P+  E+  P    LG+ DYIGCAS HGVGPS+A FD +RDG+LDCA VYP PL SR+QM+ ++  MK IL+
Subjt:  PIF-EAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATTCCAGAAGCCGCCGACGAGTTGCCACCGGCACCGAAACCGCCTGGTGCCGCGCTGTCCCAGGCGGCACCGGCACAGCGGTGCTAGCTCTATCTTCC
TCCACCGCTCCTCCAAATCTCCAACTTCTCCAAAATGCCCTCAACAAACTTCAGAATGCCCATCCTGTCCTCAAATCTAAACTCCAATTCAGCCCAATTTCGTCC
ACTGTTTCCTTCGTCACTTCTCCAACTCCTTCCGTCCAGGTCAATACGTTTAAAGCTCCAGAAACTTCCAAAATTATAAATGGCCAAAACACCCTTCTCAACAAT
AATCACCATCACGCCATTTCCATTTCTCCTCTCCAGATTCTCCTCGAACACGAACTCAACGAGAACACCGCCTGGTGTAATCTTCACCACTCCGACGCGGCGGCG
GACATGTTCTTCGTTACATTGTACGAGGTAGGCTCCAGCAAATGGGTCGCTGTGTTCCGACTACATGTTGCGGCGTGTGACCGGACCACGGCAGTGTCGCTTCTG
AAGGAGCTACTCGATCTAATGAACGACGGAGGCGGCAGAGATAAAAAAGAGGAAATGGAGTTGGGAATGGAGAACCTTGTTCCTAGAAAGTTGGCGAAGAAGCCA
TTGTTGACTCGAGGATTGGATATGATCAGCTACTCTATGAACTCGTTGAGATTAACGAATCTTAAATTTAAAGACGCTAAATCTCCCAGACGATCCCAGGTGGCG
AGGCTTCAGATGAACCACAACCAAACCCAGAAGATTCTCTATGAGTGCAAGAGGAGAGGTATAAAATTGAGTTCGGCAATGGTGGCGGCGGGGTTGGTGGCGGCT
CACAGCTCCGGTGGGCACAGCATCCACCGCCATCAACGGAAGTACGGAGTGATAACGCTTATAGATTGCCGGCGGTTTCTGGAGCCACCACTGTCAACCCACCAT
TTCGGGTTTTACCATGCTGCCATTCTGAACTCCTACACAGTAAGAGGAGGAGAAGACCTGTGGGAGCTTGCAGGGAAAATCTCATCGACATTGGAGGCTTCCAAG
AACTCAAACAAGCACTTCACCGACATGTCGGACCTGAACTTTTTGTTATGTCGAGCCATCGAGAATCCAAGTCTCACTTCGTCGGGGGCGATGAGGACGTCGTTG
ATGACGGTGTTTGAGGACACGGTGATCGACAACTCGGGTAGAATGCAGGAGGAGATCGGTGTTAATGACTACATGGGATGCGCCTCCATCCATGGAATCGGCCCC
TCCATCGCCGTGTTCGACACAATCCGAGATGGGCAGCTGGACTGTGTGTGTGTTTATCCAGCTCCATTGCACTCTAGGGAGCAAATGGAAGCTTTGGTTGAGAAC
ATGAAAAGATTTCTCATGGCGGCAGTGCTCTGTACTCACACTGATGCCGCCTCCGAAATGTCGGATCCTCCTGCCGGTGAGTCCAAGTCCCGCCCCGTCGGCGGC
ACCGAGTACAGCTGGTGCCGCGCCGCCCCCGGCGGCACTGGCACCACTGTCCTCGGCCTCCTCCTCTCAAAACCTCCCGATCTTCCCAATCTCCAGTCCACTCTC
CACTCTCTCCAAAACCTCCACCCGATTCTCCTCTCCAAAATCCACTACGATCCTTACCGACGAGACTTCTCTTTCCTCAATCCTCCTTCTCCGCCACTCCACCTC
CAGATCCTCGACCTCACAGCCACCGCACGCGCCATCGCTTCTCATCCCGATGCCGACGATCCTTCCGTCTCCGATTTCCACAAGATCCTCGAGCACGAGATCAAC
ATCACCACGTGGCTCGATCCGAACCATCCATCGTACTCTGACACCGACGTGATGTTCGCCTCTGTTTACACCATCAGCGACGGCCAATGGGGGGTTTTCCTCCGC
CTCCACACGGCAGTCTGTGACCGGACCGCCGCCACAGCGCTGTTGCGAGAGCTGCTTGCGGCGGCGAGCGGAGAAAACGAGGGCGGAGAATTCGAAATTGGGGAT
CATGGAGAGATTGGATTAGGGATTGAGGATTTAATCCCCAACGGAAAAGCGAATAAGCCTCTTTGGGCGCGTGGATTAGACATGCTTGGTTACTCGTTGAATTCG
TTTCGATTGGCGAATTTGGAGTTCAAAGATGCGAATTCTCCGAGATTTTCTCAGATGATTAGGTTGAAGATGAACTCCGATACCACCGAGAAACTTCTTGCTGGA
TGCAAATTGAGAGGCATTAAGGTGTGTGGAGCTCTGGCAGCTGCTGGATTGATTGCCACTCGTTGTTCTAAAGACCTTCCCCCTTACCACAAGGAGAAATATGGG
GTTGTTACCTTAAACGACTGTCGTTCTCTCCTTAATCCTCCCCTCACAACCCACCATCTAGGATTCTATCACTCGGCCATTTTAAACACGCATGATCTATCCGCT
GAAGACACTCTGTGGGATGTCGCAAAACGATGCTATTTTGCCTTCTCAAACGCTAAAGACAACAACAAGCATTTCTCAGACATGTCCGACTTGAACTTCCTCATG
TGTAAAGCCATTGAAAATCCTGGCCTCACTCCATCCTCGTCCATGAGAACGGCTCTGATCTCGGTGTTTGAGGATCCCATCTTTGAAGCTTTTAGTCCTGCACAG
GAACACCTCGGCTTACATGACTATATTGGTTGTGCCTCTGCGCACGGCGTTGGGCCATCGATCGCCTTCTTCGACATGATTCGCGATGGTCAGTTGGATTGTGCA
TGTGTATACCCGTTTCCTTTGTTCTCTCGAGATCAAATGAACCAAATTGTTTATCAGATGAAGAAAATTTTGGTGGGTGCTATAGAAGTAGTGGAAGGATAA
mRNA sequenceShow/hide mRNA sequence
CAAGCACAATCCCAAGGCCGCATCTCCTCAAATTTTCCGGCTGATTTATCCACATGGAAGATTCCAGAAGCCGCCGACGAGTTGCCACCGGCACCGAAACCGCCT
GGTGCCGCGCTGTCCCAGGCGGCACCGGCACAGCGGTGCTAGCTCTATCTTCCTCCACCGCTCCTCCAAATCTCCAACTTCTCCAAAATGCCCTCAACAAACTTC
AGAATGCCCATCCTGTCCTCAAATCTAAACTCCAATTCAGCCCAATTTCGTCCACTGTTTCCTTCGTCACTTCTCCAACTCCTTCCGTCCAGGTCAATACGTTTA
AAGCTCCAGAAACTTCCAAAATTATAAATGGCCAAAACACCCTTCTCAACAATAATCACCATCACGCCATTTCCATTTCTCCTCTCCAGATTCTCCTCGAACACG
AACTCAACGAGAACACCGCCTGGTGTAATCTTCACCACTCCGACGCGGCGGCGGACATGTTCTTCGTTACATTGTACGAGGTAGGCTCCAGCAAATGGGTCGCTG
TGTTCCGACTACATGTTGCGGCGTGTGACCGGACCACGGCAGTGTCGCTTCTGAAGGAGCTACTCGATCTAATGAACGACGGAGGCGGCAGAGATAAAAAAGAGG
AAATGGAGTTGGGAATGGAGAACCTTGTTCCTAGAAAGTTGGCGAAGAAGCCATTGTTGACTCGAGGATTGGATATGATCAGCTACTCTATGAACTCGTTGAGAT
TAACGAATCTTAAATTTAAAGACGCTAAATCTCCCAGACGATCCCAGGTGGCGAGGCTTCAGATGAACCACAACCAAACCCAGAAGATTCTCTATGAGTGCAAGA
GGAGAGGTATAAAATTGAGTTCGGCAATGGTGGCGGCGGGGTTGGTGGCGGCTCACAGCTCCGGTGGGCACAGCATCCACCGCCATCAACGGAAGTACGGAGTGA
TAACGCTTATAGATTGCCGGCGGTTTCTGGAGCCACCACTGTCAACCCACCATTTCGGGTTTTACCATGCTGCCATTCTGAACTCCTACACAGTAAGAGGAGGAG
AAGACCTGTGGGAGCTTGCAGGGAAAATCTCATCGACATTGGAGGCTTCCAAGAACTCAAACAAGCACTTCACCGACATGTCGGACCTGAACTTTTTGTTATGTC
GAGCCATCGAGAATCCAAGTCTCACTTCGTCGGGGGCGATGAGGACGTCGTTGATGACGGTGTTTGAGGACACGGTGATCGACAACTCGGGTAGAATGCAGGAGG
AGATCGGTGTTAATGACTACATGGGATGCGCCTCCATCCATGGAATCGGCCCCTCCATCGCCGTGTTCGACACAATCCGAGATGGGCAGCTGGACTGTGTGTGTG
TTTATCCAGCTCCATTGCACTCTAGGGAGCAAATGGAAGCTTTGGTTGAGAACATGAAAAGATTTCTCATGGCGGCAGTGCTCTGTACTCACACTGATGCCGCCT
CCGAAATGTCGGATCCTCCTGCCGGTGAGTCCAAGTCCCGCCCCGTCGGCGGCACCGAGTACAGCTGGTGCCGCGCCGCCCCCGGCGGCACTGGCACCACTGTCC
TCGGCCTCCTCCTCTCAAAACCTCCCGATCTTCCCAATCTCCAGTCCACTCTCCACTCTCTCCAAAACCTCCACCCGATTCTCCTCTCCAAAATCCACTACGATC
CTTACCGACGAGACTTCTCTTTCCTCAATCCTCCTTCTCCGCCACTCCACCTCCAGATCCTCGACCTCACAGCCACCGCACGCGCCATCGCTTCTCATCCCGATG
CCGACGATCCTTCCGTCTCCGATTTCCACAAGATCCTCGAGCACGAGATCAACATCACCACGTGGCTCGATCCGAACCATCCATCGTACTCTGACACCGACGTGA
TGTTCGCCTCTGTTTACACCATCAGCGACGGCCAATGGGGGGTTTTCCTCCGCCTCCACACGGCAGTCTGTGACCGGACCGCCGCCACAGCGCTGTTGCGAGAGC
TGCTTGCGGCGGCGAGCGGAGAAAACGAGGGCGGAGAATTCGAAATTGGGGATCATGGAGAGATTGGATTAGGGATTGAGGATTTAATCCCCAACGGAAAAGCGA
ATAAGCCTCTTTGGGCGCGTGGATTAGACATGCTTGGTTACTCGTTGAATTCGTTTCGATTGGCGAATTTGGAGTTCAAAGATGCGAATTCTCCGAGATTTTCTC
AGATGATTAGGTTGAAGATGAACTCCGATACCACCGAGAAACTTCTTGCTGGATGCAAATTGAGAGGCATTAAGGTGTGTGGAGCTCTGGCAGCTGCTGGATTGA
TTGCCACTCGTTGTTCTAAAGACCTTCCCCCTTACCACAAGGAGAAATATGGGGTTGTTACCTTAAACGACTGTCGTTCTCTCCTTAATCCTCCCCTCACAACCC
ACCATCTAGGATTCTATCACTCGGCCATTTTAAACACGCATGATCTATCCGCTGAAGACACTCTGTGGGATGTCGCAAAACGATGCTATTTTGCCTTCTCAAACG
CTAAAGACAACAACAAGCATTTCTCAGACATGTCCGACTTGAACTTCCTCATGTGTAAAGCCATTGAAAATCCTGGCCTCACTCCATCCTCGTCCATGAGAACGG
CTCTGATCTCGGTGTTTGAGGATCCCATCTTTGAAGCTTTTAGTCCTGCACAGGAACACCTCGGCTTACATGACTATATTGGTTGTGCCTCTGCGCACGGCGTTG
GGCCATCGATCGCCTTCTTCGACATGATTCGCGATGGTCAGTTGGATTGTGCATGTGTATACCCGTTTCCTTTGTTCTCTCGAGATCAAATGAACCAAATTGTTT
ATCAGATGAAGAAAATTTTGGTGGGTGCTATAGAAGTAGTGGAAGGATAAAAATTTAGCTTAGGTTTTGCAAATGTTGTTTGTGTCCCTTTTTTGTGTAATCTAA
TGGTTCACAATGGACTCATCTCGTTCGTAATTAGAAAATAAAATTTGAATTTTTTTGAGTAATTAAGAAGTGACAACAGTGACGATATGCAAAGACGTAACTAAT
TGAATGCTAGCACGTTACAATGTTGCCCGTAAA
Protein sequenceShow/hide protein sequence
MEDSRSRRRVATGTETAWCRAVPGGTGTAVLALSSSTAPPNLQLLQNALNKLQNAHPVLKSKLQFSPISSTVSFVTSPTPSVQVNTFKAPETSKIINGQNTLLNN
NHHHAISISPLQILLEHELNENTAWCNLHHSDAAADMFFVTLYEVGSSKWVAVFRLHVAACDRTTAVSLLKELLDLMNDGGGRDKKEEMELGMENLVPRKLAKKP
LLTRGLDMISYSMNSLRLTNLKFKDAKSPRRSQVARLQMNHNQTQKILYECKRRGIKLSSAMVAAGLVAAHSSGGHSIHRHQRKYGVITLIDCRRFLEPPLSTHH
FGFYHAAILNSYTVRGGEDLWELAGKISSTLEASKNSNKHFTDMSDLNFLLCRAIENPSLTSSGAMRTSLMTVFEDTVIDNSGRMQEEIGVNDYMGCASIHGIGP
SIAVFDTIRDGQLDCVCVYPAPLHSREQMEALVENMKRFLMAAVLCTHTDAASEMSDPPAGESKSRPVGGTEYSWCRAAPGGTGTTVLGLLLSKPPDLPNLQSTL
HSLQNLHPILLSKIHYDPYRRDFSFLNPPSPPLHLQILDLTATARAIASHPDADDPSVSDFHKILEHEINITTWLDPNHPSYSDTDVMFASVYTISDGQWGVFLR
LHTAVCDRTAATALLRELLAAASGENEGGEFEIGDHGEIGLGIEDLIPNGKANKPLWARGLDMLGYSLNSFRLANLEFKDANSPRFSQMIRLKMNSDTTEKLLAG
CKLRGIKVCGALAAAGLIATRCSKDLPPYHKEKYGVVTLNDCRSLLNPPLTTHHLGFYHSAILNTHDLSAEDTLWDVAKRCYFAFSNAKDNNKHFSDMSDLNFLM
CKAIENPGLTPSSSMRTALISVFEDPIFEAFSPAQEHLGLHDYIGCASAHGVGPSIAFFDMIRDGQLDCACVYPFPLFSRDQMNQIVYQMKKILVGAIEVVEG