; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g03530 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g03530
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGOLD domain-containing protein
Genome locationchr4:2191282..2200055
RNA-Seq ExpressionMoc04g03530
SyntenyMoc04g03530
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR009038 - GOLD domain
IPR015720 - Transmembrane emp24 domain-containing protein
IPR032001 - SAWADEE domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8524882.1 hypothetical protein F0562_011305 [Nyssa sinensis]3.7e-17850.6Show/hide
Query:  DAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRI----DDVRF
        D  +  +EFRC +DDAWY+V V L  D L + +  F    D ++ A DF++L  +  F  +FRP+S QLQD +C +V  G  VCAS+   +    DD+RF
Subjt:  DAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRI----DDVRF

Query:  YDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQA-------------RMNKGDSCQKDDCS
        YDA+VE V +  HS++  EEECLC+F+L WQHGPK G LT  + AD+C IQ+    D  +  F K  REKI+                +KG +C +++  
Subjt:  YDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQA-------------RMNKGDSCQKDDCS

Query:  PNLKPKLSFFQRMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGG----------MKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIF
         ++K  LS  Q   QE   + R  G+  P +  +    +  E I+QD DFGG          +   + IL+ENL+K LS   + +F+H QTSI P+ YI 
Subjt:  PNLKPKLSFFQRMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGG----------MKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIF

Query:  PTLSDESYARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAK
        P LS ESY RGA+VL+C K L ++  FL+NP H+I+SS+GRP  +  KV R  TF     +++ +   K  N+N  G       ELKVV +GT +Y  AK
Subjt:  PTLSDESYARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAK

Query:  QRKELFREFLSHQRRLHQRLALEESKIYCSAPLQYKLIPLPQLAASVCSSPDSFDFGEMERWAMMMAFAGVFLLSMRRSEGIRFVIDREECFSHNVQYDG
        + ++LF +FL HQ+RLH+         +CSA     ++ LP+          +   G +    ++M      LL +R + GIRFVID+EECFSHNVQY+G
Subjt:  QRKELFREFLSHQRRLHQRLALEESKIYCSAPLQYKLIPLPQLAASVCSSPDSFDFGEMERWAMMMAFAGVFLLSMRRSEGIRFVIDREECFSHNVQYDG

Query:  DTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETIDFDVHVGHFSYHEQHAKDEHFNPLLDQISK
        DT+HVSFVVIKAD+PWH+GDDGVDLVIKGPSGE +HDFRDK+SEK++FV H KG++ FCFTNKSPYHETIDFDVH  HF+Y++QHAKDEHF+PLL+QISK
Subjt:  DTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETIDFDVHVGHFSYHEQHAKDEHFNPLLDQISK

Query:  LEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV
        LEEALYNIQFEQHWLEAQTDRQAIVN+GMS+R IHKAMFESAAL+GASVLQ+YLL+RLFERKLGTSRV
Subjt:  LEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV

KAG6572216.1 Transmembrane emp24 domain-containing protein p24beta2, partial [Cucurbita argyrosperma subsp. sororia]5.5e-28375.41Show/hide
Query:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV
        M SAGA A S  +EFR  DD+AWYNV VKLEGD LRI Y  FA EHDN+F+AN FRS S+LSDFE RFRPLSRQLQD EC  +DPGM VCASYSSR DDV
Subjt:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV

Query:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSC--------------QKD
        RFYDALVEGVDYLEHSYANGEEECLCNFI+FWQ+GP  GN  IASIA++C+IQ  KIND VL TFF++VREKIQ +MN+GD+               QKD
Subjt:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSC--------------QKD

Query:  DCSPNLKPKLSFFQRMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDES
        DC P LK +LSFF RM+Q+TRRAKRS G V+ C+D Q L+PRK E I+QD D GGMK+QYMILLENLDKG+SPVKLAKFLHG+T ILPRVYIFP L+ E 
Subjt:  DCSPNLKPKLSFFQRMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDES

Query:  YARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTL-AGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELF
        YARGAVVLNC  NL++LC FLDNP HVI+SSQGRPLVVTG+VAR +TFGT+ AGAMVLD +NK  NE DG     VC  LKVV +GT +YS AK  KELF
Subjt:  YARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTL-AGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELF

Query:  REFLSHQRRLHQRLALEESKIYCSAPLQYKL-IPLPQLAASVCSSPDSFDFGEMERWAM----MMAFAGVFLLSMRRSEGIRFVIDREECFSHNVQYDGD
         EFLSHQRRLHQRLA+EES I   A L  K+    P +A +    P S     MERWAM    MMAFAGV LL++R+SEGIRFVIDREECFSHNVQYDGD
Subjt:  REFLSHQRRLHQRLALEESKIYCSAPLQYKL-IPLPQLAASVCSSPDSFDFGEMERWAM----MMAFAGVFLLSMRRSEGIRFVIDREECFSHNVQYDGD

Query:  TVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETIDFDVHVGHFSYHEQHAKDEHFNPLLDQISKL
        TVHVSFVVIK DS WHYG+DGVDLVIKGPSGEHLHDFRDK SEKHEFVAHNKG+HRFCFTNKSPYHETIDFDVHVGHFSYHEQHAKDEHFNPLLDQISKL
Subjt:  TVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETIDFDVHVGHFSYHEQHAKDEHFNPLLDQISKL

Query:  EEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV
        EEALYNIQFEQHWLEAQTDRQAIVNDGMS+RA+HKAMFESAALVGASVLQVYLLQRLFERKL TSRV
Subjt:  EEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV

XP_022135662.1 uncharacterized protein LOC111007565 isoform X1 [Momordica charantia]7.7e-245100Show/hide
Query:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV
        MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV
Subjt:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV

Query:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQ
        RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQ
Subjt:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQ

Query:  RMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNL
        RMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNL
Subjt:  RMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNL

Query:  DRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLA
        DRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLA
Subjt:  DRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLA

Query:  LEESKIYCSAPLQYKL
        LEESKIYCSAPLQYKL
Subjt:  LEESKIYCSAPLQYKL

XP_022135663.1 uncharacterized protein LOC111007565 isoform X2 [Momordica charantia]2.0e-22494.23Show/hide
Query:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV
        MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV
Subjt:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV

Query:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQ
        RFYDALVEG                        HGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQ
Subjt:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQ

Query:  RMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNL
        RMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNL
Subjt:  RMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNL

Query:  DRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLA
        DRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLA
Subjt:  DRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLA

Query:  LEESKIYCSAPLQYKL
        LEESKIYCSAPLQYKL
Subjt:  LEESKIYCSAPLQYKL

XP_038888477.1 uncharacterized protein LOC120078315 isoform X1 [Benincasa hispida]8.7e-17274Show/hide
Query:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV
        M SAG DA S  IEF+C DDDAWYNV VKLEGD LRI YC F  EHDNVF+AN FRSLS+LS+FEARFRPLSRQLQD EC  VDPGM VCASYSS+ DDV
Subjt:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV

Query:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSC--------------QKD
        RFYDALVEGVDYLEHSYANGEEECLCNFIL WQHGP  GNLTIASIA+MCQIQ  +INDTVL TFF +VREKI  RMN+GD+C              QKD
Subjt:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSC--------------QKD

Query:  DCSPNLKPKLSFFQRMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDES
        +CS  LK +LSFF+RMD +TRRAKRS  T+EP +D Q LS RK E IE DTD GGMK+QYMILLENLDKGLSPVK+AKFLH QT ILPRVYIFP+L+ ES
Subjt:  DCSPNLKPKLSFFQRMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDES

Query:  YARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTL-AGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELF
        YARGAVVLNC KNL+RLC FLDNP HVI+SSQGRPLVVTG++AR +TFGTL AGAMVLD +NK  NE DG     VC ELKVV VGT +Y  AK  KELF
Subjt:  YARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTL-AGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELF

Query:  REFLSHQRRLHQRLALEESKIYCSAPL
         EFLSHQRRLHQRLA+EESKIYC+  L
Subjt:  REFLSHQRRLHQRLALEESKIYCSAPL

TrEMBL top hitse value%identityAlignment
A0A5D3E5V0 SAWADEE domain-containing protein5.3e-15969.86Show/hide
Query:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV
        M SAG DA S  +EF   DDDAWYN  VKL+G  LR+ YC F+ EHDNVF A+ F+SLS+LS FEARFRP+SRQLQD EC  V PGM VCASYSSR DDV
Subjt:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV

Query:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGD--------------SCQKD
        RFYDALVEGVDYLEHSYANGEEECLCNFIL WQ GP  GNLTIASIA+MCQIQ  KINDTVL TFF++VREKI+ R N+G+              +CQKD
Subjt:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGD--------------SCQKD

Query:  DCSPNLKPKLSFFQRMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDES
        DCS  LK +LSFF+RMDQETRRAKRS G VEP +D   LS RK E IEQDTD GGMK+QYMILLENLDKGL+P+KLAKFL+ +T ILPRVYIFP+L+ E 
Subjt:  DCSPNLKPKLSFFQRMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDES

Query:  YARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTL-AGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELF
        YARGAVV+NC KNL RL  FLD+P HVI+SSQGRPLVVTGK+AR +TFGTL AGAMVLD +NK  NE DG        ELKVV VGT +Y  AK  KELF
Subjt:  YARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTL-AGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELF

Query:  REFLSHQRRLHQRLALEESKIYCSAPLQ
         EFL HQR L QRLA+EESKIYC+  LQ
Subjt:  REFLSHQRRLHQRLALEESKIYCSAPLQ

A0A5J5A4X0 GOLD domain-containing protein1.8e-17850.6Show/hide
Query:  DAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRI----DDVRF
        D  +  +EFRC +DDAWY+V V L  D L + +  F    D ++ A DF++L  +  F  +FRP+S QLQD +C +V  G  VCAS+   +    DD+RF
Subjt:  DAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRI----DDVRF

Query:  YDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQA-------------RMNKGDSCQKDDCS
        YDA+VE V +  HS++  EEECLC+F+L WQHGPK G LT  + AD+C IQ+    D  +  F K  REKI+                +KG +C +++  
Subjt:  YDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQA-------------RMNKGDSCQKDDCS

Query:  PNLKPKLSFFQRMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGG----------MKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIF
         ++K  LS  Q   QE   + R  G+  P +  +    +  E I+QD DFGG          +   + IL+ENL+K LS   + +F+H QTSI P+ YI 
Subjt:  PNLKPKLSFFQRMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGG----------MKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIF

Query:  PTLSDESYARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAK
        P LS ESY RGA+VL+C K L ++  FL+NP H+I+SS+GRP  +  KV R  TF     +++ +   K  N+N  G       ELKVV +GT +Y  AK
Subjt:  PTLSDESYARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAK

Query:  QRKELFREFLSHQRRLHQRLALEESKIYCSAPLQYKLIPLPQLAASVCSSPDSFDFGEMERWAMMMAFAGVFLLSMRRSEGIRFVIDREECFSHNVQYDG
        + ++LF +FL HQ+RLH+         +CSA     ++ LP+          +   G +    ++M      LL +R + GIRFVID+EECFSHNVQY+G
Subjt:  QRKELFREFLSHQRRLHQRLALEESKIYCSAPLQYKLIPLPQLAASVCSSPDSFDFGEMERWAMMMAFAGVFLLSMRRSEGIRFVIDREECFSHNVQYDG

Query:  DTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETIDFDVHVGHFSYHEQHAKDEHFNPLLDQISK
        DT+HVSFVVIKAD+PWH+GDDGVDLVIKGPSGE +HDFRDK+SEK++FV H KG++ FCFTNKSPYHETIDFDVH  HF+Y++QHAKDEHF+PLL+QISK
Subjt:  DTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETIDFDVHVGHFSYHEQHAKDEHFNPLLDQISK

Query:  LEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV
        LEEALYNIQFEQHWLEAQTDRQAIVN+GMS+R IHKAMFESAAL+GASVLQ+YLL+RLFERKLGTSRV
Subjt:  LEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV

A0A6J1C3C6 uncharacterized protein LOC111007565 isoform X13.7e-245100Show/hide
Query:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV
        MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV
Subjt:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV

Query:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQ
        RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQ
Subjt:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQ

Query:  RMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNL
        RMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNL
Subjt:  RMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNL

Query:  DRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLA
        DRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLA
Subjt:  DRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLA

Query:  LEESKIYCSAPLQYKL
        LEESKIYCSAPLQYKL
Subjt:  LEESKIYCSAPLQYKL

A0A6J1C5H6 uncharacterized protein LOC111007565 isoform X29.6e-22594.23Show/hide
Query:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV
        MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV
Subjt:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV

Query:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQ
        RFYDALVEG                        HGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQ
Subjt:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQ

Query:  RMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNL
        RMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNL
Subjt:  RMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNL

Query:  DRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLA
        DRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLA
Subjt:  DRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLA

Query:  LEESKIYCSAPLQYKL
        LEESKIYCSAPLQYKL
Subjt:  LEESKIYCSAPLQYKL

A0A6J1GLK5 uncharacterized protein LOC111455094 isoform X11.2e-16170.31Show/hide
Query:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV
        M SAGA A S  +EFR  DD+AWYNV VKLEGD LRI Y  FA EHDN+F+AN FRS S+LSDFE RFRPLSRQLQD EC  +DPGM VCASYSSR DDV
Subjt:  MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDV

Query:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSC--------------QKD
        RFYDALVEGVDYLEHSYANGEEECLCNFI+FWQ+GP  GN  I+SIA++C+IQ  KIND VL TFF++VREKIQ +MN+GD+               QKD
Subjt:  RFYDALVEGVDYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSC--------------QKD

Query:  DCSPNLKPKLSFFQRMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDES
        DC P LK +LSFF RM+Q+TRRAKRS G V+ C+D Q L+ RK E I+QD D GGMK+QYMILLENLDKG+SPVKLAKFLHG+T ILPRVYIFP L+ E 
Subjt:  DCSPNLKPKLSFFQRMDQETRRAKRSCGTVEPCKDLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDES

Query:  YARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTL-AGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELF
        YARGAVVLNC  NL++LC FLDNP HVI+SSQGRPLVVTG+VAR +TFGT+ AGAMVLD +NK  NE DG     VC  LKVV +GT +YS AK  KELF
Subjt:  YARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVARLDTFGTL-AGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELF

Query:  REFLSHQRRLHQRLALEESKI
         EFLSHQRRLHQRLA+EES I
Subjt:  REFLSHQRRLHQRLALEESKI

SwissProt top hitse value%identityAlignment
Q9LIL4 Transmembrane emp24 domain-containing protein p24beta34.0e-2633.33Show/hide
Query:  VFLLSMRRSEGIRFVIDREECFSHNVQYDGDTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETI
        + L S+ +   +   ++ EEC    V Y+GDTV  +FVV+  D  W     G+D  +  P+G  +   +  + +K EF A   G+++FCF N     ET+
Subjt:  VFLLSMRRSEGIRFVIDREECFSHNVQYDGDTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETI

Query:  DFDVHVGHFSYHEQHAKDEHFNPLLDQISKLEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV
         F +HVGH       AKDEH +P+  +I++L EAL ++  EQ +L+A+  R    N+   +R I   + E   L  AS LQV  +++LF + +  +RV
Subjt:  DFDVHVGHFSYHEQHAKDEHFNPLLDQISKLEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV

Q9S7M9 Transmembrane emp24 domain-containing protein p24beta29.9e-8675.51Show/hide
Query:  LLSMRRSEGIRFVIDREECFSHNVQYDGDTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETIDF
        L S + + GIRFVIDREECFSH  +Y+GDT+HVSFVVIK+DS WH+ +DGVDLVI GP+GE +HDFR++ S KH+FV   KG++RFCFTNKSPYHETIDF
Subjt:  LLSMRRSEGIRFVIDREECFSHNVQYDGDTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETIDF

Query:  DVHVGHFSYHEQHAKDEHFNPLLDQISKLEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV
        DV +GHF+Y++QHAKDEHF PL++QISKLEEALYNIQFEQHWLEAQTDRQAIVN+ MS+RA+HKA+FES AL+GAS LQVYLL+RLFERKLG SRV
Subjt:  DVHVGHFSYHEQHAKDEHFNPLLDQISKLEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV

Arabidopsis top hitse value%identityAlignment
AT3G07680.1 emp24/gp25L/p24 family/GOLD family protein7.0e-8775.51Show/hide
Query:  LLSMRRSEGIRFVIDREECFSHNVQYDGDTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETIDF
        L S + + GIRFVIDREECFSH  +Y+GDT+HVSFVVIK+DS WH+ +DGVDLVI GP+GE +HDFR++ S KH+FV   KG++RFCFTNKSPYHETIDF
Subjt:  LLSMRRSEGIRFVIDREECFSHNVQYDGDTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETIDF

Query:  DVHVGHFSYHEQHAKDEHFNPLLDQISKLEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV
        DV +GHF+Y++QHAKDEHF PL++QISKLEEALYNIQFEQHWLEAQTDRQAIVN+ MS+RA+HKA+FES AL+GAS LQVYLL+RLFERKLG SRV
Subjt:  DVHVGHFSYHEQHAKDEHFNPLLDQISKLEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV

AT3G22845.1 emp24/gp25L/p24 family/GOLD family protein2.8e-2733.33Show/hide
Query:  VFLLSMRRSEGIRFVIDREECFSHNVQYDGDTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETI
        + L S+ +   +   ++ EEC    V Y+GDTV  +FVV+  D  W     G+D  +  P+G  +   +  + +K EF A   G+++FCF N     ET+
Subjt:  VFLLSMRRSEGIRFVIDREECFSHNVQYDGDTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETI

Query:  DFDVHVGHFSYHEQHAKDEHFNPLLDQISKLEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV
         F +HVGH       AKDEH +P+  +I++L EAL ++  EQ +L+A+  R    N+   +R I   + E   L  AS LQV  +++LF + +  +RV
Subjt:  DFDVHVGHFSYHEQHAKDEHFNPLLDQISKLEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV

AT4G25330.1 unknown protein1.4e-3138.98Show/hide
Query:  IEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDVRFYDALVEGVDY
        +EFR  +D+AWY V      DAL I + GF+ EHD  + A+DF++  ++ +FE RFR  S Q+QD EC KV  G  VCA++ S  ++V+FYDA+V  V+ 
Subjt:  IEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDVRFYDALVEGVDY

Query:  LEHSY-ANGEEECLCNFILFWQHGPKHGNLTIASIADMC-QIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDD
         +H     G E C C+F LFW+ GP    +T A + D+C + +  +IN  V+ +F K  R K+      G++C + +
Subjt:  LEHSY-ANGEEECLCNFILFWQHGPKHGNLTIASIADMC-QIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCTCGGCCGGAGCCGATGCAATCTCCTGCAGCATTGAGTTCCGATGCCTGGACGACGACGCCTGGTACAATGTCGCCGTGAAACTCGAAGGCGATGCTCTTCGAAT
CATGTACTGCGGGTTCGCGGTGGAGCACGACAATGTCTTCAGCGCCAATGATTTCCGGAGCTTATCGGACTTGAGCGACTTCGAAGCTAGGTTTCGGCCTCTGTCCAGGC
AGTTGCAGGACTGTGAGTGCCGGAAGGTGGACCCTGGCATGGCCGTCTGCGCTTCCTACTCCTCCCGAATCGATGATGTTCGGTTCTACGACGCGCTTGTGGAAGGGGTG
GATTATCTTGAACACTCTTATGCAAATGGAGAAGAAGAATGCTTATGCAACTTTATCCTTTTCTGGCAGCATGGTCCAAAACATGGGAATCTGACAATTGCCAGTATTGC
TGACATGTGTCAAATTCAAGCTGGTAAAATTAATGACACAGTGTTACCAACCTTCTTCAAGAGAGTTAGGGAGAAAATCCAAGCCAGAATGAATAAAGGTGATTCTTGTC
AGAAAGATGACTGCAGTCCAAATTTGAAGCCCAAGCTCTCCTTTTTTCAACGCATGGACCAGGAAACACGACGTGCCAAGCGTTCTTGTGGTACAGTAGAACCATGTAAA
GATCTACAGATTCTCAGTCCCAGAAAAGGTGAGGCAATCGAGCAAGATACTGATTTTGGTGGAATGAAACATCAGTACATGATTTTACTTGAGAATCTAGATAAAGGATT
GTCTCCGGTAAAACTAGCCAAATTCTTACATGGACAGACATCCATATTACCTCGAGTATACATATTCCCGACTTTATCAGATGAGTCGTATGCAAGGGGAGCTGTTGTAT
TGAATTGCGGAAAGAACCTAGATAGGTTGTGTGGTTTTTTGGATAATCCGCATCATGTCATTGTATCCTCCCAAGGAAGGCCCTTGGTTGTAACTGGGAAAGTGGCGAGG
CTCGATACTTTTGGAACATTGGCAGGGGCCATGGTGCTAGACATTGATAATAAACTTTGTAACGAAAATGATGGCGGTGTTGGTGTCAGTGTGTGTAGCGAGCTGAAGGT
AGTGAGTGTAGGAACAATCGATTATTCGATTGCGAAGCAAAGGAAGGAGTTGTTTAGGGAGTTTCTTAGCCATCAGAGGAGATTGCATCAAAGATTGGCCCTGGAGGAGA
GCAAGATATATTGCAGTGCTCCTTTGCAGTATAAACTAATCCCTCTTCCCCAATTGGCGGCAAGTGTTTGTTCGTCCCCTGATTCCTTTGATTTTGGAGAAATGGAGCGA
TGGGCTATGATGATGGCGTTTGCGGGGGTGTTTCTATTGAGCATGAGAAGATCAGAAGGGATTAGGTTTGTAATAGACAGGGAAGAATGTTTCTCTCACAATGTTCAGTA
CGATGGAGATACTGTTCATGTTTCTTTCGTCGTAATTAAGGCTGATTCCCCCTGGCACTACGGCGATGATGGAGTCGATCTTGTGATAAAGGGTCCCTCTGGTGAACATC
TTCATGATTTCCGCGACAAGACTAGTGAAAAGCATGAATTTGTTGCTCACAACAAAGGGATTCACCGATTCTGCTTCACGAACAAGTCTCCATATCACGAAACAATAGAC
TTCGATGTGCACGTCGGTCACTTTTCGTACCACGAGCAGCATGCAAAAGATGAGCATTTCAACCCTTTGCTGGATCAAATATCTAAGTTGGAGGAAGCTCTTTACAATAT
CCAGTTTGAGCAGCATTGGTTAGAGGCTCAGACCGACCGCCAGGCAATAGTGAATGATGGAATGAGCAGAAGGGCAATCCACAAGGCAATGTTTGAATCAGCAGCACTGG
TAGGGGCCAGTGTCCTCCAAGTCTACTTGCTGCAACGCTTGTTTGAAAGGAAACTTGGGACTTCCAGAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGCTCGGCCGGAGCCGATGCAATCTCCTGCAGCATTGAGTTCCGATGCCTGGACGACGACGCCTGGTACAATGTCGCCGTGAAACTCGAAGGCGATGCTCTTCGAAT
CATGTACTGCGGGTTCGCGGTGGAGCACGACAATGTCTTCAGCGCCAATGATTTCCGGAGCTTATCGGACTTGAGCGACTTCGAAGCTAGGTTTCGGCCTCTGTCCAGGC
AGTTGCAGGACTGTGAGTGCCGGAAGGTGGACCCTGGCATGGCCGTCTGCGCTTCCTACTCCTCCCGAATCGATGATGTTCGGTTCTACGACGCGCTTGTGGAAGGGGTG
GATTATCTTGAACACTCTTATGCAAATGGAGAAGAAGAATGCTTATGCAACTTTATCCTTTTCTGGCAGCATGGTCCAAAACATGGGAATCTGACAATTGCCAGTATTGC
TGACATGTGTCAAATTCAAGCTGGTAAAATTAATGACACAGTGTTACCAACCTTCTTCAAGAGAGTTAGGGAGAAAATCCAAGCCAGAATGAATAAAGGTGATTCTTGTC
AGAAAGATGACTGCAGTCCAAATTTGAAGCCCAAGCTCTCCTTTTTTCAACGCATGGACCAGGAAACACGACGTGCCAAGCGTTCTTGTGGTACAGTAGAACCATGTAAA
GATCTACAGATTCTCAGTCCCAGAAAAGGTGAGGCAATCGAGCAAGATACTGATTTTGGTGGAATGAAACATCAGTACATGATTTTACTTGAGAATCTAGATAAAGGATT
GTCTCCGGTAAAACTAGCCAAATTCTTACATGGACAGACATCCATATTACCTCGAGTATACATATTCCCGACTTTATCAGATGAGTCGTATGCAAGGGGAGCTGTTGTAT
TGAATTGCGGAAAGAACCTAGATAGGTTGTGTGGTTTTTTGGATAATCCGCATCATGTCATTGTATCCTCCCAAGGAAGGCCCTTGGTTGTAACTGGGAAAGTGGCGAGG
CTCGATACTTTTGGAACATTGGCAGGGGCCATGGTGCTAGACATTGATAATAAACTTTGTAACGAAAATGATGGCGGTGTTGGTGTCAGTGTGTGTAGCGAGCTGAAGGT
AGTGAGTGTAGGAACAATCGATTATTCGATTGCGAAGCAAAGGAAGGAGTTGTTTAGGGAGTTTCTTAGCCATCAGAGGAGATTGCATCAAAGATTGGCCCTGGAGGAGA
GCAAGATATATTGCAGTGCTCCTTTGCAGTATAAACTAATCCCTCTTCCCCAATTGGCGGCAAGTGTTTGTTCGTCCCCTGATTCCTTTGATTTTGGAGAAATGGAGCGA
TGGGCTATGATGATGGCGTTTGCGGGGGTGTTTCTATTGAGCATGAGAAGATCAGAAGGGATTAGGTTTGTAATAGACAGGGAAGAATGTTTCTCTCACAATGTTCAGTA
CGATGGAGATACTGTTCATGTTTCTTTCGTCGTAATTAAGGCTGATTCCCCCTGGCACTACGGCGATGATGGAGTCGATCTTGTGATAAAGGGTCCCTCTGGTGAACATC
TTCATGATTTCCGCGACAAGACTAGTGAAAAGCATGAATTTGTTGCTCACAACAAAGGGATTCACCGATTCTGCTTCACGAACAAGTCTCCATATCACGAAACAATAGAC
TTCGATGTGCACGTCGGTCACTTTTCGTACCACGAGCAGCATGCAAAAGATGAGCATTTCAACCCTTTGCTGGATCAAATATCTAAGTTGGAGGAAGCTCTTTACAATAT
CCAGTTTGAGCAGCATTGGTTAGAGGCTCAGACCGACCGCCAGGCAATAGTGAATGATGGAATGAGCAGAAGGGCAATCCACAAGGCAATGTTTGAATCAGCAGCACTGG
TAGGGGCCAGTGTCCTCCAAGTCTACTTGCTGCAACGCTTGTTTGAAAGGAAACTTGGGACTTCCAGAGTTTAG
Protein sequenceShow/hide protein sequence
MCSAGADAISCSIEFRCLDDDAWYNVAVKLEGDALRIMYCGFAVEHDNVFSANDFRSLSDLSDFEARFRPLSRQLQDCECRKVDPGMAVCASYSSRIDDVRFYDALVEGV
DYLEHSYANGEEECLCNFILFWQHGPKHGNLTIASIADMCQIQAGKINDTVLPTFFKRVREKIQARMNKGDSCQKDDCSPNLKPKLSFFQRMDQETRRAKRSCGTVEPCK
DLQILSPRKGEAIEQDTDFGGMKHQYMILLENLDKGLSPVKLAKFLHGQTSILPRVYIFPTLSDESYARGAVVLNCGKNLDRLCGFLDNPHHVIVSSQGRPLVVTGKVAR
LDTFGTLAGAMVLDIDNKLCNENDGGVGVSVCSELKVVSVGTIDYSIAKQRKELFREFLSHQRRLHQRLALEESKIYCSAPLQYKLIPLPQLAASVCSSPDSFDFGEMER
WAMMMAFAGVFLLSMRRSEGIRFVIDREECFSHNVQYDGDTVHVSFVVIKADSPWHYGDDGVDLVIKGPSGEHLHDFRDKTSEKHEFVAHNKGIHRFCFTNKSPYHETID
FDVHVGHFSYHEQHAKDEHFNPLLDQISKLEEALYNIQFEQHWLEAQTDRQAIVNDGMSRRAIHKAMFESAALVGASVLQVYLLQRLFERKLGTSRV