; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024629 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024629
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHAD superfamily, subfamily IIIB acid phosphatase
Genome locationtig00002486:1369781..1378731
RNA-Seq ExpressionSgr024629
SyntenySgr024629
Gene Ontology termsGO:0003993 - acid phosphatase activity (molecular function)
InterPro domainsIPR005519 - Acid phosphatase, class B-like
IPR010028 - Acid phosphatase, plant
IPR021916 - Protein of unknown function DUF3527
IPR023214 - HAD superfamily
IPR036412 - HAD-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049214.1 uncharacterized protein E6C27_scaffold171G004540 [Cucumis melo var. makuwa]1.3e-27277.35Show/hide
Query:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL
        MEQTS DA K+ R KIGSFCDRNK EE LENGDL LSK SISRGLNK++MLPHALYLKLKQ RIS+SYVHDS FNC+IGLDY+VPK MVTIDEKY+RRCL
Subjt:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL

Query:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF
        ELIQ SA K ARCN SI+LSSVKT AL+ESLS+ KLRTRGMG +ERFII CPSA EDGNTV+SS  +WFVGSIMGSKSMINILKSPLLHQ  IT+ TSN 
Subjt:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF

Query:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE
        IRMDLNDIKGFTGSN MDSPGGV+ISSLK L NAK    SHQD SD ANERFFSTPSRNSLCSDQSSSGSAST LCQGMLQFTWK+G+P+FIFSVDDEKE
Subjt:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE

Query:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF
        VYVASSSKV S +N ALDYVYLF S KSGLKDHEV NSRPCIVGKMTVSTSY VCPNNS+I+DTEFVLFGGI+NSDLE+N SN +LKKNKVFPRKVAEVF
Subjt:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF

Query:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN
        RTSNSSKQR+ PN NRS  +KDSCPWEP SDKLN+ DDL+CARDLPP LELAA+VVR+HLPEDRGSRVGGWGLKFLKQ +AKQ ++S  T  Q+DCC RN
Subjt:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN

Query:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD
        SG CSTSMDILIPAGLHGGPRT+NGGPS+L ERW+SGG+CDCGGWDIGCPLT+L GQS++ D   QAD QECRAFNI+AK                    
Subjt:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD

Query:  NKYTTIVFQGSENGPPSWRMVNIRDGL
                 G ENGPP+ RMVNIR+GL
Subjt:  NKYTTIVFQGSENGPPSWRMVNIRDGL

TYK17344.1 uncharacterized protein E5676_scaffold434G001990 [Cucumis melo var. makuwa]1.0e-27277.35Show/hide
Query:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL
        MEQTS DA K+ R KIGSFCDRNK EE LENGDL LSK SISRGLNK++MLPHALYLKLKQ RIS+SYVHDS FNC+IGLDY+VPK MVTIDEKY+RRCL
Subjt:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL

Query:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF
        ELIQ SA K ARCN SI+LSSVKT AL+ESLS+ KLRTRGMG +ERFII CPSA EDGNTV+SS  +WFVGSIMGSKSMINILKSPLLHQ  IT+ TSN 
Subjt:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF

Query:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE
        IRMDLNDIKGFTGSN MDSPGGV+ISSLK L NAK    SHQD SD ANERFFSTPSRNSLCSDQSSSGSAST LCQGMLQFTWK+G+P+FIFSVDDEKE
Subjt:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE

Query:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF
        VYVASSSKV S +N ALDYVYLF S KSGLKDHEV NSRPCIVGKMTVSTSY VCPNNS+I+DTEFVLFGGI+NSDLE+N SN +LKKNKVFPRKVAEVF
Subjt:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF

Query:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN
        RTSNSSKQR+ PN NRS  +KDSCPWEP SDKLN+ DDL+CARDLPP LELAA+VVR+HLPEDRGSRVGGWGLKFLKQ +AKQ ++S  T  Q+DCC RN
Subjt:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN

Query:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD
        SG CSTSMDILIPAGLHGGPRT+NGGPS+L ERW+SGG+CDCGGWDIGCPLT+L GQS++ D   QAD QECRAFNI+AK                    
Subjt:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD

Query:  NKYTTIVFQGSENGPPSWRMVNIRDGL
                 G ENGPP+ RMVNIR+GL
Subjt:  NKYTTIVFQGSENGPPSWRMVNIRDGL

XP_004134399.1 uncharacterized protein LOC101209831 [Cucumis sativus]1.0e-27778.63Show/hide
Query:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL
        MEQTS DA+KN R KIGSFC RNK EE LENGDL LSK SISRGLNK++MLPHALYLKLKQ RIS+SYVHDSFFNC+IGLDYKVPKYMVTIDEKYLRRCL
Subjt:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL

Query:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF
        ELIQ SALK ARCN SI+LSSVKTGAL+ESLS+ KL+TRGM  +ERFII CPSAGED NTV+SS  +WFVGSIMGSKSMINILKSPLLHQ  IT+ TSN 
Subjt:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF

Query:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE
        IRMDLNDIKGFTGSNFMDSPG VDISSLK L+N K    SHQDGSD ANERFFSTPSRNSLCSDQSSSGSAST LCQGMLQFTWK+GSP+FIFSVDDEKE
Subjt:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE

Query:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF
        VYVASSSKV S DN ALDYVYLF S KSGLKDHEV NSRPCIVGKMTVSTSY VC NNS+I+DTEFVLFGGI+NSDLE++ SN +LKKNKVFPRKVAEVF
Subjt:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF

Query:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN
        RTSNSSK RN PN NRS V+KD CPWEPYSDKLN+ DDLICARDLPP LELAA+VVR+HLPED GSRVGGWGLKFLKQ +AKQT++S  T  Q+DCC RN
Subjt:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN

Query:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD
        SG CSTSMDILIPAGLHGGPRT+NGGPS+L ERW+SGG+CDCGGWDIGCPLT+L GQS++ D   QADTQECRAFNI+AK                    
Subjt:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD

Query:  NKYTTIVFQGSENGPPSWRMVNIRDGL
                 G EN PP+ RMVNIRDGL
Subjt:  NKYTTIVFQGSENGPPSWRMVNIRDGL

XP_022146763.1 uncharacterized protein LOC111015891 [Momordica charantia]3.4e-28480.86Show/hide
Query:  MEQTSFDAQKNRRGKIGSFCDRNKE-ELLENGDLDLSKSISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCLE
        MEQTSFD +K+++GKI SFCDRNK+ E+LENGDL LSKS+SRGLNKRVMLP ALYLKLKQ+RISK+YVHDSFFNCSIGLD KVPKYMVTIDEKYLRRCLE
Subjt:  MEQTSFDAQKNRRGKIGSFCDRNKE-ELLENGDLDLSKSISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCLE

Query:  LIQISALKPARCNTSINLSSVKTGALSESLS-MTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF
        LIQIS+ KPARCN SINLSS K GALS SLS M+KLRTRGM  LERF++ CPS  EDGN V+SS  LW+VGSIM S+SMINILKSPLLHQF ITDSTSNF
Subjt:  LIQISALKPARCNTSINLSSVKTGALSESLS-MTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF

Query:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE
        IRMDLNDIKGFTGS F DS GGVDISSLKKLENAK T  SHQDGSDTANERFF TPSR SLCSDQSSSGSAS PLCQGMLQFTWKEGSPHFIFSVDDEKE
Subjt:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE

Query:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF
        VYVASSSKVAS DNKALDYVY FHS KS LKDHE  +SRP IVGKMTVSTSYSVC NNSRI+DTEFVLFGGI+NSDLE+NTSNNILKKNKVFPRKVAEVF
Subjt:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF

Query:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN
        RTSNSSK R+ PN NRS VIK+SCPWEPYSDKLNNVDDLICARDLPP LELAAV+VR+HLPEDRGSR GGWGLKFLKQIE+KQT DSPKT  Q+DCC R+
Subjt:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN

Query:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD
        SG CSTSMDILIPAGLHGGPRTKNGGPSSL ERWRSGGLCDCGGWDIGCP+TVL GQS + D  PQAD QECRAFNI+AK                    
Subjt:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD

Query:  NKYTTIVFQGSENGPPSWRMVNIRDGL
                 GSE+GPP+ RMVNIRD L
Subjt:  NKYTTIVFQGSENGPPSWRMVNIRDGL

XP_038904737.1 uncharacterized protein LOC120091019 [Benincasa hispida]1.8e-28279.74Show/hide
Query:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL
        MEQTS DA+K+RR KIGSFCDR+K EELLEN DL LSK SISRGLNK+V LPHALYLKLKQYRISKS+VHDS FNC+IGLD+KVPKYMVTIDEKYLRRCL
Subjt:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL

Query:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF
        ELIQISA KPARCN SI+ SSVKTGAL+ESLS  KLRTRGMG +ERFIIACPS GEDGNTV+SS  LW+VGSIMGSKSMINILKSPLLHQ  IT+ TSN 
Subjt:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF

Query:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE
        IRMDLNDIKGFTGSNFMDSP GVDISSLKKL+NAK   G+ QD SD ANERFFSTPSRNSLCSDQSSS SASTPLCQGMLQFTWK+GSP+FIFSVDDEKE
Subjt:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE

Query:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF
        VYVASS KV S DN ++DYVYLF STKS LKDHEV N RP IVGKMTVSTSYSVCPNNS+I+DTEFVLFGGI+NSDLE+N SN ILKKNKVFPRKVAEVF
Subjt:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF

Query:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN
        RTSNSSKQR+ PN NRS V+KDSCPWEPY+DKLN+VDDLICARDLPP LELAA+VVR+HLPEDRGSRVGGWGLKFLKQ++AKQT+DS +T  Q+DCCARN
Subjt:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN

Query:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD
        SG CSTSMDILIPAGLHGGPRT+NGGPS+L ERWRSGGLCDCGGWDIGCPLT+  GQS++GD   QADTQECRAFNI+ K                    
Subjt:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD

Query:  NKYTTIVFQGSENGPPSWRMVNIRDGL
                 G ENGPP+ RMVNIRDGL
Subjt:  NKYTTIVFQGSENGPPSWRMVNIRDGL

TrEMBL top hitse value%identityAlignment
A0A0A0L4A7 Uncharacterized protein5.1e-27878.63Show/hide
Query:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL
        MEQTS DA+KN R KIGSFC RNK EE LENGDL LSK SISRGLNK++MLPHALYLKLKQ RIS+SYVHDSFFNC+IGLDYKVPKYMVTIDEKYLRRCL
Subjt:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL

Query:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF
        ELIQ SALK ARCN SI+LSSVKTGAL+ESLS+ KL+TRGM  +ERFII CPSAGED NTV+SS  +WFVGSIMGSKSMINILKSPLLHQ  IT+ TSN 
Subjt:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF

Query:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE
        IRMDLNDIKGFTGSNFMDSPG VDISSLK L+N K    SHQDGSD ANERFFSTPSRNSLCSDQSSSGSAST LCQGMLQFTWK+GSP+FIFSVDDEKE
Subjt:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE

Query:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF
        VYVASSSKV S DN ALDYVYLF S KSGLKDHEV NSRPCIVGKMTVSTSY VC NNS+I+DTEFVLFGGI+NSDLE++ SN +LKKNKVFPRKVAEVF
Subjt:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF

Query:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN
        RTSNSSK RN PN NRS V+KD CPWEPYSDKLN+ DDLICARDLPP LELAA+VVR+HLPED GSRVGGWGLKFLKQ +AKQT++S  T  Q+DCC RN
Subjt:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN

Query:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD
        SG CSTSMDILIPAGLHGGPRT+NGGPS+L ERW+SGG+CDCGGWDIGCPLT+L GQS++ D   QADTQECRAFNI+AK                    
Subjt:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD

Query:  NKYTTIVFQGSENGPPSWRMVNIRDGL
                 G EN PP+ RMVNIRDGL
Subjt:  NKYTTIVFQGSENGPPSWRMVNIRDGL

A0A1S3AX31 uncharacterized protein LOC1034835542.4e-27277.35Show/hide
Query:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL
        MEQTS DA K+ R KIGSFCDRNK EE LENGDL LSK SISRGLNK++MLPHALYLKLKQ RIS+SYVHDS FNC+IGLDY+VPK MVTIDEKY+RRCL
Subjt:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL

Query:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF
        ELIQ SA K ARCN SI+LSSVKT AL+ESLS+ KLRTRGMG +ERFII CPSA EDGNTV+SS  +WFVGSIMGSKSMINILKSPLLHQ  IT+ TSN 
Subjt:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF

Query:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE
        IRMDLNDIKGFTGSN MDSPGGV+ISSLK L NAK    SHQD SD ANERFFSTPSRNSLCSDQSSSGSAST LCQGMLQFTWK+G+P+FIFSVDDEKE
Subjt:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE

Query:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF
        VYVASSSKV S +N ALDYVYLF S KSGLKDHEV NSRPCIVGKMTVSTSY VCPNNS+I+DTEFVLFGGI+NSDLE+N SN +LKKNKVFPRKVAEVF
Subjt:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF

Query:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN
        RTSNSSKQR+  N NRS  +KDSCPWEP SDKLN+ DDL+CARDLPP LELAA+VVR+HLPEDRGSRVGGWGLKFLKQ +AKQ ++S  T  Q+DCC RN
Subjt:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN

Query:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD
        SG CSTSMDILIPAGLHGGPRT+NGGPS+L ERW+SGG+CDCGGWDIGCPLT+L GQS++ D   QAD QECRAFNI+AK                    
Subjt:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD

Query:  NKYTTIVFQGSENGPPSWRMVNIRDGL
                 G ENGPP+ RMVNIRDGL
Subjt:  NKYTTIVFQGSENGPPSWRMVNIRDGL

A0A5A7U1N3 Uncharacterized protein6.4e-27377.35Show/hide
Query:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL
        MEQTS DA K+ R KIGSFCDRNK EE LENGDL LSK SISRGLNK++MLPHALYLKLKQ RIS+SYVHDS FNC+IGLDY+VPK MVTIDEKY+RRCL
Subjt:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL

Query:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF
        ELIQ SA K ARCN SI+LSSVKT AL+ESLS+ KLRTRGMG +ERFII CPSA EDGNTV+SS  +WFVGSIMGSKSMINILKSPLLHQ  IT+ TSN 
Subjt:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF

Query:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE
        IRMDLNDIKGFTGSN MDSPGGV+ISSLK L NAK    SHQD SD ANERFFSTPSRNSLCSDQSSSGSAST LCQGMLQFTWK+G+P+FIFSVDDEKE
Subjt:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE

Query:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF
        VYVASSSKV S +N ALDYVYLF S KSGLKDHEV NSRPCIVGKMTVSTSY VCPNNS+I+DTEFVLFGGI+NSDLE+N SN +LKKNKVFPRKVAEVF
Subjt:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF

Query:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN
        RTSNSSKQR+ PN NRS  +KDSCPWEP SDKLN+ DDL+CARDLPP LELAA+VVR+HLPEDRGSRVGGWGLKFLKQ +AKQ ++S  T  Q+DCC RN
Subjt:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN

Query:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD
        SG CSTSMDILIPAGLHGGPRT+NGGPS+L ERW+SGG+CDCGGWDIGCPLT+L GQS++ D   QAD QECRAFNI+AK                    
Subjt:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD

Query:  NKYTTIVFQGSENGPPSWRMVNIRDGL
                 G ENGPP+ RMVNIR+GL
Subjt:  NKYTTIVFQGSENGPPSWRMVNIRDGL

A0A5D3D256 Uncharacterized protein4.9e-27377.35Show/hide
Query:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL
        MEQTS DA K+ R KIGSFCDRNK EE LENGDL LSK SISRGLNK++MLPHALYLKLKQ RIS+SYVHDS FNC+IGLDY+VPK MVTIDEKY+RRCL
Subjt:  MEQTSFDAQKNRRGKIGSFCDRNK-EELLENGDLDLSK-SISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCL

Query:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF
        ELIQ SA K ARCN SI+LSSVKT AL+ESLS+ KLRTRGMG +ERFII CPSA EDGNTV+SS  +WFVGSIMGSKSMINILKSPLLHQ  IT+ TSN 
Subjt:  ELIQISALKPARCNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF

Query:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE
        IRMDLNDIKGFTGSN MDSPGGV+ISSLK L NAK    SHQD SD ANERFFSTPSRNSLCSDQSSSGSAST LCQGMLQFTWK+G+P+FIFSVDDEKE
Subjt:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE

Query:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF
        VYVASSSKV S +N ALDYVYLF S KSGLKDHEV NSRPCIVGKMTVSTSY VCPNNS+I+DTEFVLFGGI+NSDLE+N SN +LKKNKVFPRKVAEVF
Subjt:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF

Query:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN
        RTSNSSKQR+ PN NRS  +KDSCPWEP SDKLN+ DDL+CARDLPP LELAA+VVR+HLPEDRGSRVGGWGLKFLKQ +AKQ ++S  T  Q+DCC RN
Subjt:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN

Query:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD
        SG CSTSMDILIPAGLHGGPRT+NGGPS+L ERW+SGG+CDCGGWDIGCPLT+L GQS++ D   QAD QECRAFNI+AK                    
Subjt:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD

Query:  NKYTTIVFQGSENGPPSWRMVNIRDGL
                 G ENGPP+ RMVNIR+GL
Subjt:  NKYTTIVFQGSENGPPSWRMVNIRDGL

A0A6J1CZF1 uncharacterized protein LOC1110158911.6e-28480.86Show/hide
Query:  MEQTSFDAQKNRRGKIGSFCDRNKE-ELLENGDLDLSKSISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCLE
        MEQTSFD +K+++GKI SFCDRNK+ E+LENGDL LSKS+SRGLNKRVMLP ALYLKLKQ+RISK+YVHDSFFNCSIGLD KVPKYMVTIDEKYLRRCLE
Subjt:  MEQTSFDAQKNRRGKIGSFCDRNKE-ELLENGDLDLSKSISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCLE

Query:  LIQISALKPARCNTSINLSSVKTGALSESLS-MTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF
        LIQIS+ KPARCN SINLSS K GALS SLS M+KLRTRGM  LERF++ CPS  EDGN V+SS  LW+VGSIM S+SMINILKSPLLHQF ITDSTSNF
Subjt:  LIQISALKPARCNTSINLSSVKTGALSESLS-MTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNF

Query:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE
        IRMDLNDIKGFTGS F DS GGVDISSLKKLENAK T  SHQDGSDTANERFF TPSR SLCSDQSSSGSAS PLCQGMLQFTWKEGSPHFIFSVDDEKE
Subjt:  IRMDLNDIKGFTGSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKE

Query:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF
        VYVASSSKVAS DNKALDYVY FHS KS LKDHE  +SRP IVGKMTVSTSYSVC NNSRI+DTEFVLFGGI+NSDLE+NTSNNILKKNKVFPRKVAEVF
Subjt:  VYVASSSKVASTDNKALDYVYLFHSTKSGLKDHEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVF

Query:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN
        RTSNSSK R+ PN NRS VIK+SCPWEPYSDKLNNVDDLICARDLPP LELAAV+VR+HLPEDRGSR GGWGLKFLKQIE+KQT DSPKT  Q+DCC R+
Subjt:  RTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICARDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARN

Query:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD
        SG CSTSMDILIPAGLHGGPRTKNGGPSSL ERWRSGGLCDCGGWDIGCP+TVL GQS + D  PQAD QECRAFNI+AK                    
Subjt:  SGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASD

Query:  NKYTTIVFQGSENGPPSWRMVNIRDGL
                 GSE+GPP+ RMVNIRD L
Subjt:  NKYTTIVFQGSENGPPSWRMVNIRDGL

SwissProt top hitse value%identityAlignment
O49195 Vegetative storage protein 12.6e-3741.26Show/hide
Query:  KNGLKISLKNYCESWRLNVELHNIRSFQVVPEECVSYIGKY-VTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVDDTLISTVPYYKKNQYGGK
        K GL I+  N C SW L VE  NI +F  VP  C +Y+  Y +TS QY+ DS+   +E   Y  K  +LK D  + WIFD+DDTL+S++PYY K  YG +
Subjt:  KNGLKISLKNYCESWRLNVELHNIRSFQVVPEECVSYIGKY-VTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVDDTLISTVPYYKKNQYGGK

Query:  KLNLTDLEAWM-SKANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPEDETKEVEKYKAEVRRRLVNGGYRIWGIV
                +W+ S  + P L  TL L+  L   G++ I+IS R + L   T+ENL  VG   W +LIL+    +  +V  YK++VR  LV  GY I G +
Subjt:  KLNLTDLEAWM-SKANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPEDETKEVEKYKAEVRRRLVNGGYRIWGIV

Query:  GDQYSSIEGSPSGKRTFKLPNPI
        GDQ++ +     G R FKLPNP+
Subjt:  GDQYSSIEGSPSGKRTFKLPNPI

P10742 Stem 31 kDa glycoprotein (Fragment)6.4e-4443.13Show/hide
Query:  CESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVDDTLISTVPYYKKNQYGGKKLNLTDLEAWMS
        C SWRL VE HNI  F+ +PEECV    +Y+   QYR DS+   ++   Y   A  L+    D ++F +D T++S +PYYKK+ YG +K N T  + W++
Subjt:  CESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVDDTLISTVPYYKKNQYGGKKLNLTDLEAWMS

Query:  KANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPED-ETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGSPS
        K NAP L  TL+ +N L + G  II +S R    ++ T  NL + GYH W  LIL+ P+D  T     YK   R +L+  GY I GI+GDQ+S + G   
Subjt:  KANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPED-ETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGSPS

Query:  GK-RTFKLPNP
        G+ RTFKLPNP
Subjt:  GK-RTFKLPNP

P10743 Stem 31 kDa glycoprotein1.5e-4040.28Show/hide
Query:  CESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVDDTLISTVPYYKKNQYGGKKLNLTDLEAWMS
        C S+RL VE HNIR+F+ +PEECV     Y+   Q+R DS+   ++   Y   A   +    D +IF +D+T++S +PYY+K+ YG ++ N T  + W++
Subjt:  CESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVDDTLISTVPYYKKNQYGGKKLNLTDLEAWMS

Query:  KANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPEDETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGSPSG
        K +AP L  TL+ +N L + G  I+ +S R     + T  NL + G+H W  LIL+ P   T     YK+ +R  L+  GYRI GI+GDQ+S + G   G
Subjt:  KANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPEDETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGSPSG

Query:  K-RTFKLPNPI
        + RTFKLPNP+
Subjt:  K-RTFKLPNPI

P15490 Stem 28 kDa glycoprotein2.9e-4442.92Show/hide
Query:  CESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVDDTLISTVPYYKKNQYGGKKLNLTDLEAWMS
        C SWRL VE HNI  F+ +PEECV    +Y+   QYR DS+   ++   Y   A  L+    D ++F +D T++S +PYYKK+ YG +K N T  + W++
Subjt:  CESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVDDTLISTVPYYKKNQYGGKKLNLTDLEAWMS

Query:  KANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPED-ETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGSPS
        K NAP L  TL+ +N L + G  II +S R    ++ T  NL + GYH W  LIL+ P+D  T     YK   R +L+  GY I GI+GDQ+S + G   
Subjt:  KANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPED-ETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGSPS

Query:  GK-RTFKLPNPI
        G+ RTFKLPNP+
Subjt:  GK-RTFKLPNPI

P27061 Acid phosphatase 12.9e-4439.52Show/hide
Query:  CESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVDDTLISTVPYYKKNQYGGKKLNLTDLEAWMS
        C +WR  VE +N+  ++ +PEEC  Y+ +Y+    Y+++ +R  +E   Y  K+  L  DG D WIFDVD+TL+S +PYY  ++YG +  +  + + W+ 
Subjt:  CESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVDDTLISTVPYYKKNQYGGKKLNLTDLEAWMS

Query:  KANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPEDETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGSPSG
           AP LG +L+L+  +   G  + L++ R E  RS T+ENL+  G+H W  LILR  +D  K    YK+E R  +V  G+RI G  GDQ+S + GS   
Subjt:  KANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPEDETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGSPSG

Query:  KRTFKLPNPI
         R+FKLPNP+
Subjt:  KRTFKLPNPI

Arabidopsis top hitse value%identityAlignment
AT1G04040.1 HAD superfamily, subfamily IIIB acid phosphatase1.0e-7653.97Show/hide
Query:  VSVTAAADWNILNQ-------RSKN-----GLKISLKNYCESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKG--
        V V +A DWNILNQ        S+N     G+K  LK YCESWR+NVE+HNIR F VVP+ECVS+I  Y+TS+QY+ D  RT++E I++    C  K   
Subjt:  VSVTAAADWNILNQ-------RSKN-----GLKISLKNYCESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKG--

Query:  DGTDAWIFDVDDTLISTVPYYKKNQ-YGGKKLNLTDLEAWMSKANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSP
        DG DAWIFD+DDTL+ST+PY+KKN  +GG+KLN T  E W+ K  AP + H  +L++ ++ RG+ I LIS+R+E LRSAT++NL+Q GY+GW+NL+LR  
Subjt:  DGTDAWIFDVDDTLISTVPYYKKNQ-YGGKKLNLTDLEAWMSKANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSP

Query:  EDETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGSPSGKRTFKLPNPI
        ED+ KEV++YK+E R+ L++ GYR+WG++GDQ+SS  G P  +RTFKLPN I
Subjt:  EDETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGSPSGKRTFKLPNPI

AT2G33360.1 Protein of unknown function (DUF3527)1.3e-8133.76Show/hide
Query:  DRNKEELLENGDLDLSKSISRGLNKRVMLPHALYLKLKQ----------YRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCLELIQISALKPAR
        D  K++L     L++ +++S G N  +   + L  K +Q           +I K+    SF   +   D K+P+ +V++DEKYLRRCL+LI ISA K A 
Subjt:  DRNKEELLENGDLDLSKSISRGLNKRVMLPHALYLKLKQ----------YRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCLELIQISALKPAR

Query:  CNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNFIRMDLNDIKGFT
        C+ S+NL   K    S    +  +    +    R +   P   + G+ VIS        +I+G K +  +L  P LH     D   N    ++   +   
Subjt:  CNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNFIRMDLNDIKGFT

Query:  GSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSG-SASTPLCQGMLQFTWKEG-SPHFIFSVDDEKEVYVASSSKV-
        G + + S   V     K   +++  +  H            ST S +S  S+QSSS  S S+ + QG LQFT K+  +PHF+FS+DD+KE+YVAS S   
Subjt:  GSNFMDSPGGVDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSG-SASTPLCQGMLQFTWKEG-SPHFIFSVDDEKEVYVASSSKV-

Query:  --ASTDNKALDYVYLFHSTKSGLKDHEVTNSRP-CIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVFR-TSNS
          +  D  +LDY YL H  K          S P  +VGK+ VST +SV   N +  + +FVLF    N  L  +     ++KN+  P+KV +  + T  +
Subjt:  --ASTDNKALDYVYLFHSTKSGLKDHEVTNSRP-CIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVFR-TSNS

Query:  SKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDL-ICARDLPPILELAAVVVRNHLP-----EDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCAR
        S+QR+   F+R+  I D C WEP+ +  ++++ + +   DLPP LE +AVVVR   P     E+   +VGGWG+KFLK+I   +T D+ K       C++
Subjt:  SKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDL-ICARDLPPILELAAVVVRNHLP-----EDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCAR

Query:  NSGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYAS
        +    STS+D++IP G+HGGPR +NGGPSSL++RW+SGG CDC GWD+GCPLTVL GQ+    +  Q     C  F ++                     
Subjt:  NSGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYAS

Query:  DNKYTTIVFQGSENGPPSWRMVNIRDGL
                 +G   G P  R++N+RDGL
Subjt:  DNKYTTIVFQGSENGPPSWRMVNIRDGL

AT2G33360.2 Protein of unknown function (DUF3527)2.2e-6338.63Show/hide
Query:  LCQGMLQFTWKEG-SPHFIFSVDDEKEVYVASSSKV---ASTDNKALDYVYLFHSTKSGLKDHEVTNSRP-CIVGKMTVSTSYSVCPNNSRISDTEFVLF
        + QG LQFT K+  +PHF+FS+DD+KE+YVAS S     +  D  +LDY YL H  K          S P  +VGK+ VST +SV   N +  + +FVLF
Subjt:  LCQGMLQFTWKEG-SPHFIFSVDDEKEVYVASSSKV---ASTDNKALDYVYLFHSTKSGLKDHEVTNSRP-CIVGKMTVSTSYSVCPNNSRISDTEFVLF

Query:  GGIKNSDLEMNTSNNILKKNKVFPRKVAEVFR-TSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDL-ICARDLPPILELAAVVVRNHLP-----E
            N  L  +     ++KN+  P+KV +  + T  +S+QR+   F+R+  I D C WEP+ +  ++++ + +   DLPP LE +AVVVR   P     E
Subjt:  GGIKNSDLEMNTSNNILKKNKVFPRKVAEVFR-TSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDL-ICARDLPPILELAAVVVRNHLP-----E

Query:  DRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARNSGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGD
        +   +VGGWG+KFLK+I   +T D+ K       C+++    STS+D++IP G+HGGPR +NGGPSSL++RW+SGG CDC GWD+GCPLTVL GQ+    
Subjt:  DRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARNSGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLTVLTGQSISGD

Query:  NSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASDNKYTTIVFQGSENGPPSWRMVNIRDGL
        +  Q     C  F ++                              +G   G P  R++N+RDGL
Subjt:  NSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASDNKYTTIVFQGSENGPPSWRMVNIRDGL

AT2G33360.2 Protein of unknown function (DUF3527)1.5e-0332.26Show/hide
Query:  DRNKEELLENGDLDLSKSISRGLNKRVMLPHALYLKLKQ----------YRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCLELIQI
        D  K++L     L++ +++S G N  +   + L  K +Q           +I K+    SF   +   D K+P+ +V++DEKYLRRCL+LI I
Subjt:  DRNKEELLENGDLDLSKSISRGLNKRVMLPHALYLKLKQ----------YRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCLELIQI

AT2G38600.1 HAD superfamily, subfamily IIIB acid phosphatase4.8e-4741.78Show/hide
Query:  NYCESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVDDTLISTVPYYKKNQYGGKKLNLTDLEAW
        +YC SWRL VE +N+R++++VP +C+ Y+  Y+ + QY  D + T+++  VYL +   L GDG DAWI DVDDT  S V YY+  +YG    + T    W
Subjt:  NYCESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVDDTLISTVPYYKKNQYGGKKLNLTDLEAW

Query:  MSKANAPVLGHTLRLFNLLKARGVDIILISAR-REGLRSATIENLVQVGYHGWTNLILRSPEDETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGS
          K  +P +   L LF  L   G  + L++ R  E LR AT+ENL   G+ G+  LI+R+ +++ +    YK  +R+ ++  GYRIWG VGDQ+S ++G 
Subjt:  MSKANAPVLGHTLRLFNLLKARGVDIILISAR-REGLRSATIENLVQVGYHGWTNLILRSPEDETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGS

Query:  PSGKRTFKLPNPI
         SG RTFK+PNP+
Subjt:  PSGKRTFKLPNPI

AT5G44020.1 HAD superfamily, subfamily IIIB acid phosphatase1.7e-7150.6Show/hide
Query:  VTAAADWNILNQ---------RSKNGLKI----SLKNYCESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKG--D
        + +A DWNIL+Q          S+ G+      +L  YCESWR+NVEL+NIR F+VVP+ECV ++ KY+TS+QY  D ER ++E I+YL K C  K   D
Subjt:  VTAAADWNILNQ---------RSKNGLKI----SLKNYCESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKG--D

Query:  GTDAWIFDVDDTLISTVPYYKKNQ-YGGKKLNLTDLEAWMSKANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPE
        G DAWIFD+DDTL+ST+PY+K N  +GG++LN T  E W +   AP + H ++L++ ++ RG  I LIS+R+E LRSAT+ENL++ GYH W+NL+LR  +
Subjt:  GTDAWIFDVDDTLISTVPYYKKNQ-YGGKKLNLTDLEAWMSKANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPE

Query:  DETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGSPSGKRTFKLPNPI
        DE K V +YKA++R  L + GYR+WG++G Q++S  G P  KRTFKLPN I
Subjt:  DETKEVEKYKAEVRRRLVNGGYRIWGIVGDQYSSIEGSPSGKRTFKLPNPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACAAACGTCTTTTGATGCTCAGAAAAATCGACGAGGTAAAATTGGTTCTTTCTGTGACAGAAACAAGGAAGAGTTGTTAGAAAATGGTGATTTGGATCTGTCAAA
GTCAATATCAAGAGGGTTGAATAAGAGGGTAATGCTGCCTCATGCATTATATCTGAAGTTGAAGCAATACCGAATCAGTAAAAGCTACGTTCATGATTCGTTTTTTAACT
GTAGTATAGGTTTAGACTATAAGGTTCCGAAGTATATGGTGACAATAGATGAGAAATATCTTCGACGTTGCTTGGAATTGATTCAGATTAGTGCATTGAAGCCTGCACGA
TGCAACACATCTATAAACTTAAGCTCTGTGAAGACAGGTGCTTTGTCTGAAAGCTTGAGCATGACTAAATTAAGAACAAGAGGGATGGGTGGTTTGGAAAGGTTCATTAT
CGCGTGCCCTTCAGCAGGCGAGGATGGCAATACGGTCATAAGCTCGAGAAACCTGTGGTTTGTAGGTTCAATTATGGGGAGCAAGAGCATGATCAACATATTGAAGAGTC
CATTATTACATCAATTCAGTATCACAGACAGCACTTCAAACTTCATAAGAATGGACTTGAATGATATCAAAGGCTTCACAGGCAGCAATTTTATGGACTCTCCCGGTGGT
GTGGATATATCTTCGTTGAAGAAACTAGAAAACGCAAAACTGACAATGGGAAGCCATCAAGATGGATCTGATACTGCAAATGAGAGGTTTTTCTCCACACCCAGCAGAAA
TTCTCTATGTTCTGACCAGTCTTCTTCGGGTTCGGCTTCCACACCTCTTTGTCAGGGTATGCTTCAGTTTACATGGAAGGAAGGAAGTCCTCACTTCATCTTCTCCGTAG
ATGACGAGAAGGAAGTCTATGTGGCTAGCTCATCGAAGGTTGCATCAACTGATAATAAGGCTTTGGATTATGTTTACCTGTTCCATTCCACAAAGAGCGGCCTAAAGGAT
CATGAAGTTACAAATAGCAGACCCTGTATTGTTGGTAAGATGACGGTGTCGACTTCTTATAGTGTCTGTCCAAATAATTCGAGAATCTCGGATACTGAATTTGTGTTGTT
TGGTGGTATAAAAAATTCTGACTTAGAGATGAACACTTCAAACAATATCCTTAAGAAAAATAAGGTTTTCCCAAGAAAGGTAGCCGAAGTATTTAGGACAAGTAACTCAT
CTAAACAGAGAAACAAACCCAATTTCAATAGATCAGTTGTCATAAAGGATTCTTGTCCTTGGGAACCATACTCAGATAAACTCAACAACGTTGATGACTTAATTTGTGCT
CGTGACCTTCCACCCATTCTTGAATTGGCTGCCGTTGTTGTGAGAAACCATCTCCCGGAAGATCGTGGCAGTAGAGTTGGAGGTTGGGGTTTAAAGTTTCTCAAACAGAT
CGAGGCCAAACAAACACACGATTCACCAAAAACCTTAGGGCAGTCTGATTGTTGTGCTCGAAACAGTGGAAACTGTTCAACAAGCATGGATATTCTTATCCCTGCTGGTC
TTCATGGCGGGCCGAGAACAAAAAATGGCGGTCCTTCCAGCCTCATGGAACGGTGGAGATCGGGCGGACTCTGTGATTGTGGTGGCTGGGACATCGGATGTCCTCTAACA
GTACTTACAGGCCAATCAATCAGCGGAGATAATTCACCCCAAGCAGATACGCAGGAGTGCAGGGCATTTAACATATATGCTAAGATGGCATTGCATGACGAGTTATGTTT
TAGCGACTGTAGTTATAAATTCTACGCATCGGATAACAAGTATACGACTATTGTGTTTCAGGGTTCAGAGAACGGTCCTCCATCATGGAGGATGGTGAACATCCGCGATG
GCTTGCTGGAGATGGTTGTCACTTCGTTTGTCAATCACCCATTTAAAATAATCCAAAGATTTCATCCTGAAGAGAACCTGAACCAAGTCTCCATTGAATTGGATGCCGCC
ACTCTGGCCTGTCCCGCATGTGATTTGAGGCCAGCCTGCGTAAGTGTTACAGCGGCTGCAGACTGGAACATTCTGAACCAGAGGAGCAAAAATGGCCTCAAAATCAGCCT
GAAAAACTACTGCGAAAGCTGGAGATTGAACGTCGAGCTCCACAACATCCGAAGCTTCCAGGTGGTGCCGGAAGAATGCGTGAGTTACATCGGCAAATACGTCACGTCGA
CTCAGTACAGAGTGGACTCCGAGAGGACGATCGAGGAGTGCATCGTCTACCTCACCAAGGCCTGCTCCCTCAAAGGCGACGGCACCGATGCGTGGATCTTCGATGTCGAC
GATACCCTCATATCCACTGTGCCGTATTACAAGAAAAATCAATATGGGGGTAAGAAACTGAACCTGACGGATTTGGAGGCGTGGATGAGCAAAGCCAATGCACCAGTTTT
GGGGCACACGCTGAGGCTTTTCAATCTGCTGAAAGCGAGAGGGGTGGACATCATTTTGATTTCTGCGAGAAGGGAAGGTCTCAGATCAGCAACCATTGAGAATCTTGTTC
AGGTTGGCTACCATGGATGGACCAATCTCATCTTAAGGAGTCCTGAAGATGAGACGAAGGAAGTGGAGAAATACAAGGCAGAAGTGAGAAGGCGATTGGTGAATGGAGGG
TATCGCATTTGGGGGATTGTGGGAGATCAATACAGCAGCATTGAGGGAAGTCCAAGTGGAAAAAGGACATTCAAACTTCCAAACCCAATCCCCCGGGAAGTCGACAGTGA
GCCGGTGAGGAGGAAAACCTCAGCAAGTCGACGGTGTATTCGCAAATTCTCTGGGGGCGATCTCCCAATTCCTGCAACGCGCATCAATCCTCCAGCCTCCAACGCTCGAA
GACCATCAAATTGCGTTAATTCTTCAGCTGCCCAAGTCCAGAAGCGAGCAGACCCAGCTTTGCCAAAAAGCCTTACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACAAACGTCTTTTGATGCTCAGAAAAATCGACGAGGTAAAATTGGTTCTTTCTGTGACAGAAACAAGGAAGAGTTGTTAGAAAATGGTGATTTGGATCTGTCAAA
GTCAATATCAAGAGGGTTGAATAAGAGGGTAATGCTGCCTCATGCATTATATCTGAAGTTGAAGCAATACCGAATCAGTAAAAGCTACGTTCATGATTCGTTTTTTAACT
GTAGTATAGGTTTAGACTATAAGGTTCCGAAGTATATGGTGACAATAGATGAGAAATATCTTCGACGTTGCTTGGAATTGATTCAGATTAGTGCATTGAAGCCTGCACGA
TGCAACACATCTATAAACTTAAGCTCTGTGAAGACAGGTGCTTTGTCTGAAAGCTTGAGCATGACTAAATTAAGAACAAGAGGGATGGGTGGTTTGGAAAGGTTCATTAT
CGCGTGCCCTTCAGCAGGCGAGGATGGCAATACGGTCATAAGCTCGAGAAACCTGTGGTTTGTAGGTTCAATTATGGGGAGCAAGAGCATGATCAACATATTGAAGAGTC
CATTATTACATCAATTCAGTATCACAGACAGCACTTCAAACTTCATAAGAATGGACTTGAATGATATCAAAGGCTTCACAGGCAGCAATTTTATGGACTCTCCCGGTGGT
GTGGATATATCTTCGTTGAAGAAACTAGAAAACGCAAAACTGACAATGGGAAGCCATCAAGATGGATCTGATACTGCAAATGAGAGGTTTTTCTCCACACCCAGCAGAAA
TTCTCTATGTTCTGACCAGTCTTCTTCGGGTTCGGCTTCCACACCTCTTTGTCAGGGTATGCTTCAGTTTACATGGAAGGAAGGAAGTCCTCACTTCATCTTCTCCGTAG
ATGACGAGAAGGAAGTCTATGTGGCTAGCTCATCGAAGGTTGCATCAACTGATAATAAGGCTTTGGATTATGTTTACCTGTTCCATTCCACAAAGAGCGGCCTAAAGGAT
CATGAAGTTACAAATAGCAGACCCTGTATTGTTGGTAAGATGACGGTGTCGACTTCTTATAGTGTCTGTCCAAATAATTCGAGAATCTCGGATACTGAATTTGTGTTGTT
TGGTGGTATAAAAAATTCTGACTTAGAGATGAACACTTCAAACAATATCCTTAAGAAAAATAAGGTTTTCCCAAGAAAGGTAGCCGAAGTATTTAGGACAAGTAACTCAT
CTAAACAGAGAAACAAACCCAATTTCAATAGATCAGTTGTCATAAAGGATTCTTGTCCTTGGGAACCATACTCAGATAAACTCAACAACGTTGATGACTTAATTTGTGCT
CGTGACCTTCCACCCATTCTTGAATTGGCTGCCGTTGTTGTGAGAAACCATCTCCCGGAAGATCGTGGCAGTAGAGTTGGAGGTTGGGGTTTAAAGTTTCTCAAACAGAT
CGAGGCCAAACAAACACACGATTCACCAAAAACCTTAGGGCAGTCTGATTGTTGTGCTCGAAACAGTGGAAACTGTTCAACAAGCATGGATATTCTTATCCCTGCTGGTC
TTCATGGCGGGCCGAGAACAAAAAATGGCGGTCCTTCCAGCCTCATGGAACGGTGGAGATCGGGCGGACTCTGTGATTGTGGTGGCTGGGACATCGGATGTCCTCTAACA
GTACTTACAGGCCAATCAATCAGCGGAGATAATTCACCCCAAGCAGATACGCAGGAGTGCAGGGCATTTAACATATATGCTAAGATGGCATTGCATGACGAGTTATGTTT
TAGCGACTGTAGTTATAAATTCTACGCATCGGATAACAAGTATACGACTATTGTGTTTCAGGGTTCAGAGAACGGTCCTCCATCATGGAGGATGGTGAACATCCGCGATG
GCTTGCTGGAGATGGTTGTCACTTCGTTTGTCAATCACCCATTTAAAATAATCCAAAGATTTCATCCTGAAGAGAACCTGAACCAAGTCTCCATTGAATTGGATGCCGCC
ACTCTGGCCTGTCCCGCATGTGATTTGAGGCCAGCCTGCGTAAGTGTTACAGCGGCTGCAGACTGGAACATTCTGAACCAGAGGAGCAAAAATGGCCTCAAAATCAGCCT
GAAAAACTACTGCGAAAGCTGGAGATTGAACGTCGAGCTCCACAACATCCGAAGCTTCCAGGTGGTGCCGGAAGAATGCGTGAGTTACATCGGCAAATACGTCACGTCGA
CTCAGTACAGAGTGGACTCCGAGAGGACGATCGAGGAGTGCATCGTCTACCTCACCAAGGCCTGCTCCCTCAAAGGCGACGGCACCGATGCGTGGATCTTCGATGTCGAC
GATACCCTCATATCCACTGTGCCGTATTACAAGAAAAATCAATATGGGGGTAAGAAACTGAACCTGACGGATTTGGAGGCGTGGATGAGCAAAGCCAATGCACCAGTTTT
GGGGCACACGCTGAGGCTTTTCAATCTGCTGAAAGCGAGAGGGGTGGACATCATTTTGATTTCTGCGAGAAGGGAAGGTCTCAGATCAGCAACCATTGAGAATCTTGTTC
AGGTTGGCTACCATGGATGGACCAATCTCATCTTAAGGAGTCCTGAAGATGAGACGAAGGAAGTGGAGAAATACAAGGCAGAAGTGAGAAGGCGATTGGTGAATGGAGGG
TATCGCATTTGGGGGATTGTGGGAGATCAATACAGCAGCATTGAGGGAAGTCCAAGTGGAAAAAGGACATTCAAACTTCCAAACCCAATCCCCCGGGAAGTCGACAGTGA
GCCGGTGAGGAGGAAAACCTCAGCAAGTCGACGGTGTATTCGCAAATTCTCTGGGGGCGATCTCCCAATTCCTGCAACGCGCATCAATCCTCCAGCCTCCAACGCTCGAA
GACCATCAAATTGCGTTAATTCTTCAGCTGCCCAAGTCCAGAAGCGAGCAGACCCAGCTTTGCCAAAAAGCCTTACTTGA
Protein sequenceShow/hide protein sequence
MEQTSFDAQKNRRGKIGSFCDRNKEELLENGDLDLSKSISRGLNKRVMLPHALYLKLKQYRISKSYVHDSFFNCSIGLDYKVPKYMVTIDEKYLRRCLELIQISALKPAR
CNTSINLSSVKTGALSESLSMTKLRTRGMGGLERFIIACPSAGEDGNTVISSRNLWFVGSIMGSKSMINILKSPLLHQFSITDSTSNFIRMDLNDIKGFTGSNFMDSPGG
VDISSLKKLENAKLTMGSHQDGSDTANERFFSTPSRNSLCSDQSSSGSASTPLCQGMLQFTWKEGSPHFIFSVDDEKEVYVASSSKVASTDNKALDYVYLFHSTKSGLKD
HEVTNSRPCIVGKMTVSTSYSVCPNNSRISDTEFVLFGGIKNSDLEMNTSNNILKKNKVFPRKVAEVFRTSNSSKQRNKPNFNRSVVIKDSCPWEPYSDKLNNVDDLICA
RDLPPILELAAVVVRNHLPEDRGSRVGGWGLKFLKQIEAKQTHDSPKTLGQSDCCARNSGNCSTSMDILIPAGLHGGPRTKNGGPSSLMERWRSGGLCDCGGWDIGCPLT
VLTGQSISGDNSPQADTQECRAFNIYAKMALHDELCFSDCSYKFYASDNKYTTIVFQGSENGPPSWRMVNIRDGLLEMVVTSFVNHPFKIIQRFHPEENLNQVSIELDAA
TLACPACDLRPACVSVTAAADWNILNQRSKNGLKISLKNYCESWRLNVELHNIRSFQVVPEECVSYIGKYVTSTQYRVDSERTIEECIVYLTKACSLKGDGTDAWIFDVD
DTLISTVPYYKKNQYGGKKLNLTDLEAWMSKANAPVLGHTLRLFNLLKARGVDIILISARREGLRSATIENLVQVGYHGWTNLILRSPEDETKEVEKYKAEVRRRLVNGG
YRIWGIVGDQYSSIEGSPSGKRTFKLPNPIPREVDSEPVRRKTSASRRCIRKFSGGDLPIPATRINPPASNARRPSNCVNSSAAQVQKRADPALPKSLT