; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017906 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017906
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDNA-dependent metalloprotease WSS1 isoform X2
Genome locationtig00153057:371692..384915
RNA-Seq ExpressionSgr017906
SyntenySgr017906
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0030001 - metal ion transport (biological process)
GO:0055085 - transmembrane transport (biological process)
GO:0016020 - membrane (cellular component)
GO:0008237 - metallopeptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046873 - metal ion transmembrane transporter activity (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR003689 - Zinc/iron permease
IPR013536 - WLM domain
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5315675.1 unnamed protein product [Arabidopsis thaliana]1.1e-21547.14Show/hide
Query:  MARSLLLLSIFLFLVSSAAPH-SGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMMHF
        M+RSL+   +FL LV     H +G   D D+A H  ++D    L+SK L+ VKI CL++IFV TFI G+SPYF KW+ GFLVLGTQFAGGVFL TA+MHF
Subjt:  MARSLLLLSIFLFLVSSAAPH-SGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMMHF

Query:  LSDANETFEDL--------TDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGD
        LSDA+ETF DL           AYPFA+MLAC G+++TM AD VI+H+Y K            D+ELQG   S  +              + TT +S GD
Subjt:  LSDANETFEDL--------TDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGD

Query:  SILLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISM
        SILLIVALCFHSVFEGIAIG++ET++DAW+ALWTI+LHK+FAAIAMGIALLRMIP+RP FS + Y+FAFAISSPIG+AIGI+IDATTQG++ADWIFA+SM
Subjt:  SILLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISM

Query:  GLACGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGIGKGVSLPIHWPADGRKRVAPIPAPMEAKTSDALHRSVAPAIFFLRSITFSVSSTISSS
         LACGVF+YVSVNHLL+KGY P   V VD P YKFLAVL G+                                                          
Subjt:  GLACGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGIGKGVSLPIHWPADGRKRVAPIPAPMEAKTSDALHRSVAPAIFFLRSITFSVSSTISSS

Query:  LDFTFAEMEQQSLYAALLVRRSKSPHRHHLLTLHEAPKEAYKTSETTSRLSYRASSIEKQRIMALIQVCTPWYCIEENHAERCSLMHSVDLRGSPQSHLH
                                                                     ++A++ +C                               
Subjt:  LDFTFAEMEQQSLYAALLVRRSKSPHRHHLLTLHEAPKEAYKTSETTSRLSYRASSIEKQRIMALIQVCTPWYCIEENHAERCSLMHSVDLRGSPQSHLH

Query:  RHHNEQPESHNYCSMTSQDKFSHLALAAWWFPKFFFRSMDLGDLNKVWEVKALK-KAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGL
                                        +    S +L DLNKVWE+KALK K  E EA++ILE++A QVQPIM R KWRVK+LSEFCP NP LLG+
Subjt:  RHHNEQPESHNYCSMTSQDKFSHLALAAWWFPKFFFRSMDLGDLNKVWEVKALK-KAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGL

Query:  NVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAA
        NV RG+ VKLRLRR N D DF  ++++LDTMLHELCHN HGPHNASFYKLWDELRKECEELM+KGI+G+ QGFD+PG+RLGG SRQP LS LR +A  AA
Subjt:  NVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAA

Query:  EGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAERRLQDDIWCAS-SQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSER
        E R R G+LLPSGP RLGGDS+IM  LSPIQAAAMAAERRL DDIWC S S +   DEE  SD   E V   +   S       NG+    ++ +  S  
Subjt:  EGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAERRLQDDIWCAS-SQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSER

Query:  NSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHKM-SDRVP-----FPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPAL
        +S   SS+     + +DL+++         ++    KR+++  D+ P      P +    SSI LP  S N   S +      E +MWEC  CTLLNP+L
Subjt:  NSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHKM-SDRVP-----FPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPAL

Query:  APICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRYSHGQPVSTRGPNLGT
        APICELC + KPK+ + K+K WSCKFCTLEN VKLEKC ACGQWRYS+G P+ST  PN+GT
Subjt:  APICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRYSHGQPVSTRGPNLGT

KAG6607757.1 hypothetical protein SDJN03_01099, partial [Cucurbita argyrosperma subsp. sororia]9.9e-21787.12Show/hide
Query:  AAWWFPKFFFRSMDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQV
        AA  F +FFFR M++GDLNK+WE+KALKKAGEKEAKEILERIAKQVQPIMR+ KWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQV
Subjt:  AAWWFPKFFFRSMDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQV

Query:  LDTMLHELCHNLHGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVAL
        LDTMLHELCHNL  PHNASFYKLWDELRKECEELMAKGISGSAQGF+LPGRRLGG S+QPPLSSLRKSALAAAEGR+RLGSLLPSGPNRLGGDSNIMVAL
Subjt:  LDTMLHELCHNLHGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVAL

Query:  SPIQAAAMAAERRLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGST
        SP+QAAAMAAERRLQDDIWCASSQE+PVDEECCSDFPS+ VH S AGKSGPSSNL N  D LHQKR+R+SE+ S+NKSS GHLKP FVDLS+DVLIPGS+
Subjt:  SPIQAAAMAAERRLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGST

Query:  ADYDAESNKRHKMSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVK
        A YD E NKR+KMSDRV FPQ CAETS+ID  C+SSN MP HDGTLHP E SMWECGNCTLLNP LAPICELC SQK KDADTKY+FWSCKFCTLENSVK
Subjt:  ADYDAESNKRHKMSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVK

Query:  LEKCSACGQWRYSHGQPVSTRGPNLGT
        LEKCSACGQWRYSHGQP STRGPNLGT
Subjt:  LEKCSACGQWRYSHGQPVSTRGPNLGT

KAG7037334.1 WSS1 [Cucurbita argyrosperma subsp. argyrosperma]8.7e-21388.19Show/hide
Query:  MDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        M++GDLNKVWE+KALKKAGEKEAKEILERIAKQVQPIMR+ KWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  HGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAER
          PHNASFYKLWDELRKECEELMAKGISGSAQGF+LPGRRLGG S+Q PLSSLRKSALAAAEGR+RLGSLLPSGPNRLGGDSNIMVALSP+QAAAMAAER
Subjt:  HGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAER

Query:  RLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHK
        RLQDDIWCASSQE+PVDEECCSDFPSE  H S AGKSGPSSNL N  DALHQKR+R+SE+ S+NKSS GHLKP FVDLS+DVLIPGS+A YD E NKR+K
Subjt:  RLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHK

Query:  MSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRY
        MSDRV FPQ CAETS+ID  C+SSN MP HDGTLHP E SMWECGNCTLLNP LAPICELC SQK KDADTKY+FWSCKFCTLENSVKLEKCSACGQWRY
Subjt:  MSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRY

Query:  SHGQPVSTRGPNLGT
        SHGQP STRGPNLGT
Subjt:  SHGQPVSTRGPNLGT

XP_023525391.1 uncharacterized protein LOC111789010 [Cucurbita pepo subsp. pepo]7.1e-21588.92Show/hide
Query:  MDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MD+GDLNKVWE+KALKKAGEKEAKEILERIAKQVQPIMR+ KWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  HGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAER
          PHNA+FYKLWDELRKECEELMAKGISGSAQGF+LPGRRLGG S+QPPLSSLRKSALAAAEGR+RLGSLLPSGPNRLGGDSNIMVALSP+QAAAMAAER
Subjt:  HGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAER

Query:  RLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHK
        RLQDDIWCASSQE+PVDEECCSDFPSE  H S AGKSGPSSNL N  DALHQKR+R+SE++S  KSS GHLKP+FVDLS+DVLIPGS+A YD ESNKRHK
Subjt:  RLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHK

Query:  MSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRY
        MSDRV FPQ CAETSSID  C+SSN MP HDGTLHP E SMWECGNCTLLNP LAPICELC SQK KDADTKY+FWSCKFCTLENSVKLEKCSACGQWRY
Subjt:  MSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRY

Query:  SHGQPVSTRGPNLGT
        SHGQP STRGPNLGT
Subjt:  SHGQPVSTRGPNLGT

XP_038898345.1 uncharacterized protein LOC120086023 [Benincasa hispida]1.3e-21990.36Show/hide
Query:  MDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MD+GDLNKVWE+KALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDF+PFNQVLDTMLHELCHNL
Subjt:  MDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  HGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAER
        HGPHNA+FYKLWDELRKECEELMAKGISG+AQGFDLPGRRLGGN RQPPLSSL KS+LAAAEGRR LGSLLPSGPNRLGGDSNIMVALSP+QAAAMAAER
Subjt:  HGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAER

Query:  RLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHK
        RLQDDIWCASSQE+PVDE+CC DFPSEA HFSQAGKSGP SNL NG DAL QKRSR+SER S+NKSSNGHLKP+FVDLSKD +IPGS+ADY AESNKRHK
Subjt:  RLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHK

Query:  MSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRY
        M DRV FP+S AETSSIDL  +SSNLMPSHDGT+HPGELSMWECGNCTLLNP LAP+CELC SQKPKDADT+YKFWSCKFCTLENSVKLEKCSAC QWRY
Subjt:  MSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRY

Query:  SHGQPVSTRGPNLGT
        SHGQPVST GPNLGT
Subjt:  SHGQPVSTRGPNLGT

TrEMBL top hitse value%identityAlignment
A0A0A0K3J6 Uncharacterized protein3.0e-21186.51Show/hide
Query:  MDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MD+GDLNKVWE+KALKKAGEKEAK++LERIAKQVQPIMR+HKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  HGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAER
        HGPHNA+FYKLWDELRKECEEL+AKG+SG+AQGFDLPGRRLGGN RQP LSSLRKS+LAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSP+QAAAMAAER
Subjt:  HGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAER

Query:  RLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHK
        RLQDDIWCAS Q +PVDE+CC  FPSEA H SQAGKSGP  NL    DALHQKR R+SER S NKSSNG L+P+FVDLSKD  IPGS+ADY AESNKRHK
Subjt:  RLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHK

Query:  MSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRY
        + DR+ FPQS AETSSIDL C+SSNLM  +DGT+HPGELSMWECGNCTLLNP LAPICELC SQKP D+DT+YKFWSCKFCTLENSVKLEKC+AC QWRY
Subjt:  MSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRY

Query:  SHGQPVSTRGPNLGT
        SHGQPVSTRGPNLGT
Subjt:  SHGQPVSTRGPNLGT

A0A6J1CDP8 uncharacterized protein LOC111010596 isoform X11.0e-21188.49Show/hide
Query:  MDLGDLNKVWEVKAL-KKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHN
        MD+GDLNKVWE+KAL KKAGEKEA+EILERIAKQVQPIMRRHKWRVKVLSEFCPKN ALLGLNVGRGIHVKLRLRRPNRD DF PFNQVLDTMLHELCHN
Subjt:  MDLGDLNKVWEVKAL-KKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHN

Query:  LHGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAE
        LHGPHNA+FYKLWDELRKECEELMAKGISG+AQGFDL GRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGP RLGGDSNIMVALSP+QAAAMAAE
Subjt:  LHGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAE

Query:  RRLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRH
        RRLQDDIWCAS QEIPVDEECC D PSEAVH  +AGK GPSSNL NG DALH KR RD   +STNK+SNGHLKP+FVDLS+DV I GSTADYDAESNKR 
Subjt:  RRLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRH

Query:  KMSDRVPFPQSCAETSSIDLPCASSNLMPSHDGT-LHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQW
        KMS+RVPFPQSCAETSS  L C+SSNLM SHDGT  HPGELSMWECGNCTLLNP LAP+CELC SQKPKDADTKY+ WSCKFCTLENSVKLEKCSAC QW
Subjt:  KMSDRVPFPQSCAETSSIDLPCASSNLMPSHDGT-LHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQW

Query:  RYSHGQPVSTRGPNLGT
        RYSHGQPVSTR PNLGT
Subjt:  RYSHGQPVSTRGPNLGT

A0A6J1FKX5 uncharacterized protein LOC111446189 isoform X15.5e-21387.95Show/hide
Query:  MDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        M++GDLNKVWE+KALKKAGEKEAKEILERIAKQVQPIMR+ KWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  HGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAER
          PHNASFYKLWDELRKECEELMAKGISGSAQGF+LPGRRLGG S+QPPLSSLRKSALAAAEGR+RLGSLLPSGPNRLGGDSNIMVALSP+QAAAMAAER
Subjt:  HGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAER

Query:  RLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHK
        RLQDDIWCASSQE+PVDEECCSDFPS+ VH S AGKSGPSSNL N  D LHQKR+R+SE+ S+NKSS GHLKP FVDLS+DVLIPGS+A YD E NKR+K
Subjt:  RLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHK

Query:  MSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRY
        MSDRV FPQ CAETS+ID  C+SSN MP HDGTLHP E SMWECGNCTLLNP LAPICELC SQK KDADTKY+FWSCKFCTLENSVKLEKC ACGQWRY
Subjt:  MSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRY

Query:  SHGQPVSTRGPNLGT
        SHGQP STRGPNLGT
Subjt:  SHGQPVSTRGPNLGT

A0A6J1IU34 uncharacterized protein LOC111480590 isoform X11.4e-21187.47Show/hide
Query:  MDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MD+GDLNKVWE+KALKKAGEKEAKEILERIAKQVQPIMR+ KWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDLGDLNKVWEVKALKKAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  HGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAER
          PHNA+FYKLWDELRKECEELMAKGISGSAQGF+LPGRRLGG S+QPPLSSLRKSALAAAEGR+RLGSLLPSGPNRLGGDSNIM ALSP+QAAAMAAER
Subjt:  HGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAER

Query:  RLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHK
        RLQDDIWCASSQE+PVDEECCSDFPSE  H S AGKSGPSSNL    DALHQKR+R+SE+ S+N SS GHL+P+FVDLS+DVLIPGS+A YD E  KR+K
Subjt:  RLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHK

Query:  MSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRY
        +SDRV FPQ CAETS+ID PC SSN MP HDGTLHP E SMWECGNCTLLNP LAPICELC SQK KDADTKY+FWSCKFCTLENSVKLEKCSACGQWRY
Subjt:  MSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRY

Query:  SHGQPVSTRGPNLGT
        SHGQPVSTRGPNLGT
Subjt:  SHGQPVSTRGPNLGT

A0A7G2DYC7 (thale cress) hypothetical protein5.3e-21647.14Show/hide
Query:  MARSLLLLSIFLFLVSSAAPH-SGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMMHF
        M+RSL+   +FL LV     H +G   D D+A H  ++D    L+SK L+ VKI CL++IFV TFI G+SPYF KW+ GFLVLGTQFAGGVFL TA+MHF
Subjt:  MARSLLLLSIFLFLVSSAAPH-SGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMMHF

Query:  LSDANETFEDL--------TDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGD
        LSDA+ETF DL           AYPFA+MLAC G+++TM AD VI+H+Y K            D+ELQG   S  +              + TT +S GD
Subjt:  LSDANETFEDL--------TDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGD

Query:  SILLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISM
        SILLIVALCFHSVFEGIAIG++ET++DAW+ALWTI+LHK+FAAIAMGIALLRMIP+RP FS + Y+FAFAISSPIG+AIGI+IDATTQG++ADWIFA+SM
Subjt:  SILLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISM

Query:  GLACGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGIGKGVSLPIHWPADGRKRVAPIPAPMEAKTSDALHRSVAPAIFFLRSITFSVSSTISSS
         LACGVF+YVSVNHLL+KGY P   V VD P YKFLAVL G+                                                          
Subjt:  GLACGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGIGKGVSLPIHWPADGRKRVAPIPAPMEAKTSDALHRSVAPAIFFLRSITFSVSSTISSS

Query:  LDFTFAEMEQQSLYAALLVRRSKSPHRHHLLTLHEAPKEAYKTSETTSRLSYRASSIEKQRIMALIQVCTPWYCIEENHAERCSLMHSVDLRGSPQSHLH
                                                                     ++A++ +C                               
Subjt:  LDFTFAEMEQQSLYAALLVRRSKSPHRHHLLTLHEAPKEAYKTSETTSRLSYRASSIEKQRIMALIQVCTPWYCIEENHAERCSLMHSVDLRGSPQSHLH

Query:  RHHNEQPESHNYCSMTSQDKFSHLALAAWWFPKFFFRSMDLGDLNKVWEVKALK-KAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGL
                                        +    S +L DLNKVWE+KALK K  E EA++ILE++A QVQPIM R KWRVK+LSEFCP NP LLG+
Subjt:  RHHNEQPESHNYCSMTSQDKFSHLALAAWWFPKFFFRSMDLGDLNKVWEVKALK-KAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGL

Query:  NVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAA
        NV RG+ VKLRLRR N D DF  ++++LDTMLHELCHN HGPHNASFYKLWDELRKECEELM+KGI+G+ QGFD+PG+RLGG SRQP LS LR +A  AA
Subjt:  NVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAA

Query:  EGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAERRLQDDIWCAS-SQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSER
        E R R G+LLPSGP RLGGDS+IM  LSPIQAAAMAAERRL DDIWC S S +   DEE  SD   E V   +   S       NG+    ++ +  S  
Subjt:  EGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAERRLQDDIWCAS-SQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSER

Query:  NSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHKM-SDRVP-----FPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPAL
        +S   SS+     + +DL+++         ++    KR+++  D+ P      P +    SSI LP  S N   S +      E +MWEC  CTLLNP+L
Subjt:  NSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHKM-SDRVP-----FPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPAL

Query:  APICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRYSHGQPVSTRGPNLGT
        APICELC + KPK+ + K+K WSCKFCTLEN VKLEKC ACGQWRYS+G P+ST  PN+GT
Subjt:  APICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRYSHGQPVSTRGPNLGT

SwissProt top hitse value%identityAlignment
P38838 DNA-dependent metalloprotease WSS11.3e-2235.15Show/hide
Query:  KAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNASFYKLWDELR
        K  +++A  +++ IA +V  +M+ + ++V  L EF P++  LLG+NV  G  + LRLR    +  F P   ++ TMLHEL HNL GPH+  FY   DEL 
Subjt:  KAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNASFYKLWDELR

Query:  KECEELMAKGISGSAQGFDLPGRRLGG-----NSRQPPLSSLRKSALAAAEGRR-RLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAERRLQDDIWCAS
             +  +G+  +  G    G+RLGG     ++R P       + +    G+  +LGSL P       G S+I    SP + AA AAERR +DD WC  
Subjt:  KECEELMAKGISGSAQGFDLPGRRLGG-----NSRQPPLSSLRKSALAAAEGRR-RLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAERRLQDDIWCAS

Query:  SQ
        ++
Subjt:  SQ

Q852F6 Zinc transporter 25.8e-10360.24Show/hide
Query:  SSAAPHSGHSDDGD-DADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMMHFLSDANETFED-LTDK
        ++A  H G    GD DAD     +G P+LR++ L+  K+ CL ++F GT   G+SPYF +WND FL LGTQFAGGVFLGTAMMHFL+DANETF D L   
Subjt:  SSAAPHSGHSDDGD-DADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMMHFLSDANETFED-LTDK

Query:  AYPFAFMLACVGYLMTMAADCVISHLYRK------QSADSSVGV-HGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGDSILLIVALCFHSVFE
        AYPFAFMLAC GY++TM ADC IS +  +       +A +  G+  G+     G ++ PP       +  H     L   S+ GDS+LLI ALCFHSVFE
Subjt:  AYPFAFMLACVGYLMTMAADCVISHLYRK------QSADSSVGV-HGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGDSILLIVALCFHSVFE

Query:  GIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISMGLACGVFIYVSVNHL
        GIAIGVAET+ADAWKALWTISLHK+FAAIAMGIALLRM+P+RPF SC  YAFAFA+SSP+G+ IGI+IDATTQG VADWIFA+SMGLA G+FIYVS+NHL
Subjt:  GIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISMGLACGVFIYVSVNHL

Query:  LSKGYTPKDAVLVDNPNYKFLAVLLGI
        LSKGYTP   V  D P  + LAV+LG+
Subjt:  LSKGYTPKDAVLVDNPNYKFLAVLLGI

Q94DG6 Zinc transporter 13.6e-9253.53Show/hide
Query:  MARSLLLLSIFLFLVSSAAPHSGHS--DDGD-DADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMM
        M  S LL+++ L    S    SGH   +DGD   D  A    +  +RSK L+ VK+ CL+++ V TF  G+SPYF++WN+ FL+LGTQFA GVFLGTA+M
Subjt:  MARSLLLLSIFLFLVSSAAPHSGHS--DDGD-DADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMM

Query:  HFLSDANETFEDLTDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSA----DSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGDSI
        HFL+D+  TF+ LT   YPF+FML CVG+L+TM +D VI+ + R+ +A    D+ V    +  + +GA  S  +      ++  HP   L   SSF D++
Subjt:  HFLSDANETFEDLTDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSA----DSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGDSI

Query:  LLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISMGL
        LLIVALCFHSVFEGIAIGV+ ++++AW+ LWTI LHK+FAA+AMGIALLRMIP RPF   VVY+ AFA+SSP+G+ IGI IDAT+QG  ADW +AISMGL
Subjt:  LLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISMGL

Query:  ACGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGI
        A GVFIYV++NHL++KGY P      D P +KFLAVLLG+
Subjt:  ACGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGI

Q94EG9 Zinc transporter 111.5e-10660.53Show/hide
Query:  MARSLLLLSIFLFLVSSAAPH-SGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMMHF
        M+RSL+   +FL LV     H +G   D D+A H  ++D    L+SK L+ VKI CL++IFV TFI G+SPYF KW+ GFLVLGTQFAGGVFL TA+MHF
Subjt:  MARSLLLLSIFLFLVSSAAPH-SGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMMHF

Query:  LSDANETFEDL--------TDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGD
        LSDA+ETF  L           AYPFA+MLAC G+++TM AD VI+H+Y K            D+ELQG   S  +              + TT +S GD
Subjt:  LSDANETFEDL--------TDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGD

Query:  SILLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISM
        SILLIVALCFHSVFEGIAIG++ET++DAW+ALWTI+LHK+FAAIAMGIALLRMIP+RP FS + Y+FAFAISSPIG+AIGI+IDATTQG++ADWIFA+SM
Subjt:  SILLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISM

Query:  GLACGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGI
         LACGVF+YVSVNHLL+KGY P   V VD P YKFLAVL G+
Subjt:  GLACGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGI

Q9LTH9 Zinc transporter 21.1e-9354.28Show/hide
Query:  SLLLLSIFLFLVSSAAPHSGHSDDGDDADHTAAADGAP------NLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMM
        +L  LSI     S    H G  DDGD+ + T     A       NLRSK L+LVKI C+I++F  TF+ G+SPYF++WN+ FL+LGTQF+GG+FL TA++
Subjt:  SLLLLSIFLFLVSSAAPHSGHSDDGDDADHTAAADGAP------NLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMM

Query:  HFLSDANETFEDLTDKAYPFAFMLACVGYLMTMAADCVISHL---YRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGDSIL
        HFLSDANETF  L  K YP+AFMLA  GY +TM AD  ++ +          +SVG    D ++  A     + ++ +G      + AL   S FGD+ L
Subjt:  HFLSDANETFEDLTDKAYPFAFMLACVGYLMTMAADCVISHL---YRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGDSIL

Query:  LIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISMGLA
        LI ALCFHS+FEGIAIG+++T++DAW+ LWTISLHKVFAA+AMGIALL++IP RPFF  VVY+FAF ISSPIG+ IGI I+AT+QGA  DW +AISMGLA
Subjt:  LIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISMGLA

Query:  CGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGI
        CGVF+YV+VNHL+SKGY P++    D P YKF+AV LG+
Subjt:  CGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGI

Arabidopsis top hitse value%identityAlignment
AT1G55910.1 zinc transporter 11 precursor1.1e-10760.53Show/hide
Query:  MARSLLLLSIFLFLVSSAAPH-SGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMMHF
        M+RSL+   +FL LV     H +G   D D+A H  ++D    L+SK L+ VKI CL++IFV TFI G+SPYF KW+ GFLVLGTQFAGGVFL TA+MHF
Subjt:  MARSLLLLSIFLFLVSSAAPH-SGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMMHF

Query:  LSDANETFEDL--------TDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGD
        LSDA+ETF  L           AYPFA+MLAC G+++TM AD VI+H+Y K            D+ELQG   S  +              + TT +S GD
Subjt:  LSDANETFEDL--------TDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGD

Query:  SILLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISM
        SILLIVALCFHSVFEGIAIG++ET++DAW+ALWTI+LHK+FAAIAMGIALLRMIP+RP FS + Y+FAFAISSPIG+AIGI+IDATTQG++ADWIFA+SM
Subjt:  SILLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISM

Query:  GLACGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGI
         LACGVF+YVSVNHLL+KGY P   V VD P YKFLAVL G+
Subjt:  GLACGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGI

AT1G55915.1 zinc ion binding1.2e-12457.31Show/hide
Query:  SMDLGDLNKVWEVKALK-KAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCH
        S +L DLNKVWE+KALK K  E EA++ILE++A QVQPIM R KWRVK+LSEFCP NP LLG+NV RG+ VKLRLRR N D DF  ++++LDTMLHELCH
Subjt:  SMDLGDLNKVWEVKALK-KAGEKEAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCH

Query:  NLHGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAA
        N HGPHNASFYKLWDELRKECEELM+KGI+G+ QGFD+PG+RLGG SRQP LS LR +A  AAE R R G+LLPSGP RLGGDS+IM  LSPIQAAAMAA
Subjt:  NLHGPHNASFYKLWDELRKECEELMAKGISGSAQGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAA

Query:  ERRLQDDIWCAS-SQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNK
        ERRL DDIWC S S +   DEE  SD   E V   +   S       NG+    ++ +  S  +S   SS+     + +DL+++         ++    K
Subjt:  ERRLQDDIWCAS-SQEIPVDEECCSDFPSEAVHFSQAGKSGPSSNLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNK

Query:  RHKM-SDRVP-----FPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEK
        R++   D+ P      P +    SSI LP  S N   S +      E +MWEC  CTLLNP+LAPICELC + KPK+ + K+K WSCKFCTLEN VKLEK
Subjt:  RHKM-SDRVP-----FPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLNPALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEK

Query:  CSACGQWRYSHGQPVSTRGPNLGT
        C ACGQWRYS+G P+ST  PN+GT
Subjt:  CSACGQWRYSHGQPVSTRGPNLGT

AT4G19680.2 iron regulated transporter 21.6e-1025.68Show/hide
Query:  LLLSIFLFLVS---SAAPHSGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKW-------NDGFLVLGTQFAGGVFLGTA
        +LL +F F VS   S AP   H D G D          P +     L +KI  ++ I   + I   SP F ++        +GF+++   F+ G+ LGT 
Subjt:  LLLSIFLFLVS---SAAPHSGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKW-------NDGFLVLGTQFAGGVFLGTA

Query:  MMHFLSDANETFEDLTDKA--------YPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMS
         MH L D   +FE L+ K         +PFA  +A +  L+T+A D + + LY  +++   V      ++ + A        +   + SH     L T  
Subjt:  MMHFLSDANETFEDLTDKA--------YPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMS

Query:  SFGD-------SILLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRM-IPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQ
          G        +++L V + FHSV  G+++G           +  +  H +F  I +G  +L+    N   F   + AF F  ++P GI +GI + +  +
Subjt:  SFGD-------SILLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRM-IPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQ

Query:  GAVADWIFAISMGLAC--GVFIYVSVNHLLS
              +  I +  AC  G+ IY+++  LL+
Subjt:  GAVADWIFAISMGLAC--GVFIYVSVNHLLS

AT4G19690.2 iron-regulated transporter 13.2e-1125.14Show/hide
Query:  SLLLLSIFLFLVSSAAPHSGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFL-------VLGTQFAGGVFLGTAM
        +LL+ +IFL L+  +   S  +    +   + +A+  P +     L +K+  + +I + + I G+    F  N  FL        +   FA G+ LGT  
Subjt:  SLLLLSIFLFLVSSAAPHSGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFL-------VLGTQFAGGVFLGTAM

Query:  MHFLSDANE-----TFEDLTDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGV--HGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSF
        MH L D+ E       E+     +PF+  LA +  L+T+A D + + LY  ++A   VG+  HG      G    P              N  L      
Subjt:  MHFLSDANE-----TFEDLTDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGV--HGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSF

Query:  GDSILLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRM-IPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFA
          +++L + +  HSV  G+++G           +  +  H++F  + +G  +L+    N   F   V AF FA+++P GIA+GI +    Q      +  
Subjt:  GDSILLIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRM-IPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFA

Query:  ISMGLAC--GVFIYVSVNHLLSKGYT-PKDAVLVDNPNYKFLAVLLGIGKGVSLPIHW
        + +  AC  G+ IY+++  LL+  +  PK    +       +A LLG G G+S+   W
Subjt:  ISMGLAC--GVFIYVSVNHLLSKGYT-PKDAVLVDNPNYKFLAVLLGIGKGVSLPIHW

AT5G59520.1 ZRT/IRT-like protein 27.9e-9554.28Show/hide
Query:  SLLLLSIFLFLVSSAAPHSGHSDDGDDADHTAAADGAP------NLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMM
        +L  LSI     S    H G  DDGD+ + T     A       NLRSK L+LVKI C+I++F  TF+ G+SPYF++WN+ FL+LGTQF+GG+FL TA++
Subjt:  SLLLLSIFLFLVSSAAPHSGHSDDGDDADHTAAADGAP------NLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMM

Query:  HFLSDANETFEDLTDKAYPFAFMLACVGYLMTMAADCVISHL---YRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGDSIL
        HFLSDANETF  L  K YP+AFMLA  GY +TM AD  ++ +          +SVG    D ++  A     + ++ +G      + AL   S FGD+ L
Subjt:  HFLSDANETFEDLTDKAYPFAFMLACVGYLMTMAADCVISHL---YRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGDSIL

Query:  LIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISMGLA
        LI ALCFHS+FEGIAIG+++T++DAW+ LWTISLHKVFAA+AMGIALL++IP RPFF  VVY+FAF ISSPIG+ IGI I+AT+QGA  DW +AISMGLA
Subjt:  LIVALCFHSVFEGIAIGVAETEADAWKALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISMGLA

Query:  CGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGI
        CGVF+YV+VNHL+SKGY P++    D P YKF+AV LG+
Subjt:  CGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVLLGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCGCTCTCTTCTCCTCCTCTCCATCTTCCTCTTCCTCGTCTCCTCCGCCGCCCCTCACAGCGGCCACAGCGACGACGGCGACGATGCTGACCACACCGCTGCAGC
CGACGGCGCCCCCAACCTCCGTTCCAAGCCGCTCCTGCTCGTCAAGATCGGCTGCCTGATTTTGATCTTTGTCGGCACTTTCATCCCCGGAATTTCCCCCTACTTCTTCA
AATGGAACGACGGGTTTCTTGTCCTCGGGACTCAGTTCGCCGGCGGCGTCTTCCTCGGTACTGCGATGATGCATTTCCTCAGCGATGCGAACGAGACGTTCGAGGATTTG
ACTGATAAGGCGTACCCGTTCGCGTTCATGCTCGCCTGCGTAGGGTATTTGATGACCATGGCGGCTGATTGTGTGATTTCGCATTTGTATCGCAAGCAGAGTGCTGATTC
TTCGGTCGGTGTCCATGGCCGTGACGTTGAGCTTCAAGGAGCTTCAACTTCACCCCCAAAGTTTCAGGTCCCAAATGGCAGCAGCAGCCATCATCCAAACCCAGCTCTCA
CAACAATGAGTTCATTTGGGGACAGCATCTTACTCATTGTTGCATTGTGCTTTCACTCCGTCTTCGAAGGCATCGCCATCGGTGTTGCAGAGACCGAAGCCGATGCCTGG
AAAGCCCTGTGGACGATCTCCCTCCACAAGGTCTTTGCAGCAATTGCCATGGGCATCGCACTCCTCCGCATGATCCCAAACCGCCCCTTCTTTTCATGCGTTGTTTATGC
CTTTGCTTTTGCCATCTCGAGCCCGATTGGTATCGCCATTGGAATTATAATCGACGCCACAACACAAGGGGCTGTTGCAGATTGGATCTTTGCCATCTCAATGGGGCTGG
CTTGTGGAGTGTTCATTTATGTATCAGTGAACCACTTGCTCTCAAAGGGATACACACCCAAGGATGCAGTTTTGGTTGACAATCCCAATTACAAGTTTCTTGCAGTGCTT
TTAGGCATTGGGAAGGGTGTGAGCTTACCGATCCACTGGCCTGCTGATGGGCGGAAGAGGGTGGCGCCGATACCTGCTCCAATGGAAGCAAAAACTAGTGATGCGCTACA
CCGAAGCGTAGCGCCTGCAATCTTCTTCCTGAGAAGCATAACTTTCTCGGTTTCGTCAACTATATCATCTTCTTTGGATTTTACTTTTGCGGAAATGGAGCAACAAAGTC
TATATGCTGCTCTTCTTGTGCGTCGTTCAAAAAGTCCTCATCGTCATCATCTTCTAACTTTGCATGAGGCTCCAAAAGAGGCATATAAAACATCAGAAACGACCTCAAGG
TTGTCATACCGAGCGTCCTCCATTGAAAAGCAGAGAATAATGGCTTTAATTCAAGTTTGTACTCCTTGGTACTGTATTGAAGAGAATCATGCAGAAAGATGTTCCTTGAT
GCACTCTGTAGATTTAAGAGGTAGTCCTCAGTCACATCTGCACCGGCATCACAATGAGCAACCGGAATCCCATAATTACTGTAGCATGACCAGCCAGGATAAGTTCTCCC
ACTTAGCCCTAGCCGCTTGGTGGTTCCCCAAGTTTTTCTTTCGTAGCATGGATTTGGGCGACCTTAACAAAGTTTGGGAAGTTAAAGCCCTGAAGAAGGCTGGGGAGAAA
GAAGCGAAGGAGATTCTGGAGAGAATTGCCAAACAAGTCCAACCCATTATGCGTAGACACAAATGGCGAGTCAAGGTTCTTTCAGAATTCTGCCCAAAAAATCCAGCACT
TTTAGGATTAAATGTGGGACGTGGTATTCATGTGAAATTGAGGCTTCGAAGGCCAAATAGAGATGGAGATTTCTTCCCCTTCAATCAAGTTTTGGATACAATGCTACATG
AACTTTGCCACAATCTCCATGGTCCTCACAATGCCAGTTTCTACAAGCTTTGGGATGAACTTAGAAAGGAATGTGAGGAGTTGATGGCTAAGGGAATTAGTGGCTCAGCA
CAAGGATTTGATCTCCCAGGGAGGCGTTTGGGTGGTAATTCGCGACAACCCCCGCTTTCTTCCCTCCGCAAATCTGCCCTAGCAGCTGCAGAAGGAAGAAGACGTTTGGG
ATCTCTACTTCCGTCTGGACCTAATCGGCTTGGTGGTGATAGTAACATCATGGTTGCTCTAAGTCCCATACAAGCAGCTGCAATGGCTGCAGAAAGGAGGCTACAGGATG
ATATTTGGTGTGCTTCATCTCAAGAAATTCCTGTGGATGAGGAATGTTGCTCTGATTTTCCATCAGAAGCAGTTCATTTCTCCCAAGCAGGTAAATCGGGGCCCTCTAGC
AATTTAGGTAATGGTGAAGATGCATTACACCAGAAAAGAAGTCGTGATTCAGAAAGGAATTCTACTAACAAGTCTTCCAATGGTCATTTGAAACCTGAATTTGTTGATTT
ATCCAAAGATGTTTTAATCCCTGGCTCCACTGCTGACTATGATGCAGAGTCAAATAAGCGACATAAAATGTCAGATAGAGTTCCATTTCCTCAATCTTGTGCAGAAACTA
GCTCAATAGATTTGCCCTGTGCATCATCTAATTTGATGCCAAGTCATGATGGAACTCTTCATCCTGGAGAACTTTCCATGTGGGAGTGTGGAAATTGCACCTTACTGAAT
CCAGCACTAGCTCCAATATGTGAGCTTTGTTGTTCACAGAAGCCAAAAGATGCTGATACCAAGTACAAATTCTGGTCATGTAAATTCTGCACCTTAGAAAACAGTGTGAA
GTTGGAGAAATGCTCGGCATGTGGTCAATGGAGATATTCTCATGGCCAGCCGGTGTCGACTCGAGGACCGAATCTCGGCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCGCTCTCTTCTCCTCCTCTCCATCTTCCTCTTCCTCGTCTCCTCCGCCGCCCCTCACAGCGGCCACAGCGACGACGGCGACGATGCTGACCACACCGCTGCAGC
CGACGGCGCCCCCAACCTCCGTTCCAAGCCGCTCCTGCTCGTCAAGATCGGCTGCCTGATTTTGATCTTTGTCGGCACTTTCATCCCCGGAATTTCCCCCTACTTCTTCA
AATGGAACGACGGGTTTCTTGTCCTCGGGACTCAGTTCGCCGGCGGCGTCTTCCTCGGTACTGCGATGATGCATTTCCTCAGCGATGCGAACGAGACGTTCGAGGATTTG
ACTGATAAGGCGTACCCGTTCGCGTTCATGCTCGCCTGCGTAGGGTATTTGATGACCATGGCGGCTGATTGTGTGATTTCGCATTTGTATCGCAAGCAGAGTGCTGATTC
TTCGGTCGGTGTCCATGGCCGTGACGTTGAGCTTCAAGGAGCTTCAACTTCACCCCCAAAGTTTCAGGTCCCAAATGGCAGCAGCAGCCATCATCCAAACCCAGCTCTCA
CAACAATGAGTTCATTTGGGGACAGCATCTTACTCATTGTTGCATTGTGCTTTCACTCCGTCTTCGAAGGCATCGCCATCGGTGTTGCAGAGACCGAAGCCGATGCCTGG
AAAGCCCTGTGGACGATCTCCCTCCACAAGGTCTTTGCAGCAATTGCCATGGGCATCGCACTCCTCCGCATGATCCCAAACCGCCCCTTCTTTTCATGCGTTGTTTATGC
CTTTGCTTTTGCCATCTCGAGCCCGATTGGTATCGCCATTGGAATTATAATCGACGCCACAACACAAGGGGCTGTTGCAGATTGGATCTTTGCCATCTCAATGGGGCTGG
CTTGTGGAGTGTTCATTTATGTATCAGTGAACCACTTGCTCTCAAAGGGATACACACCCAAGGATGCAGTTTTGGTTGACAATCCCAATTACAAGTTTCTTGCAGTGCTT
TTAGGCATTGGGAAGGGTGTGAGCTTACCGATCCACTGGCCTGCTGATGGGCGGAAGAGGGTGGCGCCGATACCTGCTCCAATGGAAGCAAAAACTAGTGATGCGCTACA
CCGAAGCGTAGCGCCTGCAATCTTCTTCCTGAGAAGCATAACTTTCTCGGTTTCGTCAACTATATCATCTTCTTTGGATTTTACTTTTGCGGAAATGGAGCAACAAAGTC
TATATGCTGCTCTTCTTGTGCGTCGTTCAAAAAGTCCTCATCGTCATCATCTTCTAACTTTGCATGAGGCTCCAAAAGAGGCATATAAAACATCAGAAACGACCTCAAGG
TTGTCATACCGAGCGTCCTCCATTGAAAAGCAGAGAATAATGGCTTTAATTCAAGTTTGTACTCCTTGGTACTGTATTGAAGAGAATCATGCAGAAAGATGTTCCTTGAT
GCACTCTGTAGATTTAAGAGGTAGTCCTCAGTCACATCTGCACCGGCATCACAATGAGCAACCGGAATCCCATAATTACTGTAGCATGACCAGCCAGGATAAGTTCTCCC
ACTTAGCCCTAGCCGCTTGGTGGTTCCCCAAGTTTTTCTTTCGTAGCATGGATTTGGGCGACCTTAACAAAGTTTGGGAAGTTAAAGCCCTGAAGAAGGCTGGGGAGAAA
GAAGCGAAGGAGATTCTGGAGAGAATTGCCAAACAAGTCCAACCCATTATGCGTAGACACAAATGGCGAGTCAAGGTTCTTTCAGAATTCTGCCCAAAAAATCCAGCACT
TTTAGGATTAAATGTGGGACGTGGTATTCATGTGAAATTGAGGCTTCGAAGGCCAAATAGAGATGGAGATTTCTTCCCCTTCAATCAAGTTTTGGATACAATGCTACATG
AACTTTGCCACAATCTCCATGGTCCTCACAATGCCAGTTTCTACAAGCTTTGGGATGAACTTAGAAAGGAATGTGAGGAGTTGATGGCTAAGGGAATTAGTGGCTCAGCA
CAAGGATTTGATCTCCCAGGGAGGCGTTTGGGTGGTAATTCGCGACAACCCCCGCTTTCTTCCCTCCGCAAATCTGCCCTAGCAGCTGCAGAAGGAAGAAGACGTTTGGG
ATCTCTACTTCCGTCTGGACCTAATCGGCTTGGTGGTGATAGTAACATCATGGTTGCTCTAAGTCCCATACAAGCAGCTGCAATGGCTGCAGAAAGGAGGCTACAGGATG
ATATTTGGTGTGCTTCATCTCAAGAAATTCCTGTGGATGAGGAATGTTGCTCTGATTTTCCATCAGAAGCAGTTCATTTCTCCCAAGCAGGTAAATCGGGGCCCTCTAGC
AATTTAGGTAATGGTGAAGATGCATTACACCAGAAAAGAAGTCGTGATTCAGAAAGGAATTCTACTAACAAGTCTTCCAATGGTCATTTGAAACCTGAATTTGTTGATTT
ATCCAAAGATGTTTTAATCCCTGGCTCCACTGCTGACTATGATGCAGAGTCAAATAAGCGACATAAAATGTCAGATAGAGTTCCATTTCCTCAATCTTGTGCAGAAACTA
GCTCAATAGATTTGCCCTGTGCATCATCTAATTTGATGCCAAGTCATGATGGAACTCTTCATCCTGGAGAACTTTCCATGTGGGAGTGTGGAAATTGCACCTTACTGAAT
CCAGCACTAGCTCCAATATGTGAGCTTTGTTGTTCACAGAAGCCAAAAGATGCTGATACCAAGTACAAATTCTGGTCATGTAAATTCTGCACCTTAGAAAACAGTGTGAA
GTTGGAGAAATGCTCGGCATGTGGTCAATGGAGATATTCTCATGGCCAGCCGGTGTCGACTCGAGGACCGAATCTCGGCACTTGA
Protein sequenceShow/hide protein sequence
MARSLLLLSIFLFLVSSAAPHSGHSDDGDDADHTAAADGAPNLRSKPLLLVKIGCLILIFVGTFIPGISPYFFKWNDGFLVLGTQFAGGVFLGTAMMHFLSDANETFEDL
TDKAYPFAFMLACVGYLMTMAADCVISHLYRKQSADSSVGVHGRDVELQGASTSPPKFQVPNGSSSHHPNPALTTMSSFGDSILLIVALCFHSVFEGIAIGVAETEADAW
KALWTISLHKVFAAIAMGIALLRMIPNRPFFSCVVYAFAFAISSPIGIAIGIIIDATTQGAVADWIFAISMGLACGVFIYVSVNHLLSKGYTPKDAVLVDNPNYKFLAVL
LGIGKGVSLPIHWPADGRKRVAPIPAPMEAKTSDALHRSVAPAIFFLRSITFSVSSTISSSLDFTFAEMEQQSLYAALLVRRSKSPHRHHLLTLHEAPKEAYKTSETTSR
LSYRASSIEKQRIMALIQVCTPWYCIEENHAERCSLMHSVDLRGSPQSHLHRHHNEQPESHNYCSMTSQDKFSHLALAAWWFPKFFFRSMDLGDLNKVWEVKALKKAGEK
EAKEILERIAKQVQPIMRRHKWRVKVLSEFCPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNASFYKLWDELRKECEELMAKGISGSA
QGFDLPGRRLGGNSRQPPLSSLRKSALAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPIQAAAMAAERRLQDDIWCASSQEIPVDEECCSDFPSEAVHFSQAGKSGPSS
NLGNGEDALHQKRSRDSERNSTNKSSNGHLKPEFVDLSKDVLIPGSTADYDAESNKRHKMSDRVPFPQSCAETSSIDLPCASSNLMPSHDGTLHPGELSMWECGNCTLLN
PALAPICELCCSQKPKDADTKYKFWSCKFCTLENSVKLEKCSACGQWRYSHGQPVSTRGPNLGT