; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G015460 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G015460
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionWAT1-related protein
Genome locationchr02:21316877..21323387
RNA-Seq ExpressionLsi02G015460
SyntenyLsi02G015460
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsIPR030184 - WAT1-related protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142470.3 WAT1-related protein At5g40210 isoform X1 [Cucumis sativus]1.5e-9762.53Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF------------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME  M  SAMIMVEIMDVI++TLSKAAMSKGMNNLVF                          APLSF MILGF LLGLNGSVGQ+MAYTGIKYSSP LL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF------------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDD---HDLLLSQNSNW---------------
        SA+SNLIPIFTFLLA++FRMEKVDL+RSSGKA CVGTILAVSGASLITL+KGPLL+   SSS S  KQE+D   H +LLS +S+W               
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDD---HDLLLSQNSNW---------------

Query:  ------------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTI----IFSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVI
                    YP KK+TN+FFFTLSM VQTA F ++VEKN T W+L+PDIEMVTI    I  V +        + KGP+YV MFKPLGMVVAIPLVV 
Subjt:  ------------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTI----IFSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVI

Query:  FLHEPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS
        FLHEPLYLGSV+GSIVIGCGFY VIWGQ K+LD  + L   SHSQS    ESPSA LL H+HS
Subjt:  FLHEPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS

XP_008458741.1 PREDICTED: WAT1-related protein At3g28050-like isoform X1 [Cucumis melo]1.3e-9863.84Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF------------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME  M  SAMIMVEIMDVI++TLSKAAMSKGMNNLVF                         AAPLS  MILGF LLGLNGSVGQI+AYTGIKYSSPALL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF------------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI-LITSSSKSSVKQEDDH---DLLLSQNSNW--------------
        SA+SNLIPIFTFLLA++FRMEKVDL+RSSGKA CVGTILAVSGASLITLYKGPLL+  I+SS+   +KQE+D     LL S NSNW              
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI-LITSSSKSSVKQEDDH---DLLLSQNSNW--------------

Query:  -------------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIFS-----VRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLV
                     YP KK+TN FFFTLSM VQTAAF ++VEKN T W+L+PDIEMVTI  S     VR  + +    + +GP+YV MFKPLGMVVAIPLV
Subjt:  -------------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIFS-----VRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLV

Query:  VIFLHEPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS
        V FLHEPLYLGSV+GSIVIGCGFY VIWGQ KQLD  + L  TSHSQS  AFESPSA LL  +HS
Subjt:  VIFLHEPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS

XP_031741104.1 WAT1-related protein At3g28050 isoform X2 [Cucumis sativus]6.1e-9964.67Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF------------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME  M  SAMIMVEIMDVI++TLSKAAMSKGMNNLVF                          APLSF MILGF LLGLNGSVGQ+MAYTGIKYSSP LL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF------------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDD---HDLLLSQNSNW---------------
        SA+SNLIPIFTFLLA++FRMEKVDL+RSSGKA CVGTILAVSGASLITL+KGPLL+   SSS S  KQE+D   H +LLS +S+W               
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDD---HDLLLSQNSNW---------------

Query:  YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTI----IFSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVM
        YP KK+TN+FFFTLSM VQTA F ++VEKN T W+L+PDIEMVTI    I  V +        + KGP+YV MFKPLGMVVAIPLVV FLHEPLYLGSV+
Subjt:  YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTI----IFSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVM

Query:  GSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS
        GSIVIGCGFY VIWGQ K+LD  + L   SHSQS    ESPSA LL H+HS
Subjt:  GSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS

XP_038889702.1 WAT1-related protein At3g28050-like isoform X1 [Benincasa hispida]3.8e-10165.73Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF------------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME  MT +AMIMVEIMDVI++TLSKAAMSKGMN LVF                         A PLSF MILGF LLGLNGSVGQIMAYTGIKYSSP LL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF------------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW------------------
        SA+SNLIPIFTFLLA +FRMEKVDL+RSSGKAKCVGTILAVSG SLITLYKGPLLI + SSSKS VKQ+DD  +LLS NSNW                  
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW------------------

Query:  ---------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIFS-----VRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFL
                 YP KK+TN+FFF+LSM VQTAAFT IVE NP +WQ+RPDIEMVTII S     VR  + +    + KGP+YV MFKPLGMVVAIPLVV FL
Subjt:  ---------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIFS-----VRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFL

Query:  HEPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLL
         EPLYLGSVMGSIVI CGFYSVIWGQ KQ D  I    TSHSQS  A ESPS  LL
Subjt:  HEPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLL

XP_038889703.1 WAT1-related protein At3g28050-like isoform X2 [Benincasa hispida]2.8e-9666.77Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF--NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIPIFTFLLAVVFRMEK
        ME  MT +AMIMVEIMDVI++TLSKAAMSKGMN LVF   +  L+  + L F LL  + SVGQIMAYTGIKYSSP LLSA+SNLIPIFTFLLA +FRMEK
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF--NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIPIFTFLLAVVFRMEK

Query:  VDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW---------------------------YPNKKMTNVFFFT
        VDL+RSSGKAKCVGTILAVSG SLITLYKGPLLI + SSSKS VKQ+DD  +LLS NSNW                           YP KK+TN+FFF+
Subjt:  VDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW---------------------------YPNKKMTNVFFFT

Query:  LSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIFS-----VRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMGSIVIGCGFYSV
        LSM VQTAAFT IVE NP +WQ+RPDIEMVTII S     VR  + +    + KGP+YV MFKPLGMVVAIPLVV FL EPLYLGSVMGSIVI CGFYSV
Subjt:  LSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIFS-----VRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMGSIVIGCGFYSV

Query:  IWGQFKQLDSPIALHYTSHSQSPCAFESPSASLL
        IWGQ KQ D  I    TSHSQS  A ESPS  LL
Subjt:  IWGQFKQLDSPIALHYTSHSQSPCAFESPSASLL

TrEMBL top hitse value%identityAlignment
A0A0A0KRZ3 WAT1-related protein3.4e-9562.5Show/hide
Query:  MVEIMDVISTTLSKAAMSKGMNNLVF------------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIPIFT
        MVEIMDVI++TLSKAAMSKGMNNLVF                          APLSF MILGF LLGLNGSVGQ+MAYTGIKYSSP LLSA+SNLIPIFT
Subjt:  MVEIMDVISTTLSKAAMSKGMNNLVF------------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIPIFT

Query:  FLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDD---HDLLLSQNSNW--------------------------
        FLLA++FRMEKVDL+RSSGKA CVGTILAVSGASLITL+KGPLL+   SSS S  KQE+D   H +LLS +S+W                          
Subjt:  FLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDD---HDLLLSQNSNW--------------------------

Query:  -YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTI----IFSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSV
         YP KK+TN+FFFTLSM VQTA F ++VEKN T W+L+PDIEMVTI    I  V +        + KGP+YV MFKPLGMVVAIPLVV FLHEPLYLGSV
Subjt:  -YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTI----IFSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSV

Query:  MGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS
        +GSIVIGCGFY VIWGQ K+LD  + L   SHSQS    ESPSA LL H+HS
Subjt:  MGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS

A0A1S3C948 WAT1-related protein6.5e-9963.84Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF------------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME  M  SAMIMVEIMDVI++TLSKAAMSKGMNNLVF                         AAPLS  MILGF LLGLNGSVGQI+AYTGIKYSSPALL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF------------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI-LITSSSKSSVKQEDDH---DLLLSQNSNW--------------
        SA+SNLIPIFTFLLA++FRMEKVDL+RSSGKA CVGTILAVSGASLITLYKGPLL+  I+SS+   +KQE+D     LL S NSNW              
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI-LITSSSKSSVKQEDDH---DLLLSQNSNW--------------

Query:  -------------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIFS-----VRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLV
                     YP KK+TN FFFTLSM VQTAAF ++VEKN T W+L+PDIEMVTI  S     VR  + +    + +GP+YV MFKPLGMVVAIPLV
Subjt:  -------------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIFS-----VRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLV

Query:  VIFLHEPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS
        V FLHEPLYLGSV+GSIVIGCGFY VIWGQ KQLD  + L  TSHSQS  AFESPSA LL  +HS
Subjt:  VIFLHEPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS

A0A6J1BWJ6 WAT1-related protein2.7e-7649.86Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSA
        + +G+   AMIMVE  DV+++TLSKAAM+KG++NLV                          PLS S+ILGFFLLG  GSVGQ+++YTGIKYSSPAL SA
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSA

Query:  MSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW--------------------
        M NLIPI TFLLAVVFRME+ DLK +S KAKCVGTIL V+GAS++TLYKGP+LI+  SSS SS        +   Q SNW                    
Subjt:  MSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW--------------------

Query:  -------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIF-----SVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHE
               YP KKMTNVFFF   +T+QTAAF V++E +P  WQ+RPDI+M+ I+F     SV +    T   + KGP+YV MFKPLGMV+A+     FLH+
Subjt:  -------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIF-----SVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHE

Query:  PLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS
         L+LGSVMGS+VIGCGFY+V+WGQ K+           H+  P   E+ S+S  L  HS
Subjt:  PLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS

A0A6J1FGV1 WAT1-related protein2.4e-8556.67Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF----NA-------------------APLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLS
        ME+ M  +AMIMVE  DVI +TL K AM+KGMNNLVF    NA                   APLSFSMIL FFLLGLNGSVG+++A TGI YSSP LLS
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF----NA-------------------APLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLS

Query:  AMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW-------------------
        AM+NLIPIFT  LAV+FRME++D KRSSGKAKC+GTI+AVSGA LITLYKGP+LI+   SS  SV QE    + LSQ  NW                   
Subjt:  AMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW-------------------

Query:  --------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIF-----SVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLH
                YPNKK+T+VFFFT  +TVQTAAF V ++ NPT+WQ+RPDIEMVTI+F     S+ +T       + KGPV+VAMFKPLGMV+A+ L V FL 
Subjt:  --------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIF-----SVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLH

Query:  EPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS
        E L LGSV+GS+VIGCGFYSVIWGQ KQL+  + L        P   ESPSASLL H  S
Subjt:  EPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS

A0A6J1FNQ6 WAT1-related protein2.4e-8556.67Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF----NA-------------------APLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLS
        ME+ M  +AMIMVE  DVI +TL K AM+KGMNNLVF    NA                   APLSFSMIL FFLLGLNGSVG+++A TGI YSSP LLS
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF----NA-------------------APLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLS

Query:  AMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW-------------------
        AM+NLIPIFT  LAV+FRME++D KRSSGKAKC+GTI+AVSGA LITLYKGP+LI+   SS  SV QE    + LSQ  NW                   
Subjt:  AMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW-------------------

Query:  --------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIF-----SVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLH
                YPNKK+T+VFFFT  +TVQTAAF V ++ NPT+WQ+RPDIEMVTI+F     S+ +T       + KGPV+VAMFKPLGMV+A+ L V FL 
Subjt:  --------YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIF-----SVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLH

Query:  EPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS
        E L LGSV+GS+VIGCGFYSVIWGQ KQL+  + L        P   ESPSASLL H  S
Subjt:  EPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS

SwissProt top hitse value%identityAlignment
F4JK59 WAT1-related protein At4g155408.9e-3737.01Show/hide
Query:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVF--------NAAPLSFSMILG---------------FFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIP
        +AMI +E   V S+ L KAA  +G +  VF            L  S+I G                FLL L G   ++    GI+YSSP L SA+SNL P
Subjt:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVF--------NAAPLSFSMILG---------------FFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIP

Query:  IFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQN----SNW----------YPNKKMTNVFFF
         FTF+LA+ FRME+V L+ S+ +AK +GTI+++SGA +I LYKGP L L+ +S  S         LLL       S W          YP +++  VF +
Subjt:  IFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQN----SNW----------YPNKKMTNVFFF

Query:  TLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIFSVRKTMTL-----THKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMGSIVIGCGFYS
         L  T+ +    ++VEK+   WQL+P   + ++I+S     +L     T    +KGPVY+++FKPL + +A+ +  IFL + L+LGSV+GS+++  GFY+
Subjt:  TLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIFSVRKTMTL-----THKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMGSIVIGCGFYS

Query:  VIWGQFKQ
        VIWG+ ++
Subjt:  VIWGQFKQ

Q945L4 WAT1-related protein At5g402101.5e-3937.9Show/hide
Query:  GMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF-----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS
        G  L+AM++ E  +V   TL KAA SKG++  V                        +  PL+FS++    +LGL  S  QI+ Y GIKYSSP L SAMS
Subjt:  GMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF-----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS

Query:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLL------------ILITSSSKSSVKQEDDHDLLLSQNSNWYPNKKMTNVF
        N+ P FTF+LAVVFRME + L + S  AK +GTIL++ GA ++TLY GP+L            +L       SV       L+++     YP+  +  + 
Subjt:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLL------------ILITSSSKSSVKQEDDHDLLLSQNSNWYPNKKMTNVF

Query:  FFTLSMTVQTAAFTVIVEK-NPTIWQLRPDIEMVTII--------FSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMGSIVI
           + + V  A  +++ EK NP  W +R DI ++T++        + V  T  ++H    KGPVY++MFKPL +++A     IFL E LYLGSVMG I+I
Subjt:  FFTLSMTVQTAAFTVIVEK-NPTIWQLRPDIEMVTII--------FSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMGSIVI

Query:  GCGFYSVIWGQFKQ
          GFY V+WG+ K+
Subjt:  GCGFYSVIWGQFKQ

Q94JU2 WAT1-related protein At3g280505.4e-4236.05Show/hide
Query:  MTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF-----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSN
        + ++A++++E  +V   TL KAA  KGM+  VF                          P++FS++    LLG+ G    IM YTGI YSSP L SA+SN
Subjt:  MTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF-----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSN

Query:  LIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLL-------------------SQNSNWYPNK
        L P FTFLLAVVFRME V  KR+S  AK +GT++++ GA ++TLY GP++I  +  S S   Q  + + +L                   +Q    YP  
Subjt:  LIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLL-------------------SQNSNWYPNK

Query:  KMTNVFFFTLSMTVQTAAFTVIVEKNPT-IWQLRPDIEMVTIIFS------VRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMG
        + T V F+++ ++  TA  T+  E N    W+++P+I +V+I+ S      +  T+  T   +IKGP++VAMFKPL + +A+ + VIFL + LY+GS++G
Subjt:  KMTNVFFFTLSMTVQTAAFTVIVEKNPT-IWQLRPDIEMVTIIFS------VRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMG

Query:  SIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCA-FESPSAS
        + VI  GFY+V+WG+ K++      +  +H ++  A  +SPS S
Subjt:  SIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCA-FESPSAS

Q9FL08 WAT1-related protein At5g402401.1e-3937.07Show/hide
Query:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVF--------NAAPLSFSMILG---------------FFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIP
        +AM  VE   V S TL KAA  +G++  VF            L  S+I G                FLLGL G + QI    GI YSSP L SA+SNL P
Subjt:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVF--------NAAPLSFSMILG---------------FFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIP

Query:  IFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW--------------------------
         FTF LAV+FRME+V L+ S+ +AK +G IL++SGA ++ LYKGP  +L ++S  + +     H  L S  S+W                          
Subjt:  IFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW--------------------------

Query:  -YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIF-----SVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGS
         YP +++T VFF+ L  T+ +    +  E N T W L+PDI +  II+     S+   +T T    +KGPVY+++F+PL + +A+ +  IFL + L+LGS
Subjt:  -YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIF-----SVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGS

Query:  VMGSIVIGCGFYSVIWGQFKQ
        V+GS+++  GFY+VIWG+ ++
Subjt:  VMGSIVIGCGFYSVIWGQFKQ

Q9LRS5 WAT1-related protein At3g281008.9e-3734.98Show/hide
Query:  LSAMIMVEIMDVISTTLSKAAMSKGMNNLVF-----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLI
        L+AM+  E   V  +TL K A SKG+N   F                       +  PLS S++    LLGL GS+  I  Y GI+YSSP L SA+SN+ 
Subjt:  LSAMIMVEIMDVISTTLSKAAMSKGMNNLVF-----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLI

Query:  PIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW-------------------------
        P  TF+LA++FRMEKV  K  S  AK +GTIL++ GA ++ LY GP + + +S    + +Q      L S NS+W                         
Subjt:  PIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW-------------------------

Query:  --YPNKKMTNVFFFTLSMTVQTAAFTVIVEK-NPTIWQLRPDIEMVTI--------IFSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEP
          YP    T  F + +S+++ T+   ++VEK NP++W +R DI ++TI        ++ V  + T+ H    KGP+Y+A+FKPL +++A+ +  +FL++ 
Subjt:  --YPNKKMTNVFFFTLSMTVQTAAFTVIVEK-NPTIWQLRPDIEMVTI--------IFSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEP

Query:  LYLGSVMGSIVIGCGFYSVIWGQ
        LYLG ++G ++I  GFY+V+WG+
Subjt:  LYLGSVMGSIVIGCGFYSVIWGQ

Arabidopsis top hitse value%identityAlignment
AT3G28050.1 nodulin MtN21 /EamA-like transporter family protein3.8e-4336.05Show/hide
Query:  MTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF-----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSN
        + ++A++++E  +V   TL KAA  KGM+  VF                          P++FS++    LLG+ G    IM YTGI YSSP L SA+SN
Subjt:  MTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF-----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSN

Query:  LIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLL-------------------SQNSNWYPNK
        L P FTFLLAVVFRME V  KR+S  AK +GT++++ GA ++TLY GP++I  +  S S   Q  + + +L                   +Q    YP  
Subjt:  LIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLL-------------------SQNSNWYPNK

Query:  KMTNVFFFTLSMTVQTAAFTVIVEKNPT-IWQLRPDIEMVTIIFS------VRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMG
        + T V F+++ ++  TA  T+  E N    W+++P+I +V+I+ S      +  T+  T   +IKGP++VAMFKPL + +A+ + VIFL + LY+GS++G
Subjt:  KMTNVFFFTLSMTVQTAAFTVIVEKNPT-IWQLRPDIEMVTIIFS------VRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMG

Query:  SIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCA-FESPSAS
        + VI  GFY+V+WG+ K++      +  +H ++  A  +SPS S
Subjt:  SIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCA-FESPSAS

AT3G28100.1 nodulin MtN21 /EamA-like transporter family protein6.4e-3834.98Show/hide
Query:  LSAMIMVEIMDVISTTLSKAAMSKGMNNLVF-----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLI
        L+AM+  E   V  +TL K A SKG+N   F                       +  PLS S++    LLGL GS+  I  Y GI+YSSP L SA+SN+ 
Subjt:  LSAMIMVEIMDVISTTLSKAAMSKGMNNLVF-----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLI

Query:  PIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW-------------------------
        P  TF+LA++FRMEKV  K  S  AK +GTIL++ GA ++ LY GP + + +S    + +Q      L S NS+W                         
Subjt:  PIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW-------------------------

Query:  --YPNKKMTNVFFFTLSMTVQTAAFTVIVEK-NPTIWQLRPDIEMVTI--------IFSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEP
          YP    T  F + +S+++ T+   ++VEK NP++W +R DI ++TI        ++ V  + T+ H    KGP+Y+A+FKPL +++A+ +  +FL++ 
Subjt:  --YPNKKMTNVFFFTLSMTVQTAAFTVIVEK-NPTIWQLRPDIEMVTI--------IFSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEP

Query:  LYLGSVMGSIVIGCGFYSVIWGQ
        LYLG ++G ++I  GFY+V+WG+
Subjt:  LYLGSVMGSIVIGCGFYSVIWGQ

AT5G40210.1 nodulin MtN21 /EamA-like transporter family protein1.0e-4037.9Show/hide
Query:  GMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF-----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS
        G  L+AM++ E  +V   TL KAA SKG++  V                        +  PL+FS++    +LGL  S  QI+ Y GIKYSSP L SAMS
Subjt:  GMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVF-----------------------NAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS

Query:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLL------------ILITSSSKSSVKQEDDHDLLLSQNSNWYPNKKMTNVF
        N+ P FTF+LAVVFRME + L + S  AK +GTIL++ GA ++TLY GP+L            +L       SV       L+++     YP+  +  + 
Subjt:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLL------------ILITSSSKSSVKQEDDHDLLLSQNSNWYPNKKMTNVF

Query:  FFTLSMTVQTAAFTVIVEK-NPTIWQLRPDIEMVTII--------FSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMGSIVI
           + + V  A  +++ EK NP  W +R DI ++T++        + V  T  ++H    KGPVY++MFKPL +++A     IFL E LYLGSVMG I+I
Subjt:  FFTLSMTVQTAAFTVIVEK-NPTIWQLRPDIEMVTII--------FSVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMGSIVI

Query:  GCGFYSVIWGQFKQ
          GFY V+WG+ K+
Subjt:  GCGFYSVIWGQFKQ

AT5G40240.1 nodulin MtN21 /EamA-like transporter family protein8.0e-4137.07Show/hide
Query:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVF--------NAAPLSFSMILG---------------FFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIP
        +AM  VE   V S TL KAA  +G++  VF            L  S+I G                FLLGL G + QI    GI YSSP L SA+SNL P
Subjt:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVF--------NAAPLSFSMILG---------------FFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIP

Query:  IFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW--------------------------
         FTF LAV+FRME+V L+ S+ +AK +G IL++SGA ++ LYKGP  +L ++S  + +     H  L S  S+W                          
Subjt:  IFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW--------------------------

Query:  -YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIF-----SVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGS
         YP +++T VFF+ L  T+ +    +  E N T W L+PDI +  II+     S+   +T T    +KGPVY+++F+PL + +A+ +  IFL + L+LGS
Subjt:  -YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIF-----SVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGS

Query:  VMGSIVIGCGFYSVIWGQFKQ
        V+GS+++  GFY+VIWG+ ++
Subjt:  VMGSIVIGCGFYSVIWGQFKQ

AT5G40240.2 nodulin MtN21 /EamA-like transporter family protein8.0e-4137.07Show/hide
Query:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVF--------NAAPLSFSMILG---------------FFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIP
        +AM  VE   V S TL KAA  +G++  VF            L  S+I G                FLLGL G + QI    GI YSSP L SA+SNL P
Subjt:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVF--------NAAPLSFSMILG---------------FFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIP

Query:  IFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW--------------------------
         FTF LAV+FRME+V L+ S+ +AK +G IL++SGA ++ LYKGP  +L ++S  + +     H  L S  S+W                          
Subjt:  IFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNW--------------------------

Query:  -YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIF-----SVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGS
         YP +++T VFF+ L  T+ +    +  E N T W L+PDI +  II+     S+   +T T    +KGPVY+++F+PL + +A+ +  IFL + L+LGS
Subjt:  -YPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIF-----SVRKTMTLTHKSKIKGPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGS

Query:  VMGSIVIGCGFYSVIWGQFKQ
        V+GS+++  GFY+VIWG+ ++
Subjt:  VMGSIVIGCGFYSVIWGQFKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGTGGCATGACGTTGAGTGCGATGATAATGGTGGAAATTATGGATGTTATCTCAACAACACTAAGCAAAGCCGCCATGTCCAAAGGAATGAACAACTTGGTTTT
CAATGCGGCTCCTCTATCTTTCTCTATGATCCTTGGCTTTTTTCTCCTAGGGCTTAACGGGAGTGTGGGACAGATAATGGCATATACAGGCATAAAGTACAGTTCTCCAG
CTCTTTTATCAGCAATGTCAAATCTCATCCCCATTTTCACCTTTCTTCTTGCTGTTGTTTTCAGAATGGAGAAGGTTGATTTGAAGAGAAGCAGTGGGAAAGCCAAATGT
GTGGGAACCATTTTAGCTGTTTCAGGAGCTTCCTTAATAACTCTGTACAAAGGGCCACTCTTAATCTTAATCACGTCTTCTTCTAAGTCCTCTGTGAAACAAGAAGATGA
TCATGATTTACTACTCTCTCAGAATTCAAACTGGTACCCTAACAAAAAAATGACGAATGTGTTCTTCTTCACATTATCAATGACAGTCCAAACTGCAGCCTTCACCGTCA
TCGTGGAAAAAAACCCAACTATTTGGCAACTCCGACCCGACATTGAAATGGTCACCATCATATTCTCGGTAAGAAAAACGATGACACTAACACACAAATCCAAAATTAAG
GGCCCAGTTTATGTTGCAATGTTCAAGCCCCTTGGCATGGTCGTCGCTATCCCCTTGGTTGTCATCTTTCTTCATGAACCGCTTTATCTTGGCAGTGTGATGGGTTCGAT
TGTGATTGGGTGTGGGTTTTATAGTGTGATATGGGGTCAGTTTAAACAACTTGACTCGCCCATCGCCTTACATTATACTTCTCACTCTCAATCGCCCTGCGCCTTTGAAT
CACCATCTGCCTCGCTTTTGCTTCACCGCCACTCG
mRNA sequenceShow/hide mRNA sequence
AGGAATTTTAGAAGGAAGAAAGGAAGAAGAAGAAGAAGAAGAAATTATGGAGAGTGGCATGACGTTGAGTGCGATGATAATGGTGGAAATTATGGATGTTATCTCAACAA
CACTAAGCAAAGCCGCCATGTCCAAAGGAATGAACAACTTGGTTTTCAATGCGGCTCCTCTATCTTTCTCTATGATCCTTGGCTTTTTTCTCCTAGGGCTTAACGGGAGT
GTGGGACAGATAATGGCATATACAGGCATAAAGTACAGTTCTCCAGCTCTTTTATCAGCAATGTCAAATCTCATCCCCATTTTCACCTTTCTTCTTGCTGTTGTTTTCAG
AATGGAGAAGGTTGATTTGAAGAGAAGCAGTGGGAAAGCCAAATGTGTGGGAACCATTTTAGCTGTTTCAGGAGCTTCCTTAATAACTCTGTACAAAGGGCCACTCTTAA
TCTTAATCACGTCTTCTTCTAAGTCCTCTGTGAAACAAGAAGATGATCATGATTTACTACTCTCTCAGAATTCAAACTGGTACCCTAACAAAAAAATGACGAATGTGTTC
TTCTTCACATTATCAATGACAGTCCAAACTGCAGCCTTCACCGTCATCGTGGAAAAAAACCCAACTATTTGGCAACTCCGACCCGACATTGAAATGGTCACCATCATATT
CTCGGTAAGAAAAACGATGACACTAACACACAAATCCAAAATTAAGGGCCCAGTTTATGTTGCAATGTTCAAGCCCCTTGGCATGGTCGTCGCTATCCCCTTGGTTGTCA
TCTTTCTTCATGAACCGCTTTATCTTGGCAGTGTGATGGGTTCGATTGTGATTGGGTGTGGGTTTTATAGTGTGATATGGGGTCAGTTTAAACAACTTGACTCGCCCATC
GCCTTACATTATACTTCTCACTCTCAATCGCCCTGCGCCTTTGAATCACCATCTGCCTCGCTTTTGCTTCACCGCCACTCG
Protein sequenceShow/hide protein sequence
MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKC
VGTILAVSGASLITLYKGPLLILITSSSKSSVKQEDDHDLLLSQNSNWYPNKKMTNVFFFTLSMTVQTAAFTVIVEKNPTIWQLRPDIEMVTIIFSVRKTMTLTHKSKIK
GPVYVAMFKPLGMVVAIPLVVIFLHEPLYLGSVMGSIVIGCGFYSVIWGQFKQLDSPIALHYTSHSQSPCAFESPSASLLLHRHS