; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G015450 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G015450
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionWAT1-related protein
Genome locationchr02:21306261..21314249
RNA-Seq ExpressionLsi02G015450
SyntenyLsi02G015450
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsIPR000620 - EamA domain
IPR030184 - WAT1-related protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458741.1 PREDICTED: WAT1-related protein At3g28050-like isoform X1 [Cucumis melo]1.7e-9662.91Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME  M  SAMIMVEIMDVI++TLSKAAMSKGMNNLVF VYSN+L+TF+ LPFLL S  RD+QAAPLS  MILGF LLGLNGSVGQI+AYTGIKYSSPALL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI--MSSSKSF-VKQEDD----HSLLSQNSNW--------------
        SA+SNLIPIFTFLLA++FRMEKVDL+RSSGKA CVGTILAVSGASLITLYKGPLL+  +SSS SF +KQE+D    H L S NSNW              
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI--MSSSKSF-VKQEDD----HSLLSQNSNW--------------

Query:  -------TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSS----TNIRFH---HKLRGD-YKERF------------L
               TWFV KYP KK+TN FFFTLS+ +QTA F ++VEKN   W+L+PDIEMVTI  S       I  H    + RG  Y   F            +
Subjt:  -------TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSS----TNIRFH---HKLRGD-YKERF------------L

Query:  DFL-------IVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS
         FL        V+GSIVIGCGFY VIWGQ KQLD  + L  TSHSQS  AFESPSA LL  +HS
Subjt:  DFL-------IVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS

XP_011655969.1 WAT1-related protein At3g28050 isoform X1 [Cucumis sativus]8.9e-9866.47Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME  MT SAMIMVEIM VIS+TL KAAMSKGMNNLVF VYSNALATFLLLPFLLLS SRDRQAAPLSFSMI  FFLLGL GSVGQIMAYTGIKYSS  LL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI-MSSSKSFVKQEDD--HSL-LSQNSNW-----------------
        SA+SNLIPIFTFLLA++FRMEKVDL+RSSGKAKCVGTILAV G SLITLYKGPLLI  SSS SFVK EDD  H L LS NSNW                 
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI-MSSSKSFVKQEDD--HSL-LSQNSNW-----------------

Query:  ----TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSS-----TNIRFH---HKLRGD--------------------Y
            TWFV KYP+KKMTNVFFFTLSVT+QTA FT I+E+NPIVWQL+PDI MV+II S+      +I  H    + +G                     +
Subjt:  ----TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSS-----TNIRFH---HKLRGD--------------------Y

Query:  KERFLDFLIVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHS
            L    VMGSIVIGCGFYSVIWGQ KQLD  + LP +S S
Subjt:  KERFLDFLIVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHS

XP_011655971.1 WAT1-related protein At3g28050 isoform X2 [Cucumis sativus]9.8e-9766.86Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME  MT SAMIMVEIM VIS+TL KAAMSKGMNNLVF VYSNALATFLLLPFLLLS SRDRQAAPLSFSMI  FFLLGL GSVGQIMAYTGIKYSS  LL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI-MSSSKSFVKQEDD--HSL-LSQNSNW-----------------
        SA+SNLIPIFTFLLA++FRMEKVDL+RSSGKAKCVGTILAV G SLITLYKGPLLI  SSS SFVK EDD  H L LS NSNW                 
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI-MSSSKSFVKQEDD--HSL-LSQNSNW-----------------

Query:  ----TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSSTNIRFH---HKLRGD--------------------YKERFL
            TWFV KYP+KKMTNVFFFTLSVT+QTA FT I+E+NPIVWQL+PDI M  I  S  +I  H    + +G                     +    L
Subjt:  ----TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSSTNIRFH---HKLRGD--------------------YKERFL

Query:  DFLIVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHS
            VMGSIVIGCGFYSVIWGQ KQLD  + LP +S S
Subjt:  DFLIVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHS

XP_031741104.1 WAT1-related protein At3g28050 isoform X2 [Cucumis sativus]1.1e-9562.96Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME  M  SAMIMVEIMDVI++TLSKAAMSKGMNNLVF VYSN+L+TF+ LPFLL S  RD+Q APLSF MILGF LLGLNGSVGQ+MAYTGIKYSSP LL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI--MSSSKSFVKQEDD----HSLLSQNSNW---------TWFVAK
        SA+SNLIPIFTFLLA++FRMEKVDL+RSSGKA CVGTILAVSGASLITL+KGPLL+  +SSS SF KQE+D    H LLS +S+W         TWFV K
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI--MSSSKSFVKQEDD----HSLLSQNSNW---------TWFVAK

Query:  YPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTI----IVSSTNIRFH----HKLRGDYKERF------------LDFL-------IVM
        YP KK+TN+FFFTLS+ +QTA F ++VEKN   W+L+PDIEMVTI    I     I  H     +    Y   F            + FL        V+
Subjt:  YPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTI----IVSSTNIRFH----HKLRGDYKERF------------LDFL-------IVM

Query:  GSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS
        GSIVIGCGFY VIWGQ K+LD  + L   SHSQS    ESPSA LL H+HS
Subjt:  GSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS

XP_038889702.1 WAT1-related protein At3g28050-like isoform X1 [Benincasa hispida]1.4e-10366.2Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME  MT +AMIMVEIMDVI++TLSKAAMSKGMN LVF VYSNALATFL LPFLLLSRSRD++A PLSF MILGF LLGLNGSVGQIMAYTGIKYSSP LL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI-MSSSKSFVK-QEDDHSLLSQNSNW-------------------
        SA+SNLIPIFTFLLA +FRMEKVDL+RSSGKAKCVGTILAVSG SLITLYKGPLLI +SSSKSFVK Q+DDH LLS NSNW                   
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI-MSSSKSFVK-QEDDHSLLSQNSNW-------------------

Query:  --TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSS----TNIRFH-----------------------HKLRGDYKER
          TWFV KYP KK+TN+FFF+LS+ +QTA FT IVE NP+VWQ+RPDIEMVTIIVS       I  H                         L   + + 
Subjt:  --TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSS----TNIRFH-----------------------HKLRGDYKER

Query:  FLDFLIVMGSIVIGCGFYSVIWGQFKQLDSPIALP-YTSHSQSPCAFESPSASLL
         L    VMGSIVI CGFYSVIWGQ KQ D  + LP  TSHSQS  A ESPS  LL
Subjt:  FLDFLIVMGSIVIGCGFYSVIWGQFKQLDSPIALP-YTSHSQSPCAFESPSASLL

TrEMBL top hitse value%identityAlignment
A0A0A0KRZ3 WAT1-related protein6.0e-9260.8Show/hide
Query:  MVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIPIFT
        MVEIMDVI++TLSKAAMSKGMNNLVF VYSN+L+TF+ LPFLL S  RD+Q APLSF MILGF LLGLNGSVGQ+MAYTGIKYSSP LLSA+SNLIPIFT
Subjt:  MVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIPIFT

Query:  FLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI--MSSSKSFVKQEDD----HSLLSQNSNW---------------------TWFVA
        FLLA++FRMEKVDL+RSSGKA CVGTILAVSGASLITL+KGPLL+  +SSS SF KQE+D    H LLS +S+W                     TWFV 
Subjt:  FLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI--MSSSKSFVKQEDD----HSLLSQNSNW---------------------TWFVA

Query:  KYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTI----IVSSTNIRFH----HKLRGDYKERF------------LDFL-------IV
        KYP KK+TN+FFFTLS+ +QTA F ++VEKN   W+L+PDIEMVTI    I     I  H     +    Y   F            + FL        V
Subjt:  KYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTI----IVSSTNIRFH----HKLRGDYKERF------------LDFL-------IV

Query:  MGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS
        +GSIVIGCGFY VIWGQ K+LD  + L   SHSQS    ESPSA LL H+HS
Subjt:  MGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS

A0A1S3C948 WAT1-related protein8.1e-9762.91Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME  M  SAMIMVEIMDVI++TLSKAAMSKGMNNLVF VYSN+L+TF+ LPFLL S  RD+QAAPLS  MILGF LLGLNGSVGQI+AYTGIKYSSPALL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI--MSSSKSF-VKQEDD----HSLLSQNSNW--------------
        SA+SNLIPIFTFLLA++FRMEKVDL+RSSGKA CVGTILAVSGASLITLYKGPLL+  +SSS SF +KQE+D    H L S NSNW              
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI--MSSSKSF-VKQEDD----HSLLSQNSNW--------------

Query:  -------TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSS----TNIRFH---HKLRGD-YKERF------------L
               TWFV KYP KK+TN FFFTLS+ +QTA F ++VEKN   W+L+PDIEMVTI  S       I  H    + RG  Y   F            +
Subjt:  -------TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSS----TNIRFH---HKLRGD-YKERF------------L

Query:  DFL-------IVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS
         FL        V+GSIVIGCGFY VIWGQ KQLD  + L  TSHSQS  AFESPSA LL  +HS
Subjt:  DFL-------IVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS

A0A1S4E259 WAT1-related protein3.0e-8370.11Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME  M  SAMIMVEIMDVI++TLSKAAMSKGMNNLVF VYSN+L+TF+ LPFLL S  RD+QAAPLS  MILGF LLGLNGSVGQI+AYTGIKYSSPALL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI--MSSSKSF-VKQEDD----HSLLSQNSNW--------------
        SA+SNLIPIFTFLLA++FRMEKVDL+RSSGKA CVGTILAVSGASLITLYKGPLL+  +SSS SF +KQE+D    H L S NSNW              
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLI--MSSSKSF-VKQEDD----HSLLSQNSNW--------------

Query:  -------TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVS
               TWFV KYP KK+TN FFFTLS+ +QTA F ++VEKN   W+L+PDIEMVTI  S
Subjt:  -------TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVS

A0A6J1FGV1 WAT1-related protein2.2e-8656.7Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME+ M  +AMIMVE  DVI +TL K AM+KGMNNLVF VYSNALATFLLLPFLL S SR+   APLSFSMIL FFLLGLNGSVG+++A TGI YSSP LL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQEDDHSLLSQNSNW---------------------
        SAM+NLIPIFT  LAV+FRME++D KRSSGKAKC+GTI+AVSGA LITLYKGP+LIMSSS+S V QE     LSQ  NW                     
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQEDDHSLLSQNSNW---------------------

Query:  TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSS----------------------------TNIRFHHKLRGDYKERF
        TWFV+ YPNKK+T+VFFFT  VT+QTA F V ++ NP VWQ+RPDIEMVTI+ S+                              +     L   +    
Subjt:  TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSS----------------------------TNIRFHHKLRGDYKERF

Query:  LDFLIVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS
        L    V+GS+VIGCGFYSVIWGQ KQL+  + LP       P   ESPSASLL H  S
Subjt:  LDFLIVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS

A0A6J1FNQ6 WAT1-related protein2.2e-8656.7Show/hide
Query:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL
        ME+ M  +AMIMVE  DVI +TL K AM+KGMNNLVF VYSNALATFLLLPFLL S SR+   APLSFSMIL FFLLGLNGSVG+++A TGI YSSP LL
Subjt:  MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALL

Query:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQEDDHSLLSQNSNW---------------------
        SAM+NLIPIFT  LAV+FRME++D KRSSGKAKC+GTI+AVSGA LITLYKGP+LIMSSS+S V QE     LSQ  NW                     
Subjt:  SAMSNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQEDDHSLLSQNSNW---------------------

Query:  TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSS----------------------------TNIRFHHKLRGDYKERF
        TWFV+ YPNKK+T+VFFFT  VT+QTA F V ++ NP VWQ+RPDIEMVTI+ S+                              +     L   +    
Subjt:  TWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSS----------------------------TNIRFHHKLRGDYKERF

Query:  LDFLIVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS
        L    V+GS+VIGCGFYSVIWGQ KQL+  + LP       P   ESPSASLL H  S
Subjt:  LDFLIVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS

SwissProt top hitse value%identityAlignment
Q8VYZ7 WAT1-related protein At3g280703.6e-3336.16Show/hide
Query:  LSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNL
        L+AM++VE   V  +TL K A SKG+N   F  YS  LA+ LLLP L  + +R     PLS S++    LLG  GS+  I  Y GI+YSSP L SA++N+
Subjt:  LSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNL

Query:  IPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVK-QEDDHSLLSQNSNW---------------------TWFVA
         P  TF+LA++FRMEKV  K  S  AK +GTIL++ GA ++  Y GP + ++SS  +V  ++    L S NS+W                        ++
Subjt:  IPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVK-QEDDHSLLSQNSNW---------------------TWFVA

Query:  KYPNKKMTNVFFFTLSVTLQTATFTVIVEK-NPIVWQLRPDIEMVTI----IVSSTNIRFH-----HK---LRGDYKERFLDFLIVMG------SIVIGC
         YP     + F +T+ V++ T+T  ++VEK NP VW +  DI ++TI    IV+S     H     HK       +K   +   +VMG      S+ +GC
Subjt:  KYPNKKMTNVFFFTLSVTLQTATFTVIVEK-NPIVWQLRPDIEMVTI----IVSSTNIRFH-----HK---LRGDYKERFLDFLIVMG------SIVIGC

Query:  ---------GFYSVIWGQ
                 GFY+V+WG+
Subjt:  ---------GFYSVIWGQ

Q945L4 WAT1-related protein At5g402103.2e-3434.62Show/hide
Query:  GMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAM
        G  L+AM++ E  +V   TL KAA SKG++  V  VYS    + LLLP    S  R R   PL+FS++    +LGL  S  QI+ Y GIKYSSP L SAM
Subjt:  GMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAM

Query:  SNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSK----------SFVKQEDDHSLLSQNSNWTWFVAKYPNKKMTN
        SN+ P FTF+LAVVFRME + L + S  AK +GTIL++ GA ++TLY GP+L+ S S            ++     + +++        + +YP+  +  
Subjt:  SNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSK----------SFVKQEDDHSLLSQNSNWTWFVAKYPNKKMTN

Query:  VFFFTLSVTLQTATFTVIVEK-NPIVWQLRPDIEMVTII---VSSTNIRFHHKLRGDYKE-------RFLDFLI-----------------VMGSIVIGC
        +    + + +  A  +++ EK NP  W +R DI ++T++   + ++     H     +K        + L  LI                 VMG I+I  
Subjt:  VFFFTLSVTLQTATFTVIVEK-NPIVWQLRPDIEMVTII---VSSTNIRFHHKLRGDYKE-------RFLDFLI-----------------VMGSIVIGC

Query:  GFYSVIWGQFKQ
        GFY V+WG+ K+
Subjt:  GFYSVIWGQFKQ

Q94JU2 WAT1-related protein At3g280501.3e-3835.8Show/hide
Query:  MTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS
        + ++A++++E  +V   TL KAA  KGM+  VF VYS  LA  LLLP L  S  R R   P++FS++    LLG+ G    IM YTGI YSSP L SA+S
Subjt:  MTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS

Query:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQEDDHSLLSQNSNW---------------TWFVA-----
        NL P FTFLLAVVFRME V  KR+S  AK +GT++++ GA ++TLY GP++I  S  S   +       S N NW                W++      
Subjt:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQEDDHSLLSQNSNW---------------TWFVA-----

Query:  -KYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPI-VWQLRPDIEMVTII--------VSSTNIRFHHKLRGD-----YKERFLDFLIVMGSI--------
         +YP  + T V F+++ V+  TA  T+  E N +  W+++P+I +V+I+        +++T   +  +++G      +K   +   + MG I        
Subjt:  -KYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPI-VWQLRPDIEMVTII--------VSSTNIRFHHKLRGD-----YKERFLDFLIVMGSI--------

Query:  -------VIGCGFYSVIWGQFKQL
               VI  GFY+V+WG+ K++
Subjt:  -------VIGCGFYSVIWGQFKQL

Q9FL08 WAT1-related protein At5g402407.2e-3436.42Show/hide
Query:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPF-LLLSRSRDRQAA--PLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS
        +AM  VE   V S TL KAA  +G++  VF  YS  ++T LLLP  ++  RSR   AA  PL F +    FLLGL G + QI    GI YSSP L SA+S
Subjt:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPF-LLLSRSRDRQAA--PLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS

Query:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQED--DHSLLSQNSNW---------------TWFVAK--
        NL P FTF LAV+FRME+V L+ S+ +AK +G IL++SGA ++ LYKGP ++ S+S + V         L S  S+W                W++ +  
Subjt:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQED--DHSLLSQNSNW---------------TWFVAK--

Query:  ----YPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSSTNIRFHH--------KLRGD-YKERFLDFLI----------------
            YP +++T VFF+ L  TL +    +  E N   W L+PDI +  II S   +             L+G  Y   F    I                
Subjt:  ----YPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSSTNIRFHH--------KLRGD-YKERFLDFLI----------------

Query:  ---VMGSIVIGCGFYSVIWGQFKQ
           V+GS+++  GFY+VIWG+ ++
Subjt:  ---VMGSIVIGCGFYSVIWGQFKQ

Q9LRS5 WAT1-related protein At3g281007.2e-3436.48Show/hide
Query:  LSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNL
        L+AM+  E   V  +TL K A SKG+N   F  YS  LA+ LLLP L  +  R R   PLS S++    LLGL GS+  I  Y GI+YSSP L SA+SN+
Subjt:  LSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNL

Query:  IPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVK-QEDDHSLLSQNSNW---------------------TWFVA
         P  TF+LA++FRMEKV  K  S  AK +GTIL++ GA ++ LY GP + ++SS  ++  ++    L S NS+W                        ++
Subjt:  IPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVK-QEDDHSLLSQNSNW---------------------TWFVA

Query:  KYPNKKMTNVFFFTLSVTLQTATFTVIVEK-NPIVWQLRPDIEMVTI----IVSSTNIRFH-----HK---LRGDYKERFLDFLIVMG------SIVIGC
         YP    T  F + +SV++ T+   ++VEK NP VW +R DI ++TI    I++S     H     HK       +K   +   +VM       S+ +GC
Subjt:  KYPNKKMTNVFFFTLSVTLQTATFTVIVEK-NPIVWQLRPDIEMVTI----IVSSTNIRFH-----HK---LRGDYKERFLDFLIVMG------SIVIGC

Query:  ---------GFYSVIWGQ
                 GFY+V+WG+
Subjt:  ---------GFYSVIWGQ

Arabidopsis top hitse value%identityAlignment
AT3G28050.1 nodulin MtN21 /EamA-like transporter family protein9.0e-4035.8Show/hide
Query:  MTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS
        + ++A++++E  +V   TL KAA  KGM+  VF VYS  LA  LLLP L  S  R R   P++FS++    LLG+ G    IM YTGI YSSP L SA+S
Subjt:  MTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS

Query:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQEDDHSLLSQNSNW---------------TWFVA-----
        NL P FTFLLAVVFRME V  KR+S  AK +GT++++ GA ++TLY GP++I  S  S   +       S N NW                W++      
Subjt:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQEDDHSLLSQNSNW---------------TWFVA-----

Query:  -KYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPI-VWQLRPDIEMVTII--------VSSTNIRFHHKLRGD-----YKERFLDFLIVMGSI--------
         +YP  + T V F+++ V+  TA  T+  E N +  W+++P+I +V+I+        +++T   +  +++G      +K   +   + MG I        
Subjt:  -KYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPI-VWQLRPDIEMVTII--------VSSTNIRFHHKLRGD-----YKERFLDFLIVMGSI--------

Query:  -------VIGCGFYSVIWGQFKQL
               VI  GFY+V+WG+ K++
Subjt:  -------VIGCGFYSVIWGQFKQL

AT3G28100.1 nodulin MtN21 /EamA-like transporter family protein5.1e-3536.48Show/hide
Query:  LSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNL
        L+AM+  E   V  +TL K A SKG+N   F  YS  LA+ LLLP L  +  R R   PLS S++    LLGL GS+  I  Y GI+YSSP L SA+SN+
Subjt:  LSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNL

Query:  IPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVK-QEDDHSLLSQNSNW---------------------TWFVA
         P  TF+LA++FRMEKV  K  S  AK +GTIL++ GA ++ LY GP + ++SS  ++  ++    L S NS+W                        ++
Subjt:  IPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVK-QEDDHSLLSQNSNW---------------------TWFVA

Query:  KYPNKKMTNVFFFTLSVTLQTATFTVIVEK-NPIVWQLRPDIEMVTI----IVSSTNIRFH-----HK---LRGDYKERFLDFLIVMG------SIVIGC
         YP    T  F + +SV++ T+   ++VEK NP VW +R DI ++TI    I++S     H     HK       +K   +   +VM       S+ +GC
Subjt:  KYPNKKMTNVFFFTLSVTLQTATFTVIVEK-NPIVWQLRPDIEMVTI----IVSSTNIRFH-----HK---LRGDYKERFLDFLIVMG------SIVIGC

Query:  ---------GFYSVIWGQ
                 GFY+V+WG+
Subjt:  ---------GFYSVIWGQ

AT5G40210.1 nodulin MtN21 /EamA-like transporter family protein2.3e-3534.62Show/hide
Query:  GMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAM
        G  L+AM++ E  +V   TL KAA SKG++  V  VYS    + LLLP    S  R R   PL+FS++    +LGL  S  QI+ Y GIKYSSP L SAM
Subjt:  GMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAM

Query:  SNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSK----------SFVKQEDDHSLLSQNSNWTWFVAKYPNKKMTN
        SN+ P FTF+LAVVFRME + L + S  AK +GTIL++ GA ++TLY GP+L+ S S            ++     + +++        + +YP+  +  
Subjt:  SNLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSK----------SFVKQEDDHSLLSQNSNWTWFVAKYPNKKMTN

Query:  VFFFTLSVTLQTATFTVIVEK-NPIVWQLRPDIEMVTII---VSSTNIRFHHKLRGDYKE-------RFLDFLI-----------------VMGSIVIGC
        +    + + +  A  +++ EK NP  W +R DI ++T++   + ++     H     +K        + L  LI                 VMG I+I  
Subjt:  VFFFTLSVTLQTATFTVIVEK-NPIVWQLRPDIEMVTII---VSSTNIRFHHKLRGDYKE-------RFLDFLI-----------------VMGSIVIGC

Query:  GFYSVIWGQFKQ
        GFY V+WG+ K+
Subjt:  GFYSVIWGQFKQ

AT5G40240.1 nodulin MtN21 /EamA-like transporter family protein5.1e-3536.42Show/hide
Query:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPF-LLLSRSRDRQAA--PLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS
        +AM  VE   V S TL KAA  +G++  VF  YS  ++T LLLP  ++  RSR   AA  PL F +    FLLGL G + QI    GI YSSP L SA+S
Subjt:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPF-LLLSRSRDRQAA--PLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS

Query:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQED--DHSLLSQNSNW---------------TWFVAK--
        NL P FTF LAV+FRME+V L+ S+ +AK +G IL++SGA ++ LYKGP ++ S+S + V         L S  S+W                W++ +  
Subjt:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQED--DHSLLSQNSNW---------------TWFVAK--

Query:  ----YPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSSTNIRFHH--------KLRGD-YKERFLDFLI----------------
            YP +++T VFF+ L  TL +    +  E N   W L+PDI +  II S   +             L+G  Y   F    I                
Subjt:  ----YPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSSTNIRFHH--------KLRGD-YKERFLDFLI----------------

Query:  ---VMGSIVIGCGFYSVIWGQFKQ
           V+GS+++  GFY+VIWG+ ++
Subjt:  ---VMGSIVIGCGFYSVIWGQFKQ

AT5G40240.2 nodulin MtN21 /EamA-like transporter family protein5.1e-3536.42Show/hide
Query:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPF-LLLSRSRDRQAA--PLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS
        +AM  VE   V S TL KAA  +G++  VF  YS  ++T LLLP  ++  RSR   AA  PL F +    FLLGL G + QI    GI YSSP L SA+S
Subjt:  SAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPF-LLLSRSRDRQAA--PLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMS

Query:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQED--DHSLLSQNSNW---------------TWFVAK--
        NL P FTF LAV+FRME+V L+ S+ +AK +G IL++SGA ++ LYKGP ++ S+S + V         L S  S+W                W++ +  
Subjt:  NLIPIFTFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQED--DHSLLSQNSNW---------------TWFVAK--

Query:  ----YPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSSTNIRFHH--------KLRGD-YKERFLDFLI----------------
            YP +++T VFF+ L  TL +    +  E N   W L+PDI +  II S   +             L+G  Y   F    I                
Subjt:  ----YPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQLRPDIEMVTIIVSSTNIRFHH--------KLRGD-YKERFLDFLI----------------

Query:  ---VMGSIVIGCGFYSVIWGQFKQ
           V+GS+++  GFY+VIWG+ ++
Subjt:  ---VMGSIVIGCGFYSVIWGQFKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGTGGCATGACGTTGAGTGCGATGATAATGGTGGAAATTATGGATGTTATCTCAACAACACTAAGCAAAGCCGCCATGTCCAAAGGAATGAACAACTTGGTTTT
CAATGTTTATTCTAACGCTCTTGCCACTTTCCTTCTTCTTCCCTTTCTTCTCCTTTCTCGATCCAGAGATAGACAGGCGGCTCCTCTATCTTTCTCTATGATCCTTGGCT
TTTTTCTCCTAGGGCTTAACGGGAGTGTGGGACAGATAATGGCATATACAGGCATAAAGTACAGTTCTCCAGCTCTTTTATCAGCAATGTCAAATCTCATCCCCATTTTC
ACCTTTCTTCTTGCTGTTGTTTTCAGAATGGAGAAGGTTGATTTGAAGAGAAGCAGTGGGAAAGCCAAATGTGTGGGAACCATTTTAGCTGTTTCAGGAGCTTCCTTAAT
AACTCTGTACAAAGGGCCACTCTTAATCATGTCTTCTTCCAAGTCCTTTGTGAAACAAGAAGATGATCATTCACTACTCTCTCAGAATTCAAATTGGACATGGTTTGTTG
CGAAGTACCCGAACAAAAAGATGACAAACGTGTTCTTCTTCACATTATCTGTGACGCTCCAAACTGCAACCTTCACCGTCATCGTTGAAAAAAACCCAATTGTTTGGCAA
CTCCGACCCGATATCGAAATGGTCACCATCATAGTCTCGTCGACAAACATAAGGTTTCATCACAAATTAAGAGGAGATTACAAAGAGAGATTTCTTGATTTTTTAATTGT
GATGGGTTCGATTGTGATTGGGTGTGGGTTTTACAGTGTGATATGGGGTCAGTTTAAACAACTTGACTCGCCCATCGCCTTACCTTATACTTCTCACTCTCAATCGCCCT
GCGCCTTTGAATCACCATCTGCCTCGCTTTTGCTTCACCGCCACTCTTAA
mRNA sequenceShow/hide mRNA sequence
TTTATGATGGAATGAGCTCATCTCAGAGGAATTTTAGAAGGAAGAAAGGAAGAAGAAGAAGAAGAAGAAATTATGGAGAGTGGCATGACGTTGAGTGCGATGATAATGGT
GGAAATTATGGATGTTATCTCAACAACACTAAGCAAAGCCGCCATGTCCAAAGGAATGAACAACTTGGTTTTCAATGTTTATTCTAACGCTCTTGCCACTTTCCTTCTTC
TTCCCTTTCTTCTCCTTTCTCGATCCAGAGATAGACAGGCGGCTCCTCTATCTTTCTCTATGATCCTTGGCTTTTTTCTCCTAGGGCTTAACGGGAGTGTGGGACAGATA
ATGGCATATACAGGCATAAAGTACAGTTCTCCAGCTCTTTTATCAGCAATGTCAAATCTCATCCCCATTTTCACCTTTCTTCTTGCTGTTGTTTTCAGAATGGAGAAGGT
TGATTTGAAGAGAAGCAGTGGGAAAGCCAAATGTGTGGGAACCATTTTAGCTGTTTCAGGAGCTTCCTTAATAACTCTGTACAAAGGGCCACTCTTAATCATGTCTTCTT
CCAAGTCCTTTGTGAAACAAGAAGATGATCATTCACTACTCTCTCAGAATTCAAATTGGACATGGTTTGTTGCGAAGTACCCGAACAAAAAGATGACAAACGTGTTCTTC
TTCACATTATCTGTGACGCTCCAAACTGCAACCTTCACCGTCATCGTTGAAAAAAACCCAATTGTTTGGCAACTCCGACCCGATATCGAAATGGTCACCATCATAGTCTC
GTCGACAAACATAAGGTTTCATCACAAATTAAGAGGAGATTACAAAGAGAGATTTCTTGATTTTTTAATTGTGATGGGTTCGATTGTGATTGGGTGTGGGTTTTACAGTG
TGATATGGGGTCAGTTTAAACAACTTGACTCGCCCATCGCCTTACCTTATACTTCTCACTCTCAATCGCCCTGCGCCTTTGAATCACCATCTGCCTCGCTTTTGCTTCAC
CGCCACTCTTAAAGCACCATCCATGGCCATTTGGATGTGGAACCAACTTTGAGGTTTGTGCGCAGAGATCAAGATGGGGGGAGTGGTAGATACTCGATCGAACCCATATA
TAATTGTTATTTTATTTTATTTTTTCTTTTATAAAATGTGTTGTAGCTAAAGTTGAAAATGTTGATGTGGATAATAACATCGAATATCTTAATTTTATAAAAATGTCAAA
TGAAATGTTGATATTGATAGATATTTTTTTAAAAATTATAAAAACAAAAAATAAAGAAAAAATATTATAAATAGGAAAAAATATCAGACTATTTTAAAATATAAAGAAAT
TTCACTGTCTATTGCGGGGCCTATCGCTCAAGTGATAGTGAAATTTTTCTATATTTATAAATAGTATGACTCATTTTTCTATATTTAAAAACAACCCAAAAATAAATTTA
AACAAGTTA
Protein sequenceShow/hide protein sequence
MESGMTLSAMIMVEIMDVISTTLSKAAMSKGMNNLVFNVYSNALATFLLLPFLLLSRSRDRQAAPLSFSMILGFFLLGLNGSVGQIMAYTGIKYSSPALLSAMSNLIPIF
TFLLAVVFRMEKVDLKRSSGKAKCVGTILAVSGASLITLYKGPLLIMSSSKSFVKQEDDHSLLSQNSNWTWFVAKYPNKKMTNVFFFTLSVTLQTATFTVIVEKNPIVWQ
LRPDIEMVTIIVSSTNIRFHHKLRGDYKERFLDFLIVMGSIVIGCGFYSVIWGQFKQLDSPIALPYTSHSQSPCAFESPSASLLLHRHS