; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026380 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026380
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionWAT1-related protein
Genome locationtig00153031:4749851..4757319
RNA-Seq ExpressionSgr026380
SyntenySgr026380
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsIPR000620 - EamA domain
IPR030184 - WAT1-related protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132567.1 WAT1-related protein At5g64700-like [Momordica charantia]3.2e-15984.49Show/hide
Query:  FFLLVSS-----IWFVLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSA
        FFL + S     +  + QI LA MSLLSKAAFASGMNSFVFVFYRQAAGAVFFLP+MM    KE R LSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSA
Subjt:  FFLLVSS-----IWFVLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSA

Query:  SLGAAAFNCLPVTTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWG
        +LGAAAFNCLPVTTFLFALILRMEKVN+RTVAGMAK+VGIL+CIGGVATLAFYKGPYLKPLINHHLF+Y K QAHQAHASS KTWIIGCFLL +SSISWG
Subjt:  SLGAAAFNCLPVTTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWG

Query:  SWFVLQAHFLKTYSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSEL
         WFVLQAHFLKTY SPLVF SHQTMLSTVQSFV+AIAMERNPSEWKLSWNIRLIAVLYCGILV V+SN LQCWV+KEKGPVFQAMTTPLNVIVTIIGSEL
Subjt:  SWFVLQAHFLKTYSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSEL

Query:  LLGEGIHLGSLVGAILLVVSLYSVLWGKSKELNIINGDCNQSSV-PAEAKDLSEMRSPAEP
        +LGEGIHLGSL+GAILLV SLY VLWGKSKELNI++ + NQ +V PAEA++LSEMRSP +P
Subjt:  LLGEGIHLGSLVGAILLVVSLYSVLWGKSKELNIINGDCNQSSV-PAEAKDLSEMRSPAEP

XP_022962748.1 WAT1-related protein At5g64700-like [Cucurbita moschata]2.6e-14580.76Show/hide
Query:  QILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLF
        QILLA MSLLSKAAFASGMNSFVFVFYRQAAGAVF+LPL+M    KE R LSL +F KIF+ISLIGMTIGFNAYGVAVDYTSA+LGAAAFNCLPVTTFLF
Subjt:  QILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLF

Query:  ALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPL
        A++LRMEKV LRTVAGMAK  GIL+CIGGV TLAFYKGPYLKPL+NHHLF++ KSQ H+ H SS+KTWIIGCFLL LSSISWG WFVLQAHFLKTY SPL
Subjt:  ALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPL

Query:  VFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILL
         F S+QT+LST QSFVIAI MERNPSEWKL WNIRL+AVLYCGILV V+SN LQCWVIKEKGPVFQAMTTPLNVI TIIGSELLLGEGI+LGSL+GA+LL
Subjt:  VFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILL

Query:  VVSLYSVLWGKSKELNIINGDCNQSSVPAEAKDLSEMRSPAEP
        VVSLYSVLWGKSKELN+I+ D NQ       +++ EM SP +P
Subjt:  VVSLYSVLWGKSKELNIINGDCNQSSVPAEAKDLSEMRSPAEP

XP_023003479.1 WAT1-related protein At5g64700-like [Cucurbita maxima]2.6e-14581.05Show/hide
Query:  QILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLF
        QILLA MSLLSKAAFASGMNSFVFVFYRQAAGAVF+LPL+M    KE R LSL +F KIF ISLIGMTIGFNAYGVAVDYTSA+LGAAAFNCLPVTTFLF
Subjt:  QILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLF

Query:  ALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPL
        A++LRMEKV LRTVAGMAK  GIL+CIGGV TLAFYKGPYLKPLINHHLF++ KSQ H+ H SS+KTWIIGCFLL LSSISWG WFVLQAHFLKTY SPL
Subjt:  ALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPL

Query:  VFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILL
         F S+QT+LST QSFVIAIAMERNPSEWKL WNIRL+AVLYCGILV V+SN LQCWVIKEKGPVFQAMTTPLNVI TIIGSELLLGEGI+LGSL+GA+LL
Subjt:  VFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILL

Query:  VVSLYSVLWGKSKELNIINGDCNQSSVPAEAKDLSEMRSPAEP
        VVSLYSVLWGKSKELN+I+ D N+       +++ EM SP +P
Subjt:  VVSLYSVLWGKSKELNIINGDCNQSSVPAEAKDLSEMRSPAEP

XP_023517581.1 WAT1-related protein At5g64700-like [Cucurbita pepo subsp. pepo]2.6e-14580.76Show/hide
Query:  QILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLF
        QILLA MSL+SKAAFASGMNSFVFVFYRQAAGAVF+LPL+M    KE R LSL +F KIF+ISLIGMTIGFNAYGVAVDYTSA+LGAAAFNCLPVTTFLF
Subjt:  QILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLF

Query:  ALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPL
        A++LRMEKV LRTVAGMAK  GIL+CIGGV TLAFYKGPYLKPLINHHLF++ KSQ H+ H SS+KTWIIGCFLL LSSISWG WFVLQAHFLKTY SPL
Subjt:  ALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPL

Query:  VFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILL
         F S+QT+LST QSFVIAI MERNPSEWKL WNIRL+AVLYCGILV V+SN LQCWVIKEKGPVFQAMTTPLNVI TIIGSELLLGEGI+LGSL+GA+LL
Subjt:  VFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILL

Query:  VVSLYSVLWGKSKELNIINGDCNQSSVPAEAKDLSEMRSPAEP
        VVSLYSVLWGKSKELN+I+ D N+ +      ++ EMRSP +P
Subjt:  VVSLYSVLWGKSKELNIINGDCNQSSVPAEAKDLSEMRSPAEP

XP_038883836.1 WAT1-related protein At5g64700-like [Benincasa hispida]1.1e-14679.78Show/hide
Query:  FFLLVSSIWFVLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAA
        +FL VS+     QI LA MSLLSKAAFASGMN+FVFVFYRQAAGAVFFLPLM     KESRSLSL DFLKIF ISLIGMTIGFNAYGVAVDYTSA+LGAA
Subjt:  FFLLVSSIWFVLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAA

Query:  AFNCLPVTTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVL
        AFNCLPVTTFLFA++LRMEKV LRTVAGMAK  GIL+CIGGV TLAFYKGPYLKPLINHHL  + KS AHQ H+ SS+TWIIGCFLL +SSISWG WFVL
Subjt:  AFNCLPVTTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVL

Query:  QAHFLKTYSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEG
        QAHFLKTY SPL F S+QT+LS  QSFVIAIAMER+PSEWKL WNIRL+AV+YCG+LV V+SN LQCWVIKEKGPVFQAMTTPLNVI TIIGSELLLGEG
Subjt:  QAHFLKTYSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEG

Query:  IHLGSLVGAILLVVSLYSVLWGKSKELNII---NGDCNQSSV---PAEAKDLSEMRSPAEP
        I+LGSL+GAILLVVSLYSVLWGKSKELN+I   N + N S V   P   KDLSEMR  AEP
Subjt:  IHLGSLVGAILLVVSLYSVLWGKSKELNII---NGDCNQSSV---PAEAKDLSEMRSPAEP

TrEMBL top hitse value%identityAlignment
A0A0A0KHV4 WAT1-related protein6.1e-13276.56Show/hide
Query:  MSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMMKESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLFALILRMEKVN
        MSLLSKAAFASGMN+FV                  KESRSLSL DFLKIFMISLIGMTIGFNAYGVAVDYTSA+LGAAAFNCLPVTTFLFA++LRMEKVN
Subjt:  MSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMMKESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLFALILRMEKVN

Query:  LRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPLVFTSHQTMLS
        LR VAG+AK +GILICIGGV TLAFYKGPYLKPLINHHL ++ KS  +  H+SSSKTWIIGCFLL +SSISWG WFVLQA+FLKTY SPL F S+QT+LS
Subjt:  LRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPLVFTSHQTMLS

Query:  TVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILLVVSLYSVLWG
          QSFVIAIAMER+PSEWKL WNIRL+AV+YCG+LV V+SN LQCWVIKEKGPVFQAMTTPLNVI TIIGSELLLGEGI+LGSL+GAILLV+SLYSVLWG
Subjt:  TVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILLVVSLYSVLWG

Query:  KSKELNIINGDC-NQSSV---PAEAKDLSEMRSPAEP
        K+KEL++ + D  NQ++V   P   KDLSEMR  AEP
Subjt:  KSKELNIINGDC-NQSSV---PAEAKDLSEMRSPAEP

A0A1S3B1Z4 WAT1-related protein9.0e-14479.83Show/hide
Query:  QILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLF
        Q+ LA MSLLSKAAFASGMN+FVFVFYRQAAGAVFFLPL+     KESRSLSL DFLKIF+ISLIGMT+GFNAYGVAVDYTSA+LGAAAFNCLPVTTFLF
Subjt:  QILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLF

Query:  ALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPL
        A++LRMEKVNLR VAG+AK  GILICIGGV TLAFYKGPYLKPLINHHL +  KS  +  H+SSSKTWIIGCFLL +SSISWG WFVLQA+FLKTY SPL
Subjt:  ALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPL

Query:  VFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILL
         F S+QT+LS  QSFVIAIAMER+PSEWKL WNIRL+AV+YCG+LV V+SN LQCWVIKEKGPVFQAMTTPLNVI TIIGSELLLGEGI+LGSL+GAILL
Subjt:  VFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILL

Query:  VVSLYSVLWGKSKELNIINGDCN-QSSV---PAEAKDLSEMRSPAEP
        V+SLYSVLWGKSKELN+++ D N Q++V   P   KDLSEMR  AEP
Subjt:  VVSLYSVLWGKSKELNIINGDCN-QSSV---PAEAKDLSEMRSPAEP

A0A6J1BU68 WAT1-related protein1.5e-15984.49Show/hide
Query:  FFLLVSS-----IWFVLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSA
        FFL + S     +  + QI LA MSLLSKAAFASGMNSFVFVFYRQAAGAVFFLP+MM    KE R LSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSA
Subjt:  FFLLVSS-----IWFVLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSA

Query:  SLGAAAFNCLPVTTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWG
        +LGAAAFNCLPVTTFLFALILRMEKVN+RTVAGMAK+VGIL+CIGGVATLAFYKGPYLKPLINHHLF+Y K QAHQAHASS KTWIIGCFLL +SSISWG
Subjt:  SLGAAAFNCLPVTTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWG

Query:  SWFVLQAHFLKTYSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSEL
         WFVLQAHFLKTY SPLVF SHQTMLSTVQSFV+AIAMERNPSEWKLSWNIRLIAVLYCGILV V+SN LQCWV+KEKGPVFQAMTTPLNVIVTIIGSEL
Subjt:  SWFVLQAHFLKTYSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSEL

Query:  LLGEGIHLGSLVGAILLVVSLYSVLWGKSKELNIINGDCNQSSV-PAEAKDLSEMRSPAEP
        +LGEGIHLGSL+GAILLV SLY VLWGKSKELNI++ + NQ +V PAEA++LSEMRSP +P
Subjt:  LLGEGIHLGSLVGAILLVVSLYSVLWGKSKELNIINGDCNQSSV-PAEAKDLSEMRSPAEP

A0A6J1HFP6 WAT1-related protein1.3e-14580.76Show/hide
Query:  QILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLF
        QILLA MSLLSKAAFASGMNSFVFVFYRQAAGAVF+LPL+M    KE R LSL +F KIF+ISLIGMTIGFNAYGVAVDYTSA+LGAAAFNCLPVTTFLF
Subjt:  QILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLF

Query:  ALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPL
        A++LRMEKV LRTVAGMAK  GIL+CIGGV TLAFYKGPYLKPL+NHHLF++ KSQ H+ H SS+KTWIIGCFLL LSSISWG WFVLQAHFLKTY SPL
Subjt:  ALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPL

Query:  VFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILL
         F S+QT+LST QSFVIAI MERNPSEWKL WNIRL+AVLYCGILV V+SN LQCWVIKEKGPVFQAMTTPLNVI TIIGSELLLGEGI+LGSL+GA+LL
Subjt:  VFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILL

Query:  VVSLYSVLWGKSKELNIINGDCNQSSVPAEAKDLSEMRSPAEP
        VVSLYSVLWGKSKELN+I+ D NQ       +++ EM SP +P
Subjt:  VVSLYSVLWGKSKELNIINGDCNQSSVPAEAKDLSEMRSPAEP

A0A6J1KTF8 WAT1-related protein1.3e-14581.05Show/hide
Query:  QILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLF
        QILLA MSLLSKAAFASGMNSFVFVFYRQAAGAVF+LPL+M    KE R LSL +F KIF ISLIGMTIGFNAYGVAVDYTSA+LGAAAFNCLPVTTFLF
Subjt:  QILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLF

Query:  ALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPL
        A++LRMEKV LRTVAGMAK  GIL+CIGGV TLAFYKGPYLKPLINHHLF++ KSQ H+ H SS+KTWIIGCFLL LSSISWG WFVLQAHFLKTY SPL
Subjt:  ALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPL

Query:  VFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILL
         F S+QT+LST QSFVIAIAMERNPSEWKL WNIRL+AVLYCGILV V+SN LQCWVIKEKGPVFQAMTTPLNVI TIIGSELLLGEGI+LGSL+GA+LL
Subjt:  VFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILL

Query:  VVSLYSVLWGKSKELNIINGDCNQSSVPAEAKDLSEMRSPAEP
        VVSLYSVLWGKSKELN+I+ D N+       +++ EM SP +P
Subjt:  VVSLYSVLWGKSKELNIINGDCNQSSVPAEAKDLSEMRSPAEP

SwissProt top hitse value%identityAlignment
Q6NMB7 WAT1-related protein At1g436502.0e-5542.45Show/hide
Query:  LQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLP----LMMKESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFL
        +QI+ A M LLSK A + G N FVFVFYRQA  A+   P    L   +S  LS +  LKIF ISL G+T+  N Y VA++ T+A+  AA  N +P  TF+
Subjt:  LQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLP----LMMKESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFL

Query:  FALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSP
         AL+ R+E V L+   G+AK  G ++ + G    AF KGP    LINH    Y  S        S+K  + G   +L ++  W  W ++Q+  +K Y + 
Subjt:  FALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSP

Query:  LVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAIL
        L   + Q + S +QS V A+A+ RNPS WK+ + + L+++ YCGI+V  L+  LQ W I++KGPVF A+ TPL +I+T I S  L  E  +LGS+ GA+L
Subjt:  LVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAIL

Query:  LVVSLYSVLWGKSKELNI
        LV  LY  LWGK+KE  I
Subjt:  LVVSLYSVLWGKSKELNI

Q9FGG3 WAT1-related protein At5g647001.1e-7748.9Show/hide
Query:  VLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTF
        ++Q++  +M L+SKA F  GMN+FVFVFYRQA   +F  PL      K +  LS + F+KIFM+SL G+T+  +  G+A+ YTSA+L AA    LP  TF
Subjt:  VLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTF

Query:  LFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLK-PLINH--HLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKT
          AL+  ME++ ++++ G AK VGI +C+GGV  LA YKGP LK PL  H  H  E+P        +  S +W+ GC L++ S+I WG W VLQ   LK 
Subjt:  LFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLK-PLINH--HLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKT

Query:  YSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLV
        Y S L FT+   +LS++QSFVIAIA+ER+ S WKL WN+RL+AV+YCG +V  ++  LQ WVI+++GPVF +M TPL+++ T++ S +LL E I LGS+V
Subjt:  YSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLV

Query:  GAILLVVSLYSVLWGKSKE
        G +LL++ LY VLWGKS+E
Subjt:  GAILLVVSLYSVLWGKSKE

Q9FL41 WAT1-related protein At5g070503.2e-4533.13Show/hide
Query:  FLLVSSIWFV---LQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLG
        FL  S  +F    LQ   A M++++K +  +GM+ +V V YR A       P       K    ++   F+++F++ L+G  I  N Y + + YTS +  
Subjt:  FLLVSSIWFV---LQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLG

Query:  AAAFNCLPVTTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAH---QAHASSSKTWIIGCFLLLLSSISWG
         A  N LP  TF+ A++ RME ++L+ +   AK  G ++ + G   +  YKGP ++     ++     S A+     ++SS K ++ G  LL+ ++++W 
Subjt:  AAAFNCLPVTTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAH---QAHASSSKTWIIGCFLLLLSSISWG

Query:  SWFVLQAHFLKTYSS-PLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSE
        S FVLQA  LKTY+   L  T+    + T+Q+  +   ME NPS W++ W++ L+A  Y GI+ + +S  +Q  V+K++GPVF    +PL +++  +   
Subjt:  SWFVLQAHFLKTYSS-PLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSE

Query:  LLLGEGIHLGSLVGAILLVVSLYSVLWGKSKE
         +L E I LG ++GA+L+V+ LY+VLWGK KE
Subjt:  LLLGEGIHLGSLVGAILLVVSLYSVLWGKSKE

Q9M0B8 WAT1-related protein At4g304201.7e-4632.64Show/hide
Query:  VLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMMKESR-------SLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPV
        ++Q+  A ++L ++A    G++  VF+ YRQA   +F  P +    R       SL L  F  IF++SLIG+TI  N Y   +  TS+S+G+A  N +P 
Subjt:  VLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMMKESR-------SLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPV

Query:  TTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKT
         TFL + +   EK+NLR + G+AK  G ++C+ G  ++   +GP    ++N      P +++   H     TW+IGC  L  S++ W  W +LQ      
Subjt:  TTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKT

Query:  YSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLV
        Y   L  ++   +  T+Q  V+   +E++P+ W L         LY GI  + LS  +Q W I ++GPVF A+  PL  ++  I + L   E I+ GSL+
Subjt:  YSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLV

Query:  GAILLVVSLYSVLWGKSKELNIINGDCNQSSVPAEAK
        G + +++ LY+VLWGK+K++ ++N D   +   +E K
Subjt:  GAILLVVSLYSVLWGKSKELNIINGDCNQSSVPAEAK

Q9SUD5 WAT1-related protein At4g280407.8e-4433.12Show/hide
Query:  VLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM-----KESR-SLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVT
        +LQ   A ++L +KAAF  G+N  VFV YRQA   +F  P+       KE++ SL +  F  + + ++IG+T+  NAY   +D +S+S+  A  N +P  
Subjt:  VLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM-----KESR-SLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVT

Query:  TFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGP-YLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKT
        TF+ ++I+  E +  R++  +AK +G  +C+GG   + F +GP  L  L+N                  +  W++GCF LL+S+ +W  W +LQ      
Subjt:  TFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGP-YLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKT

Query:  YSSPLVFTSHQTMLSTVQSFVIAIAM-ERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSL
            L  ++    ++T+ SF++A+A+   +   WKL   ++L   +Y G  +A+ S  LQ W++ +KGPVF A+  PL+ ++      L L E  +LGSL
Subjt:  YSSPLVFTSHQTMLSTVQSFVIAIAM-ERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSL

Query:  VGAILLVVSLYSVLWGKSKE
        +GA+ +++ LY VLWGKS++
Subjt:  VGAILLVVSLYSVLWGKSKE

Arabidopsis top hitse value%identityAlignment
AT1G43650.1 nodulin MtN21 /EamA-like transporter family protein1.4e-5642.45Show/hide
Query:  LQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLP----LMMKESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFL
        +QI+ A M LLSK A + G N FVFVFYRQA  A+   P    L   +S  LS +  LKIF ISL G+T+  N Y VA++ T+A+  AA  N +P  TF+
Subjt:  LQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLP----LMMKESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFL

Query:  FALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSP
         AL+ R+E V L+   G+AK  G ++ + G    AF KGP    LINH    Y  S        S+K  + G   +L ++  W  W ++Q+  +K Y + 
Subjt:  FALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSP

Query:  LVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAIL
        L   + Q + S +QS V A+A+ RNPS WK+ + + L+++ YCGI+V  L+  LQ W I++KGPVF A+ TPL +I+T I S  L  E  +LGS+ GA+L
Subjt:  LVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAIL

Query:  LVVSLYSVLWGKSKELNI
        LV  LY  LWGK+KE  I
Subjt:  LVVSLYSVLWGKSKELNI

AT4G28040.1 nodulin MtN21 /EamA-like transporter family protein5.6e-4533.12Show/hide
Query:  VLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM-----KESR-SLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVT
        +LQ   A ++L +KAAF  G+N  VFV YRQA   +F  P+       KE++ SL +  F  + + ++IG+T+  NAY   +D +S+S+  A  N +P  
Subjt:  VLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM-----KESR-SLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVT

Query:  TFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGP-YLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKT
        TF+ ++I+  E +  R++  +AK +G  +C+GG   + F +GP  L  L+N                  +  W++GCF LL+S+ +W  W +LQ      
Subjt:  TFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGP-YLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKT

Query:  YSSPLVFTSHQTMLSTVQSFVIAIAM-ERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSL
            L  ++    ++T+ SF++A+A+   +   WKL   ++L   +Y G  +A+ S  LQ W++ +KGPVF A+  PL+ ++      L L E  +LGSL
Subjt:  YSSPLVFTSHQTMLSTVQSFVIAIAM-ERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSL

Query:  VGAILLVVSLYSVLWGKSKE
        +GA+ +++ LY VLWGKS++
Subjt:  VGAILLVVSLYSVLWGKSKE

AT4G30420.1 nodulin MtN21 /EamA-like transporter family protein1.2e-4732.64Show/hide
Query:  VLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMMKESR-------SLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPV
        ++Q+  A ++L ++A    G++  VF+ YRQA   +F  P +    R       SL L  F  IF++SLIG+TI  N Y   +  TS+S+G+A  N +P 
Subjt:  VLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMMKESR-------SLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPV

Query:  TTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKT
         TFL + +   EK+NLR + G+AK  G ++C+ G  ++   +GP    ++N      P +++   H     TW+IGC  L  S++ W  W +LQ      
Subjt:  TTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKT

Query:  YSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLV
        Y   L  ++   +  T+Q  V+   +E++P+ W L         LY GI  + LS  +Q W I ++GPVF A+  PL  ++  I + L   E I+ GSL+
Subjt:  YSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLV

Query:  GAILLVVSLYSVLWGKSKELNIINGDCNQSSVPAEAK
        G + +++ LY+VLWGK+K++ ++N D   +   +E K
Subjt:  GAILLVVSLYSVLWGKSKELNIINGDCNQSSVPAEAK

AT5G07050.1 nodulin MtN21 /EamA-like transporter family protein2.3e-4633.13Show/hide
Query:  FLLVSSIWFV---LQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLG
        FL  S  +F    LQ   A M++++K +  +GM+ +V V YR A       P       K    ++   F+++F++ L+G  I  N Y + + YTS +  
Subjt:  FLLVSSIWFV---LQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLG

Query:  AAAFNCLPVTTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAH---QAHASSSKTWIIGCFLLLLSSISWG
         A  N LP  TF+ A++ RME ++L+ +   AK  G ++ + G   +  YKGP ++     ++     S A+     ++SS K ++ G  LL+ ++++W 
Subjt:  AAAFNCLPVTTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLFEYPKSQAH---QAHASSSKTWIIGCFLLLLSSISWG

Query:  SWFVLQAHFLKTYSS-PLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSE
        S FVLQA  LKTY+   L  T+    + T+Q+  +   ME NPS W++ W++ L+A  Y GI+ + +S  +Q  V+K++GPVF    +PL +++  +   
Subjt:  SWFVLQAHFLKTYSS-PLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSE

Query:  LLLGEGIHLGSLVGAILLVVSLYSVLWGKSKE
         +L E I LG ++GA+L+V+ LY+VLWGK KE
Subjt:  LLLGEGIHLGSLVGAILLVVSLYSVLWGKSKE

AT5G64700.1 nodulin MtN21 /EamA-like transporter family protein7.7e-7948.9Show/hide
Query:  VLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTF
        ++Q++  +M L+SKA F  GMN+FVFVFYRQA   +F  PL      K +  LS + F+KIFM+SL G+T+  +  G+A+ YTSA+L AA    LP  TF
Subjt:  VLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFFLPLMM----KESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTF

Query:  LFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLK-PLINH--HLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKT
          AL+  ME++ ++++ G AK VGI +C+GGV  LA YKGP LK PL  H  H  E+P        +  S +W+ GC L++ S+I WG W VLQ   LK 
Subjt:  LFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLK-PLINH--HLFEYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKT

Query:  YSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLV
        Y S L FT+   +LS++QSFVIAIA+ER+ S WKL WN+RL+AV+YCG +V  ++  LQ WVI+++GPVF +M TPL+++ T++ S +LL E I LGS+V
Subjt:  YSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKEKGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLV

Query:  GAILLVVSLYSVLWGKSKE
        G +LL++ LY VLWGKS+E
Subjt:  GAILLVVSLYSVLWGKSKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGGAGAAGATAAAGAAAGCAGCAGCAGCCAGCAGGGATTGCAAAGGAGGTGGGTATAACGTTAATGGTGTGGGCATTCTTAGTTTTTATAAAGGCCCATTTTT
TAAACCAGTTTTCACTCACCATCGAGGAAACTCAAGCTCACCGGCCGGCGCCGGAAAAACATGGACGATCGGTTGTTTCTTCCTACTCGTCTCCAGCATTTGGTTCGTTC
TCCAGATTCTGCTGGCCGTCATGAGTCTGCTGTCCAAAGCAGCGTTTGCTTCCGGCATGAACAGCTTCGTGTTTGTCTTCTACAGGCAAGCTGCTGGAGCTGTTTTCTTC
CTCCCTCTGATGATGAAAGAGAGCCGATCGCTTTCTCTTATGGATTTCTTGAAGATTTTCATGATTTCGTTAATAGGGATGACTATTGGATTCAATGCCTATGGTGTGGC
TGTTGATTATACATCTGCAAGTTTGGGTGCTGCTGCATTTAATTGCCTCCCTGTCACAACATTTCTCTTTGCTCTTATATTAAGAATGGAGAAAGTAAACTTGAGAACAG
TGGCTGGAATGGCAAAAACTGTTGGAATATTAATTTGCATTGGAGGAGTGGCCACACTTGCTTTCTACAAAGGCCCTTATTTGAAGCCTCTCATCAATCATCACCTTTTT
GAATATCCCAAAAGCCAAGCCCACCAAGCTCATGCCTCCTCTTCAAAAACATGGATTATTGGCTGTTTCCTTCTGTTACTCTCCAGCATTTCCTGGGGTTCGTGGTTTGT
GCTTCAGGCCCATTTTCTCAAGACTTATTCATCACCGCTCGTGTTCACAAGTCACCAAACAATGTTAAGCACAGTCCAATCTTTTGTAATCGCCATTGCAATGGAAAGGA
ACCCTTCTGAGTGGAAGTTGAGCTGGAACATTAGGCTGATTGCTGTACTTTACTGCGGAATTCTTGTAGCTGTTCTTTCGAATATCTTGCAATGTTGGGTGATCAAAGAG
AAGGGGCCAGTCTTCCAAGCCATGACGACGCCGTTGAATGTCATCGTCACTATCATTGGTTCTGAGTTGCTGTTGGGCGAGGGCATCCACTTGGGAAGTCTTGTCGGTGC
AATCTTGTTGGTGGTGAGCCTTTACAGTGTATTATGGGGTAAAAGCAAAGAGCTGAACATTATTAATGGAGATTGTAATCAATCATCTGTTCCAGCTGAAGCAAAAGACT
TGTCGGAGATGAGATCACCTGCTGAGCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGGAGAAGATAAAGAAAGCAGCAGCAGCCAGCAGGGATTGCAAAGGAGGTGGGTATAACGTTAATGGTGTGGGCATTCTTAGTTTTTATAAAGGCCCATTTTT
TAAACCAGTTTTCACTCACCATCGAGGAAACTCAAGCTCACCGGCCGGCGCCGGAAAAACATGGACGATCGGTTGTTTCTTCCTACTCGTCTCCAGCATTTGGTTCGTTC
TCCAGATTCTGCTGGCCGTCATGAGTCTGCTGTCCAAAGCAGCGTTTGCTTCCGGCATGAACAGCTTCGTGTTTGTCTTCTACAGGCAAGCTGCTGGAGCTGTTTTCTTC
CTCCCTCTGATGATGAAAGAGAGCCGATCGCTTTCTCTTATGGATTTCTTGAAGATTTTCATGATTTCGTTAATAGGGATGACTATTGGATTCAATGCCTATGGTGTGGC
TGTTGATTATACATCTGCAAGTTTGGGTGCTGCTGCATTTAATTGCCTCCCTGTCACAACATTTCTCTTTGCTCTTATATTAAGAATGGAGAAAGTAAACTTGAGAACAG
TGGCTGGAATGGCAAAAACTGTTGGAATATTAATTTGCATTGGAGGAGTGGCCACACTTGCTTTCTACAAAGGCCCTTATTTGAAGCCTCTCATCAATCATCACCTTTTT
GAATATCCCAAAAGCCAAGCCCACCAAGCTCATGCCTCCTCTTCAAAAACATGGATTATTGGCTGTTTCCTTCTGTTACTCTCCAGCATTTCCTGGGGTTCGTGGTTTGT
GCTTCAGGCCCATTTTCTCAAGACTTATTCATCACCGCTCGTGTTCACAAGTCACCAAACAATGTTAAGCACAGTCCAATCTTTTGTAATCGCCATTGCAATGGAAAGGA
ACCCTTCTGAGTGGAAGTTGAGCTGGAACATTAGGCTGATTGCTGTACTTTACTGCGGAATTCTTGTAGCTGTTCTTTCGAATATCTTGCAATGTTGGGTGATCAAAGAG
AAGGGGCCAGTCTTCCAAGCCATGACGACGCCGTTGAATGTCATCGTCACTATCATTGGTTCTGAGTTGCTGTTGGGCGAGGGCATCCACTTGGGAAGTCTTGTCGGTGC
AATCTTGTTGGTGGTGAGCCTTTACAGTGTATTATGGGGTAAAAGCAAAGAGCTGAACATTATTAATGGAGATTGTAATCAATCATCTGTTCCAGCTGAAGCAAAAGACT
TGTCGGAGATGAGATCACCTGCTGAGCCATAG
Protein sequenceShow/hide protein sequence
MEKEKIKKAAAASRDCKGGGYNVNGVGILSFYKGPFFKPVFTHHRGNSSSPAGAGKTWTIGCFFLLVSSIWFVLQILLAVMSLLSKAAFASGMNSFVFVFYRQAAGAVFF
LPLMMKESRSLSLMDFLKIFMISLIGMTIGFNAYGVAVDYTSASLGAAAFNCLPVTTFLFALILRMEKVNLRTVAGMAKTVGILICIGGVATLAFYKGPYLKPLINHHLF
EYPKSQAHQAHASSSKTWIIGCFLLLLSSISWGSWFVLQAHFLKTYSSPLVFTSHQTMLSTVQSFVIAIAMERNPSEWKLSWNIRLIAVLYCGILVAVLSNILQCWVIKE
KGPVFQAMTTPLNVIVTIIGSELLLGEGIHLGSLVGAILLVVSLYSVLWGKSKELNIINGDCNQSSVPAEAKDLSEMRSPAEP