; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029871 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029871
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionWAT1-related protein
Genome locationchr8:42751522..42755419
RNA-Seq ExpressionLag0029871
SyntenyLag0029871
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsIPR000620 - EamA domain
IPR030184 - WAT1-related protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602357.1 WAT1-related protein, partial [Cucurbita argyrosperma subsp. sororia]8.2e-12174.35Show/hide
Query:  MKKLSGAL---KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK----------FGSAL----SSSVVIPN
        MK+LSGAL   KPYLAAILLRF+A+GLIVIAKIALN+GMSPQ       +  L +   A +  V  F+ LF++          FG  L      SVVIPN
Subjt:  MKKLSGAL---KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK----------FGSAL----SSSVVIPN

Query:  TYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG-DSHNHSSNAASNQD-SFKGAILV
        TYFTGLK+V+PTFSIAMSNAVPALSFFFAWIFRMEKVDIRR+SS AKILGTAVTVGGAM M FVR   LRF WTKG DS +HSS AA+NQD  FKG ILV
Subjt:  TYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG-DSHNHSSNAASNQD-SFKGAILV

Query:  TVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLST
        T+SC+ SS+SC+LQAIVLKSYPVGLS+TVLV LVGVVEG+VIALAMEW NPAAWSIHFDFQLLAILYAGI+ISGLSYY+QGVVME KG VFLTAFFPLST
Subjt:  TVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLST

Query:  VIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAA
        VIVAIISSFAVSEILSLGK+VGA+III+GLYL+LW K KD A+DKAA
Subjt:  VIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAA

XP_008463485.1 PREDICTED: WAT1-related protein At2g39510-like isoform X1 [Cucumis melo]5.3e-11267.12Show/hide
Query:  MKKLSGAL----KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK-----------FGSAL----SSSVVI
        MKKL+  L    K YL  I++R  ASGLIVIAK+ALN GMSPQ       +  L +   A +  V  FSFL  +           F   L      SVVI
Subjt:  MKKLSGAL----KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK-----------FGSAL----SSSVVI

Query:  PNTYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNH-----SSNAASNQDSFK
         NTYFTGLKYVTPTFSIAMSNAVPALSFFFA +FRMEK+D+ R SSQAKILGTAV+VGGAM+MTFV G  LRFPWT G ++ H     SS   +N+DSFK
Subjt:  PNTYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNH-----SSNAASNQDSFK

Query:  GAILVTVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAF
        G ILVT+SCLC+SVSCILQAIVLKSYP+GL VT +VC+VGVVEG V+ALAMEW NP+ WSIHFDFQLLAILYAGIIISG SYYIQGVVMEEKGPVFLTAF
Subjt:  GAILVTVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAF

Query:  FPLSTVIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKV--DATTRREGS
        FPL+TV+VAIISSFA+SEILSLGKVVGAM+II GLYL LWGK KD  ID A +   DA T  E S
Subjt:  FPLSTVIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKV--DATTRREGS

XP_022961609.1 WAT1-related protein At2g39510-like [Cucurbita moschata]2.6e-11971.83Show/hide
Query:  MKKLSGAL---KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK----------FGSAL----SSSVVIPN
        MK+LSG L   KPYLAAILLRF+A+GL+VIAKIALN+GMSPQ       +  L +   A +  V  F+ LF++          F   L      SVVIPN
Subjt:  MKKLSGAL---KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK----------FGSAL----SSSVVIPN

Query:  TYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG-DSHNHSSNAASNQD-SFKGAILV
        TYFTGLK+V+PTFSIAMSNAVPALSFFFAWIFRMEKVDIRR+SS AKI+GTAVTVGGAM M FVR   LRF WTKG DS +HSS AA+NQD  FKG ILV
Subjt:  TYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG-DSHNHSSNAASNQD-SFKGAILV

Query:  TVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLST
        T+SC+ SS+SC+LQAIVLKSYPVGLS+TVLV LVGVVEG+VIALAMEW NPAAWSIHFDFQLLAILYAGI+ISGLSYY+QGVVME KG VFLTAFFPLST
Subjt:  TVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLST

Query:  VIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKVDATTRR
        VIVAIISSFAVSEILSLGK+VGA+III+GLYL+LW K KD A+DKAA   +   R
Subjt:  VIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKVDATTRR

XP_023534785.1 WAT1-related protein At2g39510-like [Cucurbita pepo subsp. pepo]2.6e-11973.78Show/hide
Query:  MKKLSGAL---KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK----------FGSAL----SSSVVIPN
        MK+LSGAL   KPYLAAILLRF+A+GLIVIAKIALN+GMSPQ       +  L +   A +  V  F+ LF++          FG  L      SVVIPN
Subjt:  MKKLSGAL---KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK----------FGSAL----SSSVVIPN

Query:  TYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG-DSHNHSSNAASNQD-SFKGAILV
        TYFTGLK+V+PTFSIAMSNAVPALSFFFAWIFRMEKVDIRR+SS AKILGTAVTVGGAM M FVR   L F WTKG DS +HSS  A+NQD  FKG ILV
Subjt:  TYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG-DSHNHSSNAASNQD-SFKGAILV

Query:  TVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLST
        T+SC+ SS+SC+LQAIVLKSYPVGLS+TVLV LVGVVEG+VIALAMEW NPAAWSIHFDFQLLAILYAGI+ISGLSYY+QGVVME KG VFLTAFFPLST
Subjt:  TVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLST

Query:  VIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAA
        VIVAIISSFAVSEILSLGK+VGA+III+GLYL+LW K KD A+DKAA
Subjt:  VIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAA

XP_031741156.1 WAT1-related protein At2g39510 isoform X1 [Cucumis sativus]8.8e-11568.22Show/hide
Query:  MKKLSGAL----KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK-----------FGSAL----SSSVVI
        MKKLS  L    K YL  I++R  ASGLIVIAKIALN GMSPQ       +  L +   A +  V  F FL  +           F   +      SVV+
Subjt:  MKKLSGAL----KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK-----------FGSAL----SSSVVI

Query:  PNTYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG--DSHNH---SSNAASNQDSFK
         NTYFTGLKYVTPTFSIAMSNAVPALSFFFAW+FRMEKVD+RR+SS AKILGTAV VGGAM+MTFV G  LRFPWT    +SHN+   SS   +N+DSFK
Subjt:  PNTYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG--DSHNH---SSNAASNQDSFK

Query:  GAILVTVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAF
        G +LVT+SCLC+SVSCILQAIVLKSYP+GL VT LVC+VGVVEGTVIA+ MEW NP+ WSIHFDFQLLAILYAGIIISG SYYIQGVVMEEKGPVF TAF
Subjt:  GAILVTVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAF

Query:  FPLSTVIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKV--DATTRREGS
        FPLST++VAIISSFA+SEILS GKVVGA++IIIGLYL LWGK KD A+DKAA+   DATT RE S
Subjt:  FPLSTVIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKV--DATTRREGS

TrEMBL top hitse value%identityAlignment
A0A0A0KNS0 WAT1-related protein4.7e-10664.74Show/hide
Query:  MKKLSGAL----KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNKFGS---------------ALSSSVVI
        MKKLS       K YL  I +R   SGLIVIAKIALN GMSPQ       +  L +   A +  V  F FL  + G                    SVVI
Subjt:  MKKLSGAL----KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNKFGS---------------ALSSSVVI

Query:  PNTYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTK--GDSHNHSSNAAS---NQDSFK
         NTYFTGLKYVTPTFS AMSN +PALSFFFAWIF MEKVDIRR SSQ KI+GTAVTVGGAM+MTFV G   RFPWT      HNHSS   S   NQDSFK
Subjt:  PNTYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTK--GDSHNHSSNAAS---NQDSFK

Query:  GAILVTVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAF
        G ILVT++ L +SVSCI+QAIVLKSYP+GL VT +VC+VGVVEGTV+ALA EW NP  WSIHFDFQLLA LYAGI++SG SY+IQGVV+E KGPVFLT F
Subjt:  GAILVTVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAF

Query:  FPLSTVIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAID-KAAK-VDATTRRE
        FPLST+IVAIISSFA+SE+LSLGKV+GA++IIIGLYLVLWGK KD AI+ KAA+ +D  T RE
Subjt:  FPLSTVIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAID-KAAK-VDATTRRE

A0A1S3CJT7 WAT1-related protein2.6e-11267.12Show/hide
Query:  MKKLSGAL----KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK-----------FGSAL----SSSVVI
        MKKL+  L    K YL  I++R  ASGLIVIAK+ALN GMSPQ       +  L +   A +  V  FSFL  +           F   L      SVVI
Subjt:  MKKLSGAL----KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK-----------FGSAL----SSSVVI

Query:  PNTYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNH-----SSNAASNQDSFK
         NTYFTGLKYVTPTFSIAMSNAVPALSFFFA +FRMEK+D+ R SSQAKILGTAV+VGGAM+MTFV G  LRFPWT G ++ H     SS   +N+DSFK
Subjt:  PNTYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNH-----SSNAASNQDSFK

Query:  GAILVTVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAF
        G ILVT+SCLC+SVSCILQAIVLKSYP+GL VT +VC+VGVVEG V+ALAMEW NP+ WSIHFDFQLLAILYAGIIISG SYYIQGVVMEEKGPVFLTAF
Subjt:  GAILVTVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAF

Query:  FPLSTVIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKV--DATTRREGS
        FPL+TV+VAIISSFA+SEILSLGKVVGAM+II GLYL LWGK KD  ID A +   DA T  E S
Subjt:  FPLSTVIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKV--DATTRREGS

A0A6J1BWE1 WAT1-related protein4.0e-10565.48Show/hide
Query:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK----------FGSAL----SSSVVIPNTYFTGLKYVTP
        KPYLAAILLRFL++ L+V AKIALN GMSP         V+           V  F+  F++          F   L      S+VIPN YF+ LKYVTP
Subjt:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK----------FGSAL----SSSVVIPNTYFTGLKYVTP

Query:  TFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDS-HNHSSN---AASNQDSFKGAILVTVSCLCSSV
        TFSIAMSN VPA SFFFA IFRMEKVDIRR+SSQAKILGT V VGGAM+MTF  G NLRFPWTK +  HNHS++   + ++QDS KG IL T++C+CSSV
Subjt:  TFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDS-HNHSSN---AASNQDSFKGAILVTVSCLCSSV

Query:  SCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSF
        SCILQA+VLKSYPVGLS+TVLV LVGVVEG ++ALAMEW+N  AWS+HFDFQLL ILYAGII+SGLSYY++G V++EKGPVF TAF PLSTVIVAIISSF
Subjt:  SCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSF

Query:  AVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDK
        A+SE+ SLGK++G ++IIIGLYLV+WGK KDPAID+
Subjt:  AVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDK

A0A6J1BZJ0 WAT1-related protein1.5e-10768.09Show/hide
Query:  KLSGAL---KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK----------FGSAL----SSSVVIPNTY
        KLS  L   KPYLAAILLRFL + L+VIAKIALN GMSP         V+           V  F+ L ++          F   L       VV+PN Y
Subjt:  KLSGAL---KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK----------FGSAL----SSSVVIPNTY

Query:  FTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTK-GDSHN--HSSNAAS----NQDSFKGA
        F+GLKYVTPTFSIAMSNA PALSFFFA   RMEKVDIRR SSQ KILGT V VGGAM+MTF RG NLRFPWTK  DSHN  HS+N+ S    +QDSFKG 
Subjt:  FTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTK-GDSHN--HSSNAAS----NQDSFKGA

Query:  ILVTVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFP
        ILVT+SCLCSS+SCILQA+VLKSYPVGLSVTVLV LVGVV G V ALAMEW N  AW+IHFDFQLLAILYAG IISGLSYY+QGVVMEEKGPVFLTAF P
Subjt:  ILVTVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFP

Query:  LSTVIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAI-DKAA
        L TV+VAIISSFAVSEILSL KV+G ++I++GLYLV+WGK KD AI DKAA
Subjt:  LSTVIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAI-DKAA

A0A6J1HEK0 WAT1-related protein1.3e-11971.83Show/hide
Query:  MKKLSGAL---KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK----------FGSAL----SSSVVIPN
        MK+LSG L   KPYLAAILLRF+A+GL+VIAKIALN+GMSPQ       +  L +   A +  V  F+ LF++          F   L      SVVIPN
Subjt:  MKKLSGAL---KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNK----------FGSAL----SSSVVIPN

Query:  TYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG-DSHNHSSNAASNQD-SFKGAILV
        TYFTGLK+V+PTFSIAMSNAVPALSFFFAWIFRMEKVDIRR+SS AKI+GTAVTVGGAM M FVR   LRF WTKG DS +HSS AA+NQD  FKG ILV
Subjt:  TYFTGLKYVTPTFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG-DSHNHSSNAASNQD-SFKGAILV

Query:  TVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLST
        T+SC+ SS+SC+LQAIVLKSYPVGLS+TVLV LVGVVEG+VIALAMEW NPAAWSIHFDFQLLAILYAGI+ISGLSYY+QGVVME KG VFLTAFFPLST
Subjt:  TVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLST

Query:  VIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKVDATTRR
        VIVAIISSFAVSEILSLGK+VGA+III+GLYL+LW K KD A+DKAA   +   R
Subjt:  VIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKVDATTRR

SwissProt top hitse value%identityAlignment
F4IQX1 WAT1-related protein At2g374507.6e-6141.44Show/hide
Query:  PYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSNAVPALSF
        P++  +LL+   +G+ ++ K  LN+GMS                    +Y   + + +   F     + V+  N +  G+KY T TF+IA+ N +PA++F
Subjt:  PYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSNAVPALSF

Query:  FFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIVLKSYPVGLSVT
          A IFR+E V  + I S AK++GT  TVGG MVMT V+G  L   WTKG S  ++        S KGA+LVT+ C   +   ILQAI LK+YP  LS+ 
Subjt:  FFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIVLKSYPVGLSVT

Query:  VLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSLGKVVGAMIIII
          +CL+G +EG V+AL ME  NP+ W+I +D +LL I Y+GI+ S L YYI GVVM+ +GPVF+TAF PL  ++VAI+SS    E + LG+ +GA +I +
Subjt:  VLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSLGKVVGAMIIII

Query:  GLYLVLWGKAKDPAIDKAAKVD-----ATTRRE
        GLYLV+WGKAKD       ++D     ATT ++
Subjt:  GLYLVLWGKAKDPAIDKAAKVD-----ATTRRE

O80638 WAT1-related protein At2g395109.2e-7546.36Show/hide
Query:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVR------IFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSN
        KP++  + L+F  +GL +IAK ALNQGMSP    +   +V     +    +  R        S  F      L    +  N Y+TG+KY + TF+ AM+N
Subjt:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVR------IFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSN

Query:  AVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTK-GDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIVLKS
         +PA +F  AWIFR+EKV++++I SQAKILGT VTVGGAM+MT V+G  +  PW    D H  SSN    QD  KGA L+ + C+C +    LQAI LKS
Subjt:  AVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTK-GDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIVLKS

Query:  YPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSLGKV
        YPV LS+T  +C +G +E T++AL +E  NP+AW+IH D +LLA +Y G+I SG+ YY+QGV+M+ +GPVF+TAF PLS VIVAI+ S  ++E++ LG++
Subjt:  YPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSLGKV

Query:  VGAMIIIIGLYLVLWGKAKDPAIDKAAKVD
        +GA++I++GLY VLWGK+KD      + +D
Subjt:  VGAMIIIIGLYLVLWGKAKDPAIDKAAKVD

Q9FL41 WAT1-related protein At5g070501.9e-5941.57Show/hide
Query:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVR------IFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSN
        KPY A I L+F  +G+ +I KI+LN GMS          +     +    +  R       FS     F   L   V+  N Y+ GLKY +PTFS AMSN
Subjt:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVR------IFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSN

Query:  AVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTK-----GDSH---NHSSNAASNQDSFKGAILVTVSCLCSSVSCIL
         +PA++F  A +FRME +D++++  QAKI GT VTV GAM+MT  +G  +   WTK       SH     S N++S+++  KG+IL+  + L  +   +L
Subjt:  AVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTK-----GDSH---NHSSNAASNQDSFKGAILVTVSCLCSSVSCIL

Query:  QAIVLKSYPV-GLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVS
        QA +LK+Y    LS+T L+C +G ++   +   ME  NP+AW I +D  LLA  Y+GI+ S +SYY+QG+VM+++GPVF TAF PL  VIVA++ SF ++
Subjt:  QAIVLKSYPV-GLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVS

Query:  EILSLGKVVGAMIIIIGLYLVLWGKAKD--PAIDKAAKVDATTR
        E + LG V+GA++I+IGLY VLWGK K+    I + AK+D+ ++
Subjt:  EILSLGKVVGAMIIIIGLYLVLWGKAKD--PAIDKAAKVDATTR

Q9FNA5 WAT1-related protein At5g136702.7e-5840.87Show/hide
Query:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVR------IFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSN
        +P++A + ++ L + + ++AK+ALN+GMSP         V     +   L   R       F  L      +L   VV  N Y++G+K  T TF+ A+ N
Subjt:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVR------IFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSN

Query:  AVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG----DSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIV
        A+PA++F  A +F++EKV I R  SQAK++GT V +GGAM+MTFV+G  +  PWT      + H H+       D  +G+I++  SC   S   ILQA +
Subjt:  AVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG----DSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIV

Query:  LKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSL
        L  Y   LS+T L+C++G++E TV+ L  E KN + W I+ D  LLA +Y G ++SGL+YY+ G   +E+GPVF++AF PLS V+VAI+S+F   E + +
Subjt:  LKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSL

Query:  GKVVGAMIIIIGLYLVLWGKAKD
        G+V+G+++I+IG+YLVLWGK+KD
Subjt:  GKVVGAMIIIIGLYLVLWGKAKD

Q9ZUS1 WAT1-related protein At2g374603.4e-6140.65Show/hide
Query:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNKFGS--------------ALSSSVVIPNTYFTGLKYVTP
        +P+++ ++L+   +G+ +++K  LN+GMS        + V +          +  F+F F+K                  L   V+  N Y+ G+KY T 
Subjt:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNKFGS--------------ALSSSVVIPNTYFTGLKYVTP

Query:  TFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCIL
        TF+ AM N +PA++F  A+IF +E+V +R I S  K++GT  TVGGAM+MT V+G  L   WTKG S  H++       + KGA+LVT+ C   +   IL
Subjt:  TFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCIL

Query:  QAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSE
        QAI L++YP  LS+T  +CL+G +EGT +AL ME  NP+AW+I +D +LL   Y+GI+ S L+YY+ GVVM+ +GPVF+TAF PL  +IVAI+S+   +E
Subjt:  QAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSE

Query:  ILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKVD
         + LG+V+GA++I  GLYLV+WGK KD   +   ++D
Subjt:  ILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKVD

Arabidopsis top hitse value%identityAlignment
AT2G37450.2 nodulin MtN21 /EamA-like transporter family protein5.4e-6241.44Show/hide
Query:  PYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSNAVPALSF
        P++  +LL+   +G+ ++ K  LN+GMS                    +Y   + + +   F     + V+  N +  G+KY T TF+IA+ N +PA++F
Subjt:  PYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSNAVPALSF

Query:  FFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIVLKSYPVGLSVT
          A IFR+E V  + I S AK++GT  TVGG MVMT V+G  L   WTKG S  ++        S KGA+LVT+ C   +   ILQAI LK+YP  LS+ 
Subjt:  FFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIVLKSYPVGLSVT

Query:  VLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSLGKVVGAMIIII
          +CL+G +EG V+AL ME  NP+ W+I +D +LL I Y+GI+ S L YYI GVVM+ +GPVF+TAF PL  ++VAI+SS    E + LG+ +GA +I +
Subjt:  VLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSLGKVVGAMIIII

Query:  GLYLVLWGKAKDPAIDKAAKVD-----ATTRRE
        GLYLV+WGKAKD       ++D     ATT ++
Subjt:  GLYLVLWGKAKDPAIDKAAKVD-----ATTRRE

AT2G37460.1 nodulin MtN21 /EamA-like transporter family protein2.4e-6240.65Show/hide
Query:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNKFGS--------------ALSSSVVIPNTYFTGLKYVTP
        +P+++ ++L+   +G+ +++K  LN+GMS        + V +          +  F+F F+K                  L   V+  N Y+ G+KY T 
Subjt:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNKFGS--------------ALSSSVVIPNTYFTGLKYVTP

Query:  TFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCIL
        TF+ AM N +PA++F  A+IF +E+V +R I S  K++GT  TVGGAM+MT V+G  L   WTKG S  H++       + KGA+LVT+ C   +   IL
Subjt:  TFSIAMSNAVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCIL

Query:  QAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSE
        QAI L++YP  LS+T  +CL+G +EGT +AL ME  NP+AW+I +D +LL   Y+GI+ S L+YY+ GVVM+ +GPVF+TAF PL  +IVAI+S+   +E
Subjt:  QAIVLKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSE

Query:  ILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKVD
         + LG+V+GA++I  GLYLV+WGK KD   +   ++D
Subjt:  ILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKVD

AT2G39510.1 nodulin MtN21 /EamA-like transporter family protein6.6e-7646.36Show/hide
Query:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVR------IFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSN
        KP++  + L+F  +GL +IAK ALNQGMSP    +   +V     +    +  R        S  F      L    +  N Y+TG+KY + TF+ AM+N
Subjt:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVR------IFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSN

Query:  AVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTK-GDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIVLKS
         +PA +F  AWIFR+EKV++++I SQAKILGT VTVGGAM+MT V+G  +  PW    D H  SSN    QD  KGA L+ + C+C +    LQAI LKS
Subjt:  AVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTK-GDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIVLKS

Query:  YPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSLGKV
        YPV LS+T  +C +G +E T++AL +E  NP+AW+IH D +LLA +Y G+I SG+ YY+QGV+M+ +GPVF+TAF PLS VIVAI+ S  ++E++ LG++
Subjt:  YPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSLGKV

Query:  VGAMIIIIGLYLVLWGKAKDPAIDKAAKVD
        +GA++I++GLY VLWGK+KD      + +D
Subjt:  VGAMIIIIGLYLVLWGKAKDPAIDKAAKVD

AT5G07050.1 nodulin MtN21 /EamA-like transporter family protein1.3e-6041.57Show/hide
Query:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVR------IFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSN
        KPY A I L+F  +G+ +I KI+LN GMS          +     +    +  R       FS     F   L   V+  N Y+ GLKY +PTFS AMSN
Subjt:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVR------IFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSN

Query:  AVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTK-----GDSH---NHSSNAASNQDSFKGAILVTVSCLCSSVSCIL
         +PA++F  A +FRME +D++++  QAKI GT VTV GAM+MT  +G  +   WTK       SH     S N++S+++  KG+IL+  + L  +   +L
Subjt:  AVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTK-----GDSH---NHSSNAASNQDSFKGAILVTVSCLCSSVSCIL

Query:  QAIVLKSYPV-GLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVS
        QA +LK+Y    LS+T L+C +G ++   +   ME  NP+AW I +D  LLA  Y+GI+ S +SYY+QG+VM+++GPVF TAF PL  VIVA++ SF ++
Subjt:  QAIVLKSYPV-GLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVS

Query:  EILSLGKVVGAMIIIIGLYLVLWGKAKD--PAIDKAAKVDATTR
        E + LG V+GA++I+IGLY VLWGK K+    I + AK+D+ ++
Subjt:  EILSLGKVVGAMIIIIGLYLVLWGKAKD--PAIDKAAKVDATTR

AT5G13670.1 nodulin MtN21 /EamA-like transporter family protein1.9e-5940.87Show/hide
Query:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVR------IFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSN
        +P++A + ++ L + + ++AK+ALN+GMSP         V     +   L   R       F  L      +L   VV  N Y++G+K  T TF+ A+ N
Subjt:  KPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVR------IFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSN

Query:  AVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG----DSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIV
        A+PA++F  A +F++EKV I R  SQAK++GT V +GGAM+MTFV+G  +  PWT      + H H+       D  +G+I++  SC   S   ILQA +
Subjt:  AVPALSFFFAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKG----DSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIV

Query:  LKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSL
        L  Y   LS+T L+C++G++E TV+ L  E KN + W I+ D  LLA +Y G ++SGL+YY+ G   +E+GPVF++AF PLS V+VAI+S+F   E + +
Subjt:  LKSYPVGLSVTVLVCLVGVVEGTVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSL

Query:  GKVVGAMIIIIGLYLVLWGKAKD
        G+V+G+++I+IG+YLVLWGK+KD
Subjt:  GKVVGAMIIIIGLYLVLWGKAKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAATTGAGTGGCGCTTTGAAGCCATATTTAGCAGCCATTCTTCTGAGATTTCTTGCTTCTGGTTTGATCGTAATAGCCAAGATTGCGTTGAACCAAGGCATGAG
CCCTCAGAAAAAGAAGACCAGAGATGACTTGGTGCATCTTTGCAAAGATTCTACTGCTGGGCTCTATGGAGTGAGAATTTTTTCTTTTCTGTTTAATAAGTTTGGTAGTG
CACTTTCAAGCTCTGTGGTTATCCCTAACACATATTTCACTGGTTTGAAATATGTAACTCCAACTTTCTCCATTGCTATGTCCAATGCTGTTCCTGCCCTTTCCTTCTTC
TTTGCTTGGATTTTCAGGATGGAGAAGGTTGATATTAGGAGAATTTCATCTCAGGCAAAGATATTGGGAACTGCAGTGACAGTGGGAGGAGCAATGGTAATGACCTTTGT
AAGAGGAACAAACTTGAGATTTCCTTGGACAAAAGGAGATTCCCATAATCACTCTTCAAATGCTGCAAGCAACCAAGATTCTTTCAAGGGTGCCATTTTGGTCACAGTTT
CTTGTCTTTGCTCTTCTGTTTCATGCATTCTACAGGCAATTGTGTTGAAATCATATCCTGTGGGGCTGAGTGTAACAGTTTTGGTGTGCTTGGTGGGAGTTGTGGAAGGC
ACTGTGATAGCTTTAGCAATGGAATGGAAGAACCCTGCTGCCTGGTCAATTCATTTTGATTTTCAACTCTTGGCCATCCTTTATGCTGGAATAATAATTTCAGGGCTTTC
GTACTACATTCAAGGAGTGGTAATGGAGGAAAAGGGACCTGTGTTTTTGACAGCATTTTTTCCTCTAAGCACTGTCATTGTTGCAATCATAAGCTCTTTTGCAGTCTCTG
AGATATTGTCCTTGGGAAAGGTTGTGGGAGCAATGATCATAATAATAGGTCTTTATTTGGTTCTGTGGGGCAAAGCCAAAGATCCCGCCATCGACAAGGCAGCTAAAGTA
GACGCGACGACAAGGAGGGAAGGTTCCGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAATTGAGTGGCGCTTTGAAGCCATATTTAGCAGCCATTCTTCTGAGATTTCTTGCTTCTGGTTTGATCGTAATAGCCAAGATTGCGTTGAACCAAGGCATGAG
CCCTCAGAAAAAGAAGACCAGAGATGACTTGGTGCATCTTTGCAAAGATTCTACTGCTGGGCTCTATGGAGTGAGAATTTTTTCTTTTCTGTTTAATAAGTTTGGTAGTG
CACTTTCAAGCTCTGTGGTTATCCCTAACACATATTTCACTGGTTTGAAATATGTAACTCCAACTTTCTCCATTGCTATGTCCAATGCTGTTCCTGCCCTTTCCTTCTTC
TTTGCTTGGATTTTCAGGATGGAGAAGGTTGATATTAGGAGAATTTCATCTCAGGCAAAGATATTGGGAACTGCAGTGACAGTGGGAGGAGCAATGGTAATGACCTTTGT
AAGAGGAACAAACTTGAGATTTCCTTGGACAAAAGGAGATTCCCATAATCACTCTTCAAATGCTGCAAGCAACCAAGATTCTTTCAAGGGTGCCATTTTGGTCACAGTTT
CTTGTCTTTGCTCTTCTGTTTCATGCATTCTACAGGCAATTGTGTTGAAATCATATCCTGTGGGGCTGAGTGTAACAGTTTTGGTGTGCTTGGTGGGAGTTGTGGAAGGC
ACTGTGATAGCTTTAGCAATGGAATGGAAGAACCCTGCTGCCTGGTCAATTCATTTTGATTTTCAACTCTTGGCCATCCTTTATGCTGGAATAATAATTTCAGGGCTTTC
GTACTACATTCAAGGAGTGGTAATGGAGGAAAAGGGACCTGTGTTTTTGACAGCATTTTTTCCTCTAAGCACTGTCATTGTTGCAATCATAAGCTCTTTTGCAGTCTCTG
AGATATTGTCCTTGGGAAAGGTTGTGGGAGCAATGATCATAATAATAGGTCTTTATTTGGTTCTGTGGGGCAAAGCCAAAGATCCCGCCATCGACAAGGCAGCTAAAGTA
GACGCGACGACAAGGAGGGAAGGTTCCGTTTGA
Protein sequenceShow/hide protein sequence
MKKLSGALKPYLAAILLRFLASGLIVIAKIALNQGMSPQKKKTRDDLVHLCKDSTAGLYGVRIFSFLFNKFGSALSSSVVIPNTYFTGLKYVTPTFSIAMSNAVPALSFF
FAWIFRMEKVDIRRISSQAKILGTAVTVGGAMVMTFVRGTNLRFPWTKGDSHNHSSNAASNQDSFKGAILVTVSCLCSSVSCILQAIVLKSYPVGLSVTVLVCLVGVVEG
TVIALAMEWKNPAAWSIHFDFQLLAILYAGIIISGLSYYIQGVVMEEKGPVFLTAFFPLSTVIVAIISSFAVSEILSLGKVVGAMIIIIGLYLVLWGKAKDPAIDKAAKV
DATTRREGSV