; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020451 (gene) of Snake gourd v1 genome

Gene IDTan0020451
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionWAT1-related protein
Genome locationLG01:540259..542817
RNA-Seq ExpressionTan0020451
SyntenyTan0020451
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0022857 - transmembrane transporter activity (molecular function)
InterPro domainsIPR000620 - EamA domain
IPR030184 - WAT1-related protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576614.1 WAT1-related protein, partial [Cucurbita argyrosperma subsp. sororia]9.3e-14775.66Show/hide
Query:  GVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLR
        GVN+++KL +ASRP+LAML VQIFA+GMQLLS+VILNHGTF+FALM+YRHLVAALCVAPFAFFFE R                    ITAAMGLYYYGLR
Subjt:  GVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLR

Query:  DTTATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHD-HHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFS
        DTTATYATNFLNLIPVVTFVISS+LR+EKVSL RRAG++ ++GAILCVGG +I S+YRGKGFHIGH   HV    N++ + +   HWGRGTLLLLGSCFS
Subjt:  DTTATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHD-HHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFS

Query:  YATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIF
        Y+TWFVVQVKLLKL PSKY ATMLTCVIACIQSTLLG CL TN      A+WKLGWDLQLLTILYSGALATAATFCLMTWAIS++GPTFPPMFNPLTLIF
Subjt:  YATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIF

Query:  VAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS
        VAISE II GEEI+VG++LGT VMV GLYCFLWGKTKEMKKS+HLPR  AAA+A+EAAT+TSEPAPL    SAAVVP+
Subjt:  VAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS

XP_022140811.1 WAT1-related protein At1g43650-like isoform X1 [Momordica charantia]8.7e-14576.66Show/hide
Query:  LFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR---------------------ITAAMGLYYYGLRDTTATY
        LF ASRPVLAMLLVQ+FA+GMQLLS++ILN GTFIFALM+YRH+VAALCVAPFAFFF+ R                     ITAAMG+YYYGLRDTTATY
Subjt:  LFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR---------------------ITAAMGLYYYGLRDTTATY

Query:  ATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQN--------HDVAVD-EAHWGRGTLLLLGSCF
        ATNFLNLIPVVTFVISS+L MEKVS+ RRAGKVKI+GAILCVGGALI   Y+GKGFHIGH H +A  ++        +++  D EAHWGRGTLLLLGSCF
Subjt:  ATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQN--------HDVAVD-EAHWGRGTLLLLGSCF

Query:  SYATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLI
         YA WFVVQVKLLKL PSKY ATMLTCVIACIQST LG CL  N     KAAW LGWDLQLLTILYSGALATAATFCLMTWAISI+GPTFPPMFNP+TLI
Subjt:  SYATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLI

Query:  FVAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPRAAAVAVEAATSTSEPAPLQHSASAAVVPS
         VAISE II GEEIRVGNI+GTAVMV GLYCFLWGKTKEMKKSSHLPRAAA AVEAAT+TSEPAPLQ   SAAVVPS
Subjt:  FVAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPRAAAVAVEAATSTSEPAPLQHSASAAVVPS

XP_022922848.1 WAT1-related protein At5g64700-like isoform X1 [Cucurbita moschata]2.7e-14675.4Show/hide
Query:  GVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLR
        GVN+++KL +ASRP+LAML VQIFA+GMQLLS+VILNHGTF+FALM+YRHLVAALCVAPFAFFFE R                    ITAAMGLYYYGLR
Subjt:  GVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLR

Query:  DTTATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHD-HHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFS
        DTTATYATNFLNLIPVVTFVISS+LR+EKVSL RRAG++ ++GAILCVGG +I S+YRGKGFHIGH   HV    N++ + +   HWGRGTLLLLGSCFS
Subjt:  DTTATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHD-HHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFS

Query:  YATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIF
        Y+TWFVVQVKLLKL PSKY ATMLTCVIACIQSTLLG CL TN      A+WKLGWDLQLLTILYSGALATAATFCLMTWAIS++GPTFPPMFNPLTLIF
Subjt:  YATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIF

Query:  VAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS
        VAISE II GEEI+VG++LGT VMV GLYCFLWGKTKE+KKS+HLPR  AAA+A+EAAT+TSEPAPL    SAAVVP+
Subjt:  VAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS

XP_022984779.1 WAT1-related protein At5g64700-like isoform X1 [Cucurbita maxima]1.1e-14475.07Show/hide
Query:  GVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFE-------------------IRITAAMGLYYYGLRD
        GVN+++KL +ASRP+LAML VQIFA+GMQLLS+VILNHGTF+FALM+YRHLVAALCVAPFAFF                       ITAAMGLYYYGLRD
Subjt:  GVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFE-------------------IRITAAMGLYYYGLRD

Query:  TTATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHD-HHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFSY
        TTATYATNFLNLIPVVTFVISS+LR+EKVSL RRAG++ ++GAILCVGG +I S+YRGKGFHIGH   HV    N++ + +   HWGRGTLLLLGSCFSY
Subjt:  TTATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHD-HHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFSY

Query:  ATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFV
        +TWFVVQVKLLKL PSKY ATMLTCVIACIQSTLLG CL TN      A+WKLGWDLQLLTILYSGALATAATFCLMTWAIS++GPTFPPMFNPLTLIFV
Subjt:  ATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFV

Query:  AISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS
        AISE II GEEI+VG++LGT VMV GLYCFLWGKTKEMKKS+HLPR  AAA+A+EAAT+TSEPAPL    SAAVVP+
Subjt:  AISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS

XP_023553116.1 WAT1-related protein At5g64700-like [Cucurbita pepo subsp. pepo]4.2e-14775.66Show/hide
Query:  GVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLR
        GVN+++KL +ASRP+LAML VQIFA+GMQLLS+VILNHGTF+FALM+YRHLVAALCVAPFAFFFE R                    ITAAMGLYYYGLR
Subjt:  GVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLR

Query:  DTTATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIG-HDHHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFS
        DTTATYATNFLNLIPVVTFVISS+LR+EKVSL RRAG++ ++GAILCVGG +I S+YRGKGFHIG HD HV    N++ + +   HWGRGTLLLLGSCFS
Subjt:  DTTATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIG-HDHHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFS

Query:  YATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIF
        Y+TWFVVQVKLLKL PSKY ATMLTCVIACIQSTLLG CL TN      A+WKLGWDLQLLTILYSGALATAATFCLMTWAIS++GPTFPPMFNPLTLIF
Subjt:  YATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIF

Query:  VAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS
        V ISE II GEEI+VG++LGT VMV GLYCFLWGKTKEMKKS+HLPR  AAA+A+EAAT+TSEPAPL    SAAVVP+
Subjt:  VAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS

TrEMBL top hitse value%identityAlignment
A0A6J1CH56 WAT1-related protein4.2e-14576.66Show/hide
Query:  LFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR---------------------ITAAMGLYYYGLRDTTATY
        LF ASRPVLAMLLVQ+FA+GMQLLS++ILN GTFIFALM+YRH+VAALCVAPFAFFF+ R                     ITAAMG+YYYGLRDTTATY
Subjt:  LFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR---------------------ITAAMGLYYYGLRDTTATY

Query:  ATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQN--------HDVAVD-EAHWGRGTLLLLGSCF
        ATNFLNLIPVVTFVISS+L MEKVS+ RRAGKVKI+GAILCVGGALI   Y+GKGFHIGH H +A  ++        +++  D EAHWGRGTLLLLGSCF
Subjt:  ATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQN--------HDVAVD-EAHWGRGTLLLLGSCF

Query:  SYATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLI
         YA WFVVQVKLLKL PSKY ATMLTCVIACIQST LG CL  N     KAAW LGWDLQLLTILYSGALATAATFCLMTWAISI+GPTFPPMFNP+TLI
Subjt:  SYATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLI

Query:  FVAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPRAAAVAVEAATSTSEPAPLQHSASAAVVPS
         VAISE II GEEIRVGNI+GTAVMV GLYCFLWGKTKEMKKSSHLPRAAA AVEAAT+TSEPAPLQ   SAAVVPS
Subjt:  FVAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPRAAAVAVEAATSTSEPAPLQHSASAAVVPS

A0A6J1CI51 WAT1-related protein1.2e-14476.6Show/hide
Query:  LFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFE--------------------IRITAAMGLYYYGLRDTTATYA
        LF ASRPVLAMLLVQ+FA+GMQLLS++ILN GTFIFALM+YRH+VAALCVAPFAFFF+                      ITAAMG+YYYGLRDTTATYA
Subjt:  LFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFE--------------------IRITAAMGLYYYGLRDTTATYA

Query:  TNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQN--------HDVAVD-EAHWGRGTLLLLGSCFS
        TNFLNLIPVVTFVISS+L MEKVS+ RRAGKVKI+GAILCVGGALI   Y+GKGFHIGH H +A  ++        +++  D EAHWGRGTLLLLGSCF 
Subjt:  TNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQN--------HDVAVD-EAHWGRGTLLLLGSCFS

Query:  YATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIF
        YA WFVVQVKLLKL PSKY ATMLTCVIACIQST LG CL  N     KAAW LGWDLQLLTILYSGALATAATFCLMTWAISI+GPTFPPMFNP+TLI 
Subjt:  YATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIF

Query:  VAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPRAAAVAVEAATSTSEPAPLQHSASAAVVPS
        VAISE II GEEIRVGNI+GTAVMV GLYCFLWGKTKEMKKSSHLPRAAA AVEAAT+TSEPAPLQ   SAAVVPS
Subjt:  VAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPRAAAVAVEAATSTSEPAPLQHSASAAVVPS

A0A6J1E4N2 WAT1-related protein2.0e-13975.9Show/hide
Query:  MLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLRDTTATYATNFLNLIPVV
        ML VQIFA+GMQLLS+VILNHGTF+FALM+YRHLVAALCVAPFAFFFE R                    ITAAMGLYYYGLRDTTATYATNFLNLIPVV
Subjt:  MLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLRDTTATYATNFLNLIPVV

Query:  TFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHD-HHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFSYATWFVVQVKLLKLLPS
        TFVISS+LR+EKVSL RRAG++ ++GAILCVGG +I S+YRGKGFHIGH   HV    N++ + +   HWGRGTLLLLGSCFSY+TWFVVQVKLLKL PS
Subjt:  TFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHD-HHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFSYATWFVVQVKLLKLLPS

Query:  KYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEAIIFGEEIRVGN
        KY ATMLTCVIACIQSTLLG CL TN      A+WKLGWDLQLLTILYSGALATAATFCLMTWAIS++GPTFPPMFNPLTLIFVAISE II GEEI+VG+
Subjt:  KYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEAIIFGEEIRVGN

Query:  ILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS
        +LGT VMV GLYCFLWGKTKE+KKS+HLPR  AAA+A+EAAT+TSEPAPL    SAAVVP+
Subjt:  ILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS

A0A6J1E7Y3 WAT1-related protein1.3e-14675.4Show/hide
Query:  GVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLR
        GVN+++KL +ASRP+LAML VQIFA+GMQLLS+VILNHGTF+FALM+YRHLVAALCVAPFAFFFE R                    ITAAMGLYYYGLR
Subjt:  GVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLR

Query:  DTTATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHD-HHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFS
        DTTATYATNFLNLIPVVTFVISS+LR+EKVSL RRAG++ ++GAILCVGG +I S+YRGKGFHIGH   HV    N++ + +   HWGRGTLLLLGSCFS
Subjt:  DTTATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHD-HHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFS

Query:  YATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIF
        Y+TWFVVQVKLLKL PSKY ATMLTCVIACIQSTLLG CL TN      A+WKLGWDLQLLTILYSGALATAATFCLMTWAIS++GPTFPPMFNPLTLIF
Subjt:  YATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIF

Query:  VAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS
        VAISE II GEEI+VG++LGT VMV GLYCFLWGKTKE+KKS+HLPR  AAA+A+EAAT+TSEPAPL    SAAVVP+
Subjt:  VAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS

A0A6J1JBI3 WAT1-related protein5.5e-14575.07Show/hide
Query:  GVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFE-------------------IRITAAMGLYYYGLRD
        GVN+++KL +ASRP+LAML VQIFA+GMQLLS+VILNHGTF+FALM+YRHLVAALCVAPFAFF                       ITAAMGLYYYGLRD
Subjt:  GVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFE-------------------IRITAAMGLYYYGLRD

Query:  TTATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHD-HHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFSY
        TTATYATNFLNLIPVVTFVISS+LR+EKVSL RRAG++ ++GAILCVGG +I S+YRGKGFHIGH   HV    N++ + +   HWGRGTLLLLGSCFSY
Subjt:  TTATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHD-HHVAQKQNHDVAVDEA-HWGRGTLLLLGSCFSY

Query:  ATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFV
        +TWFVVQVKLLKL PSKY ATMLTCVIACIQSTLLG CL TN      A+WKLGWDLQLLTILYSGALATAATFCLMTWAIS++GPTFPPMFNPLTLIFV
Subjt:  ATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFV

Query:  AISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS
        AISE II GEEI+VG++LGT VMV GLYCFLWGKTKEMKKS+HLPR  AAA+A+EAAT+TSEPAPL    SAAVVP+
Subjt:  AISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPR--AAAVAVEAATSTSEPAPLQHSASAAVVPS

SwissProt top hitse value%identityAlignment
Q8GXB4 WAT1-related protein At1g093803.6e-4532.73Show/hide
Query:  PVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR-----------------ITAAMG---LYYYGLRDTTATYATNFLNL
        P LAM+LVQI  +GM + S++ +  G     L++YR + A +   P AFF E +                 IT A G   LY+ GL++++ T A    NL
Subjt:  PVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR-----------------ITAAMG---LYYYGLRDTTATYATNFLNL

Query:  IPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDH-HVAQKQN---HDVAVDEAHWGRGTLLLLGSCFSYATWFVVQVKL
        +P VTF++++I R E V + + +G+ K++G ++CV GA+++S Y G    IG    H A  +N   H  +   +++  G  L++ +  S+A WF++Q K+
Subjt:  IPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDH-HVAQKQN---HDVAVDEAHWGRGTLLLLGSCFSYATWFVVQVKL

Query:  LKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEAIIFGE
         +   + Y +T+L C++  IQ   +        +++  + W L   L+ ++ LY+G +A+A  FCLM+WA+  KGP +  +F+PL L+ VAI    +  E
Subjt:  LKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEAIIFGE

Query:  EIRVGNILGTAVMVTGLYCFLWGKTKEMKK
        ++  G  +G+A++V GLY  LWGK +E+ +
Subjt:  EIRVGNILGTAVMVTGLYCFLWGKTKEMKK

Q9FGG3 WAT1-related protein At5g647004.3e-4631.56Show/hide
Query:  EASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLRDTTATYATN
        E+ +P L + ++Q+  + M L+S+ + N G   F  + YR   A + +AP AFFFE +                    +T ++ L    L  T+AT A  
Subjt:  EASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLRDTTATYATN

Query:  FLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRG--------KGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYAT
            +P +TF ++ +  ME++ +    G  K++G  +C+GG +I+++Y+G          F+ G +H       H V+     W +G +L++ S   +  
Subjt:  FLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRG--------KGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYAT

Query:  WFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAI
        W V+Q ++LK+ PSK + T L C+++ IQS ++   L  +      +AWKLGW+L+L+ ++Y G + T   + L +W I  +GP F  MF PL+L+F  +
Subjt:  WFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAI

Query:  SEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKS
        S AI+  E I +G+I+G  +++ GLYC LWGK++E K S
Subjt:  SEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKS

Q9LPF1 WAT1-related protein At1g448006.9e-4431.19Show/hide
Query:  EASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIRITAAM--------------------GLYYYGLRDTTATYATN
        E  +P+LA++ +Q   +GM +++ V   HG   + L +YRH+VA + +APFA  FE +I   M                     LYY GL++T+A+Y + 
Subjt:  EASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIRITAAM--------------------GLYYYGLRDTTATYATN

Query:  FLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYATWFVVQVKL
        F N +P VTF+++ I R+E V+  +     K++G ++ VGGA+IM+LY+G    I    H +            HW  GT+ ++GS  ++A +F++Q   
Subjt:  FLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYATWFVVQVKL

Query:  LKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEAIIFGE
        LK+ P++     L C I  I + +    +  +      +AWK+G D   L  +YSG + +   + + +  I  +GP F   F+P+ +I  A   A++  E
Subjt:  LKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEAIIFGE

Query:  EIRVGNILGTAVMVTGLYCFLWGKTKE
        +I +G+I+G   +V GLY  +WGK+K+
Subjt:  EIRVGNILGTAVMVTGLYCFLWGKTKE

Q9LV20 WAT1-related protein At3g182001.8e-4433.23Show/hide
Query:  EASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFE--------------------IRITAAMGLYYYGLRDTTATYATN
        E  + V+A++ +Q   +G  ++SRV LN G        YR+L+A L + PFA+FFE                    I ITA  G Y  GL   T T+A+ 
Subjt:  EASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFE--------------------IRITAAMGLYYYGLRDTTATYATN

Query:  FLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRG-----KGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYATWFV
          N +P +TF+++  LR+E + L R+ G  K++G ++ +GGA +++LYRG     +G ++  +  V    +H + +       G L L+G C S+A W V
Subjt:  FLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRG-----KGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYATWFV

Query:  VQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEA
        +Q  +LK  P+K   T  TC    IQ  ++   + T+ NN    +W+     +L TILY+G +A+     L TW I   GP F  +F PL  + VA    
Subjt:  VQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEA

Query:  IIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMK
        +I G+++  G I+G   ++ GLY  LWGK +E K
Subjt:  IIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMK

Q9ZUS1 WAT1-related protein At2g374602.3e-4733.14Show/hide
Query:  KVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIRITAAM--------------------GLYYYGLRDTT
        K     E +RP ++M+++Q+  +GM +LS+ +LN G   + L+ YRH VA + +APFAF+F+ ++   M                     LYY G++ TT
Subjt:  KVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIRITAAM--------------------GLYYYGLRDTT

Query:  ATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQNHDVAVDEAHWG-RGTLLLLGSCFSYATW
        AT+AT   N++P +TFV++ I  +E+V L       K++G +  VGGA+IM+L +G    +     V+    H+ A  + H   +G +L+   CFSYA +
Subjt:  ATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQNHDVAVDEAHWG-RGTLLLLGSCFSYATW

Query:  FVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAIS
         ++Q   L+  P++   T   C++  I+ T +   +         +AW +GWD +LLT  YSG + +A  + +    +  +GP F   F+PL +I VAI 
Subjt:  FVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAIS

Query:  EAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHL
          IIF E++ +G +LG  V+  GLY  +WGK K+ K +S L
Subjt:  EAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHL

Arabidopsis top hitse value%identityAlignment
AT1G09380.1 nodulin MtN21 /EamA-like transporter family protein2.6e-4632.73Show/hide
Query:  PVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR-----------------ITAAMG---LYYYGLRDTTATYATNFLNL
        P LAM+LVQI  +GM + S++ +  G     L++YR + A +   P AFF E +                 IT A G   LY+ GL++++ T A    NL
Subjt:  PVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR-----------------ITAAMG---LYYYGLRDTTATYATNFLNL

Query:  IPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDH-HVAQKQN---HDVAVDEAHWGRGTLLLLGSCFSYATWFVVQVKL
        +P VTF++++I R E V + + +G+ K++G ++CV GA+++S Y G    IG    H A  +N   H  +   +++  G  L++ +  S+A WF++Q K+
Subjt:  IPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDH-HVAQKQN---HDVAVDEAHWGRGTLLLLGSCFSYATWFVVQVKL

Query:  LKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEAIIFGE
         +   + Y +T+L C++  IQ   +        +++  + W L   L+ ++ LY+G +A+A  FCLM+WA+  KGP +  +F+PL L+ VAI    +  E
Subjt:  LKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEAIIFGE

Query:  EIRVGNILGTAVMVTGLYCFLWGKTKEMKK
        ++  G  +G+A++V GLY  LWGK +E+ +
Subjt:  EIRVGNILGTAVMVTGLYCFLWGKTKEMKK

AT1G44800.1 nodulin MtN21 /EamA-like transporter family protein4.9e-4531.19Show/hide
Query:  EASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIRITAAM--------------------GLYYYGLRDTTATYATN
        E  +P+LA++ +Q   +GM +++ V   HG   + L +YRH+VA + +APFA  FE +I   M                     LYY GL++T+A+Y + 
Subjt:  EASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIRITAAM--------------------GLYYYGLRDTTATYATN

Query:  FLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYATWFVVQVKL
        F N +P VTF+++ I R+E V+  +     K++G ++ VGGA+IM+LY+G    I    H +            HW  GT+ ++GS  ++A +F++Q   
Subjt:  FLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYATWFVVQVKL

Query:  LKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEAIIFGE
        LK+ P++     L C I  I + +    +  +      +AWK+G D   L  +YSG + +   + + +  I  +GP F   F+P+ +I  A   A++  E
Subjt:  LKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEAIIFGE

Query:  EIRVGNILGTAVMVTGLYCFLWGKTKE
        +I +G+I+G   +V GLY  +WGK+K+
Subjt:  EIRVGNILGTAVMVTGLYCFLWGKTKE

AT2G37460.1 nodulin MtN21 /EamA-like transporter family protein1.6e-4833.14Show/hide
Query:  KVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIRITAAM--------------------GLYYYGLRDTT
        K     E +RP ++M+++Q+  +GM +LS+ +LN G   + L+ YRH VA + +APFAF+F+ ++   M                     LYY G++ TT
Subjt:  KVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIRITAAM--------------------GLYYYGLRDTT

Query:  ATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQNHDVAVDEAHWG-RGTLLLLGSCFSYATW
        AT+AT   N++P +TFV++ I  +E+V L       K++G +  VGGA+IM+L +G    +     V+    H+ A  + H   +G +L+   CFSYA +
Subjt:  ATYATNFLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQNHDVAVDEAHWG-RGTLLLLGSCFSYATW

Query:  FVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAIS
         ++Q   L+  P++   T   C++  I+ T +   +         +AW +GWD +LLT  YSG + +A  + +    +  +GP F   F+PL +I VAI 
Subjt:  FVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAIS

Query:  EAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHL
          IIF E++ +G +LG  V+  GLY  +WGK K+ K +S L
Subjt:  EAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHL

AT3G18200.1 nodulin MtN21 /EamA-like transporter family protein1.3e-4533.23Show/hide
Query:  EASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFE--------------------IRITAAMGLYYYGLRDTTATYATN
        E  + V+A++ +Q   +G  ++SRV LN G        YR+L+A L + PFA+FFE                    I ITA  G Y  GL   T T+A+ 
Subjt:  EASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFE--------------------IRITAAMGLYYYGLRDTTATYATN

Query:  FLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRG-----KGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYATWFV
          N +P +TF+++  LR+E + L R+ G  K++G ++ +GGA +++LYRG     +G ++  +  V    +H + +       G L L+G C S+A W V
Subjt:  FLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRG-----KGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYATWFV

Query:  VQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEA
        +Q  +LK  P+K   T  TC    IQ  ++   + T+ NN    +W+     +L TILY+G +A+     L TW I   GP F  +F PL  + VA    
Subjt:  VQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEA

Query:  IIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMK
        +I G+++  G I+G   ++ GLY  LWGK +E K
Subjt:  IIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMK

AT5G64700.1 nodulin MtN21 /EamA-like transporter family protein3.1e-4731.56Show/hide
Query:  EASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLRDTTATYATN
        E+ +P L + ++Q+  + M L+S+ + N G   F  + YR   A + +AP AFFFE +                    +T ++ L    L  T+AT A  
Subjt:  EASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIR--------------------ITAAMGLYYYGLRDTTATYATN

Query:  FLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRG--------KGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYAT
            +P +TF ++ +  ME++ +    G  K++G  +C+GG +I+++Y+G          F+ G +H       H V+     W +G +L++ S   +  
Subjt:  FLNLIPVVTFVISSILRMEKVSLNRRAGKVKIMGAILCVGGALIMSLYRG--------KGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYAT

Query:  WFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAI
        W V+Q ++LK+ PSK + T L C+++ IQS ++   L  +      +AWKLGW+L+L+ ++Y G + T   + L +W I  +GP F  MF PL+L+F  +
Subjt:  WFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCLHTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAI

Query:  SEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKS
        S AI+  E I +G+I+G  +++ GLYC LWGK++E K S
Subjt:  SEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTGTGAATAAAGTTGAGAAATTGTTCGAGGCCTCTCGCCCCGTTCTCGCCATGTTGCTCGTCCAAATTTTTGCCAGCGGAATGCAACTTCTTAGCAGAGTTAT
TCTCAATCATGGCACCTTCATTTTTGCACTCATGTCCTACCGTCATCTCGTCGCCGCTCTATGCGTTGCACCCTTTGCTTTCTTCTTCGAAATTAGAATAACAGCGGCGA
TGGGGCTGTACTACTATGGTCTACGAGACACGACAGCTACTTATGCTACGAACTTCCTGAACTTGATTCCCGTAGTGACATTTGTCATATCCTCCATCCTTAGGATGGAA
AAAGTGAGCTTGAATAGAAGGGCAGGGAAAGTAAAGATAATGGGGGCAATTTTGTGCGTTGGGGGAGCATTAATTATGAGTCTATACAGAGGAAAAGGATTCCACATTGG
TCATGATCATCATGTTGCCCAGAAGCAGAATCATGATGTCGCAGTCGATGAGGCTCACTGGGGACGAGGCACCCTCCTGCTTCTTGGAAGTTGCTTCTCCTATGCTACTT
GGTTTGTTGTCCAAGTGAAGTTGCTCAAACTGCTTCCATCTAAGTATTTCGCCACCATGCTAACATGTGTCATAGCATGCATTCAGTCAACACTGCTTGGCTTCTGCCTC
CACACCAACACCAACAACAACTACAAGGCCGCTTGGAAGTTGGGTTGGGATCTGCAGCTTCTCACCATTTTATACTCGGGAGCATTGGCGACCGCAGCTACTTTTTGCTT
GATGACATGGGCAATTTCAATCAAAGGACCCACTTTCCCTCCCATGTTCAATCCCCTCACTCTAATTTTTGTGGCAATCTCAGAAGCCATCATATTTGGCGAGGAGATTA
GAGTGGGCAACATTTTGGGGACGGCTGTGATGGTAACGGGGCTCTACTGTTTCCTGTGGGGTAAGACAAAGGAGATGAAGAAATCATCGCATCTCCCGAGAGCAGCTGCG
GTTGCAGTTGAAGCAGCAACATCAACTTCAGAACCTGCACCACTGCAGCATTCAGCATCAGCAGCTGTAGTGCCAAGCACTGAACGACTGCAATTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGTGTGAATAAAGTTGAGAAATTGTTCGAGGCCTCTCGCCCCGTTCTCGCCATGTTGCTCGTCCAAATTTTTGCCAGCGGAATGCAACTTCTTAGCAGAGTTAT
TCTCAATCATGGCACCTTCATTTTTGCACTCATGTCCTACCGTCATCTCGTCGCCGCTCTATGCGTTGCACCCTTTGCTTTCTTCTTCGAAATTAGAATAACAGCGGCGA
TGGGGCTGTACTACTATGGTCTACGAGACACGACAGCTACTTATGCTACGAACTTCCTGAACTTGATTCCCGTAGTGACATTTGTCATATCCTCCATCCTTAGGATGGAA
AAAGTGAGCTTGAATAGAAGGGCAGGGAAAGTAAAGATAATGGGGGCAATTTTGTGCGTTGGGGGAGCATTAATTATGAGTCTATACAGAGGAAAAGGATTCCACATTGG
TCATGATCATCATGTTGCCCAGAAGCAGAATCATGATGTCGCAGTCGATGAGGCTCACTGGGGACGAGGCACCCTCCTGCTTCTTGGAAGTTGCTTCTCCTATGCTACTT
GGTTTGTTGTCCAAGTGAAGTTGCTCAAACTGCTTCCATCTAAGTATTTCGCCACCATGCTAACATGTGTCATAGCATGCATTCAGTCAACACTGCTTGGCTTCTGCCTC
CACACCAACACCAACAACAACTACAAGGCCGCTTGGAAGTTGGGTTGGGATCTGCAGCTTCTCACCATTTTATACTCGGGAGCATTGGCGACCGCAGCTACTTTTTGCTT
GATGACATGGGCAATTTCAATCAAAGGACCCACTTTCCCTCCCATGTTCAATCCCCTCACTCTAATTTTTGTGGCAATCTCAGAAGCCATCATATTTGGCGAGGAGATTA
GAGTGGGCAACATTTTGGGGACGGCTGTGATGGTAACGGGGCTCTACTGTTTCCTGTGGGGTAAGACAAAGGAGATGAAGAAATCATCGCATCTCCCGAGAGCAGCTGCG
GTTGCAGTTGAAGCAGCAACATCAACTTCAGAACCTGCACCACTGCAGCATTCAGCATCAGCAGCTGTAGTGCCAAGCACTGAACGACTGCAATTTTAA
Protein sequenceShow/hide protein sequence
MGGVNKVEKLFEASRPVLAMLLVQIFASGMQLLSRVILNHGTFIFALMSYRHLVAALCVAPFAFFFEIRITAAMGLYYYGLRDTTATYATNFLNLIPVVTFVISSILRME
KVSLNRRAGKVKIMGAILCVGGALIMSLYRGKGFHIGHDHHVAQKQNHDVAVDEAHWGRGTLLLLGSCFSYATWFVVQVKLLKLLPSKYFATMLTCVIACIQSTLLGFCL
HTNTNNNYKAAWKLGWDLQLLTILYSGALATAATFCLMTWAISIKGPTFPPMFNPLTLIFVAISEAIIFGEEIRVGNILGTAVMVTGLYCFLWGKTKEMKKSSHLPRAAA
VAVEAATSTSEPAPLQHSASAAVVPSTERLQF