; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016955 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016955
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionWAT1-related protein At3g02690, chloroplastic
Genome locationtig00153016:971177..980417
RNA-Seq ExpressionSgr016955
SyntenySgr016955
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000620 - EamA domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579056.1 WAT1-related protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.2e-15074.15Show/hide
Query:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQV---RISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELD
        MAGC       L T TLTT S +L SSSSP N RPL  FSF RQ+    ISS APILRRRRV + L  RRY  E  WFR DY++IPV++CTRSGADT+LD
Subjt:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQV---RISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELD

Query:  LLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI
          ES DCVGTAQDVECVVS TDE+    +G   + +N   DGDGSVA+L    KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRL+PAGFLLI
Subjt:  LLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI

Query:  AFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGL
        AFAAFRGR FPSGFSAW+SI+LFALVDAT FQGFLAQGLQRTSAGLGSV             +  S  ++  VAVLAA LFGESI  +GAAGLVLGVLGL
Subjt:  AFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGL

Query:  LLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAI
        LLLEVPSL LDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP+MATGWHMVIGG PLL IC LNH+PAVSG+L+DF+TNDILAL YASIFGSA+
Subjt:  LLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAI

Query:  SYGSFFYSATKGEI
        SYGSFFYSATKG +
Subjt:  SYGSFFYSATKGEI

XP_022141445.1 WAT1-related protein At3g02690, chloroplastic [Momordica charantia]2.0e-15876.14Show/hide
Query:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQVRI----SSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTEL
        MAG WPS  S LPTP+               NSRPLLHFSFSRQ+ +    SS API+ RR   +  +  RYG  NDWFRADY+ IPVV+CTRSGADTEL
Subjt:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQVRI----SSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTEL

Query:  DLLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLL
        DL ES DCVGTAQDVECVVS  DEDPRSSI        + VDGDGS+A+L   GKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLL
Subjt:  DLLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLL

Query:  IAFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLG
        IAFAAFRGR FPSGFSAW+SI LFALVDATSFQGFLAQGLQRTSAGLGSV             +  S  ++  VAVLAALLFGESIS IGAAGL+LGVLG
Subjt:  IAFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLG

Query:  LLLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSA
        LLLLEVPSLALDAS+FSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP+MATGWHMVIGG PLL ICILNHDPA+SG+LKDF+TNDILALLYASIFGSA
Subjt:  LLLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSA

Query:  ISYGSFFYSATKGEI
        ISYGSFFYSATKG +
Subjt:  ISYGSFFYSATKGEI

XP_022993106.1 WAT1-related protein At3g02690, chloroplastic [Cucurbita maxima]1.0e-14973.91Show/hide
Query:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQV---RISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELD
        MAGC       L T TLTT SP+L SS SP N RPL  FSF RQ+    ISS APILRRRRV + L  RRY  E   FR DY++IPV +CTRSGADT+LD
Subjt:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQV---RISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELD

Query:  LLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI
          ES DCVGTAQDVECVVS TDE+    +G   + +N   DGDGSVA+L    KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRL+PAGFLLI
Subjt:  LLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI

Query:  AFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGL
        AFAAFRGR FPSGFSAW+SI+LFALVDAT FQGFLAQGLQRTSAGLGSV             +  S  ++  VAVLAA LFGESI  +GAAGLVLGVLGL
Subjt:  AFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGL

Query:  LLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAI
        +LLEVPSL LDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP+MATGWHMVIGG PLLMIC LNH+PAVSG+L+DF+TNDILAL YASIFGSA+
Subjt:  LLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAI

Query:  SYGSFFYSATKGEI
        SYGSFFYSATKG +
Subjt:  SYGSFFYSATKGEI

XP_023550684.1 WAT1-related protein At3g02690, chloroplastic [Cucurbita pepo subsp. pepo]1.3e-14973.91Show/hide
Query:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQV---RISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELD
        MAGC       L T TLTT SP+L SSSSP N RPL  FSF RQ+    ISS APILRRRRV + L  RRY  E   FR DY++IPV +CTRSGADT+LD
Subjt:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQV---RISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELD

Query:  LLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI
          ES DCVGTAQDVECVVS TDE+    +G   + +N   DGDGSVA+L    KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRL+PAGFLLI
Subjt:  LLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI

Query:  AFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGL
        AFAAFRGR FPSGFSAW+SI+LFALVDAT FQGFLAQGLQRTSAGLGSV             +  S  ++  VA+LAA LFGESI  +GAAGLVLGVLGL
Subjt:  AFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGL

Query:  LLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAI
        LLLEVPSL LDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP+MATGWHMVIGG PLL IC LNH+PAVSG+L+DF+TNDILAL YASIFGSA+
Subjt:  LLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAI

Query:  SYGSFFYSATKGEI
        SYGSFFYSATKG +
Subjt:  SYGSFFYSATKGEI

XP_038882098.1 WAT1-related protein At3g02690, chloroplastic [Benincasa hispida]8.1e-15575.12Show/hide
Query:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQV---RISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELD
        MAGC   + +  PTPT          S+SP NSRPLLHFSF+RQ+    ISSAAPILRRR   +HL   RYG EN  FR DY++IPV +CTRSGADTELD
Subjt:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQV---RISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELD

Query:  LLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI
        L ES DCVGTAQDVECV+S T EDP SS+   +   +S  DGDGSVA+L    KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI
Subjt:  LLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI

Query:  AFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGL
        AFAAFRGR FPSGFSAW+SI+LFALVDAT FQGFLAQGLQRTSAGLGSV             +  S  ++  VAVLAA LFGESI  +GAAGLVLGVLGL
Subjt:  AFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGL

Query:  LLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAI
        LLLEVPSL  DA+SFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP+MATGWHMVIGG PLLMICILNHDPAVSG+LKDF+TNDILALLYASIFGSA+
Subjt:  LLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAI

Query:  SYGSFFYSATKGEI
        SYGSFFYSATKG +
Subjt:  SYGSFFYSATKGEI

TrEMBL top hitse value%identityAlignment
A0A0A0KQD4 Uncharacterized protein1.3e-14773.06Show/hide
Query:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQVRISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELDLLE
        MAGC       L T TLT  SP         NS P  H        ISSAAPIL RRR+ + L  R    EN  FR  Y++IPV +CTRSG DTELD  E
Subjt:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQVRISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELDLLE

Query:  SFDCVGTAQDVECVVSLTDEDPRSSIGQPLELE-NSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAF
        S DCVGTAQDVECVVS  DEDP SSIG PL+L  +S   GDGSVA+L    KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAF
Subjt:  SFDCVGTAQDVECVVSLTDEDPRSSIGQPLELE-NSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAF

Query:  AAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGLLL
        AAFRGR FPSGFSAW+SI+LFALVDAT FQGFLAQGLQRTSAGLGSV             +  S  ++  VAVLAA LFGES+  +GAAGLVLGVLGLLL
Subjt:  AAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGLLL

Query:  LEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAISY
        LEVPSL  DA+SFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP+MATGWHMVIGG PLLMICILNHDPAVSG+LKDF+TNDILALLYASIFGSA+SY
Subjt:  LEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAISY

Query:  GSFFYSATKGEI
        GSFFYSATKG +
Subjt:  GSFFYSATKGEI

A0A1S3CK55 WAT1-related protein At3g02690, chloroplastic7.4e-14674.31Show/hide
Query:  LPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQVRISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELDLLESFDCVGTAQDV
        L T TLT  SP    S+SP          F R   ISSA PILRRR       G RY  EN  FR  Y++IPV +CTRSG DTELD  ES DCVGTAQDV
Subjt:  LPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQVRISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELDLLESFDCVGTAQDV

Query:  ECVVSLTDEDPRSSIGQPLELE-NSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRSFPSG
        ECVVS TDEDP SSIG PLEL  +S   G GSVA+L    KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLL+AFAAFRGR FPSG
Subjt:  ECVVSLTDEDPRSSIGQPLELE-NSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRSFPSG

Query:  FSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGLLLLEVPSLALDAS
        FSAW+SI+LFALVDAT FQGFLAQGLQRTSAGLGSV             +  S  ++  VAVLAA LFGES+  IGAAGLVLGV GLLLLEVPSL  DA+
Subjt:  FSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGLLLLEVPSLALDAS

Query:  SFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAISYGSFFYSATKGE
        SFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP+MATGWHMVIGG PLLMICILNHDPAVSG+LKDF+TNDILALLYASIFGSA+SYGSFFYSATKG 
Subjt:  SFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAISYGSFFYSATKGE

Query:  I
        +
Subjt:  I

A0A6J1CI41 WAT1-related protein At3g02690, chloroplastic9.9e-15976.14Show/hide
Query:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQVRI----SSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTEL
        MAG WPS  S LPTP+               NSRPLLHFSFSRQ+ +    SS API+ RR   +  +  RYG  NDWFRADY+ IPVV+CTRSGADTEL
Subjt:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQVRI----SSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTEL

Query:  DLLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLL
        DL ES DCVGTAQDVECVVS  DEDPRSSI        + VDGDGS+A+L   GKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLL
Subjt:  DLLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLL

Query:  IAFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLG
        IAFAAFRGR FPSGFSAW+SI LFALVDATSFQGFLAQGLQRTSAGLGSV             +  S  ++  VAVLAALLFGESIS IGAAGL+LGVLG
Subjt:  IAFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLG

Query:  LLLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSA
        LLLLEVPSLALDAS+FSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP+MATGWHMVIGG PLL ICILNHDPA+SG+LKDF+TNDILALLYASIFGSA
Subjt:  LLLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSA

Query:  ISYGSFFYSATKGEI
        ISYGSFFYSATKG +
Subjt:  ISYGSFFYSATKGEI

A0A6J1FGD0 WAT1-related protein At3g02690, chloroplastic7.1e-14973.67Show/hide
Query:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQV---RISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELD
        MAGC       L T TLTT S +L SSSSP N RPL  FSF RQ+    ISS APILRRRRV + L  RRY  E   FR DY++IP ++CTRSGADT+LD
Subjt:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQV---RISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELD

Query:  LLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI
          ES DCVGTAQDVECVVS TDE+    +G   + +N   DGDGSVA+L    KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRL+PAGFLLI
Subjt:  LLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI

Query:  AFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGL
        AFAAFRGR FPSGFSAW+SI+LFALVDAT FQGFLAQGLQRTSAGLGSV             +  S  ++  VAVLAA LFGESI  +GAAGLVLGVLGL
Subjt:  AFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGL

Query:  LLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAI
        LLLEVPSL LDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP+MATGWHMVIGG PLL IC LNH+PAVSG+L+DF+TNDILAL YASIFGSA+
Subjt:  LLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAI

Query:  SYGSFFYSATKGEI
        SYGSFFYSATKG +
Subjt:  SYGSFFYSATKGEI

A0A6J1JVE3 WAT1-related protein At3g02690, chloroplastic4.9e-15073.91Show/hide
Query:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQV---RISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELD
        MAGC       L T TLTT SP+L SS SP N RPL  FSF RQ+    ISS APILRRRRV + L  RRY  E   FR DY++IPV +CTRSGADT+LD
Subjt:  MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQV---RISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELD

Query:  LLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI
          ES DCVGTAQDVECVVS TDE+    +G   + +N   DGDGSVA+L    KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRL+PAGFLLI
Subjt:  LLESFDCVGTAQDVECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLI

Query:  AFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGL
        AFAAFRGR FPSGFSAW+SI+LFALVDAT FQGFLAQGLQRTSAGLGSV             +  S  ++  VAVLAA LFGESI  +GAAGLVLGVLGL
Subjt:  AFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGL

Query:  LLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAI
        +LLEVPSL LDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP+MATGWHMVIGG PLLMIC LNH+PAVSG+L+DF+TNDILAL YASIFGSA+
Subjt:  LLLEVPSLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAI

Query:  SYGSFFYSATKGEI
        SYGSFFYSATKG +
Subjt:  SYGSFFYSATKGEI

SwissProt top hitse value%identityAlignment
P74436 Uncharacterized transporter sll03557.3e-5047.01Show/hide
Query:  LVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLH
        L++PFF WGTAMVAMK VL  + PFFV+  RLIPAG L++ +A  + R  P  +  W  I+LFALVD T FQGFLAQGL+RT AGLGSV       A   
Subjt:  LVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRSFPSGFSAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLH

Query:  RHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGLLLLEVP-----------SLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPV
                    VA+L++ LF E I  IG  GL+LGV G+ L+ +P            L+++ S  +L  SGE WM LA+ SMAVGTV++ +VS+  DPV
Subjt:  RHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGLLLLEVP-----------SLALDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPV

Query:  MATGWHMVIGGFPLLMICIL-NHDPAVSGNLKDFSTNDILALLYASIFGSAISYGSFFYSATKGEIVA
        +ATGWHM+IGG PLL I ++ + +P  + +L  +       L YA++FGSAI+YG FFY A+KG + +
Subjt:  MATGWHMVIGGFPLLMICIL-NHDPAVSGNLKDFSTNDILALLYASIFGSAISYGSFFYSATKGEIVA

Q93V85 WAT1-related protein At3g02690, chloroplastic1.7e-9954.25Show/hide
Query:  TTVSPKLSSSSSPLNSRPLLHFSFSRQVRIS---SAAPILRRRRV--LYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTE-LDLLESFDCVGTAQDV
        + ++   SSSSS   + P    S +R+  +S   ++   LR  R    ++L  RR          D +     +   S  +TE      S DCVG   DV
Subjt:  TTVSPKLSSSSSPLNSRPLLHFSFSRQVRIS---SAAPILRRRRV--LYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTE-LDLLESFDCVGTAQDV

Query:  ECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRSFPSGF
        ECV +  DE+ RS          SG+       L    G   E+ VL+SPFFFWGTAMVAMKEVLP +GPFFV+AFRLIPAG LL+AFA ++GR  P G 
Subjt:  ECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRSFPSGF

Query:  SAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGLLLLEVPSLALDASS
        +AW SI LFALVDAT FQGFLAQGLQRTSAGLGSV             +  S  ++  VAVLA+ LFGESI  + A GL+LGV GLLLLEVPS+  D ++
Subjt:  SAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGLLLLEVPSLALDASS

Query:  FSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAISYGSFFYSATKGEI
        FSLWGSGEWWM LAAQSMA+GTVMVRWVSKYSDP+MATGWHMVIGG PLL I ++NHDP  +G+L+D STND++ALLY SIFGSA+SYG +FYSATKG +
Subjt:  FSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAISYGSFFYSATKGEI

Arabidopsis top hitse value%identityAlignment
AT3G02690.1 nodulin MtN21 /EamA-like transporter family protein1.2e-10054.25Show/hide
Query:  TTVSPKLSSSSSPLNSRPLLHFSFSRQVRIS---SAAPILRRRRV--LYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTE-LDLLESFDCVGTAQDV
        + ++   SSSSS   + P    S +R+  +S   ++   LR  R    ++L  RR          D +     +   S  +TE      S DCVG   DV
Subjt:  TTVSPKLSSSSSPLNSRPLLHFSFSRQVRIS---SAAPILRRRRV--LYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTE-LDLLESFDCVGTAQDV

Query:  ECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRSFPSGF
        ECV +  DE+ RS          SG+       L    G   E+ VL+SPFFFWGTAMVAMKEVLP +GPFFV+AFRLIPAG LL+AFA ++GR  P G 
Subjt:  ECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRSFPSGF

Query:  SAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGLLLLEVPSLALDASS
        +AW SI LFALVDAT FQGFLAQGLQRTSAGLGSV             +  S  ++  VAVLA+ LFGESI  + A GL+LGV GLLLLEVPS+  D ++
Subjt:  SAWLSIVLFALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGLLLLEVPSLALDASS

Query:  FSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAISYGSFFYSATKGEI
        FSLWGSGEWWM LAAQSMA+GTVMVRWVSKYSDP+MATGWHMVIGG PLL I ++NHDP  +G+L+D STND++ALLY SIFGSA+SYG +FYSATKG +
Subjt:  FSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAISYGSFFYSATKGEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGGTGTTGGCCCTCCATCTGTTCCTATTTACCAACTCCCACTCTTACCACCGTCTCTCCCAAGCTTTCCTCCTCTTCTTCTCCATTGAACTCCCGCCCTCTCCT
TCATTTCAGTTTCAGCAGACAAGTTCGTATTTCCTCCGCTGCTCCAATCCTTCGGCGACGACGAGTACTGTATCATTTGAATGGCAGAAGGTACGGCCCCGAAAATGACT
GGTTTCGTGCTGATTATATATCAATACCAGTGGTCAGTTGTACTAGAAGTGGCGCAGATACTGAATTGGATTTGTTGGAGTCCTTTGATTGCGTGGGGACTGCCCAGGAT
GTGGAGTGCGTGGTTTCCCTGACTGATGAAGATCCTCGATCTTCAATTGGGCAGCCACTAGAATTAGAAAATTCCGGTGTTGATGGTGATGGCTCTGTGGCACTCTTGGC
TGCTGCAGGAAAAGCCTGGGAGTTCGCGGTGTTGGTGTCGCCATTTTTCTTTTGGGGTACGGCTATGGTGGCCATGAAGGAGGTGCTTCCAAGGTCTGGTCCCTTTTTTG
TTTCCGCCTTTCGTCTTATACCTGCCGGTTTCCTTTTGATTGCCTTTGCTGCTTTCCGCGGTCGCTCCTTTCCCTCTGGTTTTTCTGCTTGGCTTTCCATCGTTCTCTTT
GCCCTAGTCGACGCTACATCATTTCAGGGCTTTCTTGCTCAAGGCTTGCAAAGGACATCCGCAGGCTTGGGCAGTGTATGTTTCATTTTTGCATCTTATGCCTCGTTACA
TAGACATCTTTCTGCTTCATTGCAAATTTCTGAAATTGTGGCGGTGCTTGCAGCTTTATTATTTGGTGAGTCCATCAGTTTCATTGGAGCTGCTGGACTTGTACTTGGTG
TTTTAGGACTTTTACTTCTTGAGGTTCCTTCACTTGCTTTAGATGCAAGTAGCTTTTCGTTATGGGGAAGCGGAGAGTGGTGGATGTTTCTAGCTGCACAGAGCATGGCA
GTGGGTACTGTCATGGTCCGCTGGGTTTCCAAGTACTCTGATCCTGTTATGGCAACTGGATGGCACATGGTGATTGGTGGTTTCCCGCTTCTGATGATCTGTATCCTTAA
TCACGATCCTGCCGTCAGTGGGAATCTTAAGGATTTTTCAACAAATGATATACTGGCGCTCCTTTATGCATCCATTTTTGGAAGTGCTATTAGCTACGGTTCATTCTTCT
ATAGTGCAACGAAAGGAGAAATTGTTGCAAGTTTGCAACTGAATACAAATGTAGAAGGAAAATTCTGTTTTTCACAGCATAGGAGAAGAATAGAGAAGGACAACAAAACG
AAAGAAGCAAATGGCAAAGGAGAGATCTATACAATTCTTCTAATCAGGTGGAGCCAGTGGCTGCAACAGGAATTGTATGGAGAACTTTACTGGCACTGCCAAGTCTGCCA
CGGCCAATCCCAGGTCTTTGGGAATCGGAGTCTCTCCTTCTCTCGGTCGTTCTTTGCCTGGTCTCGTCCACGTCGATCCCTTGTTTTGATTGCAAGCTACTCCCAGAGCC
TGGATTCGGATGGCCTTCCCCTCTGGTGTTCTGTCTTGATGTCCAGGAGTCAGTTGAATCAGCTGAGTTGGGTTCCCATGAAGCAGTTGAAATGCCCCAGTGGTGCTGTT
GAACACTGCCCACGAAACTGTTGCTACCAGAGGATTCCTGATCATCTGTGTCTCCCCCTCCAGCAACCACAGGGCCACCCCCCCTTCTCACCGTCACATTGGTATGTTGG
TGGATTGCTTCTATCCTCAGAGAAAGCACAAGAGAAATTTGAACTTGTAGTGGAATTGAGATCAGCAACTGCTTCAACTATGGAACAGGCCTTGGTTACACAAGCTGGAA
GAGAAACTGGTTCTCTAGATTGCCTTGCTGATCTGCTCCTTGCCAATCCCTGGGGTTTCTGGTGTTCCACCACAACTACCTCTGTGTTTGACAATGGTGTTGTGGAGGCG
CGGTGGTGGCTTTGCTACTATTCACTTGAGAAACTATTACTTTGTGACAACTTTTGGAATCAGTTTTTGGTTTCTGTGTCAAAAGATTTTAAGTTTCCTTTTTGCTTCTG
TTGTGAATCCTGTTGTGCGGCTGTGAGTTGGGGTCAATCTCACTCAATGGGTTTCTTCTGTGATGAAGGAATCCTTGAACCGGCCACGGCCGCCGCCTCGCCGACGTTTC
TTTTAACCGATATCCTCTTGACAGTGGCCACAGTCACTAAATCATTGGTACCACCACAACCACCAGCGACATTGTTGCTCTTGTCCATTTCCACATGCAGAAGTTTCCGC
CGACCGTCGGCCAGGGGACCGGCTGACACGCCGCTCTCGGCTGCTGGAGCGTTGGTTTTGGTCCCTTTCTCTACTTGGGGTCCTCCTCCTCCCTTGAGAGGAAGAGGAAT
GGCGTCGACCTCCTCTTTGGTGCAGCTCGAAGTTCTCACTACACCCACTTTCAAACTTTCACCCACTGCTCCGGATTCCAAAATTTCACAGGAAGAAGACGACATGGTAG
GAGTAGCAGCAGAGAAGACGGCTGCATTTCCCTCAGTCGGCGGTGGCTGGCTGGAGTTCTTGAGGCTGGGAAATGGGGATGAGGTTGCAGCCATTACAAGAGCTGGGATC
TGGAGGGGACCCAACACCAGAAGTCCTCCAAGAGTGAGGGATGTGGAGCGAGTTCAGCTAGAGAGAGAGAGATTGGGTGGTTTGAGGTTTGAAACTGAGAGCAGTGAAGG
AAAGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGGGTGTTGGCCCTCCATCTGTTCCTATTTACCAACTCCCACTCTTACCACCGTCTCTCCCAAGCTTTCCTCCTCTTCTTCTCCATTGAACTCCCGCCCTCTCCT
TCATTTCAGTTTCAGCAGACAAGTTCGTATTTCCTCCGCTGCTCCAATCCTTCGGCGACGACGAGTACTGTATCATTTGAATGGCAGAAGGTACGGCCCCGAAAATGACT
GGTTTCGTGCTGATTATATATCAATACCAGTGGTCAGTTGTACTAGAAGTGGCGCAGATACTGAATTGGATTTGTTGGAGTCCTTTGATTGCGTGGGGACTGCCCAGGAT
GTGGAGTGCGTGGTTTCCCTGACTGATGAAGATCCTCGATCTTCAATTGGGCAGCCACTAGAATTAGAAAATTCCGGTGTTGATGGTGATGGCTCTGTGGCACTCTTGGC
TGCTGCAGGAAAAGCCTGGGAGTTCGCGGTGTTGGTGTCGCCATTTTTCTTTTGGGGTACGGCTATGGTGGCCATGAAGGAGGTGCTTCCAAGGTCTGGTCCCTTTTTTG
TTTCCGCCTTTCGTCTTATACCTGCCGGTTTCCTTTTGATTGCCTTTGCTGCTTTCCGCGGTCGCTCCTTTCCCTCTGGTTTTTCTGCTTGGCTTTCCATCGTTCTCTTT
GCCCTAGTCGACGCTACATCATTTCAGGGCTTTCTTGCTCAAGGCTTGCAAAGGACATCCGCAGGCTTGGGCAGTGTATGTTTCATTTTTGCATCTTATGCCTCGTTACA
TAGACATCTTTCTGCTTCATTGCAAATTTCTGAAATTGTGGCGGTGCTTGCAGCTTTATTATTTGGTGAGTCCATCAGTTTCATTGGAGCTGCTGGACTTGTACTTGGTG
TTTTAGGACTTTTACTTCTTGAGGTTCCTTCACTTGCTTTAGATGCAAGTAGCTTTTCGTTATGGGGAAGCGGAGAGTGGTGGATGTTTCTAGCTGCACAGAGCATGGCA
GTGGGTACTGTCATGGTCCGCTGGGTTTCCAAGTACTCTGATCCTGTTATGGCAACTGGATGGCACATGGTGATTGGTGGTTTCCCGCTTCTGATGATCTGTATCCTTAA
TCACGATCCTGCCGTCAGTGGGAATCTTAAGGATTTTTCAACAAATGATATACTGGCGCTCCTTTATGCATCCATTTTTGGAAGTGCTATTAGCTACGGTTCATTCTTCT
ATAGTGCAACGAAAGGAGAAATTGTTGCAAGTTTGCAACTGAATACAAATGTAGAAGGAAAATTCTGTTTTTCACAGCATAGGAGAAGAATAGAGAAGGACAACAAAACG
AAAGAAGCAAATGGCAAAGGAGAGATCTATACAATTCTTCTAATCAGGTGGAGCCAGTGGCTGCAACAGGAATTGTATGGAGAACTTTACTGGCACTGCCAAGTCTGCCA
CGGCCAATCCCAGGTCTTTGGGAATCGGAGTCTCTCCTTCTCTCGGTCGTTCTTTGCCTGGTCTCGTCCACGTCGATCCCTTGTTTTGATTGCAAGCTACTCCCAGAGCC
TGGATTCGGATGGCCTTCCCCTCTGGTGTTCTGTCTTGATGTCCAGGAGTCAGTTGAATCAGCTGAGTTGGGTTCCCATGAAGCAGTTGAAATGCCCCAGTGGTGCTGTT
GAACACTGCCCACGAAACTGTTGCTACCAGAGGATTCCTGATCATCTGTGTCTCCCCCTCCAGCAACCACAGGGCCACCCCCCCTTCTCACCGTCACATTGGTATGTTGG
TGGATTGCTTCTATCCTCAGAGAAAGCACAAGAGAAATTTGAACTTGTAGTGGAATTGAGATCAGCAACTGCTTCAACTATGGAACAGGCCTTGGTTACACAAGCTGGAA
GAGAAACTGGTTCTCTAGATTGCCTTGCTGATCTGCTCCTTGCCAATCCCTGGGGTTTCTGGTGTTCCACCACAACTACCTCTGTGTTTGACAATGGTGTTGTGGAGGCG
CGGTGGTGGCTTTGCTACTATTCACTTGAGAAACTATTACTTTGTGACAACTTTTGGAATCAGTTTTTGGTTTCTGTGTCAAAAGATTTTAAGTTTCCTTTTTGCTTCTG
TTGTGAATCCTGTTGTGCGGCTGTGAGTTGGGGTCAATCTCACTCAATGGGTTTCTTCTGTGATGAAGGAATCCTTGAACCGGCCACGGCCGCCGCCTCGCCGACGTTTC
TTTTAACCGATATCCTCTTGACAGTGGCCACAGTCACTAAATCATTGGTACCACCACAACCACCAGCGACATTGTTGCTCTTGTCCATTTCCACATGCAGAAGTTTCCGC
CGACCGTCGGCCAGGGGACCGGCTGACACGCCGCTCTCGGCTGCTGGAGCGTTGGTTTTGGTCCCTTTCTCTACTTGGGGTCCTCCTCCTCCCTTGAGAGGAAGAGGAAT
GGCGTCGACCTCCTCTTTGGTGCAGCTCGAAGTTCTCACTACACCCACTTTCAAACTTTCACCCACTGCTCCGGATTCCAAAATTTCACAGGAAGAAGACGACATGGTAG
GAGTAGCAGCAGAGAAGACGGCTGCATTTCCCTCAGTCGGCGGTGGCTGGCTGGAGTTCTTGAGGCTGGGAAATGGGGATGAGGTTGCAGCCATTACAAGAGCTGGGATC
TGGAGGGGACCCAACACCAGAAGTCCTCCAAGAGTGAGGGATGTGGAGCGAGTTCAGCTAGAGAGAGAGAGATTGGGTGGTTTGAGGTTTGAAACTGAGAGCAGTGAAGG
AAAGAGATAA
Protein sequenceShow/hide protein sequence
MAGCWPSICSYLPTPTLTTVSPKLSSSSSPLNSRPLLHFSFSRQVRISSAAPILRRRRVLYHLNGRRYGPENDWFRADYISIPVVSCTRSGADTELDLLESFDCVGTAQD
VECVVSLTDEDPRSSIGQPLELENSGVDGDGSVALLAAAGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRSFPSGFSAWLSIVLF
ALVDATSFQGFLAQGLQRTSAGLGSVCFIFASYASLHRHLSASLQISEIVAVLAALLFGESISFIGAAGLVLGVLGLLLLEVPSLALDASSFSLWGSGEWWMFLAAQSMA
VGTVMVRWVSKYSDPVMATGWHMVIGGFPLLMICILNHDPAVSGNLKDFSTNDILALLYASIFGSAISYGSFFYSATKGEIVASLQLNTNVEGKFCFSQHRRRIEKDNKT
KEANGKGEIYTILLIRWSQWLQQELYGELYWHCQVCHGQSQVFGNRSLSFSRSFFAWSRPRRSLVLIASYSQSLDSDGLPLWCSVLMSRSQLNQLSWVPMKQLKCPSGAV
EHCPRNCCYQRIPDHLCLPLQQPQGHPPFSPSHWYVGGLLLSSEKAQEKFELVVELRSATASTMEQALVTQAGRETGSLDCLADLLLANPWGFWCSTTTTSVFDNGVVEA
RWWLCYYSLEKLLLCDNFWNQFLVSVSKDFKFPFCFCCESCCAAVSWGQSHSMGFFCDEGILEPATAAASPTFLLTDILLTVATVTKSLVPPQPPATLLLLSISTCRSFR
RPSARGPADTPLSAAGALVLVPFSTWGPPPPLRGRGMASTSSLVQLEVLTTPTFKLSPTAPDSKISQEEDDMVGVAAEKTAAFPSVGGGWLEFLRLGNGDEVAAITRAGI
WRGPNTRSPPRVRDVERVQLERERLGGLRFETESSEGKR