; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004791 (gene) of Snake gourd v1 genome

Gene IDTan0004791
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionWAT1-related protein At3g02690, chloroplastic
Genome locationLG11:13238828..13248477
RNA-Seq ExpressionTan0004791
SyntenyTan0004791
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000620 - EamA domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579056.1 WAT1-related protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]8.6e-19284.81Show/hide
Query:  MAGCLATATLTPT-------SPSNCRRPLLHFSFSRPILAPDISSAA---LRRRVPFRLRFRRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCV
        MAGCLAT TLT T       SPSNC RPL  FSF R I+   ISS A    RRRVPF+L FRRY E  WFR DYVA+   NCTRSGADT+ D +ESIDCV
Subjt:  MAGCLATATLTPT-------SPSNCRRPLLHFSFSRPILAPDISSAA---LRRRVPFRLRFRRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCV

Query:  GTAQDVECVVSPTDEDSLSSIGKPLELGLS---DVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRG
        GTAQDVECVVS TDE          ELG+S   D  G DGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRL+PAGFLLIAFAA RG
Subjt:  GTAQDVECVVSPTDEDSLSSIGKPLELGLS---DVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRG

Query:  RPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGE
        RPFPSGFSAWISILLFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESI LVGAAGLVLGVLGLLLLEVPSL+LDASSFSLWGSGE
Subjt:  RPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGE

Query:  WWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTF
        WWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL ICFLNH+PAVSGSL+DFTTNDILAL YASIFGSAVSYGSFFYSATKGSLTKLSSLTF
Subjt:  WWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTF

Query:  LTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS
        LTPMFAS+FGFLYLGETFSPIQLVGAVVTV++IYVVNYGSS
Subjt:  LTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS

XP_004140354.1 WAT1-related protein At3g02690, chloroplastic isoform X3 [Cucumis sativus]1.2e-19687.44Show/hide
Query:  MAGCLATATLTPTSPSNCRRPLLHFSFSRPILAPDISSAA--LRRRVPFRLRFR-RYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCVGTAQDVE
        MAGCLATATLTPTSPSN   P  H        AP ISSAA  LRRR+PF+L FR RY EN+ FR  YVA+   NCTRSG DTE D +ESIDCVGTAQDVE
Subjt:  MAGCLATATLTPTSPSNCRRPLLHFSFSRPILAPDISSAA--LRRRVPFRLRFR-RYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCVGTAQDVE

Query:  CVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAW
        CVVSP DED  SSIG PL+LG+S     DGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAA RGRPFPSGFSAW
Subjt:  CVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAW

Query:  ISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQSM
        ISI+LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGES+ LVGAAGLVLGVLGLLLLEVPSL+ DA+SFSLWGSGEWWMFLAAQSM
Subjt:  ISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQSM

Query:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
        AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMIC LNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
Subjt:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFG

Query:  FLYLGETFSPIQLVGAVVTVIAIYVVNYGS
        FLYLGETFSPIQLVGAVVTV+AIYVVNYGS
Subjt:  FLYLGETFSPIQLVGAVVTVIAIYVVNYGS

XP_008463172.1 PREDICTED: WAT1-related protein At3g02690, chloroplastic [Cucumis melo]1.6e-19386.11Show/hide
Query:  MAGCLATATLTPTSPSNCRRPLLHFSFSRPIL-APDISSA--ALRRRVPFRLRF-RRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCVGTAQDV
        MA CLATATLTPTSPSN          S P   AP ISSA   LRRR+PFRL F  RY EN  FR  YVA+   NCTRSG DTE D +ESIDCVGTAQDV
Subjt:  MAGCLATATLTPTSPSNCRRPLLHFSFSRPIL-APDISSA--ALRRRVPFRLRF-RRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCVGTAQDV

Query:  ECVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSA
        ECVVSPTDED  SSIG PLELG+S      GSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLL+AFAA RGRPFPSGFSA
Subjt:  ECVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSA

Query:  WISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQS
        WISI+LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGES+ L+GAAGLVLGV GLLLLEVPSL+ DA+SFSLWGSGEWWMFLAAQS
Subjt:  WISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQS

Query:  MAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVF
        MAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMIC LNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVF
Subjt:  MAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVF

Query:  GFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS
        GFLYLGETFSPIQLVGAVVTV+AIY VNYGSS
Subjt:  GFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS

XP_022993106.1 WAT1-related protein At3g02690, chloroplastic [Cucurbita maxima]4.3e-19184.81Show/hide
Query:  MAGCLATATLTPT-------SPSNCRRPLLHFSFSRPILAPDISSAA---LRRRVPFRLRFRRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCV
        MAGCLATATLT T       SPSNC RPL  FSF R I+   ISS A    RRRVPF+L FRRY E   FR DYVA+   NCTRSGADT+ D +ESIDCV
Subjt:  MAGCLATATLTPT-------SPSNCRRPLLHFSFSRPILAPDISSAA---LRRRVPFRLRFRRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCV

Query:  GTAQDVECVVSPTDEDSLSSIGKPLELGLS---DVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRG
        GTAQDVECVVS TDE          ELG+S   D  G DGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRL+PAGFLLIAFAA RG
Subjt:  GTAQDVECVVSPTDEDSLSSIGKPLELGLS---DVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRG

Query:  RPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGE
        RPFPSGFSAWISILLFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESI LVGAAGLVLGVLGL+LLEVPSL+LDASSFSLWGSGE
Subjt:  RPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGE

Query:  WWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTF
        WWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNH+PAVSGSL+DFTTNDILAL YASIFGSAVSYGSFFYSATKGSLTKLSSLTF
Subjt:  WWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTF

Query:  LTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS
        LTPMFAS+FGFLYLGETFSPIQLVGAVVTV++IYVVNYGSS
Subjt:  LTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS

XP_038882098.1 WAT1-related protein At3g02690, chloroplastic [Benincasa hispida]1.4e-19787.73Show/hide
Query:  MAGCLATATLTP--TSPSNCRRPLLHFSFSRPILAPDISSAA--LRRRVPFRLRFRRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCVGTAQDV
        MAGCLATAT TP  TSPSN  RPLLHFSF+R IL P ISSAA  LRRR PF L F RYGEN  FR DYVA+   NCTRSGADTE DL+ESIDCVGTAQDV
Subjt:  MAGCLATATLTP--TSPSNCRRPLLHFSFSRPILAPDISSAA--LRRRVPFRLRFRRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCVGTAQDV

Query:  ECVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSA
        ECV+S T ED  SS+    E G+S     DGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAA RGRPFPSGFSA
Subjt:  ECVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSA

Query:  WISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQS
        WISI+LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESI LVGAAGLVLGVLGLLLLEVPSL+ DA+SFSLWGSGEWWMFLAAQS
Subjt:  WISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQS

Query:  MAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVF
        MAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMIC LNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFAS F
Subjt:  MAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVF

Query:  GFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS
        GFLYLGETFSPIQLVGAVVTV+AIYVVNYG++
Subjt:  GFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS

TrEMBL top hitse value%identityAlignment
A0A0A0KQD4 Uncharacterized protein5.6e-19787.44Show/hide
Query:  MAGCLATATLTPTSPSNCRRPLLHFSFSRPILAPDISSAA--LRRRVPFRLRFR-RYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCVGTAQDVE
        MAGCLATATLTPTSPSN   P  H        AP ISSAA  LRRR+PF+L FR RY EN+ FR  YVA+   NCTRSG DTE D +ESIDCVGTAQDVE
Subjt:  MAGCLATATLTPTSPSNCRRPLLHFSFSRPILAPDISSAA--LRRRVPFRLRFR-RYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCVGTAQDVE

Query:  CVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAW
        CVVSP DED  SSIG PL+LG+S     DGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAA RGRPFPSGFSAW
Subjt:  CVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAW

Query:  ISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQSM
        ISI+LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGES+ LVGAAGLVLGVLGLLLLEVPSL+ DA+SFSLWGSGEWWMFLAAQSM
Subjt:  ISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQSM

Query:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
        AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMIC LNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
Subjt:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFG

Query:  FLYLGETFSPIQLVGAVVTVIAIYVVNYGS
        FLYLGETFSPIQLVGAVVTV+AIYVVNYGS
Subjt:  FLYLGETFSPIQLVGAVVTVIAIYVVNYGS

A0A1S3CK55 WAT1-related protein At3g02690, chloroplastic7.6e-19486.11Show/hide
Query:  MAGCLATATLTPTSPSNCRRPLLHFSFSRPIL-APDISSA--ALRRRVPFRLRF-RRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCVGTAQDV
        MA CLATATLTPTSPSN          S P   AP ISSA   LRRR+PFRL F  RY EN  FR  YVA+   NCTRSG DTE D +ESIDCVGTAQDV
Subjt:  MAGCLATATLTPTSPSNCRRPLLHFSFSRPIL-APDISSA--ALRRRVPFRLRF-RRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCVGTAQDV

Query:  ECVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSA
        ECVVSPTDED  SSIG PLELG+S      GSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLL+AFAA RGRPFPSGFSA
Subjt:  ECVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSA

Query:  WISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQS
        WISI+LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGES+ L+GAAGLVLGV GLLLLEVPSL+ DA+SFSLWGSGEWWMFLAAQS
Subjt:  WISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQS

Query:  MAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVF
        MAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMIC LNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVF
Subjt:  MAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVF

Query:  GFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS
        GFLYLGETFSPIQLVGAVVTV+AIY VNYGSS
Subjt:  GFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS

A0A6J1CI41 WAT1-related protein At3g02690, chloroplastic5.6e-18984.49Show/hide
Query:  MAGCLATATLTPTSPSNCRRPLLHFSFSRPIL--APDISSAAL-RRRVPFRLRFR-RYGENNWFRADYV---AVNCTRSGADTEFDLSESIDCVGTAQDV
        MAG   +A+ +  +PSN  RPLLHFSFSR +L  +P+ S A +  RRV F      RYG N+WFRADYV    VNCTRSGADTE DLSES+DCVGTAQDV
Subjt:  MAGCLATATLTPTSPSNCRRPLLHFSFSRPIL--APDISSAAL-RRRVPFRLRFR-RYGENNWFRADYV---AVNCTRSGADTEFDLSESIDCVGTAQDV

Query:  ECVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSA
        ECVVSP DED  SSI        +DV G DGS+AVL KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAA RGRPFPSGFSA
Subjt:  ECVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSA

Query:  WISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQS
        WISI LFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAA LFGESISL+GAAGL+LGVLGLLLLEVPSL+LDAS+FSLWGSGEWWMFLAAQS
Subjt:  WISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQS

Query:  MAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVF
        MAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL IC LNHDPA+SGSLKDFTTNDILALLYASIFGSA+SYGSFFYSATKGSLTKLSSLTFLTPMFASVF
Subjt:  MAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVF

Query:  GFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS
        GFLYLGETFSPIQLVGAVVTV+AIYVVNYGSS
Subjt:  GFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS

A0A6J1FGD0 WAT1-related protein At3g02690, chloroplastic1.7e-19084.58Show/hide
Query:  MAGCLATATLTPT-------SPSNCRRPLLHFSFSRPILAPDISSAA---LRRRVPFRLRFRRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCV
        MAGCLAT TLT T       SPSNC RPL  FSF R I+   ISS A    RRRVPF+L FRRY E   FR DYVA+   NCTRSGADT+ D +ESIDCV
Subjt:  MAGCLATATLTPT-------SPSNCRRPLLHFSFSRPILAPDISSAA---LRRRVPFRLRFRRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCV

Query:  GTAQDVECVVSPTDEDSLSSIGKPLELGLS---DVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRG
        GTAQDVECVVS TDE          ELG+S   D  G DGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRL+PAGFLLIAFAA RG
Subjt:  GTAQDVECVVSPTDEDSLSSIGKPLELGLS---DVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRG

Query:  RPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGE
        RPFPSGFSAWISILLFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESI LVGAAGLVLGVLGLLLLEVPSL+LDASSFSLWGSGE
Subjt:  RPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGE

Query:  WWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTF
        WWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL ICFLNH+PAVSGSL+DFTTNDILAL YASIFGSAVSYGSFFYSATKGSLTKLSSLTF
Subjt:  WWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTF

Query:  LTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS
        LTPMFAS+FGFLYLGETFSPIQLVGAVVTV++IYVVNYGSS
Subjt:  LTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS

A0A6J1JVE3 WAT1-related protein At3g02690, chloroplastic2.1e-19184.81Show/hide
Query:  MAGCLATATLTPT-------SPSNCRRPLLHFSFSRPILAPDISSAA---LRRRVPFRLRFRRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCV
        MAGCLATATLT T       SPSNC RPL  FSF R I+   ISS A    RRRVPF+L FRRY E   FR DYVA+   NCTRSGADT+ D +ESIDCV
Subjt:  MAGCLATATLTPT-------SPSNCRRPLLHFSFSRPILAPDISSAA---LRRRVPFRLRFRRYGENNWFRADYVAV---NCTRSGADTEFDLSESIDCV

Query:  GTAQDVECVVSPTDEDSLSSIGKPLELGLS---DVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRG
        GTAQDVECVVS TDE          ELG+S   D  G DGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRL+PAGFLLIAFAA RG
Subjt:  GTAQDVECVVSPTDEDSLSSIGKPLELGLS---DVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRG

Query:  RPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGE
        RPFPSGFSAWISILLFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESI LVGAAGLVLGVLGL+LLEVPSL+LDASSFSLWGSGE
Subjt:  RPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGE

Query:  WWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTF
        WWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNH+PAVSGSL+DFTTNDILAL YASIFGSAVSYGSFFYSATKGSLTKLSSLTF
Subjt:  WWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTF

Query:  LTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS
        LTPMFAS+FGFLYLGETFSPIQLVGAVVTV++IYVVNYGSS
Subjt:  LTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNYGSS

SwissProt top hitse value%identityAlignment
O32256 Uncharacterized transporter YvbV8.0e-1528.62Show/hide
Query:  WGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFG
        WG      K  L  S P   +  R +  G LL+  A  R          W   L+ AL++ T F G    GL    AGL S I+  QP+ + V +    G
Subjt:  WGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFG

Query:  ESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDF
        ES+ ++   GL+LG  G+ ++         S       G      +A S A+GTV ++      D I      + IG + LL+  F       S S   +
Subjt:  ESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDF

Query:  TTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNYGSSGQ
        T   I +LL+ S+F  A+ +  FF     G  +K++S TFL P+ + V   ++L E  +   L G ++ V +I +VN  S  Q
Subjt:  TTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNYGSSGQ

O34416 Uncharacterized transporter YoaV2.6e-1324.38Show/hide
Query:  LVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAV
        ++S    WG   VAMK  +    P   S  RL      L     ++ +          S ++ +L+    + G L  G+Q   +G  SV++ + P+ V V
Subjt:  LVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAV

Query:  LAAFLFGESISLVGAAGLVLGVLGLL-LLEVPSLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPA
        ++ F   E +++    GLV G+ GLL +     L++D S+      GE  + +AA S  +  V  +   K+ D I    WH+++G + LL+  F+    A
Subjt:  LAAFLFGESISLVGAAGLVLGVLGLL-LLEVPSLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPA

Query:  VSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYV
        V  +  ++T   + +LL+  +  +  ++  +F+   +   +K S      P+ A  FG+L L E  +   ++GA++    I++
Subjt:  VSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYV

P42194 Protein PecM9.7e-1328.98Show/hide
Query:  WGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFG
        WGT      + LP   P   +  R +PAG +LI      G+  P     W   +L AL +   F   L     R   G+ +++   QPL V +L+  L  
Subjt:  WGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFG

Query:  ESISLVGAAGLVLGVLGL-LLLEVPSLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP-----IMATGWHMVIGGLPLLMICFLNHDPAVS
        + +        V G +G+ LL+ +P   L+        +G     LA  SMA G V+ +   K+  P     +  TGW +  GGL +L +  L       
Subjt:  ESISLVGAAGLVLGVLGL-LLLEVPSLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP-----IMATGWHMVIGGLPLLMICFLNHDPAVS

Query:  GSLKDFTT-NDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVV
          L D  T  ++   LY +I GS ++Y  +F      S   +S L FL+P+ A + GFL+L +  S  QLVG V    A+ +V
Subjt:  GSLKDFTT-NDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVV

P74436 Uncharacterized transporter sll03551.0e-7052.36Show/hide
Query:  LVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAV
        L++PFF WGTAMVAMK VL  + PFFV+  RLIPAG L++ +A  + RP P  +  W  I+LFALVD T FQGFLAQGL+RT AGLGSVIIDSQP+ VA+
Subjt:  LVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAV

Query:  LAAFLFGESISLVGAAGLVLGVLGLLLLEVP-----------SLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL
        L+++LF E I  +G  GL+LGV G+ L+ +P            LS++ S  +L  SGE WM LA+ SMAVGTV++ +VS+  DP++ATGWHM+IGGLPLL
Subjt:  LAAFLFGESISLVGAAGLVLGVLGLLLLEVP-----------SLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL

Query:  MICFL-NHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVN
         I  + + +P  +  L  +       L YA++FGSA++YG FFY A+KG+LT LSSLTFLTP+FA  F  L L E  S +Q +G   T+++IY++N
Subjt:  MICFL-NHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVN

Q93V85 WAT1-related protein At3g02690, chloroplastic4.2e-13369.78Show/hide
Query:  DYVAVNCTRSGADTE----FDLSESIDCVGTAQDVECVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPR
        D V    T S   TE       S S+DCVG   DVECV +  DE++ SS           + G +G+        E+ VL+SPFFFWGTAMVAMKEVLP 
Subjt:  DYVAVNCTRSGADTE----FDLSESIDCVGTAQDVECVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPR

Query:  SGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLG
        +GPFFV+AFRLIPAG LL+AFA  +GRP P G +AW SI LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLA+FLFGESI +V A GL+LG
Subjt:  SGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLG

Query:  VLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIF
        V GLLLLEVPS++ D ++FSLWGSGEWWM LAAQSMA+GTVMVRWVSKYSDPIMATGWHMVIGGLPLL I  +NHDP  +GSL+D +TND++ALLY SIF
Subjt:  VLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIF

Query:  GSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNY
        GSAVSYG +FYSATKGSLTKLSSLTFLTPMFAS+FG+LYL ETFS +QLVGA VT++AIY+VN+
Subjt:  GSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNY

Arabidopsis top hitse value%identityAlignment
AT3G02690.1 nodulin MtN21 /EamA-like transporter family protein3.0e-13469.78Show/hide
Query:  DYVAVNCTRSGADTE----FDLSESIDCVGTAQDVECVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPR
        D V    T S   TE       S S+DCVG   DVECV +  DE++ SS           + G +G+        E+ VL+SPFFFWGTAMVAMKEVLP 
Subjt:  DYVAVNCTRSGADTE----FDLSESIDCVGTAQDVECVVSPTDEDSLSSIGKPLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPR

Query:  SGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLG
        +GPFFV+AFRLIPAG LL+AFA  +GRP P G +AW SI LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLA+FLFGESI +V A GL+LG
Subjt:  SGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLG

Query:  VLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIF
        V GLLLLEVPS++ D ++FSLWGSGEWWM LAAQSMA+GTVMVRWVSKYSDPIMATGWHMVIGGLPLL I  +NHDP  +GSL+D +TND++ALLY SIF
Subjt:  VLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICFLNHDPAVSGSLKDFTTNDILALLYASIF

Query:  GSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNY
        GSAVSYG +FYSATKGSLTKLSSLTFLTPMFAS+FG+LYL ETFS +QLVGA VT++AIY+VN+
Subjt:  GSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGGCTGTTTAGCCACTGCCACTCTTACACCCACTTCTCCATCTAACTGTCGCCGCCCTCTTCTTCATTTCAGTTTCAGCAGACCAATTCTCGCACCGGACATTTC
CTCCGCCGCTCTCCGGCGACGTGTGCCGTTTCGTTTACGTTTCAGAAGGTACGGCGAGAATAACTGGTTTCGTGCTGATTATGTTGCGGTGAATTGCACCAGAAGTGGCG
CGGATACTGAATTCGATTTGTCTGAGTCCATCGATTGTGTGGGGACTGCTCAGGATGTGGAGTGTGTGGTTTCCCCAACTGATGAAGATTCTTTATCTTCAATTGGGAAG
CCATTGGAATTAGGGCTTTCTGATGTTGGTGGTGTTGATGGTTCTGTGGCGGTGTTGGAGAAAGCCTGGGAGTTCGCGGTGTTGGTTTCGCCGTTTTTCTTTTGGGGTAC
GGCTATGGTGGCGATGAAGGAGGTGCTTCCAAGGTCTGGTCCTTTTTTCGTTTCCGCCTTTCGCCTTATACCGGCCGGTTTCCTTTTGATTGCCTTTGCTGCTTTACGCG
GTCGTCCCTTTCCCTCGGGTTTTTCTGCTTGGATTTCCATCCTTCTCTTTGCTCTTGTCGACGCTACATCATTTCAGGGCTTTCTTGCTCAAGGCTTGCAGAGGACATCC
GCAGGCTTGGGCAGTGTAATAATTGATTCTCAACCATTAACCGTGGCAGTGCTTGCAGCTTTCTTATTTGGAGAGTCCATCAGTTTAGTTGGAGCTGCTGGCCTTGTACT
TGGTGTTTTAGGACTTTTACTTCTTGAGGTTCCTTCACTTTCTTTAGATGCAAGTAGCTTTTCGTTATGGGGAAGTGGAGAGTGGTGGATGTTTCTAGCTGCACAGAGCA
TGGCAGTAGGTACTGTCATGGTCCGCTGGGTTTCCAAGTACTCTGATCCTATTATGGCAACTGGATGGCACATGGTGATTGGTGGTCTCCCGCTTTTGATGATCTGTTTC
CTTAATCATGATCCTGCCGTCAGTGGAAGCCTCAAGGATTTTACAACAAATGATATACTGGCGCTCCTTTATGCATCCATTTTTGGAAGTGCTGTTAGCTACGGTTCATT
CTTCTATAGTGCAACTAAAGGTAGTTTGACAAAGCTCAGCTCTCTCACCTTTCTCACTCCAATGTTTGCTTCAGTTTTTGGGTTTCTATATTTGGGAGAGACATTCTCAC
CTATTCAACTGGTTGGAGCCGTCGTCACTGTGATCGCTATATACGTGGTCAACTATGGCAGTAGTGGTCAACTATGGCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGGCTGTTTAGCCACTGCCACTCTTACACCCACTTCTCCATCTAACTGTCGCCGCCCTCTTCTTCATTTCAGTTTCAGCAGACCAATTCTCGCACCGGACATTTC
CTCCGCCGCTCTCCGGCGACGTGTGCCGTTTCGTTTACGTTTCAGAAGGTACGGCGAGAATAACTGGTTTCGTGCTGATTATGTTGCGGTGAATTGCACCAGAAGTGGCG
CGGATACTGAATTCGATTTGTCTGAGTCCATCGATTGTGTGGGGACTGCTCAGGATGTGGAGTGTGTGGTTTCCCCAACTGATGAAGATTCTTTATCTTCAATTGGGAAG
CCATTGGAATTAGGGCTTTCTGATGTTGGTGGTGTTGATGGTTCTGTGGCGGTGTTGGAGAAAGCCTGGGAGTTCGCGGTGTTGGTTTCGCCGTTTTTCTTTTGGGGTAC
GGCTATGGTGGCGATGAAGGAGGTGCTTCCAAGGTCTGGTCCTTTTTTCGTTTCCGCCTTTCGCCTTATACCGGCCGGTTTCCTTTTGATTGCCTTTGCTGCTTTACGCG
GTCGTCCCTTTCCCTCGGGTTTTTCTGCTTGGATTTCCATCCTTCTCTTTGCTCTTGTCGACGCTACATCATTTCAGGGCTTTCTTGCTCAAGGCTTGCAGAGGACATCC
GCAGGCTTGGGCAGTGTAATAATTGATTCTCAACCATTAACCGTGGCAGTGCTTGCAGCTTTCTTATTTGGAGAGTCCATCAGTTTAGTTGGAGCTGCTGGCCTTGTACT
TGGTGTTTTAGGACTTTTACTTCTTGAGGTTCCTTCACTTTCTTTAGATGCAAGTAGCTTTTCGTTATGGGGAAGTGGAGAGTGGTGGATGTTTCTAGCTGCACAGAGCA
TGGCAGTAGGTACTGTCATGGTCCGCTGGGTTTCCAAGTACTCTGATCCTATTATGGCAACTGGATGGCACATGGTGATTGGTGGTCTCCCGCTTTTGATGATCTGTTTC
CTTAATCATGATCCTGCCGTCAGTGGAAGCCTCAAGGATTTTACAACAAATGATATACTGGCGCTCCTTTATGCATCCATTTTTGGAAGTGCTGTTAGCTACGGTTCATT
CTTCTATAGTGCAACTAAAGGTAGTTTGACAAAGCTCAGCTCTCTCACCTTTCTCACTCCAATGTTTGCTTCAGTTTTTGGGTTTCTATATTTGGGAGAGACATTCTCAC
CTATTCAACTGGTTGGAGCCGTCGTCACTGTGATCGCTATATACGTGGTCAACTATGGCAGTAGTGGTCAACTATGGCAGTAG
Protein sequenceShow/hide protein sequence
MAGCLATATLTPTSPSNCRRPLLHFSFSRPILAPDISSAALRRRVPFRLRFRRYGENNWFRADYVAVNCTRSGADTEFDLSESIDCVGTAQDVECVVSPTDEDSLSSIGK
PLELGLSDVGGVDGSVAVLEKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAALRGRPFPSGFSAWISILLFALVDATSFQGFLAQGLQRTS
AGLGSVIIDSQPLTVAVLAAFLFGESISLVGAAGLVLGVLGLLLLEVPSLSLDASSFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLMICF
LNHDPAVSGSLKDFTTNDILALLYASIFGSAVSYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVIAIYVVNYGSSGQLWQ