; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g30140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g30140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionWAT1-related protein At3g02690, chloroplastic
Genome locationchr4:22492957..22497298
RNA-Seq ExpressionMoc04g30140
SyntenyMoc04g30140
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR000620 - EamA domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579056.1 WAT1-related protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.0e-18383.81Show/hide
Query:  SASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIV-HRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVECVVSP
        S   S  +PSN RPL  FSF RQ ++    SS API+  RRV F      RY    WFR DYV IPV+NCTRSGADT+LD +ES+DCVGTAQDVECVVS 
Subjt:  SASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIV-HRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVECVVSP

Query:  ADED-PRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFALVDATS
         DE+   SS  D DGDGS+AVL KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRL+PAGFLLIAFAAFRGRPFPSGFSAWISI LFALVDAT 
Subjt:  ADED-PRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFALVDATS

Query:  FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKY
        FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAA LFGESI L+GAAGL+LGVLGLLLLEVPSL LDAS+FSLWGSGEWWMFLAAQSMAVGTVMVRWVSKY
Subjt:  FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKY

Query:  SDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQL
        SDPIMATGWHMVIGGLPLLAIC LNH+PA+SGSL+DFTTNDILAL YASIFGSA+SYGSFFYSATKGSLTKLSSLTFLTPMFAS+FGFLYLGETFSPIQL
Subjt:  SDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQL

Query:  VGAVVTVVAIYVVNYGSSVE
        VGAVVTVV+IYVVNYGSS+E
Subjt:  VGAVVTVVAIYVVNYGSSVE

XP_004140354.1 WAT1-related protein At3g02690, chloroplastic isoform X3 [Cucumis sativus]1.5e-18681.76Show/hide
Query:  MAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVE
        MAG   +A+ +  +PSNS P  H         A P SS API+ RR+ F      RY  N  FR  YV IPV NCTRSG DTELD +ES+DCVGTAQDVE
Subjt:  MAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVE

Query:  CVVSPADEDPRSSI---------ADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW
        CVVSP DEDP SSI         +D  GDGS+AVL KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW
Subjt:  CVVSPADEDPRSSI---------ADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW

Query:  ISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSM
        ISI LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAA LFGES+ L+GAAGL+LGVLGLLLLEVPSL  DA++FSLWGSGEWWMFLAAQSM
Subjt:  ISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSM

Query:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
        AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL ICILNHDPA+SGSLKDFTTNDILALLYASIFGSA+SYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
Subjt:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFG

Query:  FLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE
        FLYLGETFSPIQLVGAVVTVVAIYVVNYGS +E
Subjt:  FLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE

XP_008463172.1 PREDICTED: WAT1-related protein At3g02690, chloroplastic [Cucumis melo]6.5e-18280.37Show/hide
Query:  MAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVE
        MA    +A+ +  +PSNS P            A   SS  PI+ RR+ F      RY  N  FR  YV IPV NCTRSG DTELD +ES+DCVGTAQDVE
Subjt:  MAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVE

Query:  CVVSPADEDPRSSI---------ADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW
        CVVSP DEDP SSI         +D  G GS+AVL KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLL+AFAAFRGRPFPSGFSAW
Subjt:  CVVSPADEDPRSSI---------ADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW

Query:  ISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSM
        ISI LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAA LFGES+ LIGAAGL+LGV GLLLLEVPSL  DA++FSLWGSGEWWMFLAAQSM
Subjt:  ISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSM

Query:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
        AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL ICILNHDPA+SGSLKDFTTNDILALLYASIFGSA+SYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
Subjt:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFG

Query:  FLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE
        FLYLGETFSPIQLVGAVVTVVAIY VNYGSS+E
Subjt:  FLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE

XP_022141445.1 WAT1-related protein At3g02690, chloroplastic [Momordica charantia]1.0e-248100Show/hide
Query:  MALATAVSSPPNAFLLQPQRVTVAERTKMAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPV
        MALATAVSSPPNAFLLQPQRVTVAERTKMAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPV
Subjt:  MALATAVSSPPNAFLLQPQRVTVAERTKMAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPV

Query:  VNCTRSGADTELDLSESVDCVGTAQDVECVVSPADEDPRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL
        VNCTRSGADTELDLSESVDCVGTAQDVECVVSPADEDPRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL
Subjt:  VNCTRSGADTELDLSESVDCVGTAQDVECVVSPADEDPRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL

Query:  LIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDAS
        LIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDAS
Subjt:  LIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDAS

Query:  TFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGS
        TFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGS
Subjt:  TFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGS

Query:  LTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE
        LTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE
Subjt:  LTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE

XP_038882098.1 WAT1-related protein At3g02690, chloroplastic [Benincasa hispida]2.6e-19184.3Show/hide
Query:  MAGFWPSASSSLPT---PSNSRPLLHFSFSRQLLLASPN-SSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTA
        MAG   +A+ + PT   PSNSRPLLHFSF+RQ+L  +P+ SS API+ RR  FH   T RYG N  FR DYV IPV NCTRSGADTELDL+ES+DCVGTA
Subjt:  MAGFWPSASSSLPT---PSNSRPLLHFSFSRQLLLASPN-SSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTA

Query:  QDVECVVSPADEDPRSSI-----ADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW
        QDVECV+S   EDP SS+     ++ DGDGS+AVL KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW
Subjt:  QDVECVVSPADEDPRSSI-----ADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW

Query:  ISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSM
        ISI LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAA LFGESI L+GAAGL+LGVLGLLLLEVPSL  DA++FSLWGSGEWWMFLAAQSM
Subjt:  ISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSM

Query:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
        AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL ICILNHDPA+SGSLKDFTTNDILALLYASIFGSA+SYGSFFYSATKGSLTKLSSLTFLTPMFAS FG
Subjt:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFG

Query:  FLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE
        FLYLGETFSPIQLVGAVVTVVAIYVVNYG+++E
Subjt:  FLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE

TrEMBL top hitse value%identityAlignment
A0A0A0KQD4 Uncharacterized protein7.2e-18781.76Show/hide
Query:  MAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVE
        MAG   +A+ +  +PSNS P  H         A P SS API+ RR+ F      RY  N  FR  YV IPV NCTRSG DTELD +ES+DCVGTAQDVE
Subjt:  MAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVE

Query:  CVVSPADEDPRSSI---------ADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW
        CVVSP DEDP SSI         +D  GDGS+AVL KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW
Subjt:  CVVSPADEDPRSSI---------ADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW

Query:  ISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSM
        ISI LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAA LFGES+ L+GAAGL+LGVLGLLLLEVPSL  DA++FSLWGSGEWWMFLAAQSM
Subjt:  ISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSM

Query:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
        AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL ICILNHDPA+SGSLKDFTTNDILALLYASIFGSA+SYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
Subjt:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFG

Query:  FLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE
        FLYLGETFSPIQLVGAVVTVVAIYVVNYGS +E
Subjt:  FLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE

A0A1S3CK55 WAT1-related protein At3g02690, chloroplastic3.1e-18280.37Show/hide
Query:  MAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVE
        MA    +A+ +  +PSNS P            A   SS  PI+ RR+ F      RY  N  FR  YV IPV NCTRSG DTELD +ES+DCVGTAQDVE
Subjt:  MAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVE

Query:  CVVSPADEDPRSSI---------ADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW
        CVVSP DEDP SSI         +D  G GS+AVL KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLL+AFAAFRGRPFPSGFSAW
Subjt:  CVVSPADEDPRSSI---------ADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAW

Query:  ISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSM
        ISI LFALVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAA LFGES+ LIGAAGL+LGV GLLLLEVPSL  DA++FSLWGSGEWWMFLAAQSM
Subjt:  ISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSM

Query:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
        AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL ICILNHDPA+SGSLKDFTTNDILALLYASIFGSA+SYGSFFYSATKGSLTKLSSLTFLTPMFASVFG
Subjt:  AVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFG

Query:  FLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE
        FLYLGETFSPIQLVGAVVTVVAIY VNYGSS+E
Subjt:  FLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE

A0A6J1CI41 WAT1-related protein At3g02690, chloroplastic5.1e-249100Show/hide
Query:  MALATAVSSPPNAFLLQPQRVTVAERTKMAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPV
        MALATAVSSPPNAFLLQPQRVTVAERTKMAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPV
Subjt:  MALATAVSSPPNAFLLQPQRVTVAERTKMAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPV

Query:  VNCTRSGADTELDLSESVDCVGTAQDVECVVSPADEDPRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL
        VNCTRSGADTELDLSESVDCVGTAQDVECVVSPADEDPRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL
Subjt:  VNCTRSGADTELDLSESVDCVGTAQDVECVVSPADEDPRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFL

Query:  LIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDAS
        LIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDAS
Subjt:  LIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDAS

Query:  TFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGS
        TFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGS
Subjt:  TFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGS

Query:  LTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE
        LTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE
Subjt:  LTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE

A0A6J1FGD0 WAT1-related protein At3g02690, chloroplastic1.2e-18183.33Show/hide
Query:  SASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIV-HRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVECVVSP
        S   S  +PSN RPL  FSF RQ ++    SS API+  RRV F      RY     FR DYV IP +NCTRSGADT+LD +ES+DCVGTAQDVECVVS 
Subjt:  SASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIV-HRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVECVVSP

Query:  ADED-PRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFALVDATS
         DE+   SS  D DGDGS+AVL KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRL+PAGFLLIAFAAFRGRPFPSGFSAWISI LFALVDAT 
Subjt:  ADED-PRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFALVDATS

Query:  FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKY
        FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAA LFGESI L+GAAGL+LGVLGLLLLEVPSL LDAS+FSLWGSGEWWMFLAAQSMAVGTVMVRWVSKY
Subjt:  FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKY

Query:  SDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQL
        SDPIMATGWHMVIGGLPLLAIC LNH+PA+SGSL+DFTTNDILAL YASIFGSA+SYGSFFYSATKGSLTKLSSLTFLTPMFAS+FGFLYLGETFSPIQL
Subjt:  SDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQL

Query:  VGAVVTVVAIYVVNYGSSVE
        VGAVVTVV+IYVVNYGSS+E
Subjt:  VGAVVTVVAIYVVNYGSSVE

A0A6J1JVE3 WAT1-related protein At3g02690, chloroplastic5.9e-18184.02Show/hide
Query:  TPSNSRPLLHFSFSRQLLLASPNSSPAPIV-HRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVECVVSPADED-PR
        +PSN RPL  FSF RQ ++    SS API+  RRV F      RY     FR DYV IPV NCTRSGADT+LD +ES+DCVGTAQDVECVVS  DE+   
Subjt:  TPSNSRPLLHFSFSRQLLLASPNSSPAPIV-HRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTELDLSESVDCVGTAQDVECVVSPADED-PR

Query:  SSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQ
        SS  D DGDGS+AVL KAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRL+PAGFLLIAFAAFRGRPFPSGFSAWISI LFALVDAT FQGFLAQ
Subjt:  SSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQ

Query:  GLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMAT
        GLQRTSAGLGSVIIDSQPLTVAVLAA LFGESI L+GAAGL+LGVLGL+LLEVPSL LDAS+FSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMAT
Subjt:  GLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMAT

Query:  GWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTV
        GWHMVIGGLPLL IC LNH+PA+SGSL+DFTTNDILAL YASIFGSA+SYGSFFYSATKGSLTKLSSLTFLTPMFAS+FGFLYLGETFSPIQLVGAVVTV
Subjt:  GWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTV

Query:  VAIYVVNYGSSVE
        V+IYVVNYGSS+E
Subjt:  VAIYVVNYGSSVE

SwissProt top hitse value%identityAlignment
O29740 Uncharacterized transporter AF_05101.8e-0925.87Show/hide
Query:  VLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAF-RGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTV
        +L      W  + + +K  L    PF ++ +R + A  LL+A+  + RG   PSG S W+ +++ AL   T    F    L+ T+A   S++I++  + V
Subjt:  VLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAF-RGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTV

Query:  AVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDP
        A L  L+ GE+ +    AG+ L   G++L+         S+ +++  G+  M +     AV TV+   +    D    T +   +G + L+   ++    
Subjt:  AVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDP

Query:  AISGSLKDFTTN--DILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYV
          SG     T N   + ALLY SI  S  +Y  ++Y+ T    T ++   +L P+F ++F F  L E       +G ++T+  +Y+
Subjt:  AISGSLKDFTTN--DILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYV

O32256 Uncharacterized transporter YvbV1.7e-1227.92Show/hide
Query:  WGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFG
        WG      K  L  S P   +  R +  G LL+  A  R          W    + AL++ T F G    GL    AGL S I+  QP+ + V + L  G
Subjt:  WGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFG

Query:  ESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDF
        ES+ ++   GL+LG  G+ ++         S       G      +A S A+GTV ++      D I      + IG + LL           S S   +
Subjt:  ESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDF

Query:  TTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE
        T   I +LL+ S+F  A+ +  FF     G  +K++S TFL P+ + V   ++L E  +   L G ++ V +I +VN  S  +
Subjt:  TTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVNYGSSVE

P42194 Protein PecM5.1e-1228.98Show/hide
Query:  WGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFG
        WGT      + LP   P   +  R +PAG +LI      G+  P     W    L AL +   F   L     R   G+ +++   QPL V +L+ LL  
Subjt:  WGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFG

Query:  ESISLIGAAGLLLGVLGL-LLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP-----IMATGWHMVIGGLPLLAICILNHDPAIS
        + +        + G +G+ LL+ +P   L+        +G     LA  SMA G V+ +   K+  P     +  TGW +  GGL +L + +L       
Subjt:  ESISLIGAAGLLLGVLGL-LLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDP-----IMATGWHMVIGGLPLLAICILNHDPAIS

Query:  GSLKDFTT-NDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVV
          L D  T  ++   LY +I GS ++Y  +F      S   +S L FL+P+ A + GFL+L +  S  QLVG V    A+ +V
Subjt:  GSLKDFTT-NDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVV

P74436 Uncharacterized transporter sll03557.0e-7053.72Show/hide
Query:  LVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAV
        L++PFF WGTAMVAMK VL  + PFFV+  RLIPAG L++ +A  + RP P  +  W  I LFALVD T FQGFLAQGL+RT AGLGSVIIDSQP+ VA+
Subjt:  LVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAV

Query:  LAALLFGESISLIGAAGLLLGVLGLLLLEVP-----------SLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL
        L++ LF E I  IG  GLLLGV G+ L+ +P            L+++ S  +L  SGE WM LA+ SMAVGTV++ +VS+  DP++ATGWHM+IGGLPLL
Subjt:  LAALLFGESISLIGAAGLLLGVLGLLLLEVP-----------SLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVSKYSDPIMATGWHMVIGGLPLL

Query:  AICIL-NHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVN
        AI ++ + +P  +  L  +       L YA++FGSAI+YG FFY A+KG+LT LSSLTFLTP+FA  F  L L E  S +Q +G   T+V+IY++N
Subjt:  AICIL-NHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVVAIYVVN

Q93V85 WAT1-related protein At3g02690, chloroplastic4.3e-13664.13Show/hide
Query:  WP-----SASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTE-LDLSESVDCVGTAQD
        WP     ++SSS  +   + P    S +R+  L+  N+S   + H R     +   R         D V         S  +TE    S SVDCVG   D
Subjt:  WP-----SASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTE-LDLSESVDCVGTAQD

Query:  VECVVSPADEDPRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFA
        VECV +  DE+ RSS     G+G+        E+ VL+SPFFFWGTAMVAMKEVLP +GPFFV+AFRLIPAG LL+AFA ++GRP P G +AW SIALFA
Subjt:  VECVVSPADEDPRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFA

Query:  LVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMV
        LVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLA+ LFGESI ++ A GLLLGV GLLLLEVPS+  D + FSLWGSGEWWM LAAQSMA+GTVMV
Subjt:  LVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMV

Query:  RWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGET
        RWVSKYSDPIMATGWHMVIGGLPLLAI ++NHDP  +GSL+D +TND++ALLY SIFGSA+SYG +FYSATKGSLTKLSSLTFLTPMFAS+FG+LYL ET
Subjt:  RWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGET

Query:  FSPIQLVGAVVTVVAIYVVNY
        FS +QLVGA VT+VAIY+VN+
Subjt:  FSPIQLVGAVVTVVAIYVVNY

Arabidopsis top hitse value%identityAlignment
AT3G02690.1 nodulin MtN21 /EamA-like transporter family protein3.0e-13764.13Show/hide
Query:  WP-----SASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTE-LDLSESVDCVGTAQD
        WP     ++SSS  +   + P    S +R+  L+  N+S   + H R     +   R         D V         S  +TE    S SVDCVG   D
Subjt:  WP-----SASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADTE-LDLSESVDCVGTAQD

Query:  VECVVSPADEDPRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFA
        VECV +  DE+ RSS     G+G+        E+ VL+SPFFFWGTAMVAMKEVLP +GPFFV+AFRLIPAG LL+AFA ++GRP P G +AW SIALFA
Subjt:  VECVVSPADEDPRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWISIALFA

Query:  LVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMV
        LVDAT FQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLA+ LFGESI ++ A GLLLGV GLLLLEVPS+  D + FSLWGSGEWWM LAAQSMA+GTVMV
Subjt:  LVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMV

Query:  RWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGET
        RWVSKYSDPIMATGWHMVIGGLPLLAI ++NHDP  +GSL+D +TND++ALLY SIFGSA+SYG +FYSATKGSLTKLSSLTFLTPMFAS+FG+LYL ET
Subjt:  RWVSKYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGET

Query:  FSPIQLVGAVVTVVAIYVVNY
        FS +QLVGA VT+VAIY+VN+
Subjt:  FSPIQLVGAVVTVVAIYVVNY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTGGCTACGGCGGTGTCCTCGCCGCCAAATGCATTCCTTCTTCAACCACAGCGAGTAACTGTAGCAGAGAGAACAAAAATGGCGGGGTTTTGGCCCTCCGCCTC
TTCCTCTTTACCAACTCCATCCAACTCCCGCCCTCTTCTTCATTTCAGTTTCAGCAGACAACTTCTTCTTGCATCGCCCAATTCCTCCCCTGCTCCAATCGTTCACCGGC
GAGTAGGGTTTCACTTCAGCAGCACCGGAAGGTACGGCCACAATGACTGGTTCCGTGCCGATTATGTACCAATTCCCGTGGTGAATTGCACCAGAAGTGGGGCAGATACT
GAATTGGATTTGTCGGAGTCCGTTGATTGCGTGGGGACTGCCCAGGATGTGGAGTGCGTTGTTTCCCCGGCCGACGAAGATCCTCGATCTTCGATTGCCGATGTCGATGG
TGATGGCTCTTTGGCAGTCTTGGGGAAAGCCTGGGAGTTCGCGGTGTTGGTGTCGCCATTTTTCTTCTGGGGTACGGCTATGGTGGCCATGAAGGAGGTGCTTCCGAGGT
CTGGTCCTTTCTTTGTTTCCGCCTTTCGTCTTATACCTGCCGGTTTCCTTTTGATTGCCTTTGCTGCTTTCCGCGGTCGCCCCTTTCCCTCTGGTTTTTCTGCTTGGATT
TCCATCGCTCTCTTTGCTCTAGTCGACGCTACATCATTTCAGGGCTTTCTTGCTCAAGGCTTGCAGAGGACATCCGCAGGCTTGGGCAGTGTAATAATTGATTCTCAACC
ATTAACCGTAGCGGTGCTTGCAGCCTTATTATTTGGTGAATCCATCAGTTTGATTGGAGCCGCTGGACTTTTACTTGGTGTTTTAGGACTTTTGCTTCTTGAGGTTCCTT
CACTTGCTTTAGATGCAAGTACCTTTTCGTTATGGGGAAGTGGAGAGTGGTGGATGTTTCTTGCTGCACAGAGCATGGCCGTGGGTACTGTCATGGTCCGCTGGGTTTCC
AAGTACTCTGATCCTATTATGGCAACTGGATGGCACATGGTGATTGGTGGTCTCCCGCTTTTGGCGATCTGTATCCTTAATCATGATCCTGCCATCAGTGGGAGTCTGAA
GGATTTTACAACAAATGATATACTGGCACTCCTTTATGCATCAATTTTTGGAAGTGCTATTAGCTACGGTTCATTCTTCTATAGTGCAACAAAAGGTAGTTTGACAAAGC
TCAGCTCTCTCACCTTTCTCACTCCAATGTTTGCTTCAGTTTTTGGGTTTCTATATTTGGGAGAGACATTCTCACCTATTCAACTGGTTGGAGCTGTTGTTACCGTGGTC
GCTATATACGTGGTCAACTACGGCAGTAGTGTGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTTGGCTACGGCGGTGTCCTCGCCGCCAAATGCATTCCTTCTTCAACCACAGCGAGTAACTGTAGCAGAGAGAACAAAAATGGCGGGGTTTTGGCCCTCCGCCTC
TTCCTCTTTACCAACTCCATCCAACTCCCGCCCTCTTCTTCATTTCAGTTTCAGCAGACAACTTCTTCTTGCATCGCCCAATTCCTCCCCTGCTCCAATCGTTCACCGGC
GAGTAGGGTTTCACTTCAGCAGCACCGGAAGGTACGGCCACAATGACTGGTTCCGTGCCGATTATGTACCAATTCCCGTGGTGAATTGCACCAGAAGTGGGGCAGATACT
GAATTGGATTTGTCGGAGTCCGTTGATTGCGTGGGGACTGCCCAGGATGTGGAGTGCGTTGTTTCCCCGGCCGACGAAGATCCTCGATCTTCGATTGCCGATGTCGATGG
TGATGGCTCTTTGGCAGTCTTGGGGAAAGCCTGGGAGTTCGCGGTGTTGGTGTCGCCATTTTTCTTCTGGGGTACGGCTATGGTGGCCATGAAGGAGGTGCTTCCGAGGT
CTGGTCCTTTCTTTGTTTCCGCCTTTCGTCTTATACCTGCCGGTTTCCTTTTGATTGCCTTTGCTGCTTTCCGCGGTCGCCCCTTTCCCTCTGGTTTTTCTGCTTGGATT
TCCATCGCTCTCTTTGCTCTAGTCGACGCTACATCATTTCAGGGCTTTCTTGCTCAAGGCTTGCAGAGGACATCCGCAGGCTTGGGCAGTGTAATAATTGATTCTCAACC
ATTAACCGTAGCGGTGCTTGCAGCCTTATTATTTGGTGAATCCATCAGTTTGATTGGAGCCGCTGGACTTTTACTTGGTGTTTTAGGACTTTTGCTTCTTGAGGTTCCTT
CACTTGCTTTAGATGCAAGTACCTTTTCGTTATGGGGAAGTGGAGAGTGGTGGATGTTTCTTGCTGCACAGAGCATGGCCGTGGGTACTGTCATGGTCCGCTGGGTTTCC
AAGTACTCTGATCCTATTATGGCAACTGGATGGCACATGGTGATTGGTGGTCTCCCGCTTTTGGCGATCTGTATCCTTAATCATGATCCTGCCATCAGTGGGAGTCTGAA
GGATTTTACAACAAATGATATACTGGCACTCCTTTATGCATCAATTTTTGGAAGTGCTATTAGCTACGGTTCATTCTTCTATAGTGCAACAAAAGGTAGTTTGACAAAGC
TCAGCTCTCTCACCTTTCTCACTCCAATGTTTGCTTCAGTTTTTGGGTTTCTATATTTGGGAGAGACATTCTCACCTATTCAACTGGTTGGAGCTGTTGTTACCGTGGTC
GCTATATACGTGGTCAACTACGGCAGTAGTGTGGAATGA
Protein sequenceShow/hide protein sequence
MALATAVSSPPNAFLLQPQRVTVAERTKMAGFWPSASSSLPTPSNSRPLLHFSFSRQLLLASPNSSPAPIVHRRVGFHFSSTGRYGHNDWFRADYVPIPVVNCTRSGADT
ELDLSESVDCVGTAQDVECVVSPADEDPRSSIADVDGDGSLAVLGKAWEFAVLVSPFFFWGTAMVAMKEVLPRSGPFFVSAFRLIPAGFLLIAFAAFRGRPFPSGFSAWI
SIALFALVDATSFQGFLAQGLQRTSAGLGSVIIDSQPLTVAVLAALLFGESISLIGAAGLLLGVLGLLLLEVPSLALDASTFSLWGSGEWWMFLAAQSMAVGTVMVRWVS
KYSDPIMATGWHMVIGGLPLLAICILNHDPAISGSLKDFTTNDILALLYASIFGSAISYGSFFYSATKGSLTKLSSLTFLTPMFASVFGFLYLGETFSPIQLVGAVVTVV
AIYVVNYGSSVE