; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001693 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001693
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDUF21 domain-containing protein
Genome locationscaffold571:289090..292966
RNA-Seq ExpressionMS001693
SyntenyMS001693
Gene Ontology termsGO:0010960 - magnesium ion homeostasis (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR002550 - CNNM, transmembrane domain
IPR044751 - Ion transporter-like, CBS domain
IPR045095 - Ancient conserved domain protein family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575016.1 DUF21 domain-containing protein, partial [Cucurbita argyrosperma subsp. sororia]6.0e-23994.27Show/hide
Query:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
        MHF+NAVMLTRMLTRN GLES+AGEIPFGSLLW TYAGISCV VLFAGIMSGLTLGLMSLGLVDLEILQRSGT EEKKQAAAILPVVQKQHQLLVTLLLC
Subjt:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC

Query:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
        NAVAMEALPIYLDKLFNQY+AIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVR+LMVICYPIAYPIGK+LDCLLGHNEALFRRAQLKALVSIHSLE
Subjt:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE

Query:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
        AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGK+LARGHSRVPVY+GNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
Subjt:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP

Query:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING
        RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNK L PTL GEE EENK+SG +SQLTTPLL KHDENSD+VV+DIDRTSK SG+SRQ SYRRNDASSING
Subjt:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING

Query:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQKVA
        L+H SEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK A
Subjt:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQKVA

XP_004150321.1 DUF21 domain-containing protein At4g14240 [Cucumis sativus]1.5e-24296.38Show/hide
Query:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
        MHFINAVMLTRMLTRNSGL+SDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGT  EKKQAAAILPVVQKQHQLLVTLLLC
Subjt:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC

Query:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
        NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
Subjt:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE

Query:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
        AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGK+LARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
Subjt:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP

Query:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING
        RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNK L PTLDGEEFE+NK SGTESQLT PLL KHDENSD+VV+DIDRTSK S ISRQ SYRRNDASSING
Subjt:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING

Query:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
         +HSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
Subjt:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK

XP_008458553.1 PREDICTED: DUF21 domain-containing protein At4g14240-like [Cucumis melo]5.3e-24396.38Show/hide
Query:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
        MHFINAVMLTRMLTR+SGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGT  EKKQAAAILPVVQKQHQLLVTLLLC
Subjt:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC

Query:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
        NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
Subjt:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE

Query:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
        AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGK+LARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
Subjt:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP

Query:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING
        RVPSDMPLYDILNEFQKGSSHMAAVVKVKGK+K L PTLDGEEFE+NK SG ESQLT PLL KHDENSD+VV+DIDRTSK SGISRQ SYRRNDASSING
Subjt:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING

Query:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
        L+HSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
Subjt:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK

XP_022138663.1 DUF21 domain-containing protein At4g14240-like [Momordica charantia]4.3e-253100Show/hide
Query:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
        MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
Subjt:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC

Query:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
        NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
Subjt:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE

Query:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
        AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
Subjt:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP

Query:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING
        RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING
Subjt:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING

Query:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
        LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
Subjt:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK

XP_038875376.1 DUF21 domain-containing protein At4g14240-like [Benincasa hispida]1.8e-24396.18Show/hide
Query:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
        MHF+NAVMLTRMLTRNSGLES+AG IPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTP EKKQAAAILPVVQKQHQLLVTLLLC
Subjt:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC

Query:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
        NA+AMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
Subjt:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE

Query:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
        AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGK+LARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
Subjt:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP

Query:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING
        RVPSDMPLYDILNEFQKGSSHMAAVVKVKGK+K L PTLDGEE EENKVSGTESQLT PLL KHDENSD+VV+DIDRTSK SGISRQ SYRRNDASSING
Subjt:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING

Query:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQKVA
        L+HSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK A
Subjt:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQKVA

TrEMBL top hitse value%identityAlignment
A0A0A0KFQ9 Uncharacterized protein7.4e-24396.38Show/hide
Query:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
        MHFINAVMLTRMLTRNSGL+SDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGT  EKKQAAAILPVVQKQHQLLVTLLLC
Subjt:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC

Query:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
        NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
Subjt:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE

Query:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
        AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGK+LARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
Subjt:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP

Query:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING
        RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNK L PTLDGEEFE+NK SGTESQLT PLL KHDENSD+VV+DIDRTSK S ISRQ SYRRNDASSING
Subjt:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING

Query:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
         +HSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
Subjt:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK

A0A1S3C882 DUF21 domain-containing protein At4g14240-like2.6e-24396.38Show/hide
Query:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
        MHFINAVMLTRMLTR+SGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGT  EKKQAAAILPVVQKQHQLLVTLLLC
Subjt:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC

Query:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
        NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
Subjt:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE

Query:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
        AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGK+LARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
Subjt:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP

Query:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING
        RVPSDMPLYDILNEFQKGSSHMAAVVKVKGK+K L PTLDGEEFE+NK SG ESQLT PLL KHDENSD+VV+DIDRTSK SGISRQ SYRRNDASSING
Subjt:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING

Query:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
        L+HSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
Subjt:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK

A0A6J1CDN3 DUF21 domain-containing protein At4g14240-like2.1e-253100Show/hide
Query:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
        MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
Subjt:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC

Query:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
        NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
Subjt:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE

Query:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
        AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
Subjt:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP

Query:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING
        RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING
Subjt:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING

Query:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
        LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
Subjt:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK

A0A6J1KVA5 DUF21 domain-containing protein At4g14240-like isoform X26.5e-23994.06Show/hide
Query:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
        MHF+NAVMLTRMLTRN GLES+AGEIPFGSLLW TYAGISCV VLFAGIMSGLTLGLMSLGLVDLEILQRSGT EEKKQAAAILPVVQKQHQLLVTLLLC
Subjt:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC

Query:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
        NAVAMEALPIYLDKLFNQY+AIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVR+LMVICYPIAYPIGK+LDCLLGHNEALFRRAQLKALVSIHSLE
Subjt:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE

Query:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
        AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGK+LARGHSRVPVY+GNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
Subjt:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP

Query:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING
        RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNK L PTL GEE EENK+SG +SQLTTPLL KHDENSD+VV+DIDRTSK SG+SRQ SYRRNDASSING
Subjt:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING

Query:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQKVA
        ++H SEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK A
Subjt:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQKVA

A0A6J1KZJ2 DUF21 domain-containing protein At4g14240-like isoform X16.5e-23994.06Show/hide
Query:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
        MHF+NAVMLTRMLTRN GLES+AGEIPFGSLLW TYAGISCV VLFAGIMSGLTLGLMSLGLVDLEILQRSGT EEKKQAAAILPVVQKQHQLLVTLLLC
Subjt:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC

Query:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
        NAVAMEALPIYLDKLFNQY+AIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVR+LMVICYPIAYPIGK+LDCLLGHNEALFRRAQLKALVSIHSLE
Subjt:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE

Query:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
        AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGK+LARGHSRVPVY+GNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
Subjt:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP

Query:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING
        RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNK L PTL GEE EENK+SG +SQLTTPLL KHDENSD+VV+DIDRTSK SG+SRQ SYRRNDASSING
Subjt:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSING

Query:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQKVA
        ++H SEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK A
Subjt:  LNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQKVA

SwissProt top hitse value%identityAlignment
Q4V3C7 DUF21 domain-containing protein At4g142301.9e-17975.11Show/hide
Query:  MHFINAVMLTRML---TRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTL
        MH INAV+  RML   ++++ L+S+A  IPFGSL W TYAGISC  VLFAGIMSGLTLGLMSLGLV+LEILQRSGTP+EKKQ+AAI PVVQKQHQLLVTL
Subjt:  MHFINAVMLTRML---TRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTL

Query:  LLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIH
        LL NA+AME LPIYLDK+FN+YVAIILSVTFVL  GEVIPQAICTRYGLAVGAN V LVRILMV+ YPI++PI K+LD +LGHN+ LFRRAQLKALVSIH
Subjt:  LLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIH

Query:  SLEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIR
           AGKGGELTHDETTIISGALDLTEKTA+EAMTPIESTFSLDVNSKLD EAM K+ ARGHSRVPVYS NPKN+IGLLLVKSLLTVRPET T VSAV IR
Subjt:  SLEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIR

Query:  RIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASS
        RIPRVP++MPLYDILNEFQKGSSHMAAVVKVKGK+K    TL  E   E+ VS   S+LT PLL K + N D+V+V ID+ +  S IS          + 
Subjt:  RIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASS

Query:  INGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVA--AAAAASSMARAPSIRRLTAQK
          G +H+SE+IEDG+VIGIITLEDVFEELLQEEIVDETDEY+DVHKRIRVA  AA A SS+ARAPS RRL   K
Subjt:  INGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVA--AAAAASSMARAPSIRRLTAQK

Q67XQ0 DUF21 domain-containing protein At4g142406.1e-19479.49Show/hide
Query:  MHFINAVMLTRMLT---RNSGLESDAGE-IPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVT
        MH INAV   R+L+   +++G  ++ GE IPFGS  W TYAGISC  VLFAGIMSGLTLGLMSLGLV+LEILQRSGTP EKKQAAAI PVVQKQHQLLVT
Subjt:  MHFINAVMLTRMLT---RNSGLESDAGE-IPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVT

Query:  LLLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSI
        LLLCNA+AME LPIYLDKLFN+YVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFV LVRILM +CYPIA+PIGKILD +LGHN+ALFRRAQLKALVSI
Subjt:  LLLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSI

Query:  HSLEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSI
        HS EAGKGGELTHDETTIISGALDLTEKTA+EAMTPIESTFSLDVNSKLDWEAMGK+LARGHSRVPVYSGNPKN+IGLLLVKSLLTVRPETET VSAV I
Subjt:  HSLEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSI

Query:  RRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDAS
        RRIPRVP+DMPLYDILNEFQKGSSHMAAVVKVKGK+K    TL     EE+     +S LT PLL K + N DNV+V ID+       +   S+ +N+ S
Subjt:  RRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDAS

Query:  SINGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
          +G +H+SE IEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASS+ARAPS R+L AQK
Subjt:  SINGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK

Q8VZI2 DUF21 domain-containing protein At4g337005.2e-10854.72Show/hide
Query:  WFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAF
        +F +  +    VLFAG+MSGLTLGLMSL LVDLE+L +SGTPE +K AA ILPVV+ QH LLVTLL+CNA AME LPI+LD L   + AI++SVT +L F
Subjt:  WFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAF

Query:  GEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNE-ALFRRAQLKALVSIHSLEAGKGGELTHDETTIISGALDLTEKTAEEAMT
        GE+IPQ+IC+RYGLA+GA     VR+L+ IC P+A+PI K+LD LLGH   ALFRRA+LK LV  H  EAGKGGELTHDETTII+GAL+L+EK  ++AMT
Subjt:  GEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNE-ALFRRAQLKALVSIHSLEAGKGGELTHDETTIISGALDLTEKTAEEAMT

Query:  PIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGK
        PI   F +D+N+KLD + M  +L +GHSRVPVY   P NIIGL+LVK+LLT+ P+ E PV  V+IRRIPRVP  +PLYDILNEFQKG SHMA VV+   K
Subjt:  PIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGK

Query:  NKTLLPTLDGEEFEENKVSGTESQLTTP--LLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSINGLNHS--SEDIEDGEVIGIITLEDVFEELL
            LP+ +G   +E +V        TP   + +   +        +R S   G S+   + +++ + I  LN +   +  E+ E +GIIT+EDV EELL
Subjt:  NKTLLPTLDGEEFEENKVSGTESQLTTP--LLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSINGLNHS--SEDIEDGEVIGIITLEDVFEELL

Query:  QEEIVDETDEYVD
        QEEI DETD + +
Subjt:  QEEIVDETDEYVD

Q9LTD8 DUF21 domain-containing protein At5g527905.3e-10549.77Show/hide
Query:  AGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLCNAVAMEALPIYLDKLFNQYVAI
        A ++P    +++ Y  +    V+FAG+MSGLTLGLMSL +V+LE++ ++G P ++K A  ILP+V+ QH LL TLL+ NA+AMEALPI++D L   + AI
Subjt:  AGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLCNAVAMEALPIYLDKLFNQYVAI

Query:  ILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLG-HNEALFRRAQLKALVSIHSLEAGKGGELTHDETTIISGALDL
        ++SVT +LAFGE+IPQA+C+RYGL++GA    LVR+++++ +P++YPI K+LD LLG  +  L  RA+LK+LV +H  EAGKGGELTHDETTIISGALD+
Subjt:  ILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLG-HNEALFRRAQLKALVSIHSLEAGKGGELTHDETTIISGALDL

Query:  TEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSH
        ++K+A++AMTP+   FSLD+N KLD + MG + + GHSR+P+YS NP  IIG +LVK+L+ VRPE ET +  + IRR+P+V  ++PLYDILN FQ G SH
Subjt:  TEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSH

Query:  MAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSINGLNHSSEDIEDGEVIGIITLED
        MAAVV  K    T  P        E  ++G+ ++              NV + I   + +S  S QS  R  D+ S           ED EVIGIITLED
Subjt:  MAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSINGLNHSSEDIEDGEVIGIITLED

Query:  VFEELLQEEIVDETDEYVDVHKRIRVAAAAAASS
        V EEL+QEEI DETD+YV++HKRI +    + +S
Subjt:  VFEELLQEEIVDETDEYVDVHKRIRVAAAAAASS

Q9ZVS8 Putative DUF21 domain-containing protein At1g032702.7e-16570.41Show/hide
Query:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
        M  ++ + L R     +    +A +I FGS  WF   G++C  VLFAGIMSGLTLGLMSLGLV+LEILQ+SG+  EKKQAAAILPVV+KQHQLLVTLLLC
Subjt:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC

Query:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
        NA AMEALPI LDK+F+ +VA++LSVTFVLAFGE+IPQAIC+RYGLAVGANF+ LVRILM+ICYPIAYPIGK+LD ++GHN+ LFRRAQLKALVSIHS E
Subjt:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE

Query:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
        AGKGGELTH+ET IISGALDL++KTAEEAMTPIESTFSLDVN+KLDWE +GK+L+RGHSR+PVY GNPKNIIGLLLVKSLLTVR ETE PVS+VSIR+IP
Subjt:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP

Query:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNK--TLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSI
        RVPSDMPLYDILNEFQKGSSHMAAVVKVK K+K   +    +GE  +EN      S LT PLL KH+  S +VVVDID+  K+  +  +    + + +  
Subjt:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNK--TLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSI

Query:  NGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAA--SSMARA
          L    ED ED EVIGIITLEDVFEELLQ EIVDETD Y+DVHKR+RVAAAAAA  SS+ RA
Subjt:  NGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAA--SSMARA

Arabidopsis top hitse value%identityAlignment
AT1G03270.1 CBS domain-containing protein with a domain of unknown function (DUF21)1.9e-16670.41Show/hide
Query:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC
        M  ++ + L R     +    +A +I FGS  WF   G++C  VLFAGIMSGLTLGLMSLGLV+LEILQ+SG+  EKKQAAAILPVV+KQHQLLVTLLLC
Subjt:  MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLC

Query:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE
        NA AMEALPI LDK+F+ +VA++LSVTFVLAFGE+IPQAIC+RYGLAVGANF+ LVRILM+ICYPIAYPIGK+LD ++GHN+ LFRRAQLKALVSIHS E
Subjt:  NAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLE

Query:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP
        AGKGGELTH+ET IISGALDL++KTAEEAMTPIESTFSLDVN+KLDWE +GK+L+RGHSR+PVY GNPKNIIGLLLVKSLLTVR ETE PVS+VSIR+IP
Subjt:  AGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIP

Query:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNK--TLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSI
        RVPSDMPLYDILNEFQKGSSHMAAVVKVK K+K   +    +GE  +EN      S LT PLL KH+  S +VVVDID+  K+  +  +    + + +  
Subjt:  RVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNK--TLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSI

Query:  NGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAA--SSMARA
          L    ED ED EVIGIITLEDVFEELLQ EIVDETD Y+DVHKR+RVAAAAAA  SS+ RA
Subjt:  NGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAA--SSMARA

AT4G14230.1 CBS domain-containing protein with a domain of unknown function (DUF21)1.4e-18075.11Show/hide
Query:  MHFINAVMLTRML---TRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTL
        MH INAV+  RML   ++++ L+S+A  IPFGSL W TYAGISC  VLFAGIMSGLTLGLMSLGLV+LEILQRSGTP+EKKQ+AAI PVVQKQHQLLVTL
Subjt:  MHFINAVMLTRML---TRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTL

Query:  LLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIH
        LL NA+AME LPIYLDK+FN+YVAIILSVTFVL  GEVIPQAICTRYGLAVGAN V LVRILMV+ YPI++PI K+LD +LGHN+ LFRRAQLKALVSIH
Subjt:  LLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIH

Query:  SLEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIR
           AGKGGELTHDETTIISGALDLTEKTA+EAMTPIESTFSLDVNSKLD EAM K+ ARGHSRVPVYS NPKN+IGLLLVKSLLTVRPET T VSAV IR
Subjt:  SLEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIR

Query:  RIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASS
        RIPRVP++MPLYDILNEFQKGSSHMAAVVKVKGK+K    TL  E   E+ VS   S+LT PLL K + N D+V+V ID+ +  S IS          + 
Subjt:  RIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASS

Query:  INGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVA--AAAAASSMARAPSIRRLTAQK
          G +H+SE+IEDG+VIGIITLEDVFEELLQEEIVDETDEY+DVHKRIRVA  AA A SS+ARAPS RRL   K
Subjt:  INGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVA--AAAAASSMARAPSIRRLTAQK

AT4G14240.1 CBS domain-containing protein with a domain of unknown function (DUF21)4.4e-19579.49Show/hide
Query:  MHFINAVMLTRMLT---RNSGLESDAGE-IPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVT
        MH INAV   R+L+   +++G  ++ GE IPFGS  W TYAGISC  VLFAGIMSGLTLGLMSLGLV+LEILQRSGTP EKKQAAAI PVVQKQHQLLVT
Subjt:  MHFINAVMLTRMLT---RNSGLESDAGE-IPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVT

Query:  LLLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSI
        LLLCNA+AME LPIYLDKLFN+YVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFV LVRILM +CYPIA+PIGKILD +LGHN+ALFRRAQLKALVSI
Subjt:  LLLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSI

Query:  HSLEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSI
        HS EAGKGGELTHDETTIISGALDLTEKTA+EAMTPIESTFSLDVNSKLDWEAMGK+LARGHSRVPVYSGNPKN+IGLLLVKSLLTVRPETET VSAV I
Subjt:  HSLEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSI

Query:  RRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDAS
        RRIPRVP+DMPLYDILNEFQKGSSHMAAVVKVKGK+K    TL     EE+     +S LT PLL K + N DNV+V ID+       +   S+ +N+ S
Subjt:  RRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDAS

Query:  SINGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
          +G +H+SE IEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASS+ARAPS R+L AQK
Subjt:  SINGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK

AT4G14240.2 CBS domain-containing protein with a domain of unknown function (DUF21)8.8e-18877.8Show/hide
Query:  MHFINAVMLTRMLT---RNSGLESDAGE-IPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVT
        MH INAV   R+L+   +++G  ++ GE IPFGS  W TYAGISC  VLFAGIMSGLTLGLMSLGLV+LEILQRS         AAI PVVQKQHQLLVT
Subjt:  MHFINAVMLTRMLT---RNSGLESDAGE-IPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVT

Query:  LLLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSI
        LLLCNA+AME LPIYLDKLFN+YVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFV LVRILM +CYPIA+PIGKILD +LGHN+ALFRRAQLKALVSI
Subjt:  LLLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSI

Query:  HSLEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSI
        HS EAGKGGELTHDETTIISGALDLTEKTA+EAMTPIESTFSLDVNSKLDWEAMGK+LARGHSRVPVYSGNPKN+IGLLLVKSLLTVRPETET VSAV I
Subjt:  HSLEAGKGGELTHDETTIISGALDLTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSI

Query:  RRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDAS
        RRIPRVP+DMPLYDILNEFQKGSSHMAAVVKVKGK+K    TL     EE+     +S LT PLL K + N DNV+V ID+       +   S+ +N+ S
Subjt:  RRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGKNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDAS

Query:  SINGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK
          +G +H+SE IEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASS+ARAPS R+L AQK
Subjt:  SINGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVDVHKRIRVAAAAAASSMARAPSIRRLTAQK

AT4G33700.1 CBS domain-containing protein with a domain of unknown function (DUF21)3.7e-10954.72Show/hide
Query:  WFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAF
        +F +  +    VLFAG+MSGLTLGLMSL LVDLE+L +SGTPE +K AA ILPVV+ QH LLVTLL+CNA AME LPI+LD L   + AI++SVT +L F
Subjt:  WFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLCNAVAMEALPIYLDKLFNQYVAIILSVTFVLAF

Query:  GEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNE-ALFRRAQLKALVSIHSLEAGKGGELTHDETTIISGALDLTEKTAEEAMT
        GE+IPQ+IC+RYGLA+GA     VR+L+ IC P+A+PI K+LD LLGH   ALFRRA+LK LV  H  EAGKGGELTHDETTII+GAL+L+EK  ++AMT
Subjt:  GEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNE-ALFRRAQLKALVSIHSLEAGKGGELTHDETTIISGALDLTEKTAEEAMT

Query:  PIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGK
        PI   F +D+N+KLD + M  +L +GHSRVPVY   P NIIGL+LVK+LLT+ P+ E PV  V+IRRIPRVP  +PLYDILNEFQKG SHMA VV+   K
Subjt:  PIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKGK

Query:  NKTLLPTLDGEEFEENKVSGTESQLTTP--LLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSINGLNHS--SEDIEDGEVIGIITLEDVFEELL
            LP+ +G   +E +V        TP   + +   +        +R S   G S+   + +++ + I  LN +   +  E+ E +GIIT+EDV EELL
Subjt:  NKTLLPTLDGEEFEENKVSGTESQLTTP--LLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSINGLNHS--SEDIEDGEVIGIITLEDVFEELL

Query:  QEEIVDETDEYVD
        QEEI DETD + +
Subjt:  QEEIVDETDEYVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTTTATAAATGCTGTGATGTTGACTCGAATGCTGACTCGGAATTCGGGGCTGGAATCGGATGCCGGTGAAATTCCGTTTGGATCGTTGTTGTGGTTCACCTACGC
CGGGATCTCCTGCGTTTTTGTCCTCTTCGCCGGAATAATGTCCGGTTTGACCCTTGGTCTCATGTCTCTCGGTCTTGTCGACCTTGAAATTCTTCAACGGAGCGGCACCC
CTGAGGAGAAGAAACAAGCAGCGGCTATACTCCCAGTGGTCCAAAAGCAGCACCAGCTCCTTGTTACGTTGCTCTTATGCAATGCCGTTGCCATGGAGGCACTTCCAATA
TACTTGGACAAACTTTTTAATCAGTATGTTGCTATAATTCTCTCTGTGACATTTGTATTGGCATTTGGTGAGGTTATACCGCAAGCAATTTGCACCCGGTATGGACTGGC
TGTAGGTGCCAATTTTGTGTGTCTAGTACGAATTTTAATGGTTATTTGCTATCCAATTGCTTATCCCATCGGGAAGATTCTTGACTGTTTGCTGGGACATAATGAGGCAT
TGTTTAGGCGTGCTCAGTTAAAAGCTCTTGTCTCCATCCACAGTCTGGAGGCTGGAAAGGGTGGCGAACTCACCCATGATGAAACAACAATAATTAGTGGAGCTCTAGAT
TTAACTGAAAAGACAGCTGAGGAGGCTATGACACCTATTGAATCAACTTTTTCCTTGGATGTAAATTCAAAATTGGACTGGGAAGCAATGGGAAAAGTTCTTGCTCGTGG
TCATAGTCGAGTTCCTGTCTATTCTGGGAATCCAAAGAATATTATAGGGCTTCTACTGGTGAAAAGTCTTTTAACTGTACGACCTGAAACAGAGACCCCAGTCAGTGCCG
TTTCCATTCGAAGAATTCCTAGGGTTCCTTCAGACATGCCTCTTTATGACATATTGAATGAATTTCAGAAAGGTAGTAGTCACATGGCTGCCGTGGTGAAAGTAAAAGGG
AAAAACAAGACTCTTCTGCCTACATTAGATGGAGAAGAATTCGAGGAAAACAAAGTCTCAGGCACGGAATCCCAACTGACAACTCCTTTACTACATAAGCATGACGAGAA
TTCAGATAATGTAGTCGTTGATATCGATAGGACGTCCAAGAATTCTGGTATAAGCAGGCAATCTTCTTACCGACGCAATGATGCTTCTTCGATAAACGGGTTGAACCATT
CTTCAGAGGACATAGAAGATGGTGAAGTTATTGGTATCATCACCCTTGAAGATGTATTTGAAGAACTTTTGCAGGAGGAGATTGTTGATGAAACAGATGAATACGTTGAT
GTGCACAAAAGAATTCGTGTTGCTGCAGCTGCGGCTGCTTCCTCTATGGCAAGAGCTCCATCAATTAGGAGATTAACTGCGCAGAAGGTAGCAGTGTTT
mRNA sequenceShow/hide mRNA sequence
ATGCATTTTATAAATGCTGTGATGTTGACTCGAATGCTGACTCGGAATTCGGGGCTGGAATCGGATGCCGGTGAAATTCCGTTTGGATCGTTGTTGTGGTTCACCTACGC
CGGGATCTCCTGCGTTTTTGTCCTCTTCGCCGGAATAATGTCCGGTTTGACCCTTGGTCTCATGTCTCTCGGTCTTGTCGACCTTGAAATTCTTCAACGGAGCGGCACCC
CTGAGGAGAAGAAACAAGCAGCGGCTATACTCCCAGTGGTCCAAAAGCAGCACCAGCTCCTTGTTACGTTGCTCTTATGCAATGCCGTTGCCATGGAGGCACTTCCAATA
TACTTGGACAAACTTTTTAATCAGTATGTTGCTATAATTCTCTCTGTGACATTTGTATTGGCATTTGGTGAGGTTATACCGCAAGCAATTTGCACCCGGTATGGACTGGC
TGTAGGTGCCAATTTTGTGTGTCTAGTACGAATTTTAATGGTTATTTGCTATCCAATTGCTTATCCCATCGGGAAGATTCTTGACTGTTTGCTGGGACATAATGAGGCAT
TGTTTAGGCGTGCTCAGTTAAAAGCTCTTGTCTCCATCCACAGTCTGGAGGCTGGAAAGGGTGGCGAACTCACCCATGATGAAACAACAATAATTAGTGGAGCTCTAGAT
TTAACTGAAAAGACAGCTGAGGAGGCTATGACACCTATTGAATCAACTTTTTCCTTGGATGTAAATTCAAAATTGGACTGGGAAGCAATGGGAAAAGTTCTTGCTCGTGG
TCATAGTCGAGTTCCTGTCTATTCTGGGAATCCAAAGAATATTATAGGGCTTCTACTGGTGAAAAGTCTTTTAACTGTACGACCTGAAACAGAGACCCCAGTCAGTGCCG
TTTCCATTCGAAGAATTCCTAGGGTTCCTTCAGACATGCCTCTTTATGACATATTGAATGAATTTCAGAAAGGTAGTAGTCACATGGCTGCCGTGGTGAAAGTAAAAGGG
AAAAACAAGACTCTTCTGCCTACATTAGATGGAGAAGAATTCGAGGAAAACAAAGTCTCAGGCACGGAATCCCAACTGACAACTCCTTTACTACATAAGCATGACGAGAA
TTCAGATAATGTAGTCGTTGATATCGATAGGACGTCCAAGAATTCTGGTATAAGCAGGCAATCTTCTTACCGACGCAATGATGCTTCTTCGATAAACGGGTTGAACCATT
CTTCAGAGGACATAGAAGATGGTGAAGTTATTGGTATCATCACCCTTGAAGATGTATTTGAAGAACTTTTGCAGGAGGAGATTGTTGATGAAACAGATGAATACGTTGAT
GTGCACAAAAGAATTCGTGTTGCTGCAGCTGCGGCTGCTTCCTCTATGGCAAGAGCTCCATCAATTAGGAGATTAACTGCGCAGAAGGTAGCAGTGTTT
Protein sequenceShow/hide protein sequence
MHFINAVMLTRMLTRNSGLESDAGEIPFGSLLWFTYAGISCVFVLFAGIMSGLTLGLMSLGLVDLEILQRSGTPEEKKQAAAILPVVQKQHQLLVTLLLCNAVAMEALPI
YLDKLFNQYVAIILSVTFVLAFGEVIPQAICTRYGLAVGANFVCLVRILMVICYPIAYPIGKILDCLLGHNEALFRRAQLKALVSIHSLEAGKGGELTHDETTIISGALD
LTEKTAEEAMTPIESTFSLDVNSKLDWEAMGKVLARGHSRVPVYSGNPKNIIGLLLVKSLLTVRPETETPVSAVSIRRIPRVPSDMPLYDILNEFQKGSSHMAAVVKVKG
KNKTLLPTLDGEEFEENKVSGTESQLTTPLLHKHDENSDNVVVDIDRTSKNSGISRQSSYRRNDASSINGLNHSSEDIEDGEVIGIITLEDVFEELLQEEIVDETDEYVD
VHKRIRVAAAAAASSMARAPSIRRLTAQKVAVF