; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023273 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023273
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionintracellular protein transport protein USO1-related
Genome locationtig00000892:1717465..1729130
RNA-Seq ExpressionSgr023273
SyntenySgr023273
Gene Ontology termsNA
InterPro domainsIPR043424 - Protein BRANCHLESS TRICHOME-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035923.1 hypothetical protein SDJN02_02723 [Cucurbita argyrosperma subsp. argyrosperma]7.9e-21670.27Show/hide
Query:  MSWPEQQTEKRCKIRKRGC--SSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSARK
        MSWPEQ+TE+ CKIRKR C  SSSSSSSTLV KYRFK                   PTWKMSTKSHSS        NRSPSCS+ GGG KGKEASVS   
Subjt:  MSWPEQQTEKRCKIRKRGC--SSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSARK

Query:  LAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEI
                  + + +  N+SQKLKNN DI+EDK+EL+KT++ V     SCLSDPDP FN+  S+KVEG R+HRRR SASS+R+G  G +NFHGN CL+EI
Subjt:  LAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEI

Query:  ENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKA
        ENPS QG+T RRKTKFMLKTRLKEV NCLTTSKEL+RVLNHV AHED+DQ RPSSIS LITALK+E++RAK RVDHLIK+QS HGDEIE++MKRF EEK 
Subjt:  ENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKA

Query:  AWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEK
        AWK +ERAR+RS+I SMADE+E+EKKLRRQAERLNKTIAKELA+AK+SLSKAMKDLQRE+RAKEIFEQICDELAKGIGEDRAQFEE KKESAKVREE+E+
Subjt:  AWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEK

Query:  EREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQD-MERKEPNDEED--SEDDS
        EREMLQLADVLREERVQMKLSEAKY FEEKNAAVERLKDELEA+LITQ  +EN E+  +   KIKELEAYLKKINFGS Q+ +E  E  +E++   EDDS
Subjt:  EREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQD-MERKEPNDEED--SEDDS

Query:  DLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKES-NGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQK
        DLHSIELNMDNNNKSYRWSFVHG    GSKRNSFEK+  NGRKS+SEKIQWGSICLNRKASN  KNGEFVGRKSHE+S+R +WERF+E+           
Subjt:  DLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKES-NGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQK

Query:  QDHHHNDEGDETQPKQKSVKCLRDILFPG
              D G   +   KS KCLRDILFPG
Subjt:  QDHHHNDEGDETQPKQKSVKCLRDILFPG

XP_022957750.1 uncharacterized protein LOC111459204 isoform X1 [Cucurbita moschata]2.3e-21570.27Show/hide
Query:  MSWPEQQTEKRCKIRKRGCS--SSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSARK
        MSWPEQ+TE+ CKIRKR CS  SSSSSSTLV KYRFK                   PTWKMSTKSHSS        NRSPSCS+ GGG KGKEASVS   
Subjt:  MSWPEQQTEKRCKIRKRGCS--SSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSARK

Query:  LAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEI
                  + + +  N+SQKLKNN DI+EDK+EL+KT++ V     SCLSDPDP FN+  S+KVEG R+HRRR SASS+R+G  G +NFHG+ CL+EI
Subjt:  LAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEI

Query:  ENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKA
        ENPS QGKT RRKTKFMLKTRLKEV NCLTTSKEL+RVLNHV AHED+DQ RPSSIS LITALK+E++RAK RVDHLIK+QS HGDEIE++MKRF EEK 
Subjt:  ENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKA

Query:  AWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEK
        AWK +ERAR+RS+I SMADE+E+EKKLR+QAERLNKTIAKELA+AK+SLSKAMKDLQRE+RAKEIFEQICDELAKGIGEDRAQFEE KKESAKVREE+E+
Subjt:  AWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEK

Query:  EREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQD-MERKEPNDEED--SEDDS
        EREMLQLADVLREERVQMKLSEAKY FEEKNAAVERLKDELEA+LITQ  +EN E+  +   KIKELEAYLKKINFGS Q+ +E  E  +E++   EDDS
Subjt:  EREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQD-MERKEPNDEED--SEDDS

Query:  DLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEK-ESNGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQK
        DLHSIELNMDNNNKSYRWSFVHG    GSKRNSFEK E NGRKS+SEKIQWGSICLNRKASN  KNGEFVGRKSHE+S+R +WERF+E+           
Subjt:  DLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEK-ESNGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQK

Query:  QDHHHNDEGDETQPKQKSVKCLRDILFPG
              D G   +   KS KCLRDILFPG
Subjt:  QDHHHNDEGDETQPKQKSVKCLRDILFPG

XP_022995046.1 uncharacterized protein LOC111490718 isoform X1 [Cucurbita maxima]4.6e-21670.36Show/hide
Query:  MSWPEQQTEKRCKIRKRGC----SSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSA
        MSWPEQ+TE+ CKIRKR C    SSSSSSSTLV KYRFK                   PTWKMSTKSHSS        NRSPSCS+ GGG KGKEASVS 
Subjt:  MSWPEQQTEKRCKIRKRGC----SSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSA

Query:  RKLAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLM
                    + + +  N+SQKLKNN DI+EDK+EL+KT++ V     SCLSDPDP FN+  S+KVEG R+HRRR SASSLRIG  G +NFHGN CL+
Subjt:  RKLAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLM

Query:  EIENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEE
        EIENPS QG+T RRKTKFMLKTRLKEVSNCLTTSKEL+RVLNHV AHED+DQ RPSSIS LITALK+E++RAK RVDHLIK+QSFHGDEIE++MKRF EE
Subjt:  EIENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEE

Query:  KAAWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEV
        K AWK +ERAR+RS+I SMADE+E+EKKLR+QAERLNKTIAKELA+AK+SLSKAMKDLQRE+RAKEIFEQICDELAKGIGEDRAQFEE KKESAKVREE+
Subjt:  KAAWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEV

Query:  EKEREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQ---DMERKEPNDEEDSED
        E+EREMLQLADVLREERVQMKLSEAKY FEEKNAAVERLKDELEA+LITQ  +EN E+  +   KIKELEAYLKKINFGS Q   D + K    E   ED
Subjt:  EKEREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQ---DMERKEPNDEEDSED

Query:  DSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKES-NGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQT
        DSDLHSIELNMDNNNKSYRWSFVHG    GSKRNSFEK+  NGRKS+SEKIQWGSICLNRKASN  KNG+FVGRKSHE+S+R +WERF+E+         
Subjt:  DSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKES-NGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQT

Query:  QKQDHHHNDEGDETQPKQKSVKCLRDILFPG
                D G   +   KS KCLRDILFPG
Subjt:  QKQDHHHNDEGDETQPKQKSVKCLRDILFPG

XP_023534122.1 uncharacterized protein LOC111795778 isoform X1 [Cucurbita pepo subsp. pepo]1.7e-21570.11Show/hide
Query:  MSWPEQQTEKRCKIRKRGC--SSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSARK
        MSWPEQ+TE+ CKIRKR C  SSSSSSSTLV KYRFK                   PTWKMSTKSHSS        NRSPSCS+ GGG KGKEASVS   
Subjt:  MSWPEQQTEKRCKIRKRGC--SSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSARK

Query:  LAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEI
                  + + +  N+SQKLKNN DI+EDK+EL+KT++ V     SCLSDPDP FN+  S+KVEG R+HRRR SASS+R+G  G +NFHGN CL+EI
Subjt:  LAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEI

Query:  ENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKA
        ENPS QG+T RRKTKFMLKTRLKEV NCLTTSKEL+RVLNHV AHED+DQ RPSSIS LITALK E++RAK RVDHLIK+QSFHGDEIE++MKRF EEK 
Subjt:  ENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKA

Query:  AWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEK
        AWK +ERAR+RS+I SMADE+E+EKKLRRQAERLNKTIAKELA+AK+SLSKAMKDLQRE+RAKEIFEQICDELAKGIGEDRAQFEE KKESAKVREE+E+
Subjt:  AWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEK

Query:  EREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQDM---ERKEPNDEEDSEDDS
        EREMLQLADVLREERVQMKLSEAKY FEEKNAAVERLKDELEA+LITQ  +EN E+  +   KIKELEAYLKKINFGS Q+    + K    E   EDDS
Subjt:  EREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQDM---ERKEPNDEEDSEDDS

Query:  DLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKES-NGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQK
        DLHSIELNMDNNNKSYRWSFVHG    GSKRNSFEK+  NGRKS+SEKIQWGSICLNRKASN  KNGEFVGRKSHE+++R +WERF+E+           
Subjt:  DLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKES-NGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQK

Query:  QDHHHNDEGDETQPKQKSVKCLRDILFPG
           +  D G   +   KS KCLRDILFPG
Subjt:  QDHHHNDEGDETQPKQKSVKCLRDILFPG

XP_038900292.1 uncharacterized protein At5g41620 isoform X1 [Benincasa hispida]2.0e-21167.21Show/hide
Query:  SWPEQQTE-KRCKIRKRGCSSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGGKGKEASVSARKLAA
        SW EQ+TE KRCKIRKR C SS SSSTLVRKYRFKK                  PTWKMSTKSHSSK+ST D PNRSPSCSLDGGGKGKE SVS      
Subjt:  SWPEQQTE-KRCKIRKRGCSSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGGKGKEASVSARKLAA

Query:  TLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIG---EVGGSNFHGNDCLMEI
               +A  +  NNSQKLKNNSD+VE+KKEL+KTRE+V     SCLSDPD +  N  +EK E  R+HRRR SASSLRIG    VGGSNFHGNDCLMEI
Subjt:  TLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIG---EVGGSNFHGNDCLMEI

Query:  ENPSQGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAA
        EN + GKTTRRKTK  +KTRLKEVSNCLTTSKELLRVL+HV  HE+H    PSS SSLITALK+ELDRAKTRVDHLIK+Q+FHGDEIE+L KRFAEEKAA
Subjt:  ENPSQGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAA

Query:  WKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKE
        WKY+ERAR  S I+SMA+EVEVEKKLRRQAERLNKTIAKELA+AKVS+SKAMK+++REKRAKEI EQIC+ELAKGIGEDRA+FEELKKESAKVREEVEKE
Subjt:  WKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKE

Query:  REMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGN------ENGEDSCNKFEKIKELEAYLKKINFGSCQDME---RKEPNDEE-
        REML LADVLREERVQMKLSEAKY FEEKNAAVERLK +L+AYL+TQ GN      EN E SCN+FEKIKELEAYLKKINFGSCQD E   RKE  +E+ 
Subjt:  REMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGN------ENGEDSCNKFEKIKELEAYLKKINFGSCQDME---RKEPNDEE-

Query:  -DSEDDSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQS
         D E++SDLHSIELNMDNNNKSYRWSFVHG + + ++  S    +NGRKS+SEKIQWGSICLN   +N+   G         NS+ FDW+ FSEL     
Subjt:  -DSEDDSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQS

Query:  QTQTQKQDHHHNDEGDETQPKQKSVKCLRDILFPGAAGCEETAEKNQVGKTDNGIAMMAVQTQSDEAAGSV
         TQ Q +D    +  D+ Q   KSVKCLRDILFP            ++ +  NGIA M      DEA+  V
Subjt:  QTQTQKQDHHHNDEGDETQPKQKSVKCLRDILFPGAAGCEETAEKNQVGKTDNGIAMMAVQTQSDEAAGSV

TrEMBL top hitse value%identityAlignment
A0A0A0KEV9 Uncharacterized protein4.3e-17559.88Show/hide
Query:  SWPEQQTE-KRCKIRKRGCSSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMST--KSHSSKISTA-DFPNRSPSCSLDGGGKGKEASVSARK
        SW EQ+TE K+CKIRKR C SS SSST VRKYRFKK                  PTWKMST  KSHSSK+ST  D  NRSPSCS++   KGKE       
Subjt:  SWPEQQTE-KRCKIRKRGCSSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMST--KSHSSKISTA-DFPNRSPSCSLDGGGKGKEASVSARK

Query:  LAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELVSCLSDPDPTFNNIISEKVEGV----RIHRRRRSASS---LRIGE---VGGSNFHGND
                +E       + S+ LK NS++VEDK   + +    + LSDPD +  N  + + + +    R+HRRRRSA++   LRIG    VGGSNFHGND
Subjt:  LAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELVSCLSDPDPTFNNIISEKVEGV----RIHRRRRSASS---LRIGE---VGGSNFHGND

Query:  CL-MEIENPSQGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRF
        CL MEIEN +  KTTRRKTK  +KTRLKEVSNCLTTSKELLRVL+H+  HEDH    PSS SSLI+ALK+ELDRAKTRVDHLIK+Q+F+ DEIE+L +R 
Subjt:  CL-MEIENPSQGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRF

Query:  AEEKAAWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVR
        AEEKAAWKY+ERAR  S I+SMA+E+E+EKKLRRQAERLNK+IAKELA+AKVS+SKAMK+++REKRAKEI EQIC+ELAKGIGEDRA+FEELKKESAKVR
Subjt:  AEEKAAWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVR

Query:  EEVEKEREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLI----TQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQDME---RKEPN
        EEVEKEREML LADVLREERVQMKLSEAKY FEEKNAAVERLK +L+ Y +     Q   EN E SCN+FEKIKELEAYLKKINFGSCQD E   +KE N
Subjt:  EEVEKEREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLI----TQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQDME---RKEPN

Query:  --------DEEDSEDDSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWE
                +EE+ E++SD+HSIELNMDNNNKSYRWSFV  +  N  + N      NGRKS+SEKIQWGSICLN   +N+            +NS+ FDW+
Subjt:  --------DEEDSEDDSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWE

Query:  RFSELLITQSQTQTQKQDHHHNDEGDETQPKQKSVKCLRDILFPGAAGCEETAEKNQVGKTDNGIAMM
         FSE L T+   +        +D GD  Q   KSVKCLRDILFP     E     N V K D+  + M
Subjt:  RFSELLITQSQTQTQKQDHHHNDEGDETQPKQKSVKCLRDILFPGAAGCEETAEKNQVGKTDNGIAMM

A0A6J1H001 uncharacterized protein LOC111459204 isoform X27.7e-20171.7Show/hide
Query:  MSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSARKLAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNN
        MSTKSHSS        NRSPSCS+ GGG KGKEASVS             + + +  N+SQKLKNN DI+EDK+EL+KT++ V     SCLSDPDP FN+
Subjt:  MSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSARKLAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNN

Query:  IISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEIENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLI
          S+KVEG R+HRRR SASS+R+G  G +NFHG+ CL+EIENPS QGKT RRKTKFMLKTRLKEV NCLTTSKEL+RVLNHV AHED+DQ RPSSIS LI
Subjt:  IISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEIENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLI

Query:  TALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREK
        TALK+E++RAK RVDHLIK+QS HGDEIE++MKRF EEK AWK +ERAR+RS+I SMADE+E+EKKLR+QAERLNKTIAKELA+AK+SLSKAMKDLQRE+
Subjt:  TALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREK

Query:  RAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNK
        RAKEIFEQICDELAKGIGEDRAQFEE KKESAKVREE+E+EREMLQLADVLREERVQMKLSEAKY FEEKNAAVERLKDELEA+LITQ  +EN E+  + 
Subjt:  RAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNK

Query:  FEKIKELEAYLKKINFGSCQD-MERKEPNDEED--SEDDSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEK-ESNGRKSLSEKIQWGSICLNRKA
          KIKELEAYLKKINFGS Q+ +E  E  +E++   EDDSDLHSIELNMDNNNKSYRWSFVHG    GSKRNSFEK E NGRKS+SEKIQWGSICLNRKA
Subjt:  FEKIKELEAYLKKINFGSCQD-MERKEPNDEED--SEDDSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEK-ESNGRKSLSEKIQWGSICLNRKA

Query:  SNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQKQDHHHNDEGDETQPKQKSVKCLRDILFPG
        SN  KNGEFVGRKSHE+S+R +WERF+E+                 D G   +   KS KCLRDILFPG
Subjt:  SNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQKQDHHHNDEGDETQPKQKSVKCLRDILFPG

A0A6J1H037 uncharacterized protein LOC111459204 isoform X11.1e-21570.27Show/hide
Query:  MSWPEQQTEKRCKIRKRGCS--SSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSARK
        MSWPEQ+TE+ CKIRKR CS  SSSSSSTLV KYRFK                   PTWKMSTKSHSS        NRSPSCS+ GGG KGKEASVS   
Subjt:  MSWPEQQTEKRCKIRKRGCS--SSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSARK

Query:  LAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEI
                  + + +  N+SQKLKNN DI+EDK+EL+KT++ V     SCLSDPDP FN+  S+KVEG R+HRRR SASS+R+G  G +NFHG+ CL+EI
Subjt:  LAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEI

Query:  ENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKA
        ENPS QGKT RRKTKFMLKTRLKEV NCLTTSKEL+RVLNHV AHED+DQ RPSSIS LITALK+E++RAK RVDHLIK+QS HGDEIE++MKRF EEK 
Subjt:  ENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKA

Query:  AWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEK
        AWK +ERAR+RS+I SMADE+E+EKKLR+QAERLNKTIAKELA+AK+SLSKAMKDLQRE+RAKEIFEQICDELAKGIGEDRAQFEE KKESAKVREE+E+
Subjt:  AWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEK

Query:  EREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQD-MERKEPNDEED--SEDDS
        EREMLQLADVLREERVQMKLSEAKY FEEKNAAVERLKDELEA+LITQ  +EN E+  +   KIKELEAYLKKINFGS Q+ +E  E  +E++   EDDS
Subjt:  EREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQD-MERKEPNDEED--SEDDS

Query:  DLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEK-ESNGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQK
        DLHSIELNMDNNNKSYRWSFVHG    GSKRNSFEK E NGRKS+SEKIQWGSICLNRKASN  KNGEFVGRKSHE+S+R +WERF+E+           
Subjt:  DLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEK-ESNGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQK

Query:  QDHHHNDEGDETQPKQKSVKCLRDILFPG
              D G   +   KS KCLRDILFPG
Subjt:  QDHHHNDEGDETQPKQKSVKCLRDILFPG

A0A6J1K0X3 uncharacterized protein LOC111490718 isoform X12.2e-21670.36Show/hide
Query:  MSWPEQQTEKRCKIRKRGC----SSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSA
        MSWPEQ+TE+ CKIRKR C    SSSSSSSTLV KYRFK                   PTWKMSTKSHSS        NRSPSCS+ GGG KGKEASVS 
Subjt:  MSWPEQQTEKRCKIRKRGC----SSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSA

Query:  RKLAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLM
                    + + +  N+SQKLKNN DI+EDK+EL+KT++ V     SCLSDPDP FN+  S+KVEG R+HRRR SASSLRIG  G +NFHGN CL+
Subjt:  RKLAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLM

Query:  EIENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEE
        EIENPS QG+T RRKTKFMLKTRLKEVSNCLTTSKEL+RVLNHV AHED+DQ RPSSIS LITALK+E++RAK RVDHLIK+QSFHGDEIE++MKRF EE
Subjt:  EIENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEE

Query:  KAAWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEV
        K AWK +ERAR+RS+I SMADE+E+EKKLR+QAERLNKTIAKELA+AK+SLSKAMKDLQRE+RAKEIFEQICDELAKGIGEDRAQFEE KKESAKVREE+
Subjt:  KAAWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEV

Query:  EKEREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQ---DMERKEPNDEEDSED
        E+EREMLQLADVLREERVQMKLSEAKY FEEKNAAVERLKDELEA+LITQ  +EN E+  +   KIKELEAYLKKINFGS Q   D + K    E   ED
Subjt:  EKEREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQ---DMERKEPNDEEDSED

Query:  DSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKES-NGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQT
        DSDLHSIELNMDNNNKSYRWSFVHG    GSKRNSFEK+  NGRKS+SEKIQWGSICLNRKASN  KNG+FVGRKSHE+S+R +WERF+E+         
Subjt:  DSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKES-NGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQT

Query:  QKQDHHHNDEGDETQPKQKSVKCLRDILFPG
                D G   +   KS KCLRDILFPG
Subjt:  QKQDHHHNDEGDETQPKQKSVKCLRDILFPG

A0A6J1K315 uncharacterized protein LOC111490718 isoform X29.1e-20272.06Show/hide
Query:  MSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSARKLAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNN
        MSTKSHSS        NRSPSCS+ GGG KGKEASVS             + + +  N+SQKLKNN DI+EDK+EL+KT++ V     SCLSDPDP FN+
Subjt:  MSTKSHSSKISTADFPNRSPSCSLDGGG-KGKEASVSARKLAATLWEIDEAAAAAPANNSQKLKNNSDIVEDKKELIKTRELV-----SCLSDPDPTFNN

Query:  IISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEIENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLI
          S+KVEG R+HRRR SASSLRIG  G +NFHGN CL+EIENPS QG+T RRKTKFMLKTRLKEVSNCLTTSKEL+RVLNHV AHED+DQ RPSSIS LI
Subjt:  IISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEIENPS-QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLI

Query:  TALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREK
        TALK+E++RAK RVDHLIK+QSFHGDEIE++MKRF EEK AWK +ERAR+RS+I SMADE+E+EKKLR+QAERLNKTIAKELA+AK+SLSKAMKDLQRE+
Subjt:  TALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREK

Query:  RAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNK
        RAKEIFEQICDELAKGIGEDRAQFEE KKESAKVREE+E+EREMLQLADVLREERVQMKLSEAKY FEEKNAAVERLKDELEA+LITQ  +EN E+  + 
Subjt:  RAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNK

Query:  FEKIKELEAYLKKINFGSCQ---DMERKEPNDEEDSEDDSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKES-NGRKSLSEKIQWGSICLNRKA
          KIKELEAYLKKINFGS Q   D + K    E   EDDSDLHSIELNMDNNNKSYRWSFVHG    GSKRNSFEK+  NGRKS+SEKIQWGSICLNRKA
Subjt:  FEKIKELEAYLKKINFGSCQ---DMERKEPNDEEDSEDDSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKES-NGRKSLSEKIQWGSICLNRKA

Query:  SNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQKQDHHHNDEGDETQPKQKSVKCLRDILFPG
        SN  KNG+FVGRKSHE+S+R +WERF+E+                 D G   +   KS KCLRDILFPG
Subjt:  SNSIKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQKQDHHHNDEGDETQPKQKSVKCLRDILFPG

SwissProt top hitse value%identityAlignment
F4I878 Protein BRANCHLESS TRICHOME4.8e-0631.29Show/hide
Query:  ERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREML
        E  + +  I  +  E++ E+K RR+AE + K +A              KD++ E+ A+E  E     L K +  +       K E  +++ ++E+ER+M 
Subjt:  ERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREML

Query:  QLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELE
        +LA+VLREERVQMKL +A+   EEK + +E               N  GE   N+  K K LE
Subjt:  QLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELE

Q66GQ2 Uncharacterized protein At5g416202.4e-1829.22Show/hide
Query:  LTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVEVEKKLR
        L TS ELL+VLN +W+ E   +Q  S+I SLI ALK E+  ++ R+  L++ Q     E++ ++K+ AEEK   K KE  R+ S + S+   +E E+KLR
Subjt:  LTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVEVEKKLR

Query:  RQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKV--REEVEKEREMLQLADVLREERVQMKLSEAKYH
        +++E L++ +A+EL++ K SLS  +K+L+R  ++ ++ E +CDE AKGI     +   LKK++           ++ +L +A+   +ER+QM+L      
Subjt:  RQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKV--REEVEKEREMLQLADVLREERVQMKLSEAKYH

Query:  FEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKI--KELEAYLKKINFGSCQD---------MERKEPNDE-EDSEDDSDLHSIELNMDNNNKSYR
          +  + +++L+ E+E +L  +  NE   +  N  E +    L A  + ++   C++          E K+P +   D     + H+ + ++D   KS  
Subjt:  FEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKI--KELEAYLKKINFGSCQD---------MERKEPNDE-EDSEDDSDLHSIELNMDNNNKSYR

Query:  WSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQ
               Q N   + ++   SNG+K  +  I+
Subjt:  WSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQ

Arabidopsis top hitse value%identityAlignment
AT1G50660.1 unknown protein4.1e-3727.92Show/hide
Query:  STLVRKYRFKKAILVAKAKR--VGGRSS---TPVPTWKMS--TKSHSSKISTADFPNRSPSCSLDGGGKGKE-----ASVSARKLAATLWEIDEAAAAAP
        +T V K R +   L  + +    GGR S   TP+  WK+    K  S  +   D+ + +   +     + K+       VS RKLAA LW +    A++ 
Subjt:  STLVRKYRFKKAILVAKAKR--VGGRSS---TPVPTWKMS--TKSHSSKISTADFPNRSPSCSLDGGGKGKE-----ASVSARKLAATLWEIDEAAAAAP

Query:  ANNSQKLKNNSDIVEDKKELIKTRELVSCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEIENPSQGKTTRRKTKFMLKTRL
            +            KE +  +     +  P   +    S+K  G + ++ R++ S++       +  +G  C +E   P         TK+      
Subjt:  ANNSQKLKNNSDIVEDKKELIKTRELVSCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEIENPSQGKTTRRKTKFMLKTRL

Query:  KEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVE
             CL T +E+ ++ +++   +  DQQ   +  SL+++L+AEL+ A  R++ L  E+  H  ++E  +++ +EE+AAW+ +E  ++R+ I  M  ++ 
Subjt:  KEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVE

Query:  VEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSE
         EKK R++ E +N  +  ELAD+K+++ + M+D ++E++A+E+ E++CDELAK IGED+A+ E LK+ES  +REEV+ ER MLQ+A+V REERVQMKL +
Subjt:  VEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSE

Query:  AKYHFEEKNAAVERLKDELEAYL-----ITQLGNENGEDSCNK------FEKIKE----------LEAYLKKINFGSCQDMERKEPNDEEDSEDDSDLHS
        AK   EE+ + + +L  +LE++L     +T +      +   +       ++IKE          + A  +++N G   D E ++         DS +H+
Subjt:  AKYHFEEKNAAVERLKDELEAYL-----ITQLGNENGEDSCNK------FEKIKE----------LEAYLKKINFGSCQDMERKEPNDEEDSEDDSDLHS

Query:  IELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENS
        + L+ +  NK  R S  +  Q NG      E++ +G +++S   + GS      +  S+ N     R S+ +S
Subjt:  IELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENS

AT3G11590.1 unknown protein2.8e-9444.94Show/hide
Query:  KIRKRGCSS--SSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPS--CSLDGGGKGKEASVSARKLAATLWEIDEA
        KIRKRGCSS  SS+SS L   YRFK+AI+V K     G S+TPVPTW++  +S S + S A     SPS  C    G     A VSARKLAATLWE++E 
Subjt:  KIRKRGCSS--SSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPS--CSLDGGGKGKEASVSARKLAATLWEIDEA

Query:  AA------AAPANNSQKLKNNSDIVEDKKELIKTRELVSCLSDPDPTFNNIISEKVEGVRIHRRRRSASS----LRIGE--VGGSNFHGNDCLMEIENPS
         +      AAP     + K     +   +  + +  L   LSDP    ++ +SE++E      R+R ASS    LR+G+  VG  +   +   M+IE  S
Subjt:  AA------AAPANNSQKLKNNSDIVEDKKELIKTRELVSCLSDPDPTFNNIISEKVEGVRIHRRRRSASS----LRIGE--VGGSNFHGNDCLMEIENPS

Query:  QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYK
        + +T    T   +KTRLK+ SN LTTSKELL+++N +W  +D    RPSS  SL++AL +EL+RA+ +V+ LI E     ++I  LMKRFAEEKA WK  
Subjt:  QGKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYK

Query:  ERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREML
        E+  + + I S+A E+EVE+KLRR+ E LNK + KELA+ K +L KA+K+++ EKRA+ + E++CDELA+ I ED+A+ EELK+ES KV+EEVEKEREML
Subjt:  ERARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREML

Query:  QLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLK-KINFGSCQDMERKEPNDEEDSEDDSDLHSIEL
        QLAD LREERVQMKLSEAK+  EEKNAAV++L+++L+ YL  +   E   +        +E   YL   I+FGS    + +  N  E+   +SDLHSIEL
Subjt:  QLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLK-KINFGSCQDMERKEPNDEEDSEDDSDLHSIEL

Query:  NMDNNNKSYRWSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQW
        N+D  NKSY+W +    + N  ++++  K  + ++S+S+ + W
Subjt:  NMDNNNKSYRWSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQW

AT3G20350.1 unknown protein8.8e-3230.46Show/hide
Query:  CLTTSKELLRVLNHV-WAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVEVEKK
        CL T  ++ ++  +V W     + Q+ + + SL ++++ +L  A+  +  L  E+     ++E  +K+ +EE+AAW+ +E  ++R+ I  M  ++  EKK
Subjt:  CLTTSKELLRVLNHV-WAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVEVEKK

Query:  LRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKYH
         R++ E +N  +  ELAD+K+++ + M D Q+E++A+E+ E++CDELAK I ED+A+ E LK ES  +REEV+ ER MLQ+A+V REERVQMKL +AK  
Subjt:  LRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKYH

Query:  FEEKNAAVERLKDELEAYL------------ITQLGNENGEDSCNKFEKIKE----------LEAYLKKINFGSCQDMERKEPNDEEDSEDDSDLHSIEL
         EEK + + +L  ++EA+L            + +L  E    S +  ++IKE          +    +++N G  QD E ++          S  H++  
Subjt:  FEEKNAAVERLKDELEAYL------------ITQLGNENGEDSCNKFEKIKE----------LEAYLKKINFGSCQDMERKEPNDEEDSEDDSDLHSIEL

Query:  NMDNNNKSYRWSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSD
        +++  NK         S     +   FE++ +G +++S   + GS   +     SI N   +    H NS+
Subjt:  NMDNNNKSYRWSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQWGSICLNRKASNSIKNGEFVGRKSHENSD

AT5G22310.1 unknown protein1.6e-4937.5Show/hide
Query:  QQTEKRCKIRKRGCSSSSSSSTLVRKYRFKKAILVAK-AKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGGKGKEASVSARKLAATLWE
        +Q +K CKIRKRG  SSSSSS+L R+ RFK+AI   K A +  G S TPV     S  +  + +  +  P   P        + +++ VSARKLAATLWE
Subjt:  QQTEKRCKIRKRGCSSSSSSSTLVRKYRFKKAILVAK-AKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGGKGKEASVSARKLAATLWE

Query:  IDEAAAAAPANNSQKLKNNSDIVEDKK----ELIKTRELVSCLSDP---DPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEIENPSQ
        I++   A P  NS K     D +  KK       K+ E  S    P   DP  + + SE+++      RRRS +  ++                  NP +
Subjt:  IDEAAAAAPANNSQKLKNNSDIVEDKK----ELIKTRELVSCLSDP---DPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEIENPSQ

Query:  GKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKE
         K     +   +KTR K VS+ LTTSKEL++VL  +   E  D  + +S + LI+AL  ELDRA++ + HL+ E     DE E       EEK       
Subjt:  GKTTRRKTKFMLKTRLKEVSNCLTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKE

Query:  RARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQ
            R  I S+ +E  VE+KLRR+ E++N+ + +EL +AK +  K  ++++REKRAK++ E++CDEL KGIG+D              ++E+EKEREM+ 
Subjt:  RARIRSTITSMADEVEVEKKLRRQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQ

Query:  LADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQDMERKEPNDEEDSEDDSDLHSIELNM
        +ADVLREERVQMKL+EAK+ FE+K AAVERLK EL       L  E G+ S        E+   L+ I+                D +++SDL SIELNM
Subjt:  LADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKIKELEAYLKKINFGSCQDMERKEPNDEEDSEDDSDLHSIELNM

Query:  DNNNKSYRWSFV
        ++ +K   W +V
Subjt:  DNNNKSYRWSFV

AT5G41620.1 FUNCTIONS IN: molecular_function unknown1.7e-1929.22Show/hide
Query:  LTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVEVEKKLR
        L TS ELL+VLN +W+ E   +Q  S+I SLI ALK E+  ++ R+  L++ Q     E++ ++K+ AEEK   K KE  R+ S + S+   +E E+KLR
Subjt:  LTTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVEVEKKLR

Query:  RQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKV--REEVEKEREMLQLADVLREERVQMKLSEAKYH
        +++E L++ +A+EL++ K SLS  +K+L+R  ++ ++ E +CDE AKGI     +   LKK++           ++ +L +A+   +ER+QM+L      
Subjt:  RQAERLNKTIAKELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKV--REEVEKEREMLQLADVLREERVQMKLSEAKYH

Query:  FEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKI--KELEAYLKKINFGSCQD---------MERKEPNDE-EDSEDDSDLHSIELNMDNNNKSYR
          +  + +++L+ E+E +L  +  NE   +  N  E +    L A  + ++   C++          E K+P +   D     + H+ + ++D   KS  
Subjt:  FEEKNAAVERLKDELEAYLITQLGNENGEDSCNKFEKI--KELEAYLKKINFGSCQD---------MERKEPNDE-EDSEDDSDLHSIELNMDNNNKSYR

Query:  WSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQ
               Q N   + ++   SNG+K  +  I+
Subjt:  WSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTGGCCAGAGCAGCAAACAGAGAAGAGATGTAAGATTAGAAAGCGAGGGTGTTCGTCGTCGTCGTCTTCTTCGACTTTGGTTCGTAAGTACAGATTCAAGAAGGC
AATTCTGGTGGCGAAGGCGAAGCGAGTTGGTGGCCGTTCGAGCACGCCGGTCCCGACGTGGAAAATGAGCACAAAATCTCATTCTTCCAAGATATCCACCGCCGACTTCC
CCAACCGGTCGCCGTCATGCTCCCTCGACGGCGGCGGTAAAGGGAAGGAAGCGTCTGTTTCAGCCCGCAAACTGGCGGCCACCCTGTGGGAGATCGACGAAGCAGCAGCA
GCAGCTCCGGCCAACAATTCCCAGAAGTTGAAGAACAATTCAGATATCGTTGAAGATAAGAAAGAACTGATCAAAACCCGAGAGCTGGTTTCGTGTTTATCAGATCCGGA
TCCGACTTTCAACAACATAATTTCTGAGAAAGTTGAAGGTGTCAGAATACACAGAAGAAGAAGGTCGGCATCTTCGTTGAGAATTGGGGAGGTGGGGGGTTCGAATTTTC
ATGGCAATGACTGTTTAATGGAGATTGAAAATCCTAGTCAGGGAAAGACGACACGTCGTAAGACCAAATTTATGCTAAAAACACGTTTGAAGGAAGTGAGCAATTGCCTG
ACGACATCGAAGGAGCTTCTCAGAGTTTTAAACCATGTCTGGGCTCATGAAGATCATGATCAGCAGCGGCCATCGTCAATTTCATCTCTGATCACAGCTCTAAAAGCGGA
GCTTGATCGAGCCAAGACCCGAGTTGACCATTTAATCAAAGAGCAGAGCTTCCATGGCGACGAGATTGAGCTTCTAATGAAGCGTTTTGCAGAGGAAAAGGCTGCGTGGA
AGTACAAGGAGAGAGCCAGGATTCGGAGCACCATTACATCAATGGCGGACGAAGTTGAGGTCGAGAAGAAGCTCAGAAGACAAGCGGAGAGACTGAACAAGACGATTGCC
AAAGAACTTGCCGACGCTAAAGTTTCGCTTTCAAAAGCAATGAAAGATCTGCAGAGGGAAAAGAGAGCGAAGGAGATTTTCGAGCAGATTTGCGACGAGTTGGCAAAAGG
GATTGGAGAAGACAGAGCCCAGTTTGAGGAGCTGAAGAAGGAGTCCGCTAAAGTGAGAGAAGAAGTGGAAAAGGAGAGGGAAATGCTTCAACTTGCTGATGTTTTACGAG
AGGAAAGAGTTCAGATGAAGCTCTCGGAGGCCAAGTATCATTTCGAAGAGAAAAACGCAGCGGTGGAGAGGCTAAAAGATGAGCTCGAAGCGTATTTGATAACCCAACTG
GGGAACGAAAATGGAGAAGATTCTTGTAATAAATTTGAGAAAATCAAGGAGCTTGAGGCTTACTTGAAGAAGATAAACTTCGGGTCATGTCAAGATATGGAAAGAAAAGA
GCCGAATGACGAAGAAGATTCAGAAGATGACAGTGATCTGCACTCCATTGAACTGAACATGGACAACAACAACAAGAGTTACAGATGGAGTTTCGTCCATGGCTCGCAGA
CGAATGGGTCGAAAAGGAACTCATTCGAGAAGGAAAGCAACGGAAGAAAGTCACTTTCGGAGAAAATCCAATGGGGAAGCATTTGTTTGAATAGGAAAGCATCGAACAGT
ATCAAAAATGGAGAATTTGTTGGTCGTAAGAGCCATGAAAACTCAGACAGATTTGACTGGGAGAGATTTTCAGAGCTTCTTATTACACAGTCTCAGACACAGACACAGAA
ACAAGATCATCATCACAACGACGAGGGTGATGAAACCCAGCCGAAGCAAAAATCAGTGAAGTGCCTTAGAGACATTCTATTTCCAGGAGCAGCAGGATGTGAAGAAACAG
CAGAAAAAAATCAAGTTGGGAAGACAGATAATGGCATTGCCATGATGGCTGTCCAAACTCAAAGCGATGAAGCTGCTGGTTCAGTGGTCCCTTTCCCTGAAGTTGGATTA
AGTCCACGTGCTATCTACGTGATGGTGAGCACCCCACGGCCAATCCGCCGCCGTTCGCCACGTGTCGGTCGTCGGTGGGCCGGAACTGAGGTGATCTCCGGCGGCCGGAG
AGCCACCACCGACGTGATACTTGGAGTATCCGTACGGAAGTTGTACAACTTCTTGCCGCTGTTGATGTATGAAGTTGAATTAAAACTGCAGGATTCAATGGTGGTTTGCC
GTCTTCGGAAGAACAACGATTTTCGGCGGAACAACTCGGACAAGGCAACTTCAAGTAAACGGCAGGTATCCGGCGACGATGGGGACGCCGGAGTTGGGAGCAGCCGAGGA
GAGAAGGGGGGCGGTGAAAGTGACCGATTCTCGAAGAACTGCAGCAGCAGTTTCGATTCTCATTCGCTGGAGCAATTTGATTCTGCGATTGAATCCAATCAGAAACAGAC
CTCCGATCCCGCTCTTCAGGAATCTCTCAGTCCTGATCCTGAAAAGGGTTATAGTGGAGACGAAGAGTTGTATGCAGATCTATTGAAAGATGACATAATTACATTGGATG
ATAGTTTACTATGCGAAACTCATAATCCAACCCCCATTATGAACTTCTCGGAATTGGAGAAATACTCCCCCTCGCCTACACAAACCGCCGTGCCGGAGGCGGAGGCGCTG
CCATTGCTGCCGTACGGTACAAGTCGGCCGCGGTTGTCCCGCTCAGGCGTACGTGTGAATGAAGGTGCAGCGGAAGCGCGTCCGAATTCCGAGCAGTATCTATTAATTTG
TCGGCGACCTTCTTCATATGGTGCCGAAGCGCTTTCGGAATTTTTCAGTATCCAACGCCAAGTTTCAATAACCAACCACGAATTCTCTCCACCAATTTTGATCATGAACT
TGGGGCTTTCTCGTAGGTGTCGATATTTCGAGTGGAAGCAACAACTGGTACCTGAGGTTGATACCGCGGCAAACTCCCAGAAGCAGCAAGAAATACCAAAGAGGGAAGAA
GCAGTTAAGGAGAAGGAATTCAAAAGCGTGTCAGTTGATATGTCAGTCAGAACTCATGAACTAATGAAAGTTCAAAGGGTAGCATCATACCAATCCCAAATTCTATTGCG
TGAGAATCACTTCCCGCCCGCAGCCATTTTGGATGGAGGGTATAACATCAAAGGGGATTATGGGGTGCTTGGAGGTGCTGCTGCTGCCCATTTCTTTTACCCAAGGAGAA
AAGGACAGTTTCTTATGGCCTTGTCAATTAGACAAGTCAAGCCCTGCGCTTCAGTTTCATTTTTGTCTGAACTTTTTTATTCTACATCATCTTATTATCACAAGAAAGGA
ATATCCAGAGACTTAATGAAGTTGTGCGAAGCCTACGGGAACAGTTGCAGCAGTGCAGAGGCAGTAGCACCACAAACGGGACAACTCTGCCGAGTGGACATATTCTTGAG
CTCGAACGACCTCATGTTTTGGAGGACTGATGCGGCCCCCAACCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTGGCCAGAGCAGCAAACAGAGAAGAGATGTAAGATTAGAAAGCGAGGGTGTTCGTCGTCGTCGTCTTCTTCGACTTTGGTTCGTAAGTACAGATTCAAGAAGGC
AATTCTGGTGGCGAAGGCGAAGCGAGTTGGTGGCCGTTCGAGCACGCCGGTCCCGACGTGGAAAATGAGCACAAAATCTCATTCTTCCAAGATATCCACCGCCGACTTCC
CCAACCGGTCGCCGTCATGCTCCCTCGACGGCGGCGGTAAAGGGAAGGAAGCGTCTGTTTCAGCCCGCAAACTGGCGGCCACCCTGTGGGAGATCGACGAAGCAGCAGCA
GCAGCTCCGGCCAACAATTCCCAGAAGTTGAAGAACAATTCAGATATCGTTGAAGATAAGAAAGAACTGATCAAAACCCGAGAGCTGGTTTCGTGTTTATCAGATCCGGA
TCCGACTTTCAACAACATAATTTCTGAGAAAGTTGAAGGTGTCAGAATACACAGAAGAAGAAGGTCGGCATCTTCGTTGAGAATTGGGGAGGTGGGGGGTTCGAATTTTC
ATGGCAATGACTGTTTAATGGAGATTGAAAATCCTAGTCAGGGAAAGACGACACGTCGTAAGACCAAATTTATGCTAAAAACACGTTTGAAGGAAGTGAGCAATTGCCTG
ACGACATCGAAGGAGCTTCTCAGAGTTTTAAACCATGTCTGGGCTCATGAAGATCATGATCAGCAGCGGCCATCGTCAATTTCATCTCTGATCACAGCTCTAAAAGCGGA
GCTTGATCGAGCCAAGACCCGAGTTGACCATTTAATCAAAGAGCAGAGCTTCCATGGCGACGAGATTGAGCTTCTAATGAAGCGTTTTGCAGAGGAAAAGGCTGCGTGGA
AGTACAAGGAGAGAGCCAGGATTCGGAGCACCATTACATCAATGGCGGACGAAGTTGAGGTCGAGAAGAAGCTCAGAAGACAAGCGGAGAGACTGAACAAGACGATTGCC
AAAGAACTTGCCGACGCTAAAGTTTCGCTTTCAAAAGCAATGAAAGATCTGCAGAGGGAAAAGAGAGCGAAGGAGATTTTCGAGCAGATTTGCGACGAGTTGGCAAAAGG
GATTGGAGAAGACAGAGCCCAGTTTGAGGAGCTGAAGAAGGAGTCCGCTAAAGTGAGAGAAGAAGTGGAAAAGGAGAGGGAAATGCTTCAACTTGCTGATGTTTTACGAG
AGGAAAGAGTTCAGATGAAGCTCTCGGAGGCCAAGTATCATTTCGAAGAGAAAAACGCAGCGGTGGAGAGGCTAAAAGATGAGCTCGAAGCGTATTTGATAACCCAACTG
GGGAACGAAAATGGAGAAGATTCTTGTAATAAATTTGAGAAAATCAAGGAGCTTGAGGCTTACTTGAAGAAGATAAACTTCGGGTCATGTCAAGATATGGAAAGAAAAGA
GCCGAATGACGAAGAAGATTCAGAAGATGACAGTGATCTGCACTCCATTGAACTGAACATGGACAACAACAACAAGAGTTACAGATGGAGTTTCGTCCATGGCTCGCAGA
CGAATGGGTCGAAAAGGAACTCATTCGAGAAGGAAAGCAACGGAAGAAAGTCACTTTCGGAGAAAATCCAATGGGGAAGCATTTGTTTGAATAGGAAAGCATCGAACAGT
ATCAAAAATGGAGAATTTGTTGGTCGTAAGAGCCATGAAAACTCAGACAGATTTGACTGGGAGAGATTTTCAGAGCTTCTTATTACACAGTCTCAGACACAGACACAGAA
ACAAGATCATCATCACAACGACGAGGGTGATGAAACCCAGCCGAAGCAAAAATCAGTGAAGTGCCTTAGAGACATTCTATTTCCAGGAGCAGCAGGATGTGAAGAAACAG
CAGAAAAAAATCAAGTTGGGAAGACAGATAATGGCATTGCCATGATGGCTGTCCAAACTCAAAGCGATGAAGCTGCTGGTTCAGTGGTCCCTTTCCCTGAAGTTGGATTA
AGTCCACGTGCTATCTACGTGATGGTGAGCACCCCACGGCCAATCCGCCGCCGTTCGCCACGTGTCGGTCGTCGGTGGGCCGGAACTGAGGTGATCTCCGGCGGCCGGAG
AGCCACCACCGACGTGATACTTGGAGTATCCGTACGGAAGTTGTACAACTTCTTGCCGCTGTTGATGTATGAAGTTGAATTAAAACTGCAGGATTCAATGGTGGTTTGCC
GTCTTCGGAAGAACAACGATTTTCGGCGGAACAACTCGGACAAGGCAACTTCAAGTAAACGGCAGGTATCCGGCGACGATGGGGACGCCGGAGTTGGGAGCAGCCGAGGA
GAGAAGGGGGGCGGTGAAAGTGACCGATTCTCGAAGAACTGCAGCAGCAGTTTCGATTCTCATTCGCTGGAGCAATTTGATTCTGCGATTGAATCCAATCAGAAACAGAC
CTCCGATCCCGCTCTTCAGGAATCTCTCAGTCCTGATCCTGAAAAGGGTTATAGTGGAGACGAAGAGTTGTATGCAGATCTATTGAAAGATGACATAATTACATTGGATG
ATAGTTTACTATGCGAAACTCATAATCCAACCCCCATTATGAACTTCTCGGAATTGGAGAAATACTCCCCCTCGCCTACACAAACCGCCGTGCCGGAGGCGGAGGCGCTG
CCATTGCTGCCGTACGGTACAAGTCGGCCGCGGTTGTCCCGCTCAGGCGTACGTGTGAATGAAGGTGCAGCGGAAGCGCGTCCGAATTCCGAGCAGTATCTATTAATTTG
TCGGCGACCTTCTTCATATGGTGCCGAAGCGCTTTCGGAATTTTTCAGTATCCAACGCCAAGTTTCAATAACCAACCACGAATTCTCTCCACCAATTTTGATCATGAACT
TGGGGCTTTCTCGTAGGTGTCGATATTTCGAGTGGAAGCAACAACTGGTACCTGAGGTTGATACCGCGGCAAACTCCCAGAAGCAGCAAGAAATACCAAAGAGGGAAGAA
GCAGTTAAGGAGAAGGAATTCAAAAGCGTGTCAGTTGATATGTCAGTCAGAACTCATGAACTAATGAAAGTTCAAAGGGTAGCATCATACCAATCCCAAATTCTATTGCG
TGAGAATCACTTCCCGCCCGCAGCCATTTTGGATGGAGGGTATAACATCAAAGGGGATTATGGGGTGCTTGGAGGTGCTGCTGCTGCCCATTTCTTTTACCCAAGGAGAA
AAGGACAGTTTCTTATGGCCTTGTCAATTAGACAAGTCAAGCCCTGCGCTTCAGTTTCATTTTTGTCTGAACTTTTTTATTCTACATCATCTTATTATCACAAGAAAGGA
ATATCCAGAGACTTAATGAAGTTGTGCGAAGCCTACGGGAACAGTTGCAGCAGTGCAGAGGCAGTAGCACCACAAACGGGACAACTCTGCCGAGTGGACATATTCTTGAG
CTCGAACGACCTCATGTTTTGGAGGACTGATGCGGCCCCCAACCACTAA
Protein sequenceShow/hide protein sequence
MSWPEQQTEKRCKIRKRGCSSSSSSSTLVRKYRFKKAILVAKAKRVGGRSSTPVPTWKMSTKSHSSKISTADFPNRSPSCSLDGGGKGKEASVSARKLAATLWEIDEAAA
AAPANNSQKLKNNSDIVEDKKELIKTRELVSCLSDPDPTFNNIISEKVEGVRIHRRRRSASSLRIGEVGGSNFHGNDCLMEIENPSQGKTTRRKTKFMLKTRLKEVSNCL
TTSKELLRVLNHVWAHEDHDQQRPSSISSLITALKAELDRAKTRVDHLIKEQSFHGDEIELLMKRFAEEKAAWKYKERARIRSTITSMADEVEVEKKLRRQAERLNKTIA
KELADAKVSLSKAMKDLQREKRAKEIFEQICDELAKGIGEDRAQFEELKKESAKVREEVEKEREMLQLADVLREERVQMKLSEAKYHFEEKNAAVERLKDELEAYLITQL
GNENGEDSCNKFEKIKELEAYLKKINFGSCQDMERKEPNDEEDSEDDSDLHSIELNMDNNNKSYRWSFVHGSQTNGSKRNSFEKESNGRKSLSEKIQWGSICLNRKASNS
IKNGEFVGRKSHENSDRFDWERFSELLITQSQTQTQKQDHHHNDEGDETQPKQKSVKCLRDILFPGAAGCEETAEKNQVGKTDNGIAMMAVQTQSDEAAGSVVPFPEVGL
SPRAIYVMVSTPRPIRRRSPRVGRRWAGTEVISGGRRATTDVILGVSVRKLYNFLPLLMYEVELKLQDSMVVCRLRKNNDFRRNNSDKATSSKRQVSGDDGDAGVGSSRG
EKGGGESDRFSKNCSSSFDSHSLEQFDSAIESNQKQTSDPALQESLSPDPEKGYSGDEELYADLLKDDIITLDDSLLCETHNPTPIMNFSELEKYSPSPTQTAVPEAEAL
PLLPYGTSRPRLSRSGVRVNEGAAEARPNSEQYLLICRRPSSYGAEALSEFFSIQRQVSITNHEFSPPILIMNLGLSRRCRYFEWKQQLVPEVDTAANSQKQQEIPKREE
AVKEKEFKSVSVDMSVRTHELMKVQRVASYQSQILLRENHFPPAAILDGGYNIKGDYGVLGGAAAAHFFYPRRKGQFLMALSIRQVKPCASVSFLSELFYSTSSYYHKKG
ISRDLMKLCEAYGNSCSSAEAVAPQTGQLCRVDIFLSSNDLMFWRTDAAPNH