; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011899 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011899
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionQWRF motif-containing protein 7
Genome locationChr01:14927421..14935626
RNA-Seq ExpressionHG10011899
SyntenyHG10011899
Gene Ontology termsNA
InterPro domainsIPR007573 - QWRF family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008465854.1 PREDICTED: QWRF motif-containing protein 7 [Cucumis melo]7.1e-14680.94Show/hide
Query:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS
        MENTRIRRPKSPALPPPPSPG    SKSRSSAAITLPENNSCAANTSQR TIHRSKSVTKSRTK KDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS
Subjt:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS

Query:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHN
        PGRSLGS  VL PLTTVE AV DGR+ KLG QR   VSGVLRFF+PKKAA  +EAEELHRFRILQNRLLQWKYVNVR ETSMANVKT VQDRIFSVWLHN
Subjt:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHN

Query:  LRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------Q
        LRMRN+ILEKRIEVEKLRKEIKLY IIFPQV LLKQWAKLDKRNQESVG+LASLLST SL++PL+HGAK                              Q
Subjt:  LRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------Q

Query:  LEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKKD---PTC
        LEKTLY+LTERLSIFKEQE+CLEKLEEAVCSV TLLAKENSIRIQ+IQATNST KD   PTC
Subjt:  LEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKKD---PTC

XP_022952094.1 QWRF motif-containing protein 7 [Cucurbita moschata]2.1e-13776.67Show/hide
Query:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS
        M+NTR RRP SPAL PPPSP +L RSKSRSSA+IT PENNSCAANTSQR T HRSKSVTKSR   KDEENLNPLNCKTK GF+KFLKSSPAT+PSAWALS
Subjt:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS

Query:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHN
        PGRSLGSP +L P  TVEPAVVDG + KLGG+RGG V+GVLRFFR KKA  TVEA ELHRFRILQNRLLQW+YVNVRAE SMANVKT+ QDRIFS WL N
Subjt:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHN

Query:  LRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------Q
        LRMRN ILEKRIEVEKLRKEIKLY IIFPQV LLKQWAKLDKRNQESVG LAS+LS LSL++PL+HGAK                              +
Subjt:  LRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------Q

Query:  LEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKK-DPTC
        LEKTLYMLTERLS+FKEQE+CLEKLE+AVCSVTTLLA+ENSIRIQLIQATNST+K DPTC
Subjt:  LEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKK-DPTC

XP_023511750.1 QWRF motif-containing protein 7 [Cucurbita pepo subsp. pepo]7.9e-13776.39Show/hide
Query:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS
        M+NTR RRP SPAL PPPSP +LFRSKSRSSA+IT PENNSCAANTSQR T HRSKSVTKSR   KDEENLNPLNCKTK GF+KFLKSSPA +PSAWALS
Subjt:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS

Query:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHN
        PGRSLGSP +L P  TVEPA VDG + KLGG+RGG V+GVLRFFR KKA  TVEA ELHRFRILQNRLLQW+YVNVRAE SMANVKT+ QDRIFS WL N
Subjt:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHN

Query:  LRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------Q
        LRMRN ILEKRIEVEKLRKEIKLY IIFPQV LLKQWAKLDKRNQESVG LAS+LS LSL++PL+HGAK                              +
Subjt:  LRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------Q

Query:  LEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKK-DPTC
        LEKTLYMLTERLS+FKEQE+CLEKLE+AVCSVTTLLA+ENSIRIQLIQATNST+K DPTC
Subjt:  LEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKK-DPTC

XP_031738514.1 QWRF motif-containing protein 7 [Cucumis sativus]7.9e-14580.17Show/hide
Query:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKT-KDEENLNPLNCKTKAGFTKFLKSSPATSPSAWAL
        MENTRIRRPK+PALPPPPSPG    SKSRSS AITLP+NNSCAANTSQR TIHRSKSVTKSR K  KDEENLNPLNCKTKAGFTKFLKSSPATSPSAWAL
Subjt:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKT-KDEENLNPLNCKTKAGFTKFLKSSPATSPSAWAL

Query:  SPGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLH
        SPGRSLGSP+VL PLT VE A  DGR+ KLG QRGG VSGVLRFF+PKKAA  +EAEELHRFRILQNRLLQWKY NVRAETSMANVKTLVQDRIFSVWLH
Subjt:  SPGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLH

Query:  NLRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------
        NLRMRNRILEKRIEVEKLRKEIKLY IIFPQV LLKQWAKLDKRNQESVG+LAS+LST SL++PL+HGAK                              
Subjt:  NLRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------

Query:  QLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKKD---PTC
        QLEKTLY+LTERLSIFKEQE+CLEKLEEAVCSV TLLAKENSIRIQ+IQATNST KD   PTC
Subjt:  QLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKKD---PTC

XP_038888647.1 QWRF motif-containing protein 7 [Benincasa hispida]4.8e-15083.01Show/hide
Query:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS
        M+NTRIRRPKSPALPPPPSP SL RSKSRSSAAITLPENNSCAANTSQR  IHRSKSVTKSRTK KDEENLNPLNCKTKAGFTK LKSSPATSPSAWALS
Subjt:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS

Query:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKK-AAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLH
        PGRSLGSP VL PLT VEPAVVDGR+ KLGGQRGG VSGVLRFFRPKK AA+T EAEELHRFRILQNRLLQWKYVNVRAE SMANVKTLVQD+IFSVWLH
Subjt:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKK-AAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLH

Query:  NLRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------
        NLRMRNRILEKRIEVEKLRKEIKLY IIFPQV LLKQWAKLDKRNQES+G+LAS+LSTLSLR+PL+HGAK                              
Subjt:  NLRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------

Query:  QLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKKDPT
        QLEK LYMLTERLSIFKEQED LE+LEEAVCSVTTL+A ENSIRIQLIQATNS+KKDPT
Subjt:  QLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKKDPT

TrEMBL top hitse value%identityAlignment
A0A0A0LEM6 Uncharacterized protein5.5e-12078.83Show/hide
Query:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKT-KDEENLNPLNCKTKAGFTKFLKSSPATSPSAWAL
        MENTRIRRPK+PALPPPPSPG    SKSRSS AITLP+NNSCAANTSQR TIHRSKSVTKSR K  KDEENLNPLNCKTKAGFTKFLKSSPATSPSAWAL
Subjt:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKT-KDEENLNPLNCKTKAGFTKFLKSSPATSPSAWAL

Query:  SPGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLH
        SPGRSLGSP+VL PLT VE A  DGR+ KLG QRGG VSGVLRFF+PKKAA  +EAEELHRFRILQNRLLQWKY NVRAETSMANVKTLVQDRIFSVWLH
Subjt:  SPGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLH

Query:  NLRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAKQLEKTLYM-LTERLSIFKEQEDCLEKLEEA
        NLRMRNRILEKRIEVEKLRKEIKLY IIFPQV LLKQWAKLDKRNQESVG+LAS+LST SL++PL+HGAK   K     L+  + +  + E  + K    
Subjt:  NLRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAKQLEKTLYM-LTERLSIFKEQEDCLEKLEEA

Query:  VCSVTTL
        V  +++L
Subjt:  VCSVTTL

A0A1S3CPT6 QWRF motif-containing protein 73.4e-14680.94Show/hide
Query:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS
        MENTRIRRPKSPALPPPPSPG    SKSRSSAAITLPENNSCAANTSQR TIHRSKSVTKSRTK KDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS
Subjt:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS

Query:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHN
        PGRSLGS  VL PLTTVE AV DGR+ KLG QR   VSGVLRFF+PKKAA  +EAEELHRFRILQNRLLQWKYVNVR ETSMANVKT VQDRIFSVWLHN
Subjt:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHN

Query:  LRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------Q
        LRMRN+ILEKRIEVEKLRKEIKLY IIFPQV LLKQWAKLDKRNQESVG+LASLLST SL++PL+HGAK                              Q
Subjt:  LRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------Q

Query:  LEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKKD---PTC
        LEKTLY+LTERLSIFKEQE+CLEKLEEAVCSV TLLAKENSIRIQ+IQATNST KD   PTC
Subjt:  LEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKKD---PTC

A0A5A7SQ12 QWRF motif-containing protein 73.9e-8187.89Show/hide
Query:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS
        MENTRIRRPKSPALPPPPSPG    SKSRSSAAITLPENNSCAANTSQR TIHRSKSVTKSRTK KDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS
Subjt:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS

Query:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQ
        PGRSLGS  VL PLTTVE AV DGR+ KLG QR   VSGVLRFF+PKKAA  +EAEELHRFRILQNRLLQWKYVNVR ETSMANVKT VQ
Subjt:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQ

A0A6J1GKM1 QWRF motif-containing protein 71.0e-13776.67Show/hide
Query:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS
        M+NTR RRP SPAL PPPSP +L RSKSRSSA+IT PENNSCAANTSQR T HRSKSVTKSR   KDEENLNPLNCKTK GF+KFLKSSPAT+PSAWALS
Subjt:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS

Query:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHN
        PGRSLGSP +L P  TVEPAVVDG + KLGG+RGG V+GVLRFFR KKA  TVEA ELHRFRILQNRLLQW+YVNVRAE SMANVKT+ QDRIFS WL N
Subjt:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHN

Query:  LRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------Q
        LRMRN ILEKRIEVEKLRKEIKLY IIFPQV LLKQWAKLDKRNQESVG LAS+LS LSL++PL+HGAK                              +
Subjt:  LRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------Q

Query:  LEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKK-DPTC
        LEKTLYMLTERLS+FKEQE+CLEKLE+AVCSVTTLLA+ENSIRIQLIQATNST+K DPTC
Subjt:  LEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKK-DPTC

A0A6J1I736 QWRF motif-containing protein 78.8e-13475.83Show/hide
Query:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS
        M+NTR RRP SPAL PPPSP +L RSKSRSSA+IT PENNSCAANTSQR T HRSKSVTKSR   KDEENLNP+NCKTK GF KFLKSSPA SPSAWALS
Subjt:  MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALS

Query:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHN
        PGRSLGSP +L P  TVEPAVV G + KLGG+RG  V+GVLRFFR KKA  TVEA ELHRFRILQNRLLQW+YVNV+AE SMANVKT+ QDRIFSV L N
Subjt:  PGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHN

Query:  LRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------Q
        +RMRN ILEKRIEVEKLRKEIKLY II PQV LLKQWAKLDKRNQESVG LAS+LS LSLR+PL+HGAK                              +
Subjt:  LRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK------------------------------Q

Query:  LEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKK-DPTC
        LEKTLYMLTERLSIFKEQEDCLEKLE+AVCSVTTLLA+ENSIRIQLIQATNST+K DPTC
Subjt:  LEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKK-DPTC

SwissProt top hitse value%identityAlignment
F4INP9 QWRF motif-containing protein 41.7e-0922.85Show/hide
Query:  PALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALSPGRSLGSPVVL
        P L P  +PGS   S SRSS + +   N               S+ ++ SR        ++P+   +  G    ++SS   +P +  +SP R        
Subjt:  PALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALSPGRSLGSPVVL

Query:  LPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRP-KKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEK
                        +   Q     + VL F    KK       E++H+ R+L NR  QW++ N RAE        + ++ +++VW     +R+ +  +
Subjt:  LPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRP-KKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEK

Query:  RIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK-----------------------------QLEKTLYMLTER
        RI +++L+ EIKL  I+  Q++ L+ WA +++ +  S+      L   +LR+PL  G K                             Q+E+   ++++ 
Subjt:  RIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK-----------------------------QLEKTLYMLTER

Query:  LSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQ
          I K +   L+K E  + S   +  +E S++  LIQ
Subjt:  LSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQ

Q1PE51 QWRF motif-containing protein 72.4e-4332.99Show/hide
Query:  PPSPGSLFRSKSRS-SAAITLP-------------ENNSCAANTSQRLTIHRSKSVTKS-----RTKTKDEENLNPL-----------NCKTKAGFTKFL
        PPSP +   ++SR+ S++I+LP              ++S  +N+S+R+ I RS+S T+S      + +K  EN+ P            N +++  F ++L
Subjt:  PPSPGSLFRSKSRS-SAAITLP-------------ENNSCAANTSQRLTIHRSKSVTKS-----RTKTKDEENLNPL-----------NCKTKAGFTKFL

Query:  ----KSSP------------ATSPSAWALSPGR--------SLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFR
            + SP            A+SPSAWALSPGR        S  +P   + +T  E  V    K K+    GG V+GVL++F  +K    V+ E+ HRFR
Subjt:  ----KSSP------------ATSPSAWALSPGR--------SLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFR

Query:  ILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRI
        I QNRLLQW++VN R E +MAN+K  V+D++F VWL   +MRN ++E  IE+++LR++IK+  ++  Q+ LL +W+K+D +N E++  L   L  LS+R+
Subjt:  ILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRI

Query:  PLIHGA-----------------------------KQLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTK
        PL+HGA                              ++E  LY LTE + +F ++    E+++E++ S+    AKE+S+R+ ++Q T   +
Subjt:  PLIHGA-----------------------------KQLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTK

Q8S8I1 QWRF motif-containing protein 34.2e-1628.15Show/hide
Query:  LTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLK-----SSPATSPSAWALSPGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLR--
        LT   SK   K++ +++D+         +  G    L+     S   +S S WALSPGRSL +  V +P + ++P             RG GV  ++   
Subjt:  LTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLK-----SSPATSPSAWALSPGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLR--

Query:  --FFRPKKAA---------VTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEKRIEVEKLRKEIKLYIIIFPQV
          FFR K  +          T + E  H+ +++ NRLLQW++VN RA     NV +  ++++   W   +++ N +L++RI+++K   E+KL  +   QV
Subjt:  --FFRPKKAA---------VTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEKRIEVEKLRKEIKLYIIIFPQV

Query:  LLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK
          L+ W  ++ ++  S+  +   L ++  R+PL  GAK
Subjt:  LLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK

Q94AI1 QWRF motif-containing protein 29.1e-1123.81Show/hide
Query:  SKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSS--PATSPSAWALSPGRS----LGSPVVLLPLTTVEP
        SKS+S     LP N   +A   Q  T  R + +    +       L   +  +K G +K   S   P +SP   A SP R       SP  L   TT  P
Subjt:  SKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSS--PATSPSAWALSPGRS----LGSPVVLLPLTTVEP

Query:  A--------VVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVE--AEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILE
        A          +G  +++          +L F    +     E    + H  R+L NR LQW++VN RA++++   +   +  +++ W+    +R+ +  
Subjt:  A--------VVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVE--AEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILE

Query:  KRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAKQLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKE
        KRI++  LR+++KL  I+  Q+  L++W+ LD+ +  S+      L   +LR+P++       KT+  + +         D ++ +  ++ S+T+ + + 
Subjt:  KRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAKQLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKE

Query:  NSIRIQLIQATNSTK
        NS+ ++ +  T   K
Subjt:  NSIRIQLIQATNSTK

Q9SUH5 AUGMIN subunit 81.2e-1023.92Show/hide
Query:  LPPPPSPGSLFRSKSR----SSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALSPGRSLGSPV
        L P P+PGS   S SR    SS++I+   + S   + S+ L+  R  S T+  + ++    L+P      + F +     P+T PS   +SP R      
Subjt:  LPPPPSPGSLFRSKSR----SSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALSPGRSLGSPV

Query:  VLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRP-KKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRIL
                        ++     +    + VL F    KK       E++H+ R+L NR LQW++   RAE+ M   +   ++ +F+VW     +++ + 
Subjt:  VLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRP-KKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRIL

Query:  EKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK-----------------------------QLEKTLYMLT
         +RI +++L+ EIKL  ++  Q++ L+ WA L++ +  S+    S L   +LR+P   G K                             ++E+   M+T
Subjt:  EKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK-----------------------------QLEKTLYMLT

Query:  ERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKKD
        E   +  ++     K E+ + S   +  +E S+R  LIQ      +D
Subjt:  ERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKKD

Arabidopsis top hitse value%identityAlignment
AT2G20815.1 Family of unknown function (DUF566)3.0e-1728.15Show/hide
Query:  LTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLK-----SSPATSPSAWALSPGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLR--
        LT   SK   K++ +++D+         +  G    L+     S   +S S WALSPGRSL +  V +P + ++P             RG GV  ++   
Subjt:  LTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLK-----SSPATSPSAWALSPGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLR--

Query:  --FFRPKKAA---------VTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEKRIEVEKLRKEIKLYIIIFPQV
          FFR K  +          T + E  H+ +++ NRLLQW++VN RA     NV +  ++++   W   +++ N +L++RI+++K   E+KL  +   QV
Subjt:  --FFRPKKAA---------VTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEKRIEVEKLRKEIKLYIIIFPQV

Query:  LLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK
          L+ W  ++ ++  S+  +   L ++  R+PL  GAK
Subjt:  LLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK

AT2G20815.2 Family of unknown function (DUF566)3.0e-1728.15Show/hide
Query:  LTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLK-----SSPATSPSAWALSPGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLR--
        LT   SK   K++ +++D+         +  G    L+     S   +S S WALSPGRSL +  V +P + ++P             RG GV  ++   
Subjt:  LTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLK-----SSPATSPSAWALSPGRSLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLR--

Query:  --FFRPKKAA---------VTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEKRIEVEKLRKEIKLYIIIFPQV
          FFR K  +          T + E  H+ +++ NRLLQW++VN RA     NV +  ++++   W   +++ N +L++RI+++K   E+KL  +   QV
Subjt:  --FFRPKKAA---------VTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEKRIEVEKLRKEIKLYIIIFPQV

Query:  LLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK
          L+ W  ++ ++  S+  +   L ++  R+PL  GAK
Subjt:  LLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK

AT4G25190.1 Family of unknown function (DUF566)1.7e-4432.99Show/hide
Query:  PPSPGSLFRSKSRS-SAAITLP-------------ENNSCAANTSQRLTIHRSKSVTKS-----RTKTKDEENLNPL-----------NCKTKAGFTKFL
        PPSP +   ++SR+ S++I+LP              ++S  +N+S+R+ I RS+S T+S      + +K  EN+ P            N +++  F ++L
Subjt:  PPSPGSLFRSKSRS-SAAITLP-------------ENNSCAANTSQRLTIHRSKSVTKS-----RTKTKDEENLNPL-----------NCKTKAGFTKFL

Query:  ----KSSP------------ATSPSAWALSPGR--------SLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFR
            + SP            A+SPSAWALSPGR        S  +P   + +T  E  V    K K+    GG V+GVL++F  +K    V+ E+ HRFR
Subjt:  ----KSSP------------ATSPSAWALSPGR--------SLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFR

Query:  ILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRI
        I QNRLLQW++VN R E +MAN+K  V+D++F VWL   +MRN ++E  IE+++LR++IK+  ++  Q+ LL +W+K+D +N E++  L   L  LS+R+
Subjt:  ILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRI

Query:  PLIHGA-----------------------------KQLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTK
        PL+HGA                              ++E  LY LTE + +F ++    E+++E++ S+    AKE+S+R+ ++Q T   +
Subjt:  PLIHGA-----------------------------KQLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTK

AT4G25190.2 Family of unknown function (DUF566)3.4e-4533.16Show/hide
Query:  PPSPGSLFRSKSRS-SAAITLP-------------ENNSCAANTSQRLTIHRSKSVTKS-----RTKTKDEENLNPL-----------NCKTKAGFTKFL
        PPSP +   ++SR+ S++I+LP              ++S  +N+S+R+ I RS+S T+S      + +K  EN+ P            N +++  F ++L
Subjt:  PPSPGSLFRSKSRS-SAAITLP-------------ENNSCAANTSQRLTIHRSKSVTKS-----RTKTKDEENLNPL-----------NCKTKAGFTKFL

Query:  ----KSSP------------ATSPSAWALSPGR--------SLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFR
            + SP            A+SPSAWALSPGR        S  +P   + +T  E  V    K K+    GG V+GVL++F  +K    V+ E+ HRFR
Subjt:  ----KSSP------------ATSPSAWALSPGR--------SLGSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFR

Query:  ILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRI
        I QNRLLQW++VN R E +MAN+K  V+D++F VWL   +MRN ++E  IE+++LR++IK+  ++  Q+ LL +W+K+D +N E++  L   L  LS+R+
Subjt:  ILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRI

Query:  PLIHGA------------------------------KQLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTK
        PL+HGA                              +Q+E  LY LTE + +F ++    E+++E++ S+    AKE+S+R+ ++Q T   +
Subjt:  PLIHGA------------------------------KQLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTK

AT4G30710.1 Family of unknown function (DUF566)8.4e-1223.92Show/hide
Query:  LPPPPSPGSLFRSKSR----SSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALSPGRSLGSPV
        L P P+PGS   S SR    SS++I+   + S   + S+ L+  R  S T+  + ++    L+P      + F +     P+T PS   +SP R      
Subjt:  LPPPPSPGSLFRSKSR----SSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALSPGRSLGSPV

Query:  VLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRP-KKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRIL
                        ++     +    + VL F    KK       E++H+ R+L NR LQW++   RAE+ M   +   ++ +F+VW     +++ + 
Subjt:  VLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRP-KKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRIL

Query:  EKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK-----------------------------QLEKTLYMLT
         +RI +++L+ EIKL  ++  Q++ L+ WA L++ +  S+    S L   +LR+P   G K                             ++E+   M+T
Subjt:  EKRIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAK-----------------------------QLEKTLYMLT

Query:  ERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKKD
        E   +  ++     K E+ + S   +  +E S+R  LIQ      +D
Subjt:  ERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQLIQATNSTKKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAACACACGCATACGCCGTCCAAAATCCCCTGCCCTTCCGCCGCCACCATCCCCCGGCAGCCTCTTCCGAAGCAAAAGCCGCTCCTCCGCCGCCATAACC
TTACCCGAGAACAACTCATGCGCCGCAAACACAAGCCAACGCTTAACCATCCATCGATCAAAATCGGTGACAAAGTCAAGAACTAAGACCAAGGATGAAGAGAAT
TTAAACCCCTTGAATTGTAAAACCAAGGCAGGGTTTACCAAGTTCTTGAAGTCGTCCCCGGCAACTTCCCCGTCTGCGTGGGCACTGTCACCGGGCCGGTCCTTG
GGCTCTCCCGTTGTGCTGTTGCCGTTGACGACGGTTGAGCCTGCGGTGGTGGATGGACGGAAAGAGAAACTAGGTGGTCAACGAGGTGGTGGTGTGAGCGGAGTT
TTGAGGTTTTTTAGACCGAAGAAGGCGGCAGTGACGGTGGAAGCGGAGGAGCTTCATCGGTTTAGGATTTTGCAGAATAGGTTGTTGCAATGGAAGTATGTTAAC
GTTAGAGCTGAGACTTCCATGGCTAACGTTAAAACACTTGTTCAGGACAGAATATTCAGTGTGTGGCTTCACAACTTGAGAATGAGAAATCGCATCTTAGAAAAA
CGAATTGAAGTTGAAAAATTGAGAAAGGAGATCAAATTATACATAATAATCTTCCCTCAAGTACTACTCCTCAAGCAATGGGCTAAGTTAGACAAAAGAAACCAA
GAATCAGTTGGCAACTTAGCCTCTCTTCTTTCAACACTCTCACTCAGAATCCCTCTAATCCATGGAGCCAAGCAACTGGAGAAAACGTTGTACATGCTAACGGAG
CGGCTAAGCATATTTAAAGAACAGGAAGACTGCTTGGAGAAATTGGAGGAGGCAGTATGTTCAGTCACCACTTTACTGGCGAAGGAGAATAGTATTAGAATACAA
CTCATACAAGCAACAAATTCTACCAAGAAGGATCCTACTTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAACACACGCATACGCCGTCCAAAATCCCCTGCCCTTCCGCCGCCACCATCCCCCGGCAGCCTCTTCCGAAGCAAAAGCCGCTCCTCCGCCGCCATAACC
TTACCCGAGAACAACTCATGCGCCGCAAACACAAGCCAACGCTTAACCATCCATCGATCAAAATCGGTGACAAAGTCAAGAACTAAGACCAAGGATGAAGAGAAT
TTAAACCCCTTGAATTGTAAAACCAAGGCAGGGTTTACCAAGTTCTTGAAGTCGTCCCCGGCAACTTCCCCGTCTGCGTGGGCACTGTCACCGGGCCGGTCCTTG
GGCTCTCCCGTTGTGCTGTTGCCGTTGACGACGGTTGAGCCTGCGGTGGTGGATGGACGGAAAGAGAAACTAGGTGGTCAACGAGGTGGTGGTGTGAGCGGAGTT
TTGAGGTTTTTTAGACCGAAGAAGGCGGCAGTGACGGTGGAAGCGGAGGAGCTTCATCGGTTTAGGATTTTGCAGAATAGGTTGTTGCAATGGAAGTATGTTAAC
GTTAGAGCTGAGACTTCCATGGCTAACGTTAAAACACTTGTTCAGGACAGAATATTCAGTGTGTGGCTTCACAACTTGAGAATGAGAAATCGCATCTTAGAAAAA
CGAATTGAAGTTGAAAAATTGAGAAAGGAGATCAAATTATACATAATAATCTTCCCTCAAGTACTACTCCTCAAGCAATGGGCTAAGTTAGACAAAAGAAACCAA
GAATCAGTTGGCAACTTAGCCTCTCTTCTTTCAACACTCTCACTCAGAATCCCTCTAATCCATGGAGCCAAGCAACTGGAGAAAACGTTGTACATGCTAACGGAG
CGGCTAAGCATATTTAAAGAACAGGAAGACTGCTTGGAGAAATTGGAGGAGGCAGTATGTTCAGTCACCACTTTACTGGCGAAGGAGAATAGTATTAGAATACAA
CTCATACAAGCAACAAATTCTACCAAGAAGGATCCTACTTGTTAA
Protein sequenceShow/hide protein sequence
MENTRIRRPKSPALPPPPSPGSLFRSKSRSSAAITLPENNSCAANTSQRLTIHRSKSVTKSRTKTKDEENLNPLNCKTKAGFTKFLKSSPATSPSAWALSPGRSL
GSPVVLLPLTTVEPAVVDGRKEKLGGQRGGGVSGVLRFFRPKKAAVTVEAEELHRFRILQNRLLQWKYVNVRAETSMANVKTLVQDRIFSVWLHNLRMRNRILEK
RIEVEKLRKEIKLYIIIFPQVLLLKQWAKLDKRNQESVGNLASLLSTLSLRIPLIHGAKQLEKTLYMLTERLSIFKEQEDCLEKLEEAVCSVTTLLAKENSIRIQ
LIQATNSTKKDPTC