; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G006070 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G006070
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionhigh mobility group B protein 6-like
Genome locationCmo_Chr16:2949630..2968808
RNA-Seq ExpressionCmoCh16G006070
SyntenyCmoCh16G006070
Gene Ontology termsGO:0006810 - transport (biological process)
GO:0009987 - cellular process (biological process)
GO:1900150 - regulation of defense response to fungus (biological process)
GO:0016020 - membrane (cellular component)
GO:0003779 - actin binding (molecular function)
InterPro domainsIPR009071 - High mobility group box domain
IPR011684 - Protein Networked (NET), actin-binding (NAB) domain
IPR013057 - Amino acid transporter, transmembrane domain
IPR021480 - Probable zinc-ribbon domain, plant
IPR036910 - High mobility group box domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015153.1 Extra-large guanine nucleotide-binding protein 1, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0098.24Show/hide
Query:  EWDCPSMEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFE
        +WDCPSMEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFE
Subjt:  EWDCPSMEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFE

Query:  HQRIERDAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYY
        HQRIERDAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERA+VDPYY
Subjt:  HQRIERDAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYY

Query:  GRATYNVPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNAS
        GRATYNVPMQSSTRSPPHIYEPHYVDRGNGTFPA GQHQRNGED LHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNAS
Subjt:  GRATYNVPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNAS

Query:  NPKESIKSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKL
        NPKESIKSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKL
Subjt:  NPKESIKSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKL

Query:  EMDWQKLQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKE
        EMDWQKLQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPD+GSPKRVVNATSSLESSDNSSHKSI TDHNKPSDSRDSNLGESKTQELTSSL+ STEKE
Subjt:  EMDWQKLQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKE

Query:  TLPTKDAPSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRS
        TLPTKDAPSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRS
Subjt:  TLPTKDAPSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRS

Query:  SEITDNGRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGL
        SEITDNGRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGL
Subjt:  SEITDNGRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGL

Query:  PTTTNKLYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR
        PTTTNKLYRIDISGRVVDDDSGKELYNLGKLAPT+  ++
Subjt:  PTTTNKLYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR

KAG7015154.1 hypothetical protein SDJN02_22787, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0097.32Show/hide
Query:  STCRIQEYHFWLIRHEEDGGKSKVNFQEMTKHRFRNSLKSLFGSYLDPETNERLRGNKSVIEDKVNKIRQLIKGEDLGVEDHDQSETRKKQSIDELFDDF
        S C   EYHFWLIRHEEDGGKSKVNFQEMTKHRFRNSLKSLFGSYLDPETNERLRGNKSVIEDKVNKIRQLIKGEDLGVEDHDQSETRKKQSIDELFDDF
Subjt:  STCRIQEYHFWLIRHEEDGGKSKVNFQEMTKHRFRNSLKSLFGSYLDPETNERLRGNKSVIEDKVNKIRQLIKGEDLGVEDHDQSETRKKQSIDELFDDF

Query:  LNVYQALYEQYDSLTGELRRKFQKRREKESSSSSSSDSDSDDSSKKKVSKDDRGLEREFQEVGEIKQELDAALSEVADLKRILATTIKEHESLNTEHLTA
        LNVYQALYEQYDSLTGELRRKFQKRREKESSSSSSSDSDSDDSSKKKVSKDDRGLEREFQEVGEIKQELDAALSEVADLKRILATT KEHESLN+EHLTA
Subjt:  LNVYQALYEQYDSLTGELRRKFQKRREKESSSSSSSDSDSDDSSKKKVSKDDRGLEREFQEVGEIKQELDAALSEVADLKRILATTIKEHESLNTEHLTA

Query:  LSKIQEADGIIRDLKVEAEIWDSQKSKFQLEIEELNLALSNAGRNESELNERLKGMETEMNNYIEEKETARRKIEEGEKTIDELKALADQLKEKLSATME
        LSK+QEADGIIRDLKVEAE WDSQKSKFQLEIEELNLALSNAGRNESELNERLKGMETEMNNYIEEKETARRKIEEG KTIDELKALADQLKEKLSATME
Subjt:  LSKIQEADGIIRDLKVEAEIWDSQKSKFQLEIEELNLALSNAGRNESELNERLKGMETEMNNYIEEKETARRKIEEGEKTIDELKALADQLKEKLSATME

Query:  EKEALNSQHLKTLSRVHEADMITRDLKVESETWGGEKSKFLLEIEELNQKLGAAGKLEAQLNERLKDIGIENEYLIKEKESAQRTIEEMSQRLSNAVKIE
        EKEALNSQHLKTLSRVHEADMITRDLKVESETWGGEKSKFLLEIEELNQKLGAAGKLEAQLNERLK+IGIENEYLIKEKESAQRTIEEMSQRLSNAVKIE
Subjt:  EKEALNSQHLKTLSRVHEADMITRDLKVESETWGGEKSKFLLEIEELNQKLGAAGKLEAQLNERLKDIGIENEYLIKEKESAQRTIEEMSQRLSNAVKIE

Query:  AELNGRLKDIETEKDGLIKEKEIAWKEIEQGKRVIEELNAMVDQLNSQLTITVEEKKSLNLQLEKEKVELLRSIADHQRNLKEHEDAYKKLNDEFQECKL
        AELNGRLKD+ETEKDGLIKEKEIAWKEIEQGKRV EELNAMVDQLNSQLTITVEEKKSLNLQLEKEKVELLRS+ADHQRNLKEHEDAYKKLNDEFQECKL
Subjt:  AELNGRLKDIETEKDGLIKEKEIAWKEIEQGKRVIEELNAMVDQLNSQLTITVEEKKSLNLQLEKEKVELLRSIADHQRNLKEHEDAYKKLNDEFQECKL

Query:  KLDNAEMMMAEMSREFLNDIRSKEQVKDDLELMVEDLKRELEVKSDEINSLVENARTIEVKLRLSNQKLRVTEQLLTEKEEIFRKAELKYQKERKLLEER
        KLDNAEM MAEMSREFLNDIRSKEQVKDDLELMVEDLKRELEVKSDEINSLVENARTIEVKLRLSNQKLRVTEQLLTEKEEIFRKAELKYQKERKLLEER
Subjt:  KLDNAEMMMAEMSREFLNDIRSKEQVKDDLELMVEDLKRELEVKSDEINSLVENARTIEVKLRLSNQKLRVTEQLLTEKEEIFRKAELKYQKERKLLEER

Query:  IHGLSATVVTNKVTYQKTISTVSENINSNLSQLECVIRKFILEYAKYEKCVMETSRDLRLTKSWVSNAIEETEGLKKEVADLGKQLEDKKERESILVQQV
        IHGLSATVV NKVTYQK ISTVSE+INSNLSQLECVIRKFILEYAKYEKCVMETSRDLRLTKSWVSNAIEET+GLKKEVADLGKQLEDKKERE+ILV+QV
Subjt:  IHGLSATVVTNKVTYQKTISTVSENINSNLSQLECVIRKFILEYAKYEKCVMETSRDLRLTKSWVSNAIEETEGLKKEVADLGKQLEDKKERESILVQQV

Query:  EKLEIKANKEGSEKDGLVEAIHELEKRQTELEKLMEEKNEGMVGLKEEKKEAIRQLCMLIEYHRDRYDFLKDEVLKLNVKGGQSI
        EKLEIKANKEGSEKDGLVEAIHELEKRQTELEKLMEEKNEGMVGLKEEKKEAIRQLCMLIEYHRDRYDFLKDEVLKLNVKGGQS+
Subjt:  EKLEIKANKEGSEKDGLVEAIHELEKRQTELEKLMEEKNEGMVGLKEEKKEAIRQLCMLIEYHRDRYDFLKDEVLKLNVKGGQSI

XP_022931304.1 uncharacterized protein LOC111437527 [Cucurbita moschata]0.0e+0099.32Show/hide
Query:  MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFEHQRIER
        MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFEHQRIER
Subjt:  MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFEHQRIER

Query:  DAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN
        DAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN
Subjt:  DAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN

Query:  VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI
        VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI
Subjt:  VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI

Query:  KSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK
        KSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK
Subjt:  KSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK

Query:  LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKETLPTKD
        LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKETLPTKD
Subjt:  LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKETLPTKD

Query:  APSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDN
        APSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDN
Subjt:  APSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDN

Query:  GRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNK
        GRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNK
Subjt:  GRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNK

Query:  LYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR
        LYRIDISGRVVDDDSGKELYNLGKLAPT+  ++
Subjt:  LYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR

XP_022985374.1 uncharacterized protein LOC111483403 [Cucurbita maxima]0.0e+0095.23Show/hide
Query:  MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFEHQRIER
        MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIG+DTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNP+GAARTRA FEHQR+ER
Subjt:  MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFEHQRIER

Query:  DAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN
        DAFTGYSGNS+AIADRIGVPNFHYPSDRPSSSNVDR YGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVV+RAAVDPYYGRATYN
Subjt:  DAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN

Query:  VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI
        VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGED LHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLR+PNSPISNASNPKE+I
Subjt:  VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI

Query:  KSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK
        KSSTYRNENPV VGLTASN QRAGRFPSQDALP S QLSELDSEIDGF PVRP+T  V RRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK
Subjt:  KSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK

Query:  LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKETLPTKD
        LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPD+GSPKRVVNATSSLESSDNSSHKSI +DHNKPS+ RDSNLGESK QELTSSLV S+EKETLPTKD
Subjt:  LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKETLPTKD

Query:  APSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDN
        +PSLTNSDNPSYDEPSKYREESENNQDTVINDVT PSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDN
Subjt:  APSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDN

Query:  GRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNK
        GRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDY AGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNK
Subjt:  GRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNK

Query:  LYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR
        LYRIDISGRVVDDDSGKELYNLGKLAPT+  ++
Subjt:  LYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR

XP_023552647.1 uncharacterized protein LOC111810233 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0095.41Show/hide
Query:  MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFEHQRIER
        MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIG+DTKGVWSMQS GDKEVGLVEETQRIENWIRRNNIEHDMDIYVNP+GAARTRA FEHQR+ER
Subjt:  MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFEHQRIER

Query:  DAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN
        DAFTGYSGNS+AIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN
Subjt:  DAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN

Query:  VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI
        VPMQSSTRSPPHIYEPHYVDRGNGTFPA GQHQRNGE+ LHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI
Subjt:  VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI

Query:  KSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK
        KSSTY NENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK
Subjt:  KSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK

Query:  LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKET-----
        LQCGACSVVIIVKVENRRLVVSVPAE KPKEVSPD+GSPKRVVNATSSLESSDNSSHKSI TDHNKPSD RDSNLGE+ TQELTSSL+ S EKET     
Subjt:  LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKET-----

Query:  --LPTKDAPSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSR
          LPTKDAPSLTNSDNPSYDEPSKYREE ENNQDT INDVT PSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSR
Subjt:  --LPTKDAPSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSR

Query:  SSEITDNGRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRG
        SSEITDNGRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRG
Subjt:  SSEITDNGRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRG

Query:  LPTTTNKLYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR
        LPTTTNKLYRIDISGRVVDDDSGKELYNLGKLAPT+  ++
Subjt:  LPTTTNKLYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR

TrEMBL top hitse value%identityAlignment
A0A5D3CYK7 DUF3133 domain-containing protein0.0e+0067.38Show/hide
Query:  IRVLRCPRCENLL----------------------LVLF----------------------------VEWDCPSMEPSVKERVLRSRRTVFSSSSIRTND
        +RV+RCPRC+NLL                       VLF                             EWD PSMEP+V ERV+RSRRTVFS+S IRT+D
Subjt:  IRVLRCPRCENLL----------------------LVLF----------------------------VEWDCPSMEPSVKERVLRSRRTVFSSSSIRTND

Query:  REDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEE------TQRIENWIRRNNIEHDMDI----------YVNPMGAARTRAAFEHQRIERDAFTGYSGNS
        REDIDDYER+IG++TKGVW+MQ LGDKEVG VEE       QRIENW+RR NIE DM+I          Y NP+G ARTRA FEH R+ERDAFTGYSGNS
Subjt:  REDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEE------TQRIENWIRRNNIEHDMDI----------YVNPMGAARTRAAFEHQRIERDAFTGYSGNS

Query:  LAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVV-ERAAVDPYYGRATYNVPMQSSTRS
        +A A   G PNF Y SDRPSSSN+D  YGHPE N++YE P++GLDPNRAELLR+LDELK QIIKSCDV DRP+VV +RA VDPYYGRA YNV M+SST+S
Subjt:  LAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVV-ERAAVDPYYGRATYNVPMQSSTRS

Query:  PPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPP---RLPHEHYPESFMD------------------------
        P HI  P Y  RG+GTFPA G HQRNGED LHPPRHVVKD+PLYED+FQEQM RKTN+ P     P +HYPES MD                        
Subjt:  PPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPP---RLPHEHYPESFMD------------------------

Query:  -----------LRAPNSPISNASNPKESIKSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAG
                   L+APNSP  N SNPKE  KSSTY NENPVTVGL ASNL RAGRFPSQD LPHSRQ SELDSEIDGF  V+PRT+ VL+RNGKSRDAIAG
Subjt:  -----------LRAPNSPISNASNPKESIKSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAG

Query:  GAPFIVCNSCLELLKLPRKLYKLEMDWQKLQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSR
        GAPFIVC+SCLELLKLPRKLYKLE+DWQKLQCGACSVVIIVKV+NR+LV+SVPAE KP EVSP++GSP+ VV+AT S+ESSDNSSHK I TDHNKPSD +
Subjt:  GAPFIVCNSCLELLKLPRKLYKLEMDWQKLQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSR

Query:  DSNLGESKTQELTSSLVPSTEKET---------------LPTKDAPS----LTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHIS
        DS+   +KTQE+TSS + S EKE+               LP KD PS    + NSDNPS+D+PS++RE SEN Q  +++DVTEPSELDVSF+DYSNIH+S
Subjt:  DSNLGESKTQELTSSLVPSTEKET---------------LPTKDAPS----LTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHIS

Query:  QDSMEISKEEEEE-----NQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDNGRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHP
         D++EI+KEEEEE     +Q+K+KSN+ESETFFV LS+NNLRDFSRSSEITDNGRPTVSVNGQPLPAH+VKKAEK AGPILPGDYWYDYQAGFWGVMGHP
Subjt:  QDSMEISKEEEEE-----NQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDNGRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHP

Query:  CLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNKLYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR
        CLGIIPPFIDEFTYPMSRNCA GNTG+FVNGRELHKRDLELLSSRGLPTTTNKLYRIDISGRV+D+DSGKELYNLGKLAPT+  ++
Subjt:  CLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNKLYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR

A0A6J1ETW9 uncharacterized protein LOC1114375270.0e+0099.32Show/hide
Query:  MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFEHQRIER
        MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFEHQRIER
Subjt:  MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFEHQRIER

Query:  DAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN
        DAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN
Subjt:  DAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN

Query:  VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI
        VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI
Subjt:  VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI

Query:  KSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK
        KSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK
Subjt:  KSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK

Query:  LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKETLPTKD
        LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKETLPTKD
Subjt:  LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKETLPTKD

Query:  APSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDN
        APSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDN
Subjt:  APSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDN

Query:  GRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNK
        GRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNK
Subjt:  GRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNK

Query:  LYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR
        LYRIDISGRVVDDDSGKELYNLGKLAPT+  ++
Subjt:  LYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR

A0A6J1EZ25 COP1-interactive protein 1-like isoform X10.0e+0099.87Show/hide
Query:  MTKHRFRNSLKSLFGSYLDPETNERLRGNKSVIEDKVNKIRQLIKGEDLGVEDHDQSETRKKQSIDELFDDFLNVYQALYEQYDSLTGELRRKFQKRREK
        MTKHRFRNSLKSLFGSYLDPETNERLRGNKSVIEDKVNKIRQLIKGEDLGVEDHDQSETRKKQSIDELFDDFLNVYQALYEQYDSLTGELRRKFQKRREK
Subjt:  MTKHRFRNSLKSLFGSYLDPETNERLRGNKSVIEDKVNKIRQLIKGEDLGVEDHDQSETRKKQSIDELFDDFLNVYQALYEQYDSLTGELRRKFQKRREK

Query:  ESSSSSSSDSDSDDSSKKKVSKDDRGLEREFQEVGEIKQELDAALSEVADLKRILATTIKEHESLNTEHLTALSKIQEADGIIRDLKVEAEIWDSQKSKF
        ESSSSSSSDSDSDDSSKKKVSKDDRGLEREFQEVGEIKQELDAALSEVADLKRILATTIKEHESLNTEHLTALSKIQEADGIIRDLKVEAEIWDSQKSKF
Subjt:  ESSSSSSSDSDSDDSSKKKVSKDDRGLEREFQEVGEIKQELDAALSEVADLKRILATTIKEHESLNTEHLTALSKIQEADGIIRDLKVEAEIWDSQKSKF

Query:  QLEIEELNLALSNAGRNESELNERLKGMETEMNNYIEEKETARRKIEEGEKTIDELKALADQLKEKLSATMEEKEALNSQHLKTLSRVHEADMITRDLKV
        QLEIEELNLALSNAGRNESELNERLKGMETEMNNYIEEKETARRKIEEGEKTIDELKALADQLKEKLSATMEEKEALNSQHLKTLSRVHEADMITRDLKV
Subjt:  QLEIEELNLALSNAGRNESELNERLKGMETEMNNYIEEKETARRKIEEGEKTIDELKALADQLKEKLSATMEEKEALNSQHLKTLSRVHEADMITRDLKV

Query:  ESETWGGEKSKFLLEIEELNQKLGAAGKLEAQLNERLKDIGIENEYLIKEKESAQRTIEEMSQRLSNAVKIEAELNGRLKDIETEKDGLIKEKEIAWKEI
        ESETWGGEKSKFLLEIEELNQKLGAAGKLEAQLNERLKDIGIENEYLIKEKESAQRTIEEMSQRLSNAVKIEAELNGRLKDIETEKDGLIKEKEIAWKEI
Subjt:  ESETWGGEKSKFLLEIEELNQKLGAAGKLEAQLNERLKDIGIENEYLIKEKESAQRTIEEMSQRLSNAVKIEAELNGRLKDIETEKDGLIKEKEIAWKEI

Query:  EQGKRVIEELNAMVDQLNSQLTITVEEKKSLNLQLEKEKVELLRSIADHQRNLKEHEDAYKKLNDEFQECKLKLDNAEMMMAEMSREFLNDIRSKEQVKD
        EQGKRVIEELNAMVDQLNSQLTITVEEKKSLNLQLEKEKVELLRSIADHQRNLKEHEDAYKKLNDEFQECKLKLDNAEMMMAEMSREFLNDIRSKEQVKD
Subjt:  EQGKRVIEELNAMVDQLNSQLTITVEEKKSLNLQLEKEKVELLRSIADHQRNLKEHEDAYKKLNDEFQECKLKLDNAEMMMAEMSREFLNDIRSKEQVKD

Query:  DLELMVEDLKRELEVKSDEINSLVENARTIEVKLRLSNQKLRVTEQLLTEKEEIFRKAELKYQKERKLLEERIHGLSATVVTNKVTYQKTISTVSENINS
        DLELMVEDLKRELEVKSDEINSLVENARTIEVKLRLSNQKLRVTEQLLTEKEEIFRKAELKYQKERKLLEERIHGLSATVVTNKVTYQKTISTVSENINS
Subjt:  DLELMVEDLKRELEVKSDEINSLVENARTIEVKLRLSNQKLRVTEQLLTEKEEIFRKAELKYQKERKLLEERIHGLSATVVTNKVTYQKTISTVSENINS

Query:  NLSQLECVIRKFILEYAKYEKCVMETSRDLRLTKSWVSNAIEETEGLKKEVADLGKQLEDKKERESILVQQVEKLEIKANKEGSEKDGLVEAIHELEKRQ
        NLSQLECVIRKFILEYAKYEKCVMETSRDLRLTKSWVSNAIEETEGLKKEVADLGKQLEDKKERESILVQQVEKLEIKANKEGSEKDGLVEAIHELEKRQ
Subjt:  NLSQLECVIRKFILEYAKYEKCVMETSRDLRLTKSWVSNAIEETEGLKKEVADLGKQLEDKKERESILVQQVEKLEIKANKEGSEKDGLVEAIHELEKRQ

Query:  TELEKLMEEKNEGMVGLKEEKKEAIRQLCMLIEYHRDRYDFLKDEVLKLNVKGGQSI
        TELEKLMEEKNEGMVGLKEEKKEAIRQLCMLIEYHRDRYDFLKDEVLKLNVKGGQS+
Subjt:  TELEKLMEEKNEGMVGLKEEKKEAIRQLCMLIEYHRDRYDFLKDEVLKLNVKGGQSI

A0A6J1J4Q6 uncharacterized protein LOC1114834030.0e+0095.23Show/hide
Query:  MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFEHQRIER
        MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIG+DTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNP+GAARTRA FEHQR+ER
Subjt:  MEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRRNNIEHDMDIYVNPMGAARTRAAFEHQRIER

Query:  DAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN
        DAFTGYSGNS+AIADRIGVPNFHYPSDRPSSSNVDR YGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVV+RAAVDPYYGRATYN
Subjt:  DAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYN

Query:  VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI
        VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGED LHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLR+PNSPISNASNPKE+I
Subjt:  VPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPISNASNPKESI

Query:  KSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK
        KSSTYRNENPV VGLTASN QRAGRFPSQDALP S QLSELDSEIDGF PVRP+T  V RRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK
Subjt:  KSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK

Query:  LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKETLPTKD
        LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPD+GSPKRVVNATSSLESSDNSSHKSI +DHNKPS+ RDSNLGESK QELTSSLV S+EKETLPTKD
Subjt:  LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKETLPTKD

Query:  APSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDN
        +PSLTNSDNPSYDEPSKYREESENNQDTVINDVT PSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDN
Subjt:  APSLTNSDNPSYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDN

Query:  GRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNK
        GRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDY AGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNK
Subjt:  GRPTVSVNGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNK

Query:  LYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR
        LYRIDISGRVVDDDSGKELYNLGKLAPT+  ++
Subjt:  LYRIDISGRVVDDDSGKELYNLGKLAPTVFTIR

A0A6J1JD34 COP1-interactive protein 1 isoform X10.0e+0096.17Show/hide
Query:  MTKHRFRNSLKSLFGSYLDPETNERLRGNKSVIEDKVNKIRQLIKGEDLGVEDHDQSETRKKQSIDELFDDFLNVYQALYEQYDSLTGELRRKFQKRREK
        MTKHRFRNSLKSLFGSYLDPETNERL+GNKSVIEDKVNKIRQLIKGEDLGVEDHDQSE RKKQSIDEL DDFLNVYQALYEQYDSLTGELRRKFQKRREK
Subjt:  MTKHRFRNSLKSLFGSYLDPETNERLRGNKSVIEDKVNKIRQLIKGEDLGVEDHDQSETRKKQSIDELFDDFLNVYQALYEQYDSLTGELRRKFQKRREK

Query:  ESSSSSSSDSDSDDSSKKKVSKDDRGLEREFQEVGEIKQELDAALSEVADLKRILATTIKEHESLNTEHLTALSKIQEADGIIRDLKVEAEIWDSQKSKF
        ESSSSSSSDSDSDDSSKKKVSKDDRGLEREFQEVGEIKQELDAALSEVADLKRILATT KEHESLN+EHLTALSK+QEAD IIRD KVEAE WDSQKSKF
Subjt:  ESSSSSSSDSDSDDSSKKKVSKDDRGLEREFQEVGEIKQELDAALSEVADLKRILATTIKEHESLNTEHLTALSKIQEADGIIRDLKVEAEIWDSQKSKF

Query:  QLEIEELNLALSNAGRNESELNERLKGMETEMNNYIEEKETARRKIEEGEKTIDELKALADQLKEKLSATMEEKEALNSQHLKTLSRVHEADMITRDLKV
        QLEIEELNLALSNAGRNESELNERLKGM TEMNNYIEEKETARRKIEEGEKTIDE KALADQLKEKLSATMEEKEALNSQHLKTLSRVHEA MITRDLKV
Subjt:  QLEIEELNLALSNAGRNESELNERLKGMETEMNNYIEEKETARRKIEEGEKTIDELKALADQLKEKLSATMEEKEALNSQHLKTLSRVHEADMITRDLKV

Query:  ESETWGGEKSKFLLEIEELNQKLGAAGKLEAQLNERLKDIGIENEYLIKEKESAQRTIEEMSQRLSNAVKIEAELNGRLKDIETEKDGLIKEKEIAWKEI
        ESETWGGEKSKFLLEIEELNQKLGAAGKLEAQLNERLKDIGIENEYLI+EKESAQRTIEEMSQRLSNAVKIEAEL+GRLKDIETEKDGLIKEKEIAWKEI
Subjt:  ESETWGGEKSKFLLEIEELNQKLGAAGKLEAQLNERLKDIGIENEYLIKEKESAQRTIEEMSQRLSNAVKIEAELNGRLKDIETEKDGLIKEKEIAWKEI

Query:  EQGKRVIEELNAMVDQLNSQLTITVEEKKSLNLQLEKEKVELLRSIADHQRNLKEHEDAYKKLNDEFQECKLKLDNAEMMMAEMSREFLNDIRSKEQVKD
        EQG RV EELNAMVDQLNSQLTITVEEKKSL+LQLEKEKVELLRSIADHQRNLKEHEDAYKKLNDEFQECKLKLDNAEM  AEMSREF NDIRSKEQVK+
Subjt:  EQGKRVIEELNAMVDQLNSQLTITVEEKKSLNLQLEKEKVELLRSIADHQRNLKEHEDAYKKLNDEFQECKLKLDNAEMMMAEMSREFLNDIRSKEQVKD

Query:  DLELMVEDLKRELEVKSDEINSLVENARTIEVKLRLSNQKLRVTEQLLTEKEEIFRKAELKYQKERKLLEERIHGLSATVVTNKVTYQKTISTVSENINS
        DLE+MVEDLKRELEVKSDEINSLVENARTIEVKLRLSNQKLRVTEQLLTEKEEIFRKAELKYQKERKLLEERIHGLSATVV NKVTYQKTISTVSENINS
Subjt:  DLELMVEDLKRELEVKSDEINSLVENARTIEVKLRLSNQKLRVTEQLLTEKEEIFRKAELKYQKERKLLEERIHGLSATVVTNKVTYQKTISTVSENINS

Query:  NLSQLECVIRKFILEYAKYEKCVMETSRDLRLTKSWVSNAIEETEGLKKEVADLGKQLEDKKERESILVQQVEKLEIKANKEGSEKDGLVEAIHELEKRQ
        NLSQLECVIRKFILE+AKYEKCVM+TSRDLRLTKSWVSNAIEETEGLKKEVADLGKQLEDKKERESILV+QVEKLEIKANKEGSEKDGLVEAIHELEKRQ
Subjt:  NLSQLECVIRKFILEYAKYEKCVMETSRDLRLTKSWVSNAIEETEGLKKEVADLGKQLEDKKERESILVQQVEKLEIKANKEGSEKDGLVEAIHELEKRQ

Query:  TELEKLMEEKNEGMVGLKEEKKEAIRQLCMLIEYHRDRYDFLKDEVLKLNVKGGQSI
         ELEK+MEEKNEGMVGLKEEKKEAIRQLCMLIEYHRDRYDFLKDEVLKLNVKGGQS+
Subjt:  TELEKLMEEKNEGMVGLKEEKKEAIRQLCMLIEYHRDRYDFLKDEVLKLNVKGGQSI

SwissProt top hitse value%identityAlignment
F4HW02 GABA transporter 16.0e-5937.16Show/hide
Query:  LDHCEKAGRRHIRFRELAADVLGSGWMFYFVIFIQTAINTGVGIGAILLSGQCLQIIYSNLYPNGSMKLYEFIAIVTGVMIILSQLPTFHSLRHVSLASL
        L+H    G R++RFR++A  +L   W  Y+V  IQ A+  GV I   LL GQCL+ +Y  + PNG MKL+EF+ I   ++++L+Q P+FHSLR+++  SL
Subjt:  LDHCEKAGRRHIRFRELAADVLGSGWMFYFVIFIQTAINTGVGIGAILLSGQCLQIIYSNLYPNGSMKLYEFIAIVTGVMIILSQLPTFHSLRHVSLASL

Query:  LLSLGYAFLIIAA--------------------------------CIIAAS------------------GKMVKGLLMCYCVIFVTFYAIAASGYWVFGN
        LL L Y+    AA                                 IIA +                  GKM+KGL MCY V+ +TF+ +A +GYW FG 
Subjt:  LLSLGYAFLIIAA--------------------------------CIIAAS------------------GKMVKGLLMCYCVIFVTFYAIAASGYWVFGN

Query:  RASSNILLS-LTPDTGPPLAPAWILGLAVIFVLLQLLAIGLVYSQVAYEIMEKQSADTKKGMFSKRNLIPRLILRSIYMIICGFFAAMLPFFGDISAVVG
        +A+  I  + L  +T     P W + L  +F +LQL A+ +VY Q   +I+E   +D  K  FS RN+IPRL++RS+++++    AAMLPFFGD+++++G
Subjt:  RASSNILLS-LTPDTGPPLAPAWILGLAVIFVLLQLLAIGLVYSQVAYEIMEKQSADTKKGMFSKRNLIPRLILRSIYMIICGFFAAMLPFFGDISAVVG

Query:  AICFIPLDFILPMLLYNITHNPPKSSLTYSINLAIIVVFTGVGLLGSFSSIRKLVLDTSKFKLFSN
        A  FIPLDF+LP++ +N T  P K S  + IN  I VVF+ +G++   +++R++++D + +KLF++
Subjt:  AICFIPLDFILPMLLYNITHNPPKSSLTYSINLAIIVVFTGVGLLGSFSSIRKLVLDTSKFKLFSN

F4JZY1 COP1-interactive protein 12.1e-4327.37Show/hide
Query:  MTKHRFRNSLKSLFGSYLDPETNERLRGNKSVIEDKVNKIRQLIKGEDLGVEDHDQSETRKKQSIDELFDDFLNVYQALYEQYDSLTGELRRKFQKRREK
        M KH+FR +LKS F  + D E  E L+G K+ I++KVNKI  +++  D+   + D+S    +Q + +L  +F + YQ+LY QYD LTGE+R+K   +   
Subjt:  MTKHRFRNSLKSLFGSYLDPETNERLRGNKSVIEDKVNKIRQLIKGEDLGVEDHDQSETRKKQSIDELFDDFLNVYQALYEQYDSLTGELRRKFQKRREK

Query:  ESSSSSSSDSDSDDSSKKKVSKDDRG-LEREFQEV-GEIKQELDAALSEVADLKRILATTIKEHESLNTEHLTALSKIQEADGIIRDLKVEAEIWDSQKS
        ESSSSSSSDSDSD SSK+KV ++  G +E++ + V G +KQ+++AA  E+ADLK  L TT++E E++++E   AL K++E++ I   LK+E E  + +KS
Subjt:  ESSSSSSSDSDSDDSSKKKVSKDDRG-LEREFQEV-GEIKQELDAALSEVADLKRILATTIKEHESLNTEHLTALSKIQEADGIIRDLKVEAEIWDSQKS

Query:  KFQLEIEELNLALSNAGRNESELNERLKGMETEMNNYIEEKETARRKIEEGEKTIDELKALADQLKEKLSATMEEKEALNSQHLKTLSRVHEADMITRDL
            +  EL+  L  AG+ E++LN++L+ ++ E +    E++   ++ +E EK  ++ K  +DQLK++ S   ++ EA   +  +  S ++ A+   + L
Subjt:  KFQLEIEELNLALSNAGRNESELNERLKGMETEMNNYIEEKETARRKIEEGEKTIDELKALADQLKEKLSATMEEKEALNSQHLKTLSRVHEADMITRDL

Query:  KVESETWGGEKSKFLLEIEELNQKLGAAGKLEAQLNERLKDIGIENEYLIK-----EKESAQRTIEEMSQRLSNAVKIEAELNGRLKDIETEKDGLIKEK
         ++         +    I+EL  +LG       ++ E+ K+   E+  L++     E+ES+ + ++E+   + ++ K+ A+    L + E EK  L ++ 
Subjt:  KVESETWGGEKSKFLLEIEELNQKLGAAGKLEAQLNERLKDIGIENEYLIK-----EKESAQRTIEEMSQRLSNAVKIEAELNGRLKDIETEKDGLIKEK

Query:  EIAWKEIEQGKRVIEELNAMVDQLNSQLTITVEEKKSLNLQLEKEKVELLRSIADHQRNLKEHEDAYKKLNDEFQECKLKLDNAEMMMAEMSREFLNDIR
             EI++ +  ++EL +   QL    ++   E  SL    E  +        D      E E   +    +  +    L  AE     +S + +  + 
Subjt:  EIAWKEIEQGKRVIEELNAMVDQLNSQLTITVEEKKSLNLQLEKEKVELLRSIADHQRNLKEHEDAYKKLNDEFQECKLKLDNAEMMMAEMSREFLNDIR

Query:  SKEQVKDDL-ELMVE--DLKRELEVKSDEINSLVE----NARTIEVKLRLSNQKLRVTEQLLTEKEEIFRKAELKYQKERKLLEERIHGLSATVVTNKVT
          EQ ++ + ELM E   LK     K  E++SLVE    + R   + ++   +++  +++L+ E  +    AE    +E+K+L ++I  LS  +   + T
Subjt:  SKEQVKDDL-ELMVE--DLKRELEVKSDEINSLVE----NARTIEVKLRLSNQKLRVTEQLLTEKEEIFRKAELKYQKERKLLEERIHGLSATVVTNKVT

Query:  YQKTISTVSENINSNLSQLECVIRKFILEYAKYEKCVMETSRDLRLTKSWVSNAIEETEGLKKEVADLGKQLEDKKERESIL----VQQVEKLEIKANKE
         Q+ +S      +  L +   V  + +       + + ET +  R + + VS    + E  ++ ++DL   L+D +E    +    ++ ++KLE   N  
Subjt:  YQKTISTVSENINSNLSQLECVIRKFILEYAKYEKCVMETSRDLRLTKSWVSNAIEETEGLKKEVADLGKQLEDKKERESIL----VQQVEKLEIKANKE

Query:  GSEKDGLVEAIHELEKRQTELEKLMEEKNEGMVGLKEEKKEAIRQLCML
            D L E     +++++EL  L++  ++ +  +K+    A  +  ML
Subjt:  GSEKDGLVEAIHELEKRQTELEKLMEEKNEGMVGLKEEKKEAIRQLCML

Q8L4X4 Probable GABA transporter 28.2e-12565.95Show/hide
Query:  VLDHCEKAGRRHIRFRELAADVLGSGWMFYFVIFIQTAINTGVGIGAILLSGQCLQIIYSNLYPNGSMKLYEFIAIVTGVMIILSQLPTFHSLRHVSLAS
        VLDHCEK+GRRHIRFRELAADVLGSG MFY VIFIQTAINTG+GIGAILL+GQCL I+YS+L+P G++KLYEFIA+VT VM++LSQLP+FHSLRH++ AS
Subjt:  VLDHCEKAGRRHIRFRELAADVLGSGWMFYFVIFIQTAINTGVGIGAILLSGQCLQIIYSNLYPNGSMKLYEFIAIVTGVMIILSQLPTFHSLRHVSLAS

Query:  LLLSLGYAFLIIAAC--------------------------------IIA------------------ASGKMVKGLLMCYCVIFVTFYAIAASGYWVFG
        LLLSLGY FL++ AC                                IIA                  A+GKM+KGLL+CY VIF TFY+ A SGYWVFG
Subjt:  LLLSLGYAFLIIAAC--------------------------------IIA------------------ASGKMVKGLLMCYCVIFVTFYAIAASGYWVFG

Query:  NRASSNILLSLTPDTGPPLAPAWILGLAVIFVLLQLLAIGLVYSQVAYEIMEKQSADTKKGMFSKRNLIPRLILRSIYMIICGFFAAMLPFFGDISAVVG
        N +SSNIL +L PD GP LAP  ++GLAVIFVLLQL AIGLVYSQVAYEIMEK+SADT KG+FSKRNL+PRLILR++YM  CGF AAMLPFFGDI+AVVG
Subjt:  NRASSNILLSLTPDTGPPLAPAWILGLAVIFVLLQLLAIGLVYSQVAYEIMEKQSADTKKGMFSKRNLIPRLILRSIYMIICGFFAAMLPFFGDISAVVG

Query:  AICFIPLDFILPMLLYNITHNPPKSSLTYSINLAIIVVFTGVGLLGSFSSIRKLVLDTSKFKLFSNDVVD
        A  FIPLDF+LPMLLYN+T+ P + S TY IN+ I+VVFT  GL+G+FSSIRKLVLD +KFKLFS++VVD
Subjt:  AICFIPLDFILPMLLYNITHNPPKSSLTYSINLAIIVVFTGVGLLGSFSSIRKLVLDTSKFKLFSNDVVD

Q9SUP7 High mobility group B protein 61.3e-14161.7Show/hide
Query:  MASTAEIPKTKKPRNSRKALKDKNSTPEEPQSESSMVTKVTQPSEEEILLSQNQSSAKKPKSKAAPKKQPAKQSFDKELQEMQDMLQQLRLDKEKTEELL
        MA+ A+   TKKPRNSRKALK KN   E P S  S+                        K K+A       +SF+++L EMQ ML++++++K+KTEELL
Subjt:  MASTAEIPKTKKPRNSRKALKDKNSTPEEPQSESSMVTKVTQPSEEEILLSQNQSSAKKPKSKAAPKKQPAKQSFDKELQEMQDMLQQLRLDKEKTEELL

Query:  KAKDEMLKQKDEELKTRDMEQEKLQIELKKLQKLKEFKPTMNFPMIQ-ILKDKEQE---KKEKKKCSENKRPSPPYILWCKDQWNEIKKENPEAEFKEIS
        K KDE+L++K+EEL+TRD EQEKL++ELKKLQK+KEFKP M F   Q  L   EQE   KK+KK C E KRPS  Y+LWCKDQW E+KKENPEA+FKE S
Subjt:  KAKDEMLKQKDEELKTRDMEQEKLQIELKKLQKLKEFKPTMNFPMIQ-ILKDKEQE---KKEKKKCSENKRPSPPYILWCKDQWNEIKKENPEAEFKEIS

Query:  NILGAKWKNVTADEKKPYEERYQAEKQAYLQITSKEKRESEAMKLLEEEQKQKTAMELLDQYLQFKGEAEKENKKK-KKEKDPLKPKHPMSAFFLFSNER
        NILGAKWK+++A++KKPYEERYQ EK+AYLQ+ +KEKRE EAMKLLE++QKQ+TAMELLDQYL F  EAE++NKKK KKEKDPLKPKHP+SAF +++NER
Subjt:  NILGAKWKNVTADEKKPYEERYQAEKQAYLQITSKEKRESEAMKLLEEEQKQKTAMELLDQYLQFKGEAEKENKKK-KKEKDPLKPKHPMSAFFLFSNER

Query:  RASLLAENKNVLEVAKITGEEWKNMTEKQRGPYEEMARKNKEKYMQEMEIYKQQKEEEAAILKKEEEEQMKLHKHEALLLLKKKEKTETIIKKTKEERQK
        RA+L  ENK+V+EVAKITGEEWKN+++K++ PYE++A+KNKE Y+Q ME YK+ KEEEA   KKEEEE +KLHK EAL +LKKKEKT+ +IKK K  ++K
Subjt:  RASLLAENKNVLEVAKITGEEWKNMTEKQRGPYEEMARKNKEKYMQEMEIYKQQKEEEAAILKKEEEEQMKLHKHEALLLLKKKEKTETIIKKTKEERQK

Query:  KKKEGKKSVDPNKPKKPASSYILFSKEARKSVMEERPGANNSTVNALISVKWKELSERERKMWNDKAAEAMDAYRKELEEYNK
        K     ++VDPNKPKKPASSY LFSK+ RK + EERPG NN+TV ALIS+KWKELSE E++++N KAA+ M+AY+KE+E YNK
Subjt:  KKKEGKKSVDPNKPKKPASSYILFSKEARKSVMEERPGANNSTVNALISVKWKELSERERKMWNDKAAEAMDAYRKELEEYNK

Q9T012 High mobility group B protein 131.6e-13661.22Show/hide
Query:  KKPRNSRKALKDKNSTPE-EPQSESSMVTKVTQPSEEEILLSQNQSSAKKPKSKAAPKKQPAKQSFDKELQEMQDMLQQLRLDKEKTEELLKAKDEMLKQ
        KK RNSRKALK KN   E  P S+    TK                                  SF+K+L EMQ ML++++++KEKTE+LLK KDE+L++
Subjt:  KKPRNSRKALKDKNSTPE-EPQSESSMVTKVTQPSEEEILLSQNQSSAKKPKSKAAPKKQPAKQSFDKELQEMQDMLQQLRLDKEKTEELLKAKDEMLKQ

Query:  KDEELKTRDMEQEKLQIELKKLQKLKEFKPTMNFPMIQILKDKEQEKKEKKK---CSENKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKNV
        K       ++EQEKL+ ELKKLQK+KEFKP M F   Q L   E+EKK KKK   C+E KRPS PYILWCKD WNE+KK+NPEA+FKE SNILGAKWK +
Subjt:  KDEELKTRDMEQEKLQIELKKLQKLKEFKPTMNFPMIQILKDKEQEKKEKKK---CSENKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKNV

Query:  TADEKKPYEERYQAEKQAYLQITSKEKRESEAMKLLEEEQKQKTAMELLDQYLQFKGEAEKENKKK-KKEKDPLKPKHPMSAFFLFSNERRASLLAENKN
        +A+EKKPYEE+YQA+K+AYLQ+ +KEKRE EAMKLL++EQKQKTAMELLDQYL F  EAE +NKKK KK KDPLKPK P+SA+ +++NERRA+L  ENK+
Subjt:  TADEKKPYEERYQAEKQAYLQITSKEKRESEAMKLLEEEQKQKTAMELLDQYLQFKGEAEKENKKK-KKEKDPLKPKHPMSAFFLFSNERRASLLAENKN

Query:  VLEVAKITGEEWKNMTEKQRGPYEEMARKNKEKYMQEMEIYKQQKEEEAAILKKEEEEQMKLHKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKSVD
        V+EVAK+ GEEWKN++E+++ PY++MA+KNKE Y+QEME YK+ KEEEA   KKEEEE MKLHK EAL LLKKKEKT+ IIKKTKE  + KKK   ++VD
Subjt:  VLEVAKITGEEWKNMTEKQRGPYEEMARKNKEKYMQEMEIYKQQKEEEAAILKKEEEEQMKLHKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKSVD

Query:  PNKPKKPASSYILFSKEARKSVMEERPGANNSTVNALISVKWKELSERERKMWNDKAAEAMDAYRKELEEYNKTVVS
        PNKPKKP SSY LF K+ARKSV+EE PG NNSTV A IS+KW EL E E++++N KAAE M+AY+KE+EEYNKT  S
Subjt:  PNKPKKPASSYILFSKEARKSVMEERPGANNSTVNALISVKWKELSERERKMWNDKAAEAMDAYRKELEEYNKTVVS

Arabidopsis top hitse value%identityAlignment
AT3G61670.1 Protein of unknown function (DUF3133)6.5e-6928.71Show/hide
Query:  IRVLRCPRCENLLLVLFVEWDCPSMEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRI-------ENWIR
        +R++RCP+CENL   L    D P  +      VLR++     + S+     ED          +   + S ++  D +V  +     +       +   +
Subjt:  IRVLRCPRCENLLLVLFVEWDCPSMEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRI-------ENWIR

Query:  RNNIEH-DMDIYVNPMGAARTRAAFEHQRIERDAF---TGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPL-----DGLDPNRAE
         +++E  +  I +      ++++    Q    D F   T    +S ++ +R+      +P D  +SS+ +     P+S  ++++ L     + ++ +RA 
Subjt:  RNNIEH-DMDIYVNPMGAARTRAAFEHQRIERDAF---TGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPL-----DGLDPNRAE

Query:  LLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYNVPMQ------SSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLY-
        LLR+L+++K+Q+++SC+V    K  E+A             PM+       +   P + ++P +    N    A   H      L+HP        P++ 
Subjt:  LLRRLDELKDQIIKSCDVGDRPKVVERAAVDPYYGRATYNVPMQ------SSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLY-

Query:  ---EDRFQEQMKRKTN--------YPPRLPHEH---------YPESFMDLRA---PNSPISNASNPKESIKSSTYRNENPVTVGLTASNLQRAGRFPSQD
              F  Q     N        YP +  H H         Y   +    A   P++P +    P ES+        NP T G  +  LQ  GR+PS  
Subjt:  ---EDRFQEQMKRKTN--------YPPRLPHEH---------YPESFMDLRA---PNSPISNASNPKESIKSSTYRNENPVTVGLTASNLQRAGRFPSQD

Query:  ALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQKLQCGACSVVIIVKVENRRLVVSVPAEAKPK
        +          D+++D  S +RP   VVL    +    +AGGAPFI C +C ELL+LP+K        QK++CGACS +I + V N + V+S        
Subjt:  ALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQKLQCGACSVVIIVKVENRRLVVSVPAEAKPK

Query:  EVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSD--------SRDSNLGESKTQELTSSLVPSTEKETLPTKDA-PSLTNSDNPSYDEPSKYREE
          S  +G  +   + TS  +  D   +   S D ++P D        S+D     S +  L+   + S      P  +A  +  +  + ++D        
Subjt:  EVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSD--------SRDSNLGESKTQELTSSLVPSTEKETLPTKDA-PSLTNSDNPSYDEPSKYREE

Query:  SENNQDTV------------INDVTEPSELDVSFEDYS--NIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDNGRPTVSV
        S +  D V            + +V+  SE++V+F DYS  N  +S+D  + +K                ++ F  + K + +D ++S +  +  +  VS+
Subjt:  SENNQDTV------------INDVTEPSELDVSFEDYS--NIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDNGRPTVSV

Query:  NGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNKLYRIDIS
        NG PL   +++KAEK AG I PG+YWYDY+AGFWGVMG P LGI+PPFI+E  YPM  NC+GG TGVFVNGRELH++DL+LL+ RGLP   ++ Y +DI+
Subjt:  NGQPLPAHVVKKAEKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNKLYRIDIS

Query:  GRVVDDDSGKELYNLGKLAPTV
        GRV+D+D+G+EL  LGKLAPT+
Subjt:  GRVVDDDSGKELYNLGKLAPTV

AT4G11080.1 HMG (high mobility group) box protein1.1e-13761.22Show/hide
Query:  KKPRNSRKALKDKNSTPE-EPQSESSMVTKVTQPSEEEILLSQNQSSAKKPKSKAAPKKQPAKQSFDKELQEMQDMLQQLRLDKEKTEELLKAKDEMLKQ
        KK RNSRKALK KN   E  P S+    TK                                  SF+K+L EMQ ML++++++KEKTE+LLK KDE+L++
Subjt:  KKPRNSRKALKDKNSTPE-EPQSESSMVTKVTQPSEEEILLSQNQSSAKKPKSKAAPKKQPAKQSFDKELQEMQDMLQQLRLDKEKTEELLKAKDEMLKQ

Query:  KDEELKTRDMEQEKLQIELKKLQKLKEFKPTMNFPMIQILKDKEQEKKEKKK---CSENKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKNV
        K       ++EQEKL+ ELKKLQK+KEFKP M F   Q L   E+EKK KKK   C+E KRPS PYILWCKD WNE+KK+NPEA+FKE SNILGAKWK +
Subjt:  KDEELKTRDMEQEKLQIELKKLQKLKEFKPTMNFPMIQILKDKEQEKKEKKK---CSENKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKNV

Query:  TADEKKPYEERYQAEKQAYLQITSKEKRESEAMKLLEEEQKQKTAMELLDQYLQFKGEAEKENKKK-KKEKDPLKPKHPMSAFFLFSNERRASLLAENKN
        +A+EKKPYEE+YQA+K+AYLQ+ +KEKRE EAMKLL++EQKQKTAMELLDQYL F  EAE +NKKK KK KDPLKPK P+SA+ +++NERRA+L  ENK+
Subjt:  TADEKKPYEERYQAEKQAYLQITSKEKRESEAMKLLEEEQKQKTAMELLDQYLQFKGEAEKENKKK-KKEKDPLKPKHPMSAFFLFSNERRASLLAENKN

Query:  VLEVAKITGEEWKNMTEKQRGPYEEMARKNKEKYMQEMEIYKQQKEEEAAILKKEEEEQMKLHKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKSVD
        V+EVAK+ GEEWKN++E+++ PY++MA+KNKE Y+QEME YK+ KEEEA   KKEEEE MKLHK EAL LLKKKEKT+ IIKKTKE  + KKK   ++VD
Subjt:  VLEVAKITGEEWKNMTEKQRGPYEEMARKNKEKYMQEMEIYKQQKEEEAAILKKEEEEQMKLHKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKSVD

Query:  PNKPKKPASSYILFSKEARKSVMEERPGANNSTVNALISVKWKELSERERKMWNDKAAEAMDAYRKELEEYNKTVVS
        PNKPKKP SSY LF K+ARKSV+EE PG NNSTV A IS+KW EL E E++++N KAAE M+AY+KE+EEYNKT  S
Subjt:  PNKPKKPASSYILFSKEARKSVMEERPGANNSTVNALISVKWKELSERERKMWNDKAAEAMDAYRKELEEYNKTVVS

AT4G23800.1 HMG (high mobility group) box protein8.9e-14361.7Show/hide
Query:  MASTAEIPKTKKPRNSRKALKDKNSTPEEPQSESSMVTKVTQPSEEEILLSQNQSSAKKPKSKAAPKKQPAKQSFDKELQEMQDMLQQLRLDKEKTEELL
        MA+ A+   TKKPRNSRKALK KN   E P S  S+                        K K+A       +SF+++L EMQ ML++++++K+KTEELL
Subjt:  MASTAEIPKTKKPRNSRKALKDKNSTPEEPQSESSMVTKVTQPSEEEILLSQNQSSAKKPKSKAAPKKQPAKQSFDKELQEMQDMLQQLRLDKEKTEELL

Query:  KAKDEMLKQKDEELKTRDMEQEKLQIELKKLQKLKEFKPTMNFPMIQ-ILKDKEQE---KKEKKKCSENKRPSPPYILWCKDQWNEIKKENPEAEFKEIS
        K KDE+L++K+EEL+TRD EQEKL++ELKKLQK+KEFKP M F   Q  L   EQE   KK+KK C E KRPS  Y+LWCKDQW E+KKENPEA+FKE S
Subjt:  KAKDEMLKQKDEELKTRDMEQEKLQIELKKLQKLKEFKPTMNFPMIQ-ILKDKEQE---KKEKKKCSENKRPSPPYILWCKDQWNEIKKENPEAEFKEIS

Query:  NILGAKWKNVTADEKKPYEERYQAEKQAYLQITSKEKRESEAMKLLEEEQKQKTAMELLDQYLQFKGEAEKENKKK-KKEKDPLKPKHPMSAFFLFSNER
        NILGAKWK+++A++KKPYEERYQ EK+AYLQ+ +KEKRE EAMKLLE++QKQ+TAMELLDQYL F  EAE++NKKK KKEKDPLKPKHP+SAF +++NER
Subjt:  NILGAKWKNVTADEKKPYEERYQAEKQAYLQITSKEKRESEAMKLLEEEQKQKTAMELLDQYLQFKGEAEKENKKK-KKEKDPLKPKHPMSAFFLFSNER

Query:  RASLLAENKNVLEVAKITGEEWKNMTEKQRGPYEEMARKNKEKYMQEMEIYKQQKEEEAAILKKEEEEQMKLHKHEALLLLKKKEKTETIIKKTKEERQK
        RA+L  ENK+V+EVAKITGEEWKN+++K++ PYE++A+KNKE Y+Q ME YK+ KEEEA   KKEEEE +KLHK EAL +LKKKEKT+ +IKK K  ++K
Subjt:  RASLLAENKNVLEVAKITGEEWKNMTEKQRGPYEEMARKNKEKYMQEMEIYKQQKEEEAAILKKEEEEQMKLHKHEALLLLKKKEKTETIIKKTKEERQK

Query:  KKKEGKKSVDPNKPKKPASSYILFSKEARKSVMEERPGANNSTVNALISVKWKELSERERKMWNDKAAEAMDAYRKELEEYNK
        K     ++VDPNKPKKPASSY LFSK+ RK + EERPG NN+TV ALIS+KWKELSE E++++N KAA+ M+AY+KE+E YNK
Subjt:  KKKEGKKSVDPNKPKKPASSYILFSKEARKSVMEERPGANNSTVNALISVKWKELSERERKMWNDKAAEAMDAYRKELEEYNK

AT4G23800.2 HMG (high mobility group) box protein2.0e-14261.49Show/hide
Query:  MASTAEIPKTKKPRNSRKALKDKNSTPEEPQSESSMVTKVTQPSEEEILLSQNQSSAKKPKSKAAPKKQPAKQSFDKELQEMQDMLQQLRLDKEKTEELL
        MA+ A+   TKKPRNSRKALK KN   E P S  S+                        K K+A       +SF+++L EMQ ML++++++K+KTEELL
Subjt:  MASTAEIPKTKKPRNSRKALKDKNSTPEEPQSESSMVTKVTQPSEEEILLSQNQSSAKKPKSKAAPKKQPAKQSFDKELQEMQDMLQQLRLDKEKTEELL

Query:  KAKDEMLKQKDEELKTRDMEQEKLQIELKKLQKLKEFKPTMNFPMIQ-ILKDKEQE---KKEKKKCSENKRPSPPYILWCKDQWNEIKKENPEAEFKEIS
        K KDE+L++K+EEL+TRD EQEKL++ELKKLQK+KEFKP M F   Q  L   EQE   KK+KK C E KRPS  Y+LWCKDQW E+KKENPEA+FKE S
Subjt:  KAKDEMLKQKDEELKTRDMEQEKLQIELKKLQKLKEFKPTMNFPMIQ-ILKDKEQE---KKEKKKCSENKRPSPPYILWCKDQWNEIKKENPEAEFKEIS

Query:  NILGAKWKNVTADEKKPYEERYQAEKQAYLQITSKEKRESEAMKLLEEEQKQKTAMELLDQYLQFKGEAEKENKKK-KKEKDPLKPKHPMSAFFLFSNER
        NILGAKWK+++A++KKPYEERYQ EK+AYLQ+ +KEKRE EAMKLLE++QKQ+TAMELLDQYL F  EAE++NKKK KKEKDPLKPKHP+SAF +++NER
Subjt:  NILGAKWKNVTADEKKPYEERYQAEKQAYLQITSKEKRESEAMKLLEEEQKQKTAMELLDQYLQFKGEAEKENKKK-KKEKDPLKPKHPMSAFFLFSNER

Query:  RASLLAENKNVLEVAKITGEEWKNMTEKQRGPYEEMARKNKEKYMQEMEIYKQQKEEEAAILKKEEEEQMKLHKHEALLLLKKKEKTETIIKKTKEERQK
        RA+L  ENK+V+EVAKITGEEWKN+++K++ PYE++A+KNKE Y+Q ME YK+ KEEEA   KKEEEE +KLHK EAL +LKKKEKT+ +IKK K E   
Subjt:  RASLLAENKNVLEVAKITGEEWKNMTEKQRGPYEEMARKNKEKYMQEMEIYKQQKEEEAAILKKEEEEQMKLHKHEALLLLKKKEKTETIIKKTKEERQK

Query:  KKKEGKKSVDPNKPKKPASSYILFSKEARKSVMEERPGANNSTVNALISVKWKELSERERKMWNDKAAEAMDAYRKELEEYNK
               +VDPNKPKKPASSY LFSK+ RK + EERPG NN+TV ALIS+KWKELSE E++++N KAA+ M+AY+KE+E YNK
Subjt:  KKKEGKKSVDPNKPKKPASSYILFSKEARKSVMEERPGANNSTVNALISVKWKELSERERKMWNDKAAEAMDAYRKELEEYNK

AT5G41800.1 Transmembrane amino acid transporter family protein5.8e-12665.95Show/hide
Query:  VLDHCEKAGRRHIRFRELAADVLGSGWMFYFVIFIQTAINTGVGIGAILLSGQCLQIIYSNLYPNGSMKLYEFIAIVTGVMIILSQLPTFHSLRHVSLAS
        VLDHCEK+GRRHIRFRELAADVLGSG MFY VIFIQTAINTG+GIGAILL+GQCL I+YS+L+P G++KLYEFIA+VT VM++LSQLP+FHSLRH++ AS
Subjt:  VLDHCEKAGRRHIRFRELAADVLGSGWMFYFVIFIQTAINTGVGIGAILLSGQCLQIIYSNLYPNGSMKLYEFIAIVTGVMIILSQLPTFHSLRHVSLAS

Query:  LLLSLGYAFLIIAAC--------------------------------IIA------------------ASGKMVKGLLMCYCVIFVTFYAIAASGYWVFG
        LLLSLGY FL++ AC                                IIA                  A+GKM+KGLL+CY VIF TFY+ A SGYWVFG
Subjt:  LLLSLGYAFLIIAAC--------------------------------IIA------------------ASGKMVKGLLMCYCVIFVTFYAIAASGYWVFG

Query:  NRASSNILLSLTPDTGPPLAPAWILGLAVIFVLLQLLAIGLVYSQVAYEIMEKQSADTKKGMFSKRNLIPRLILRSIYMIICGFFAAMLPFFGDISAVVG
        N +SSNIL +L PD GP LAP  ++GLAVIFVLLQL AIGLVYSQVAYEIMEK+SADT KG+FSKRNL+PRLILR++YM  CGF AAMLPFFGDI+AVVG
Subjt:  NRASSNILLSLTPDTGPPLAPAWILGLAVIFVLLQLLAIGLVYSQVAYEIMEKQSADTKKGMFSKRNLIPRLILRSIYMIICGFFAAMLPFFGDISAVVG

Query:  AICFIPLDFILPMLLYNITHNPPKSSLTYSINLAIIVVFTGVGLLGSFSSIRKLVLDTSKFKLFSNDVVD
        A  FIPLDF+LPMLLYN+T+ P + S TY IN+ I+VVFT  GL+G+FSSIRKLVLD +KFKLFS++VVD
Subjt:  AICFIPLDFILPMLLYNITHNPPKSSLTYSINLAIIVVFTGVGLLGSFSSIRKLVLDTSKFKLFSNDVVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTACGTGCCGGATCCAGGAATATCACTTTTGGTTGATCCGCCATGAGGAAGACGGAGGAAAGAGTAAAGTGAATTTTCAGGAGATGACAAAGCACCGGTTCAGGAA
TTCTTTGAAGTCTCTATTTGGGAGTTACCTTGATCCAGAAACAAATGAACGGCTCAGAGGAAATAAATCAGTTATCGAGGACAAGGTGAATAAAATTAGACAACTCATAA
AAGGTGAAGATCTAGGAGTAGAAGACCACGACCAATCAGAAACTCGCAAGAAACAGTCCATTGATGAATTATTTGATGATTTCCTTAACGTTTATCAGGCCCTCTATGAA
CAATACGACAGCCTAACCGGAGAGTTGAGAAGAAAATTTCAAAAAAGAAGAGAAAAGGAAAGCTCTTCATCTAGTTCATCCGACTCGGATTCAGATGATTCTTCAAAGAA
AAAGGTCAGCAAAGATGATCGAGGGTTGGAGAGAGAATTCCAAGAAGTTGGTGAAATCAAGCAGGAACTTGACGCAGCACTTTCAGAAGTAGCCGATTTGAAAAGGATAT
TGGCAACTACAATTAAAGAACATGAATCCCTAAATACAGAACATCTGACAGCTTTAAGTAAGATACAAGAAGCAGATGGAATTATTAGAGATTTGAAGGTTGAAGCTGAA
ATTTGGGACTCTCAGAAGTCTAAATTTCAGCTTGAAATCGAAGAACTGAACCTGGCGTTAAGTAATGCTGGTAGGAATGAATCTGAGTTGAATGAGAGATTAAAAGGTAT
GGAAACAGAGATGAATAACTACATTGAAGAAAAGGAGACTGCAAGGAGGAAGATTGAAGAGGGGGAAAAAACTATAGATGAATTGAAGGCTTTGGCTGATCAGTTGAAGG
AGAAGTTGTCAGCCACAATGGAAGAAAAGGAAGCTCTGAACTCACAGCACTTGAAAACTTTAAGTAGGGTACATGAAGCAGATATGATCACAAGAGATTTGAAGGTTGAA
TCAGAAACCTGGGGTGGTGAAAAATCTAAATTTCTTCTCGAGATTGAAGAGCTGAATCAGAAACTGGGTGCTGCCGGAAAATTGGAAGCGCAATTGAATGAAAGGTTGAA
GGATATTGGAATTGAAAACGAATATTTGATCAAGGAAAAGGAATCTGCGCAGAGGACGATTGAAGAGATGAGTCAGAGGCTGAGCAATGCTGTTAAGATAGAAGCAGAAC
TCAATGGAAGATTGAAAGATATTGAAACTGAGAAAGATGGGTTGATCAAGGAAAAGGAGATTGCATGGAAGGAGATTGAACAAGGTAAACGAGTTATAGAAGAATTAAAC
GCCATGGTTGATCAACTGAACAGCCAATTGACAATTACAGTAGAAGAAAAGAAATCTCTCAATTTACAACTTGAAAAGGAGAAAGTTGAGTTGTTGAGATCAATTGCTGA
TCATCAAAGAAATCTGAAGGAACACGAGGATGCATACAAGAAGCTAAATGATGAGTTTCAAGAATGTAAGCTAAAGCTTGATAATGCAGAAATGATGATGGCAGAAATGA
GTCGAGAGTTTCTTAATGACATTAGATCAAAGGAGCAAGTGAAAGATGACCTGGAGCTAATGGTTGAGGATCTTAAAAGAGAGCTGGAAGTAAAAAGTGATGAGATAAAT
AGCCTAGTTGAAAACGCTCGCACGATCGAAGTCAAGCTTCGGTTATCAAACCAGAAGCTTCGTGTTACAGAACAATTATTAACTGAAAAGGAAGAGATTTTTCGGAAAGC
TGAATTGAAATATCAAAAGGAACGGAAATTGCTCGAGGAAAGAATTCATGGACTATCTGCAACAGTTGTTACTAACAAAGTAACGTATCAAAAGACGATATCAACTGTTT
CAGAAAACATTAACAGTAACCTCTCCCAACTGGAATGTGTCATCAGGAAATTCATATTGGAGTATGCAAAGTATGAGAAGTGTGTTATGGAGACATCCCGCGATCTACGG
CTTACAAAGAGTTGGGTTTCAAATGCTATTGAAGAAACAGAAGGCCTAAAAAAAGAGGTGGCAGACCTTGGGAAACAACTTGAAGATAAGAAAGAGAGGGAATCGATATT
AGTACAACAGGTTGAGAAGTTGGAGATTAAGGCCAACAAGGAAGGATCTGAGAAGGATGGATTGGTTGAAGCAATCCACGAACTTGAAAAGAGACAGACAGAATTGGAGA
AGTTGATGGAAGAGAAGAATGAAGGTATGGTGGGTCTGAAAGAGGAGAAGAAAGAAGCGATAAGGCAACTTTGCATGTTGATAGAGTATCATCGGGACCGCTATGATTTT
CTCAAGGATGAGGTCTTGAAGTTGAATGTTAAAGGAGGCCAGAGTATAATTCGAGTTCTTCGTTGTCCCAGATGCGAGAATCTCTTGCTTGTACTGTTCGTAGAGTGGGA
CTGTCCAAGCATGGAACCATCTGTAAAAGAGAGGGTCTTAAGGTCTAGAAGGACTGTCTTTAGCAGTAGTTCAATTAGAACAAATGATAGAGAGGATATAGATGATTATG
AGAGGAGAATTGGGCAGGACACCAAGGGGGTTTGGTCTATGCAGAGTCTTGGTGATAAAGAAGTTGGTTTAGTTGAAGAAACACAACGTATCGAGAATTGGATTCGACGA
AATAATATCGAACATGACATGGATATTTATGTTAATCCTATGGGAGCAGCGAGAACCCGAGCGGCTTTTGAGCATCAAAGAATTGAAAGAGATGCATTCACAGGGTATTC
TGGAAACTCCTTGGCTATTGCTGATAGAATAGGTGTTCCGAATTTCCATTATCCTAGTGATCGACCTTCGAGTTCTAATGTGGATCGGCTCTATGGTCATCCGGAGTCTA
ATCAGGACTATGAACGTCCTTTAGACGGTTTAGACCCGAATCGAGCTGAACTACTTAGAAGGTTGGATGAGTTGAAAGATCAAATTATCAAGTCTTGTGATGTGGGAGAT
AGACCAAAAGTTGTTGAGAGAGCTGCAGTCGATCCGTACTATGGTCGAGCTACTTATAATGTCCCGATGCAATCTTCGACAAGAAGCCCGCCGCATATCTATGAGCCTCA
TTACGTGGATCGGGGCAATGGAACCTTTCCAGCAATGGGTCAACATCAGAGAAATGGTGAGGATTTGTTGCATCCTCCAAGGCATGTTGTGAAAGATATACCATTGTATG
AGGATCGGTTTCAGGAGCAAATGAAAAGAAAGACAAACTATCCTCCACGGCTTCCTCACGAACACTATCCAGAAAGCTTCATGGATTTGAGAGCGCCAAACTCTCCTATA
AGCAATGCAAGCAATCCAAAAGAGTCAATCAAATCTAGCACGTATCGTAACGAGAATCCTGTAACAGTTGGACTTACGGCTTCCAATCTACAACGTGCTGGTCGATTTCC
ATCTCAAGATGCACTTCCACACTCGAGACAACTTAGTGAGCTTGATTCAGAGATTGATGGTTTCAGTCCGGTTCGACCAAGAACATCTGTAGTTTTGCGAAGAAATGGAA
AATCTCGAGATGCCATCGCTGGTGGTGCTCCATTCATTGTGTGCAATAGTTGCTTGGAATTGCTAAAACTTCCCAGAAAACTTTACAAGTTGGAGATGGATTGGCAGAAA
CTACAGTGTGGTGCTTGTTCGGTTGTCATTATCGTAAAAGTCGAAAACAGAAGGCTTGTTGTTAGCGTTCCAGCTGAAGCCAAGCCCAAAGAAGTTTCTCCTGACGAGGG
TTCCCCCAAAAGAGTTGTCAATGCTACCAGCTCTTTAGAAAGCTCTGATAATTCTAGTCACAAATCGATCAGTACCGACCACAACAAGCCCTCGGACAGTCGGGATTCAA
ATCTTGGTGAATCTAAAACGCAGGAGCTAACTTCGTCTCTCGTTCCTTCCACGGAAAAAGAGACCCTGCCTACAAAAGATGCACCATCTTTAACAAATTCTGATAACCCT
TCTTATGATGAACCTAGCAAATACAGAGAAGAAAGCGAAAATAATCAGGATACTGTGATAAACGACGTCACCGAACCAAGTGAGTTGGACGTCTCGTTCGAAGACTATTC
GAACATTCATATTTCTCAAGATTCTATGGAAATAAGCAAAGAAGAAGAAGAAGAAAATCAAAGCAAGATCAAAAGCAATGAAGAATCTGAAACCTTTTTTGTGGATCTCA
GCAAGAACAACTTAAGAGATTTTTCAAGATCAAGTGAAATCACTGATAATGGAAGGCCTACTGTTTCTGTTAATGGCCAGCCTCTACCAGCTCATGTTGTCAAAAAGGCT
GAAAAGCTTGCTGGGCCCATTCTCCCAGGAGATTATTGGTATGATTATCAAGCTGGATTCTGGGGTGTAATGGGACATCCATGTCTTGGCATTATTCCTCCATTTATCGA
CGAGTTCACGTATCCAATGTCAAGGAACTGTGCTGGTGGAAACACTGGAGTTTTTGTCAATGGGAGAGAGCTTCATAAAAGGGATTTAGAGTTGCTGTCTAGCAGAGGGT
TGCCTACTACTACGAATAAATTATATAGAATCGACATTTCGGGAAGAGTCGTAGATGACGATTCTGGAAAAGAGTTATATAACCTGGGAAAACTCGCCCCGACAGTTTTT
ACAATACGCCTGAAGCCCTCCCCTGCTCTCTGCTCTGCTCCAGTTCCCATATCTCCGATGGCTTCCACTGCTGAAATTCCGAAGACAAAGAAACCCAGGAACAGCCGGAA
GGCTCTGAAGGACAAAAACTCTACACCGGAGGAGCCACAATCTGAATCCTCCATGGTTACGAAAGTGACACAGCCATCGGAAGAGGAGATCCTCCTATCTCAGAATCAAT
CTTCGGCTAAGAAACCGAAATCCAAAGCTGCGCCGAAGAAGCAGCCGGCGAAGCAGTCCTTCGACAAAGAGTTGCAGGAAATGCAGGACATGCTTCAACAGTTGAGGCTC
GATAAGGAGAAGACTGAGGAGCTTTTGAAAGCCAAAGATGAGATGCTTAAGCAGAAGGATGAAGAACTTAAGACGAGAGATATGGAGCAGGAAAAGCTCCAGATCGAATT
GAAGAAGTTGCAGAAGTTGAAGGAGTTCAAACCTACTATGAACTTCCCTATGATTCAAATTTTGAAAGATAAGGAACAAGAGAAGAAAGAGAAGAAGAAGTGCTCGGAAA
ACAAGAGGCCATCTCCACCTTACATCTTATGGTGCAAAGATCAATGGAACGAGATCAAGAAGGAGAATCCAGAGGCAGAGTTCAAGGAGATCTCAAACATTTTGGGGGCA
AAGTGGAAGAATGTCACTGCAGATGAGAAGAAGCCATATGAGGAGAGGTATCAGGCTGAAAAACAAGCCTATTTGCAAATCACTTCTAAAGAGAAGCGTGAGAGTGAGGC
GATGAAGCTGTTAGAAGAGGAGCAGAAGCAGAAGACAGCCATGGAGCTGCTTGATCAATACCTTCAATTCAAAGGGGAGGCTGAGAAAGAGAACAAGAAGAAGAAGAAAG
AGAAGGATCCATTGAAGCCCAAGCATCCCATGTCTGCATTCTTTCTCTTTTCAAATGAGAGGCGTGCATCCCTTCTTGCCGAAAACAAGAATGTTCTAGAGGTAGCGAAG
ATAACAGGTGAGGAGTGGAAGAACATGACAGAGAAACAGAGAGGTCCTTACGAAGAGATGGCGAGGAAGAACAAGGAAAAATACATGCAGGAAATGGAAATATACAAGCA
GCAAAAGGAGGAGGAAGCAGCAATCCTCAAGAAGGAAGAGGAAGAACAAATGAAGCTTCATAAACATGAAGCTCTGCTGTTGCTAAAGAAGAAAGAGAAAACCGAGACTA
TTATAAAGAAAACAAAAGAGGAGCGGCAGAAGAAGAAGAAGGAAGGGAAGAAGAGTGTTGATCCTAACAAGCCTAAGAAGCCTGCATCCTCTTACATCCTGTTCAGCAAA
GAAGCTAGGAAAAGTGTAATGGAGGAGAGGCCAGGAGCCAACAATTCCACAGTGAATGCACTGATTTCAGTGAAATGGAAGGAACTAAGTGAAAGGGAGAGGAAGATGTG
GAATGATAAAGCTGCAGAAGCCATGGATGCTTACAGAAAGGAATTGGAGGAATACAACAAAACTGTAGTTTCGCCTCCCTCTCCGTACCGGCCTTGGATTCCACCACCGC
CCGCCGACAGCGACAACTACGATGGGGATTCAGCTTCCGGTCACCGAAGACCGAATCTCCGACGCCGGAGCATCATTCGTCCTCCAATCCAAAGGGGAATGGTGGCATGC
GGGATTCCATTTGACGACGGCGATTGTCGGACCGACGATTCTAACGCTGCCGTTCTCGATCACTGCGAGAAGGCCGGTCGCCGTCACATCCGATTCCGGGAACTCGCCGC
CGACGTATTAGGATCTGGATGGATGTTTTACTTCGTAATATTCATCCAAACCGCGATCAATACTGGAGTTGGAATTGGAGCAATCTTGCTCTCCGGGCAGTGCCTTCAGA
TAATATATTCAAACCTTTACCCAAATGGATCCATGAAACTGTACGAGTTCATAGCAATAGTAACAGGAGTGATGATCATTCTGTCTCAGCTTCCAACCTTCCACTCTCTT
AGACATGTCAGTCTGGCTTCTCTGCTTCTCAGCTTGGGCTACGCATTTCTTATTATTGCTGCTTGTATCATTGCAGCGAGTGGGAAGATGGTGAAAGGGCTTTTGATGTG
TTACTGTGTGATATTTGTAACTTTCTACGCCATTGCAGCATCTGGATATTGGGTGTTTGGAAACAGGGCAAGCTCCAATATTCTGCTGAGCCTGACGCCGGACACTGGAC
CTCCATTGGCTCCCGCTTGGATTCTTGGGCTCGCTGTCATCTTTGTTCTTCTTCAACTCCTCGCCATTGGACTGGTGTATTCACAAGTGGCATACGAAATAATGGAGAAG
CAATCAGCTGACACGAAGAAAGGAATGTTTTCCAAAAGGAACCTTATTCCAAGGCTCATTCTTCGCTCAATATACATGATCATCTGTGGCTTTTTTGCTGCTATGCTTCC
ATTCTTTGGTGACATTAGTGCCGTGGTGGGTGCTATTTGCTTCATTCCTCTTGATTTCATTCTACCAATGCTTCTCTATAACATCACCCACAATCCTCCCAAATCCTCCC
TCACCTATTCTATCAACCTCGCCATTATTGTCGTATTCACCGGGGTTGGACTCTTGGGTTCGTTCTCTTCTATACGAAAGCTCGTTCTTGATACTTCCAAGTTCAAGCTC
TTCAGTAATGATGTTGTCGATTGA
mRNA sequenceShow/hide mRNA sequence
CCCGTCTTCCTCTGATTTTCAGGTGATTTTCTTCTTTTTTCTTAATTCTTCTTTCAATTTGATCGTTTCGTTTCTACTGCAGTTCATCTTCTGATTTCGTTGAAATTTTG
CCTTATAACTTGATGTTGGAGTTCGATTCGTTGTGTAATTTTCTGGTGTTTGTTCTCTTTACATCGATTGTCTCATTTCTTGTTTCTGGAATTTTCGGATTGATCGGATC
CTTGTGTTTGGATCGAGGTTGATGATTGCTTGAGTTCGAAATGTTTTTGGATGATTTCTTTTCTCTTTCGATGTTTATTTGGTTTGATTGGATCTTCTGTTGAGTTTAAA
TTGATAGTTTCGGATAGCGGATGGACTTTTTTTTAATTGAGTTGAAGCGATTACTGCTCCTATTTTAAAGCGAATCTGATGGTTTTAGGATTTCTGTATGAATCTATTTC
GCTGTATACTCGATCACTTCAAAACTTTTCCTGCTGAAACAGAAAAAGTTCTCCTCACAACTATTTTGGTTCCGCTAATTTGTTAGAGTTGATTGGAATTTTGTGCTTAT
TTTCCTCTGTAACCGATTCTTGTTTCTCGATGTCTACGTGCCGGATCCAGGAATATCACTTTTGGTTGATCCGCCATGAGGAAGACGGAGGAAAGAGTAAAGTGAATTTT
CAGGAGATGACAAAGCACCGGTTCAGGAATTCTTTGAAGTCTCTATTTGGGAGTTACCTTGATCCAGAAACAAATGAACGGCTCAGAGGAAATAAATCAGTTATCGAGGA
CAAGGTGAATAAAATTAGACAACTCATAAAAGGTGAAGATCTAGGAGTAGAAGACCACGACCAATCAGAAACTCGCAAGAAACAGTCCATTGATGAATTATTTGATGATT
TCCTTAACGTTTATCAGGCCCTCTATGAACAATACGACAGCCTAACCGGAGAGTTGAGAAGAAAATTTCAAAAAAGAAGAGAAAAGGAAAGCTCTTCATCTAGTTCATCC
GACTCGGATTCAGATGATTCTTCAAAGAAAAAGGTCAGCAAAGATGATCGAGGGTTGGAGAGAGAATTCCAAGAAGTTGGTGAAATCAAGCAGGAACTTGACGCAGCACT
TTCAGAAGTAGCCGATTTGAAAAGGATATTGGCAACTACAATTAAAGAACATGAATCCCTAAATACAGAACATCTGACAGCTTTAAGTAAGATACAAGAAGCAGATGGAA
TTATTAGAGATTTGAAGGTTGAAGCTGAAATTTGGGACTCTCAGAAGTCTAAATTTCAGCTTGAAATCGAAGAACTGAACCTGGCGTTAAGTAATGCTGGTAGGAATGAA
TCTGAGTTGAATGAGAGATTAAAAGGTATGGAAACAGAGATGAATAACTACATTGAAGAAAAGGAGACTGCAAGGAGGAAGATTGAAGAGGGGGAAAAAACTATAGATGA
ATTGAAGGCTTTGGCTGATCAGTTGAAGGAGAAGTTGTCAGCCACAATGGAAGAAAAGGAAGCTCTGAACTCACAGCACTTGAAAACTTTAAGTAGGGTACATGAAGCAG
ATATGATCACAAGAGATTTGAAGGTTGAATCAGAAACCTGGGGTGGTGAAAAATCTAAATTTCTTCTCGAGATTGAAGAGCTGAATCAGAAACTGGGTGCTGCCGGAAAA
TTGGAAGCGCAATTGAATGAAAGGTTGAAGGATATTGGAATTGAAAACGAATATTTGATCAAGGAAAAGGAATCTGCGCAGAGGACGATTGAAGAGATGAGTCAGAGGCT
GAGCAATGCTGTTAAGATAGAAGCAGAACTCAATGGAAGATTGAAAGATATTGAAACTGAGAAAGATGGGTTGATCAAGGAAAAGGAGATTGCATGGAAGGAGATTGAAC
AAGGTAAACGAGTTATAGAAGAATTAAACGCCATGGTTGATCAACTGAACAGCCAATTGACAATTACAGTAGAAGAAAAGAAATCTCTCAATTTACAACTTGAAAAGGAG
AAAGTTGAGTTGTTGAGATCAATTGCTGATCATCAAAGAAATCTGAAGGAACACGAGGATGCATACAAGAAGCTAAATGATGAGTTTCAAGAATGTAAGCTAAAGCTTGA
TAATGCAGAAATGATGATGGCAGAAATGAGTCGAGAGTTTCTTAATGACATTAGATCAAAGGAGCAAGTGAAAGATGACCTGGAGCTAATGGTTGAGGATCTTAAAAGAG
AGCTGGAAGTAAAAAGTGATGAGATAAATAGCCTAGTTGAAAACGCTCGCACGATCGAAGTCAAGCTTCGGTTATCAAACCAGAAGCTTCGTGTTACAGAACAATTATTA
ACTGAAAAGGAAGAGATTTTTCGGAAAGCTGAATTGAAATATCAAAAGGAACGGAAATTGCTCGAGGAAAGAATTCATGGACTATCTGCAACAGTTGTTACTAACAAAGT
AACGTATCAAAAGACGATATCAACTGTTTCAGAAAACATTAACAGTAACCTCTCCCAACTGGAATGTGTCATCAGGAAATTCATATTGGAGTATGCAAAGTATGAGAAGT
GTGTTATGGAGACATCCCGCGATCTACGGCTTACAAAGAGTTGGGTTTCAAATGCTATTGAAGAAACAGAAGGCCTAAAAAAAGAGGTGGCAGACCTTGGGAAACAACTT
GAAGATAAGAAAGAGAGGGAATCGATATTAGTACAACAGGTTGAGAAGTTGGAGATTAAGGCCAACAAGGAAGGATCTGAGAAGGATGGATTGGTTGAAGCAATCCACGA
ACTTGAAAAGAGACAGACAGAATTGGAGAAGTTGATGGAAGAGAAGAATGAAGGTATGGTGGGTCTGAAAGAGGAGAAGAAAGAAGCGATAAGGCAACTTTGCATGTTGA
TAGAGTATCATCGGGACCGCTATGATTTTCTCAAGGATGAGGTCTTGAAGTTGAATGTTAAAGGAGGCCAGAGTATAATTCGAGTTCTTCGTTGTCCCAGATGCGAGAAT
CTCTTGCTTGTACTGTTCGTAGAGTGGGACTGTCCAAGCATGGAACCATCTGTAAAAGAGAGGGTCTTAAGGTCTAGAAGGACTGTCTTTAGCAGTAGTTCAATTAGAAC
AAATGATAGAGAGGATATAGATGATTATGAGAGGAGAATTGGGCAGGACACCAAGGGGGTTTGGTCTATGCAGAGTCTTGGTGATAAAGAAGTTGGTTTAGTTGAAGAAA
CACAACGTATCGAGAATTGGATTCGACGAAATAATATCGAACATGACATGGATATTTATGTTAATCCTATGGGAGCAGCGAGAACCCGAGCGGCTTTTGAGCATCAAAGA
ATTGAAAGAGATGCATTCACAGGGTATTCTGGAAACTCCTTGGCTATTGCTGATAGAATAGGTGTTCCGAATTTCCATTATCCTAGTGATCGACCTTCGAGTTCTAATGT
GGATCGGCTCTATGGTCATCCGGAGTCTAATCAGGACTATGAACGTCCTTTAGACGGTTTAGACCCGAATCGAGCTGAACTACTTAGAAGGTTGGATGAGTTGAAAGATC
AAATTATCAAGTCTTGTGATGTGGGAGATAGACCAAAAGTTGTTGAGAGAGCTGCAGTCGATCCGTACTATGGTCGAGCTACTTATAATGTCCCGATGCAATCTTCGACA
AGAAGCCCGCCGCATATCTATGAGCCTCATTACGTGGATCGGGGCAATGGAACCTTTCCAGCAATGGGTCAACATCAGAGAAATGGTGAGGATTTGTTGCATCCTCCAAG
GCATGTTGTGAAAGATATACCATTGTATGAGGATCGGTTTCAGGAGCAAATGAAAAGAAAGACAAACTATCCTCCACGGCTTCCTCACGAACACTATCCAGAAAGCTTCA
TGGATTTGAGAGCGCCAAACTCTCCTATAAGCAATGCAAGCAATCCAAAAGAGTCAATCAAATCTAGCACGTATCGTAACGAGAATCCTGTAACAGTTGGACTTACGGCT
TCCAATCTACAACGTGCTGGTCGATTTCCATCTCAAGATGCACTTCCACACTCGAGACAACTTAGTGAGCTTGATTCAGAGATTGATGGTTTCAGTCCGGTTCGACCAAG
AACATCTGTAGTTTTGCGAAGAAATGGAAAATCTCGAGATGCCATCGCTGGTGGTGCTCCATTCATTGTGTGCAATAGTTGCTTGGAATTGCTAAAACTTCCCAGAAAAC
TTTACAAGTTGGAGATGGATTGGCAGAAACTACAGTGTGGTGCTTGTTCGGTTGTCATTATCGTAAAAGTCGAAAACAGAAGGCTTGTTGTTAGCGTTCCAGCTGAAGCC
AAGCCCAAAGAAGTTTCTCCTGACGAGGGTTCCCCCAAAAGAGTTGTCAATGCTACCAGCTCTTTAGAAAGCTCTGATAATTCTAGTCACAAATCGATCAGTACCGACCA
CAACAAGCCCTCGGACAGTCGGGATTCAAATCTTGGTGAATCTAAAACGCAGGAGCTAACTTCGTCTCTCGTTCCTTCCACGGAAAAAGAGACCCTGCCTACAAAAGATG
CACCATCTTTAACAAATTCTGATAACCCTTCTTATGATGAACCTAGCAAATACAGAGAAGAAAGCGAAAATAATCAGGATACTGTGATAAACGACGTCACCGAACCAAGT
GAGTTGGACGTCTCGTTCGAAGACTATTCGAACATTCATATTTCTCAAGATTCTATGGAAATAAGCAAAGAAGAAGAAGAAGAAAATCAAAGCAAGATCAAAAGCAATGA
AGAATCTGAAACCTTTTTTGTGGATCTCAGCAAGAACAACTTAAGAGATTTTTCAAGATCAAGTGAAATCACTGATAATGGAAGGCCTACTGTTTCTGTTAATGGCCAGC
CTCTACCAGCTCATGTTGTCAAAAAGGCTGAAAAGCTTGCTGGGCCCATTCTCCCAGGAGATTATTGGTATGATTATCAAGCTGGATTCTGGGGTGTAATGGGACATCCA
TGTCTTGGCATTATTCCTCCATTTATCGACGAGTTCACGTATCCAATGTCAAGGAACTGTGCTGGTGGAAACACTGGAGTTTTTGTCAATGGGAGAGAGCTTCATAAAAG
GGATTTAGAGTTGCTGTCTAGCAGAGGGTTGCCTACTACTACGAATAAATTATATAGAATCGACATTTCGGGAAGAGTCGTAGATGACGATTCTGGAAAAGAGTTATATA
ACCTGGGAAAACTCGCCCCGACAGTTTTTACAATACGCCTGAAGCCCTCCCCTGCTCTCTGCTCTGCTCCAGTTCCCATATCTCCGATGGCTTCCACTGCTGAAATTCCG
AAGACAAAGAAACCCAGGAACAGCCGGAAGGCTCTGAAGGACAAAAACTCTACACCGGAGGAGCCACAATCTGAATCCTCCATGGTTACGAAAGTGACACAGCCATCGGA
AGAGGAGATCCTCCTATCTCAGAATCAATCTTCGGCTAAGAAACCGAAATCCAAAGCTGCGCCGAAGAAGCAGCCGGCGAAGCAGTCCTTCGACAAAGAGTTGCAGGAAA
TGCAGGACATGCTTCAACAGTTGAGGCTCGATAAGGAGAAGACTGAGGAGCTTTTGAAAGCCAAAGATGAGATGCTTAAGCAGAAGGATGAAGAACTTAAGACGAGAGAT
ATGGAGCAGGAAAAGCTCCAGATCGAATTGAAGAAGTTGCAGAAGTTGAAGGAGTTCAAACCTACTATGAACTTCCCTATGATTCAAATTTTGAAAGATAAGGAACAAGA
GAAGAAAGAGAAGAAGAAGTGCTCGGAAAACAAGAGGCCATCTCCACCTTACATCTTATGGTGCAAAGATCAATGGAACGAGATCAAGAAGGAGAATCCAGAGGCAGAGT
TCAAGGAGATCTCAAACATTTTGGGGGCAAAGTGGAAGAATGTCACTGCAGATGAGAAGAAGCCATATGAGGAGAGGTATCAGGCTGAAAAACAAGCCTATTTGCAAATC
ACTTCTAAAGAGAAGCGTGAGAGTGAGGCGATGAAGCTGTTAGAAGAGGAGCAGAAGCAGAAGACAGCCATGGAGCTGCTTGATCAATACCTTCAATTCAAAGGGGAGGC
TGAGAAAGAGAACAAGAAGAAGAAGAAAGAGAAGGATCCATTGAAGCCCAAGCATCCCATGTCTGCATTCTTTCTCTTTTCAAATGAGAGGCGTGCATCCCTTCTTGCCG
AAAACAAGAATGTTCTAGAGGTAGCGAAGATAACAGGTGAGGAGTGGAAGAACATGACAGAGAAACAGAGAGGTCCTTACGAAGAGATGGCGAGGAAGAACAAGGAAAAA
TACATGCAGGAAATGGAAATATACAAGCAGCAAAAGGAGGAGGAAGCAGCAATCCTCAAGAAGGAAGAGGAAGAACAAATGAAGCTTCATAAACATGAAGCTCTGCTGTT
GCTAAAGAAGAAAGAGAAAACCGAGACTATTATAAAGAAAACAAAAGAGGAGCGGCAGAAGAAGAAGAAGGAAGGGAAGAAGAGTGTTGATCCTAACAAGCCTAAGAAGC
CTGCATCCTCTTACATCCTGTTCAGCAAAGAAGCTAGGAAAAGTGTAATGGAGGAGAGGCCAGGAGCCAACAATTCCACAGTGAATGCACTGATTTCAGTGAAATGGAAG
GAACTAAGTGAAAGGGAGAGGAAGATGTGGAATGATAAAGCTGCAGAAGCCATGGATGCTTACAGAAAGGAATTGGAGGAATACAACAAAACTGTAGTTTCGCCTCCCTC
TCCGTACCGGCCTTGGATTCCACCACCGCCCGCCGACAGCGACAACTACGATGGGGATTCAGCTTCCGGTCACCGAAGACCGAATCTCCGACGCCGGAGCATCATTCGTC
CTCCAATCCAAAGGGGAATGGTGGCATGCGGGATTCCATTTGACGACGGCGATTGTCGGACCGACGATTCTAACGCTGCCGTTCTCGATCACTGCGAGAAGGCCGGTCGC
CGTCACATCCGATTCCGGGAACTCGCCGCCGACGTATTAGGATCTGGATGGATGTTTTACTTCGTAATATTCATCCAAACCGCGATCAATACTGGAGTTGGAATTGGAGC
AATCTTGCTCTCCGGGCAGTGCCTTCAGATAATATATTCAAACCTTTACCCAAATGGATCCATGAAACTGTACGAGTTCATAGCAATAGTAACAGGAGTGATGATCATTC
TGTCTCAGCTTCCAACCTTCCACTCTCTTAGACATGTCAGTCTGGCTTCTCTGCTTCTCAGCTTGGGCTACGCATTTCTTATTATTGCTGCTTGTATCATTGCAGCGAGT
GGGAAGATGGTGAAAGGGCTTTTGATGTGTTACTGTGTGATATTTGTAACTTTCTACGCCATTGCAGCATCTGGATATTGGGTGTTTGGAAACAGGGCAAGCTCCAATAT
TCTGCTGAGCCTGACGCCGGACACTGGACCTCCATTGGCTCCCGCTTGGATTCTTGGGCTCGCTGTCATCTTTGTTCTTCTTCAACTCCTCGCCATTGGACTGGTGTATT
CACAAGTGGCATACGAAATAATGGAGAAGCAATCAGCTGACACGAAGAAAGGAATGTTTTCCAAAAGGAACCTTATTCCAAGGCTCATTCTTCGCTCAATATACATGATC
ATCTGTGGCTTTTTTGCTGCTATGCTTCCATTCTTTGGTGACATTAGTGCCGTGGTGGGTGCTATTTGCTTCATTCCTCTTGATTTCATTCTACCAATGCTTCTCTATAA
CATCACCCACAATCCTCCCAAATCCTCCCTCACCTATTCTATCAACCTCGCCATTATTGTCGTATTCACCGGGGTTGGACTCTTGGGTTCGTTCTCTTCTATACGAAAGC
TCGTTCTTGATACTTCCAAGTTCAAGCTCTTCAGTAATGATGTTGTCGATTGATCTAAAAAATCCCTCTAAATTTTGTATCCAATTTAAAAATACTCTTAAAATTTCTTA
AAGGTAAGAATGTCAAAAAAGAATGAAAAATATTTGACGGCATACTTACCTACACGTACTTACACATGAGAAACGGGAAAAAAATGACAAAAAAATCGCGGAAAATGGA
Protein sequenceShow/hide protein sequence
MSTCRIQEYHFWLIRHEEDGGKSKVNFQEMTKHRFRNSLKSLFGSYLDPETNERLRGNKSVIEDKVNKIRQLIKGEDLGVEDHDQSETRKKQSIDELFDDFLNVYQALYE
QYDSLTGELRRKFQKRREKESSSSSSSDSDSDDSSKKKVSKDDRGLEREFQEVGEIKQELDAALSEVADLKRILATTIKEHESLNTEHLTALSKIQEADGIIRDLKVEAE
IWDSQKSKFQLEIEELNLALSNAGRNESELNERLKGMETEMNNYIEEKETARRKIEEGEKTIDELKALADQLKEKLSATMEEKEALNSQHLKTLSRVHEADMITRDLKVE
SETWGGEKSKFLLEIEELNQKLGAAGKLEAQLNERLKDIGIENEYLIKEKESAQRTIEEMSQRLSNAVKIEAELNGRLKDIETEKDGLIKEKEIAWKEIEQGKRVIEELN
AMVDQLNSQLTITVEEKKSLNLQLEKEKVELLRSIADHQRNLKEHEDAYKKLNDEFQECKLKLDNAEMMMAEMSREFLNDIRSKEQVKDDLELMVEDLKRELEVKSDEIN
SLVENARTIEVKLRLSNQKLRVTEQLLTEKEEIFRKAELKYQKERKLLEERIHGLSATVVTNKVTYQKTISTVSENINSNLSQLECVIRKFILEYAKYEKCVMETSRDLR
LTKSWVSNAIEETEGLKKEVADLGKQLEDKKERESILVQQVEKLEIKANKEGSEKDGLVEAIHELEKRQTELEKLMEEKNEGMVGLKEEKKEAIRQLCMLIEYHRDRYDF
LKDEVLKLNVKGGQSIIRVLRCPRCENLLLVLFVEWDCPSMEPSVKERVLRSRRTVFSSSSIRTNDREDIDDYERRIGQDTKGVWSMQSLGDKEVGLVEETQRIENWIRR
NNIEHDMDIYVNPMGAARTRAAFEHQRIERDAFTGYSGNSLAIADRIGVPNFHYPSDRPSSSNVDRLYGHPESNQDYERPLDGLDPNRAELLRRLDELKDQIIKSCDVGD
RPKVVERAAVDPYYGRATYNVPMQSSTRSPPHIYEPHYVDRGNGTFPAMGQHQRNGEDLLHPPRHVVKDIPLYEDRFQEQMKRKTNYPPRLPHEHYPESFMDLRAPNSPI
SNASNPKESIKSSTYRNENPVTVGLTASNLQRAGRFPSQDALPHSRQLSELDSEIDGFSPVRPRTSVVLRRNGKSRDAIAGGAPFIVCNSCLELLKLPRKLYKLEMDWQK
LQCGACSVVIIVKVENRRLVVSVPAEAKPKEVSPDEGSPKRVVNATSSLESSDNSSHKSISTDHNKPSDSRDSNLGESKTQELTSSLVPSTEKETLPTKDAPSLTNSDNP
SYDEPSKYREESENNQDTVINDVTEPSELDVSFEDYSNIHISQDSMEISKEEEEENQSKIKSNEESETFFVDLSKNNLRDFSRSSEITDNGRPTVSVNGQPLPAHVVKKA
EKLAGPILPGDYWYDYQAGFWGVMGHPCLGIIPPFIDEFTYPMSRNCAGGNTGVFVNGRELHKRDLELLSSRGLPTTTNKLYRIDISGRVVDDDSGKELYNLGKLAPTVF
TIRLKPSPALCSAPVPISPMASTAEIPKTKKPRNSRKALKDKNSTPEEPQSESSMVTKVTQPSEEEILLSQNQSSAKKPKSKAAPKKQPAKQSFDKELQEMQDMLQQLRL
DKEKTEELLKAKDEMLKQKDEELKTRDMEQEKLQIELKKLQKLKEFKPTMNFPMIQILKDKEQEKKEKKKCSENKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGA
KWKNVTADEKKPYEERYQAEKQAYLQITSKEKRESEAMKLLEEEQKQKTAMELLDQYLQFKGEAEKENKKKKKEKDPLKPKHPMSAFFLFSNERRASLLAENKNVLEVAK
ITGEEWKNMTEKQRGPYEEMARKNKEKYMQEMEIYKQQKEEEAAILKKEEEEQMKLHKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKSVDPNKPKKPASSYILFSK
EARKSVMEERPGANNSTVNALISVKWKELSERERKMWNDKAAEAMDAYRKELEEYNKTVVSPPSPYRPWIPPPPADSDNYDGDSASGHRRPNLRRRSIIRPPIQRGMVAC
GIPFDDGDCRTDDSNAAVLDHCEKAGRRHIRFRELAADVLGSGWMFYFVIFIQTAINTGVGIGAILLSGQCLQIIYSNLYPNGSMKLYEFIAIVTGVMIILSQLPTFHSL
RHVSLASLLLSLGYAFLIIAACIIAASGKMVKGLLMCYCVIFVTFYAIAASGYWVFGNRASSNILLSLTPDTGPPLAPAWILGLAVIFVLLQLLAIGLVYSQVAYEIMEK
QSADTKKGMFSKRNLIPRLILRSIYMIICGFFAAMLPFFGDISAVVGAICFIPLDFILPMLLYNITHNPPKSSLTYSINLAIIVVFTGVGLLGSFSSIRKLVLDTSKFKL
FSNDVVD