; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022656 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022656
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDihydrolipoamide acetyltransferase component of pyruvate dehydrogenase complex
Genome locationtig00000289:1995521..2026055
RNA-Seq ExpressionSgr022656
SyntenySgr022656
Gene Ontology termsGO:0000462 - maturation of SSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0032040 - small-subunit processome (cellular component)
GO:0016746 - transferase activity, transferring acyl groups (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR003016 - 2-oxo acid dehydrogenase, lipoyl-binding site
IPR040315 - WD repeat-containing protein WDR46/Utp7
IPR036625 - E3-binding domain superfamily
IPR036322 - WD40-repeat-containing domain superfamily
IPR023213 - Chloramphenicol acetyltransferase-like domain superfamily
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR012952 - BING4, C-terminal domain
IPR011053 - Single hybrid motif
IPR004167 - Peripheral subunit-binding domain
IPR001680 - WD40 repeat
IPR001078 - 2-oxoacid dehydrogenase acyltransferase, catalytic domain
IPR000089 - Biotin/lipoyl attachment


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138833.1 probable U3 small nucleolar RNA-associated protein 7 [Cucumis sativus]2.0e-27690.58Show/hide
Query:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE
        ME+ELG+ V ERILPPTEQEVSNEIDV++KKY+RGEGANLEVLKDKKLKGQL+  EDLYGKSAKAAA+VEKWLMPSEGGYLE EGLEKTWRIKQETISHE
Subjt:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE

Query:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL
        VDILSRRNQHDIILPALGPYS+DYTSNGRYM IAGRKGHLA+VDMK+LNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNR+GTELHCLKEHGSV 
Subjt:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL

Query:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS
        RLQFLKNHFLL SINKFGQLHYQDVTTGSMV  FRTGLGRTDVMQVNPFNGVIATGHSGGSV MWKPTSSAPLVKMLCH GPVSALAFHPNGHLMATSG+
Subjt:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS

Query:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF
        ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQ+LGDFSG QNY+RYMAHSM KGYQI KILFRPYEDVLGIGHSMGWSSILIPGSGEPNF
Subjt:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF

Query:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL
        DTWVANPFETSKQRREKEV+SLLDKLPPETISLNP+KIG +M+VKKKEKKT+KER+AE+E+AV+AAKGIT KKKTKGRN+P+KREKKK E +EK KRPFL
Subjt:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL

Query:  HEQINEEEVSRKRPRLSEEVELPKSLQRFAR
        HEQI EEE+SRK+ RLSEEVELPKSLQRFAR
Subjt:  HEQINEEEVSRKRPRLSEEVELPKSLQRFAR

XP_008456565.1 PREDICTED: probable U3 small nucleolar RNA-associated protein 7 [Cucumis melo]8.9e-27790.94Show/hide
Query:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE
        ME+ELG+ VAERILPPTEQE+SNEIDV++KKY+RGEGANLEVLKDKKLKGQL+V EDLYGKSAKAAAKVEKWLMPSEGGYLE EGLEKTWRIKQETISHE
Subjt:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE

Query:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL
        VDILSRRNQHDIILPALGPYS+DYTSNGRYM IAGRKGHLA+VDMK+LNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNR+GTELHCLKEHGSVL
Subjt:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL

Query:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS
        RLQFLKNHFLL SINKFGQLHYQDVTTGSMV  FRTGLGRTDVMQVNPFNGVIATGHSGGSV MWKPTSSAPLVKMLCH GPVSALAFHPNGHLMATSG+
Subjt:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS

Query:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF
        ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGD SG Q+Y+RYMAHSM KGYQI K+LFRPYEDVLGIGHSMGWSSILIPGSGEPNF
Subjt:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF

Query:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL
        DTWVANPFETSKQRREKEV+SLLDKLPPETISLNPSKIG +++VKKKEKKT+KER+AE+E+AV+AAKGIT KKKTKGRN+P+KREKKK E +EK KRPFL
Subjt:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL

Query:  HEQINEEEVSRKRPRLSEEVELPKSLQRFA
        HEQI EEE+SRKR RLSEEVELPKSLQRFA
Subjt:  HEQINEEEVSRKRPRLSEEVELPKSLQRFA

XP_022134200.1 probable U3 small nucleolar RNA-associated protein 7 [Momordica charantia]1.0e-28093.06Show/hide
Query:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE
        MEEE G+AVAER+LPPTEQEVS+E+DVRIKKYLRGEGANLEVLKDKKLKGQL+VREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE
Subjt:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE

Query:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL
        VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL
Subjt:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL

Query:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS
        RLQFLKNHFLLAS NKFGQLHYQDVTTGSMV VFRTGLGRTDVMQVNPFNGVIATGHSGG+V MWKPTSSAPLVKMLCH+ PVSALAFHPNGHLMATSGS
Subjt:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS

Query:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF
        ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGS VQ+LGD SG QNYS+YMAHSMVKGYQI K+LFRPYEDVLGIGHSMGWSSILIPGSGEPNF
Subjt:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF

Query:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL
        DTWVANPFETSKQRREKEV+SLLDKLPPETISLNPSKIG VMSVKKKEKK++KEREAE+ESAV+ AKGIT KKKTKGRN+P+KREKKKRE +EK KRPFL
Subjt:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL

Query:  HEQI--NEEEVSRKRPRLSEEVELPKSLQRFAR
        HEQI   EEE+SRKRP+LSEEVELPKSLQRFAR
Subjt:  HEQI--NEEEVSRKRPRLSEEVELPKSLQRFAR

XP_038884737.1 dihydrolipoyllysine-residue acetyltransferase component 1 of pyruvate dehydrogenase complex, mitochondrial isoform X1 [Benincasa hispida]5.7e-27690.99Show/hide
Query:  DSSHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDI
        DSSHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKILVPEGSKDVPVGQPIAITVEDP+DI+RVLAN VSGA+DI
Subjt:  DSSHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDI

Query:  KDDAIGDQKDKNEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVG
        KDDA G+Q+DKNEDRAQASSVEIN+SKLPPH +LEMPALSPTMNQGNIA WRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVG
Subjt:  KDDAIGDQKDKNEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVG

Query:  KPIAITVEDPADIESVKSAVSSSLGTKEDKPAQSTIRNDAGTSK-GSVARISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSK
        KPIAITVEDP DIESVKSAVSSS G KEDKPA +T++ND GTSK G+VARISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLS+VSLSK
Subjt:  KPIAITVEDPADIESVKSAVSSSLGTKEDKPAQSTIRNDAGTSK-GSVARISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSK

Query:  EKRSPEVHAQASSTASSESKSPIKQSDSFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDVKVSVNDIVIKAVAVALRNVPG
        +KRSPEV AQASST SSESK  IKQSDSFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDVKVSVNDIVIKAVAVALRNV G
Subjt:  EKRSPEVHAQASSTASSESKSPIKQSDSFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDVKVSVNDIVIKAVAVALRNVPG

Query:  ANAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVG
        ANAYWD+EKGEVVFCDSIDISIAVATEKGLMTPIVRNAD KT+S ISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVD+FCAIINPPQAGILAVG
Subjt:  ANAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVG

Query:  RGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVGDSAIPTSPEPFQTLQ
        RGNKVVEP+IG DG+ERPVV+NKMNLTLSADHRVFDGKVG   +      F ++Q
Subjt:  RGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVGDSAIPTSPEPFQTLQ

XP_038886607.1 probable U3 small nucleolar RNA-associated protein 7 [Benincasa hispida]2.5e-27992.28Show/hide
Query:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE
        ME+ELG+AVAERILPPTEQEVSNEIDV+++KYLRGEGANLEVLKDKKLKGQL+V EDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE
Subjt:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE

Query:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL
        VDILS+RNQHDIILPALGPYSLDYTSNGRYM IAGRKGHLA+VDMK+LNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNR GTELHCLKEHGSVL
Subjt:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL

Query:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS
        RLQFLKNHFLLASINKFGQLHYQDVTTGSMV  FRTGLGRTDVMQVNPFNGVIATGHSGGSV MWKPTSSAPLVKMLCHQGPVSALAFHPNG+LMATSGS
Subjt:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS

Query:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF
        ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQ+LGD SG QNY+RYMAHSMVKGYQI KILFRPYEDVLGIGHSMGWSSILIPGSGEPNF
Subjt:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF

Query:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL
        DTWVANPFETSKQRREKEV+SLLDKLPPETISLNPSKIG VM+VKKKEKKT+ ER+AE+E+A++AAKGIT KKKTKGRN+P+KREKKK E +EK KRPFL
Subjt:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL

Query:  HEQINEEEVSRKRPRLSEEVELPKSLQRFAR
        HEQI EEE+SRKR RLSEEVELPKSLQRFAR
Subjt:  HEQINEEEVSRKRPRLSEEVELPKSLQRFAR

TrEMBL top hitse value%identityAlignment
A0A0A0LQI6 WD_REPEATS_REGION domain-containing protein9.6e-27790.58Show/hide
Query:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE
        ME+ELG+ V ERILPPTEQEVSNEIDV++KKY+RGEGANLEVLKDKKLKGQL+  EDLYGKSAKAAA+VEKWLMPSEGGYLE EGLEKTWRIKQETISHE
Subjt:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE

Query:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL
        VDILSRRNQHDIILPALGPYS+DYTSNGRYM IAGRKGHLA+VDMK+LNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNR+GTELHCLKEHGSV 
Subjt:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL

Query:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS
        RLQFLKNHFLL SINKFGQLHYQDVTTGSMV  FRTGLGRTDVMQVNPFNGVIATGHSGGSV MWKPTSSAPLVKMLCH GPVSALAFHPNGHLMATSG+
Subjt:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS

Query:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF
        ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQ+LGDFSG QNY+RYMAHSM KGYQI KILFRPYEDVLGIGHSMGWSSILIPGSGEPNF
Subjt:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF

Query:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL
        DTWVANPFETSKQRREKEV+SLLDKLPPETISLNP+KIG +M+VKKKEKKT+KER+AE+E+AV+AAKGIT KKKTKGRN+P+KREKKK E +EK KRPFL
Subjt:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL

Query:  HEQINEEEVSRKRPRLSEEVELPKSLQRFAR
        HEQI EEE+SRK+ RLSEEVELPKSLQRFAR
Subjt:  HEQINEEEVSRKRPRLSEEVELPKSLQRFAR

A0A1S3C483 probable U3 small nucleolar RNA-associated protein 74.3e-27790.94Show/hide
Query:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE
        ME+ELG+ VAERILPPTEQE+SNEIDV++KKY+RGEGANLEVLKDKKLKGQL+V EDLYGKSAKAAAKVEKWLMPSEGGYLE EGLEKTWRIKQETISHE
Subjt:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE

Query:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL
        VDILSRRNQHDIILPALGPYS+DYTSNGRYM IAGRKGHLA+VDMK+LNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNR+GTELHCLKEHGSVL
Subjt:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL

Query:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS
        RLQFLKNHFLL SINKFGQLHYQDVTTGSMV  FRTGLGRTDVMQVNPFNGVIATGHSGGSV MWKPTSSAPLVKMLCH GPVSALAFHPNGHLMATSG+
Subjt:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS

Query:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF
        ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGD SG Q+Y+RYMAHSM KGYQI K+LFRPYEDVLGIGHSMGWSSILIPGSGEPNF
Subjt:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF

Query:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL
        DTWVANPFETSKQRREKEV+SLLDKLPPETISLNPSKIG +++VKKKEKKT+KER+AE+E+AV+AAKGIT KKKTKGRN+P+KREKKK E +EK KRPFL
Subjt:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL

Query:  HEQINEEEVSRKRPRLSEEVELPKSLQRFA
        HEQI EEE+SRKR RLSEEVELPKSLQRFA
Subjt:  HEQINEEEVSRKRPRLSEEVELPKSLQRFA

A0A5D3BDN6 Putative U3 small nucleolar RNA-associated protein 74.3e-27790.94Show/hide
Query:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE
        ME+ELG+ VAERILPPTEQE+SNEIDV++KKY+RGEGANLEVLKDKKLKGQL+V EDLYGKSAKAAAKVEKWLMPSEGGYLE EGLEKTWRIKQETISHE
Subjt:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE

Query:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL
        VDILSRRNQHDIILPALGPYS+DYTSNGRYM IAGRKGHLA+VDMK+LNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNR+GTELHCLKEHGSVL
Subjt:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL

Query:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS
        RLQFLKNHFLL SINKFGQLHYQDVTTGSMV  FRTGLGRTDVMQVNPFNGVIATGHSGGSV MWKPTSSAPLVKMLCH GPVSALAFHPNGHLMATSG+
Subjt:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS

Query:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF
        ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGD SG Q+Y+RYMAHSM KGYQI K+LFRPYEDVLGIGHSMGWSSILIPGSGEPNF
Subjt:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF

Query:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL
        DTWVANPFETSKQRREKEV+SLLDKLPPETISLNPSKIG +++VKKKEKKT+KER+AE+E+AV+AAKGIT KKKTKGRN+P+KREKKK E +EK KRPFL
Subjt:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL

Query:  HEQINEEEVSRKRPRLSEEVELPKSLQRFA
        HEQI EEE+SRKR RLSEEVELPKSLQRFA
Subjt:  HEQINEEEVSRKRPRLSEEVELPKSLQRFA

A0A6J1BX94 probable U3 small nucleolar RNA-associated protein 74.9e-28193.06Show/hide
Query:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE
        MEEE G+AVAER+LPPTEQEVS+E+DVRIKKYLRGEGANLEVLKDKKLKGQL+VREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE
Subjt:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE

Query:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL
        VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL
Subjt:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL

Query:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS
        RLQFLKNHFLLAS NKFGQLHYQDVTTGSMV VFRTGLGRTDVMQVNPFNGVIATGHSGG+V MWKPTSSAPLVKMLCH+ PVSALAFHPNGHLMATSGS
Subjt:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS

Query:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF
        ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGS VQ+LGD SG QNYS+YMAHSMVKGYQI K+LFRPYEDVLGIGHSMGWSSILIPGSGEPNF
Subjt:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF

Query:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL
        DTWVANPFETSKQRREKEV+SLLDKLPPETISLNPSKIG VMSVKKKEKK++KEREAE+ESAV+ AKGIT KKKTKGRN+P+KREKKKRE +EK KRPFL
Subjt:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL

Query:  HEQI--NEEEVSRKRPRLSEEVELPKSLQRFAR
        HEQI   EEE+SRKRP+LSEEVELPKSLQRFAR
Subjt:  HEQI--NEEEVSRKRPRLSEEVELPKSLQRFAR

A0A6J1FNM3 probable U3 small nucleolar RNA-associated protein 78.9e-27590.98Show/hide
Query:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE
        MEEELG  VAERILPPTEQEV NE DV+IKKYLRGEGANLEVLKDKKLKGQL+V EDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRI+QETISHE
Subjt:  MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHE

Query:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL
        VDILSRRNQHDIILPALGPYSLDYT NGRYM IAGRKGHLA+VDMK+LNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL
Subjt:  VDILSRRNQHDIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVL

Query:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS
        RLQFLKNHFLLASINKFGQLHYQDVTTGSM  VFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSS+PLVKMLCHQGPVSALAFHPNGHLMATSG 
Subjt:  RLQFLKNHFLLASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGS

Query:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF
        ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLA GTGS+VQ+LGD +G QNY+RYM+HSMVKGYQI KILFRPYEDVLGIGHSMGWSSIL+PGSGEPNF
Subjt:  ERKIKLWDLRKFEVLQTLPGHAKTLDFSQKGLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNF

Query:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL
        DTWVANPFETSKQRREKEV+SLLDKLPPETI+LNPSKIG VM+VKKKEKKT+KEREAE+ESA++AAK IT KKKTKGRN+P+KREKKKRE ++K K+PFL
Subjt:  DTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFL

Query:  HEQI-NEEEVSRKRPRLSEEVELPKSLQRFAR
         EQI  EEE+SRKRPR SEEVELPKSLQRFAR
Subjt:  HEQI-NEEEVSRKRPRLSEEVELPKSLQRFAR

SwissProt top hitse value%identityAlignment
P08461 Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex, mitochondrial1.8e-10241.57Show/hide
Query:  SLCRFNASGIRLSPAFVLFSELASVLSLRLPLPPPDTFLDSSHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKI
        SLC ++     +    +L   L S       LPP        H  + +P+LSPTM  G IA+W KKEG+KI+ GD++ E+ETDKAT+ FESLEE ++AKI
Subjt:  SLCRFNASGIRLSPAFVLFSELASVLSLRLPLPPPDTFLDSSHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKI

Query:  LVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKDDAIGDQKDKNEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEV
        LVPEG++DVPVG  I ITVE P DI     N    ++     A             A S     S  P H  + +PALSPTM  G +  W KK G+K+  
Subjt:  LVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKDDAIGDQKDKNEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEV

Query:  GDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGKPIAITVEDPADIES---VKSAVSSSLGTKEDKPA-----------QSTIRNDAGTSKGSV
        GD++ EIETDKAT+ FE  EEGYLAKIL PEG++DV +G P+ I VE   DI +    +    +SL  +   P            Q      +    G  
Subjt:  GDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGKPIAITVEDPADIES---VKSAVSSSLGTKEDKPA-----------QSTIRNDAGTSKGSV

Query:  AR--ISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEKRSPEVHAQASSTASSESKSPIKQSDSFEDLPNSQIRKVIAKRLL
         R  +SP AK L AE G+D + +K +G  G ++K D+ + + +            K +P     A++ A    +     +  F D+P S IR+VIA+RL+
Subjt:  AR--ISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEKRSPEVHAQASSTASSESKSPIKQSDSFEDLPNSQIRKVIAKRLL

Query:  ESKQNTPHLYLSTDVILDPLLSLRKDLKE--KHDVKVSVNDIVIKAVAVALRNVPGANAYWDN---EKGEVVFCDSIDISIAVATEKGLMTPIVRNADQK
        +SKQ  PH YLS DV +  +L +RK+L +  +   K+SVND +IKA A+A   VP AN+ W +    +  VV     D+S+AV+T  GL+TPIV NA  K
Subjt:  ESKQNTPHLYLSTDVILDPLLSLRKDLKE--KHDVKVSVNDIVIKAVAVALRNVPGANAYWDN---EKGEVVFCDSIDISIAVATEKGLMTPIVRNADQK

Query:  TVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVGRGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVG
         +  I+S+V  LA KAR GKL+P EFQGGTF+ISNLGMF + +F AIINPPQA ILA+G     + P     G +   V + M++TLS DHRV DG VG
Subjt:  TVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVGRGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVG

P10515 Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex, mitochondrial5.1e-10242.81Show/hide
Query:  HAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKDD
        H  + +P+LSPTM  G IA+W KKEGDKI  GD++ E+ETDKAT+ FESLEE ++AKILV EG++DVP+G  I ITV  P+DI       +  ++     
Subjt:  HAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKDD

Query:  AIGDQKDKNEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGKPI
        A               S +   S  PPH  + +PALSPTM  G +  W KK G+K+  GD++ EIETDKAT+ FE  EEGYLAKIL PEG++DV +G P+
Subjt:  AIGDQKDKNEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGKPI

Query:  AITVEDPADIESVK----------------------SAVSSSLGTKEDKPAQSTIRNDAGTSKGSVARISPAAKLLIAEHGLDASSLKASGSHGTLLKGD
         I VE  ADI +                        +AV  +       P+       AG  KG V  +SP AK L  E G+D + +K +G  G + K D
Subjt:  AITVEDPADIESVK----------------------SAVSSSLGTKEDKPAQSTIRNDAGTSKGSVARISPAAKLLIAEHGLDASSLKASGSHGTLLKGD

Query:  VLAAIKSGKGLSEVSLSKEKRSPEVHAQASSTASSESKSPIKQSDSFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKE--KHDVK
        + + + S            K +P   A    T    +  P   +  F D+P S IR+VIA+RL++SKQ  PH YLS DV +  +L +RK+L +  +   K
Subjt:  VLAAIKSGKGLSEVSLSKEKRSPEVHAQASSTASSESKSPIKQSDSFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKE--KHDVK

Query:  VSVNDIVIKAVAVALRNVPGANAYWDN---EKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNL
        +SVND +IKA A+A   VP AN+ W +    +  VV     D+S+AV+T  GL+TPIV NA  K V  I+++V  LA KAR GKL+P EFQGGTF+ISNL
Subjt:  VSVNDIVIKAVAVALRNVPGANAYWDN---EKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNL

Query:  GMFPVDHFCAIINPPQAGILAVGRGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVG
        GMF + +F AIINPPQA ILA+G     + P     G +   V + M++TLS DHRV DG VG
Subjt:  GMFPVDHFCAIINPPQAGILAVGRGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVG

P36413 Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex, mitochondrial1.5e-10140.58Show/hide
Query:  LEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLE-EGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKDDAI
        + MPALSP+M +GNI +W+KKEGD+I  GDV+ E+ETDKAT++F+  +  G+LAKIL+PEG+K + + +PIAI V   +DI   + N    +S      +
Subjt:  LEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLE-EGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKDDAI

Query:  GDQKDKNEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLE-EGYLAKILAPEGSKDVAVGKPIA
         ++  K +  A   S    T   P H ++ MPALSP+M  G IA+W KKEGD+I+ GD I E+ETDKAT++F+  +  GYLAKIL P G+  + + +P+ 
Subjt:  GDQKDKNEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLE-EGYLAKILAPEGSKDVAVGKPIA

Query:  ITVEDP------ADIESVKSAVSSSLGTKEDKPAQSTIRNDAGTSKGSVAR-----------ISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIK
        I V++       AD    + + SSS  ++E  P+ S+  +   T   S ++            +PAA+   +  G D S++  +G +  +LK DVL  + 
Subjt:  ITVEDP------ADIESVKSAVSSSLGTKEDKPAQSTIRNDAGTSKGSVAR-----------ISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIK

Query:  SGKGLSEVSLSKEKRSPEVHAQASSTASSESKSPIKQSDSFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDVKVSVNDIVI
                    +K+      Q  +T +++  +    S  F D+P+S IRKV A RL ESKQ  PH YL+ +  +D LL LR +L   + VK+SVND ++
Subjt:  SGKGLSEVSLSKEKRSPEVHAQASSTASSESKSPIKQSDSFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDVKVSVNDIVI

Query:  KAVAVALRNVPGANAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAI
        KA A ALR+ P  N+ W ++   +    +IDI++AV T +GL TPIVR  D K ++ IS+ VK+LAEKA+ GKL P EF+ GTF+ISNLGM  +  F A+
Subjt:  KAVAVALRNVPGANAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAI

Query:  INPPQAGILAVGRGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVG
        INPPQA ILAVG     V  V+             +++TLS DHRV DG VG
Subjt:  INPPQAGILAVGRGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVG

Q0WQF7 Dihydrolipoyllysine-residue acetyltransferase component 1 of pyruvate dehydrogenase complex, mitochondrial3.8e-19869.13Show/hide
Query:  SHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKD
        S  VL MPALSPTM+ GN+ KW KKEGDK+ VGDVLCEIETDKAT+EFES EEGFLAKILV EGSKD+PV +PIAI VE+ DDI  V A  + G  D K+
Subjt:  SHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKD

Query:  DAIGDQKDK-NEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGK
        +    Q  K +E   Q SS++ + S LPPH +LEMPALSPTMNQGNIA W KKEGDKIEVGDVI EIETDKATLEFESLEEGYLAKIL PEGSKDVAVGK
Subjt:  DAIGDQKDK-NEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGK

Query:  PIAITVEDPADIESVKSAVSSSLGTKEDKPAQSTIRNDAGTSKGSVARISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEK
        PIA+ VED   IE++KS+ + S      K    ++ +     K    +ISPAAKLLI EHGL+ASS++ASG +GTLLK DV+AAI SGK  S+ S S +K
Subjt:  PIAITVEDPADIESVKSAVSSSLGTKEDKPAQSTIRNDAGTSKGSVARISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEK

Query:  RSPEVHAQASSTASSESKSPIKQSD-SFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDVKVSVNDIVIKAVAVALRNVPGA
        + P    +  S +SS SK  + QSD ++ED PNSQIRK+IAKRLLESKQ  PHLYL +DV+LDPLL+ RK+L+E H VKVSVNDIVIKAVAVALRNV  A
Subjt:  RSPEVHAQASSTASSESKSPIKQSD-SFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDVKVSVNDIVIKAVAVALRNVPGA

Query:  NAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVGR
        NA+WD EKG++V CDS+DISIAVATEKGLMTPI++NADQK++S IS EVKELA+KAR+GKL P EFQGGTFSISNLGM+PVD+FCAIINPPQAGILAVGR
Subjt:  NAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVGR

Query:  GNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVGDS
        GNKVVEPVIG DGIE+P V+ KMN+TLSADHR+FDG+VG S
Subjt:  GNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVGDS

Q8BMF4 Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex, mitochondrial1.4e-10442.4Show/hide
Query:  SLCRFNASGIRLSPAFVLFSELASVLSLRLPLPPPDTFLDSSHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKI
        SLC + +SG    P   L  +L    S R    PP       H  + +P+LSPTM  G IA+W KKEG+KI+ GD++ E+ETDKAT+ FESLEE ++AKI
Subjt:  SLCRFNASGIRLSPAFVLFSELASVLSLRLPLPPPDTFLDSSHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKI

Query:  LVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKDDAIGDQKDKNEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEV
        LVPEG++DVPVG  I ITVE P DI       +  A+     A             A S     S  P H  + +PALSPTM  G +  W KK G+K+  
Subjt:  LVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKDDAIGDQKDKNEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEV

Query:  GDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGKPIAITVEDPADIES---VKSAVSSSLGTKEDKPA-----------QSTIRNDAGTSKGSV
        GD++ EIETDKAT+ FE  EEGYLAKIL PEG++DV +G P+ I VE   DI +    +    +SL  +   PA           Q      +    G  
Subjt:  GDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGKPIAITVEDPADIES---VKSAVSSSLGTKEDKPA-----------QSTIRNDAGTSKGSV

Query:  AR--ISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEKRSPEVHAQASSTASSESKSPIKQSDSFEDLPNSQIRKVIAKRLL
         R  +SP AK L AE G+D + +K +G  G ++K D+ + + S            K +P   A  +      + +P   +  F D+P S IR+VIA+RL+
Subjt:  AR--ISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEKRSPEVHAQASSTASSESKSPIKQSDSFEDLPNSQIRKVIAKRLL

Query:  ESKQNTPHLYLSTDVILDPLLSLRKDLKE--KHDVKVSVNDIVIKAVAVALRNVPGANAYWDN---EKGEVVFCDSIDISIAVATEKGLMTPIVRNADQK
        +SKQ  PH YLS DV +  +L +RK+L +  +   K+SVND +IKA A+A   VP AN+ W +    +  VV     D+S+AV+T  GL+TPIV NA  K
Subjt:  ESKQNTPHLYLSTDVILDPLLSLRKDLKE--KHDVKVSVNDIVIKAVAVALRNVPGANAYWDN---EKGEVVFCDSIDISIAVATEKGLMTPIVRNADQK

Query:  TVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVGRGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVG
         +  I+S+V  LA KAR GKL+P EFQGGTF+ISNLGMF + +F AIINPPQA ILA+G     + P     G +   V + M++TLS DHRV DG VG
Subjt:  TVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVGRGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVG

Arabidopsis top hitse value%identityAlignment
AT3G10530.1 Transducin/WD40 repeat-like superfamily protein1.5e-21371.56Show/hide
Query:  ERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQH
        E++LPP EQE   E++ ++KKYLRGEGANLE LKDKKLK QLA RE LYGKSAKAAAK+EKWL+P+E GYLE EGLEKTWR+KQ  I++EVDILS RNQ+
Subjt:  ERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQH

Query:  DIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVLRLQFLKNHFL
        DI+LP  GPY LD+T++GR+M   GRKGHLA++DM N++LIKE QV+ETVRDV FLHN+ FFAAAQKKY YIY RDGTELHCLKE G V RL+FLKNHFL
Subjt:  DIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVLRLQFLKNHFL

Query:  LASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGSERKIKLWDLR
        LAS+N  GQLHYQDVT G MV+  RTG GRTDVM+VNP+N V+  GHSGG+V MWKPTS APLV+M CH GPVS++AFHPNGHLMATSG ERKIK+WDLR
Subjt:  LASINKFGQLHYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGSERKIKLWDLR

Query:  KFEVLQTLPG-HAKTLDFSQKGLLAYGTGSFVQVLGDFSG--VQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANP
        KFE +QT+   HAKTL FSQKGLLA GTGSFVQ+LGD SG    NY+RYM HSMVKGYQI K++FRPYEDV+GIGHSMGWSSILIPGSGEPNFD+WVANP
Subjt:  KFEVLQTLPG-HAKTLDFSQKGLLAYGTGSFVQVLGDFSG--VQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANP

Query:  FETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFLHEQINEE
        FETSKQRREKEV SLLDKLPPETI L+PSKIGA+   ++KEK ++ E EAE+E A+EAAK    K KTKGRN+PSKR KKK+E VE  KR F  EQ +  
Subjt:  FETSKQRREKEVQSLLDKLPPETISLNPSKIGAVMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFLHEQINEE

Query:  EVSRKRPRLSEEVELPKSLQRFAR
         + ++R       ELP SL+RFAR
Subjt:  EVSRKRPRLSEEVELPKSLQRFAR

AT3G13930.1 Dihydrolipoamide acetyltransferase, long form protein7.2e-8343.86Show/hide
Query:  NTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGKPIAITVEDPADIESVKSAVSSS
        ++S LPPH  + MP+LSPTM +GNIA W KKEGDK+  G+V+CE+ETDKAT+E E +EEG+LAKI+  EG+K++ VG+ IAITVED  DI+  K    SS
Subjt:  NTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGKPIAITVEDPADIESVKSAVSSS

Query:  ---LGTKEDKPAQSTIRN----------DAGTSKGSVA----RI--SPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEKRSP
               E KPA S  +           +A  SK S A    RI  SP A+ L  ++ +  SS+K +G  G ++K DV   + SG        SKE    
Subjt:  ---LGTKEDKPAQSTIRN----------DAGTSKGSVA----RI--SPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEKRSP

Query:  EVHAQASSTASSESKSPIKQSDSFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDV----KVSVNDIVIKAVAVALRNVPGA
             A  +   +SK P   +  + D+P++QIRKV A RL  SKQ  PH YL+ D  +D ++ LR  L    +     ++SVND+VIKA A+ALR VP  
Subjt:  EVHAQASSTASSESKSPIKQSDSFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDV----KVSVNDIVIKAVAVALRNVPGA

Query:  NAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNL-GMFPVDHFCAIINPPQAGILAVG
        N+ W +E   +    +++I++AV TE GL  P+V++AD+K +S I  EV+ LA+KA+   LKP++++GGTF++SNL G F +  FCA+INPPQA ILA+G
Subjt:  NAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNL-GMFPVDHFCAIINPPQAGILAVG

Query:  RGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVG
           K V P  G D      V + M++TLS DHRV DG +G
Subjt:  RGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVG

AT3G52200.1 Dihydrolipoamide acetyltransferase, long form protein2.7e-19969.13Show/hide
Query:  SHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKD
        S  VL MPALSPTM+ GN+ KW KKEGDK+ VGDVLCEIETDKAT+EFES EEGFLAKILV EGSKD+PV +PIAI VE+ DDI  V A  + G  D K+
Subjt:  SHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKD

Query:  DAIGDQKDK-NEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGK
        +    Q  K +E   Q SS++ + S LPPH +LEMPALSPTMNQGNIA W KKEGDKIEVGDVI EIETDKATLEFESLEEGYLAKIL PEGSKDVAVGK
Subjt:  DAIGDQKDK-NEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGK

Query:  PIAITVEDPADIESVKSAVSSSLGTKEDKPAQSTIRNDAGTSKGSVARISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEK
        PIA+ VED   IE++KS+ + S      K    ++ +     K    +ISPAAKLLI EHGL+ASS++ASG +GTLLK DV+AAI SGK  S+ S S +K
Subjt:  PIAITVEDPADIESVKSAVSSSLGTKEDKPAQSTIRNDAGTSKGSVARISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEK

Query:  RSPEVHAQASSTASSESKSPIKQSD-SFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDVKVSVNDIVIKAVAVALRNVPGA
        + P    +  S +SS SK  + QSD ++ED PNSQIRK+IAKRLLESKQ  PHLYL +DV+LDPLL+ RK+L+E H VKVSVNDIVIKAVAVALRNV  A
Subjt:  RSPEVHAQASSTASSESKSPIKQSD-SFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDVKVSVNDIVIKAVAVALRNVPGA

Query:  NAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVGR
        NA+WD EKG++V CDS+DISIAVATEKGLMTPI++NADQK++S IS EVKELA+KAR+GKL P EFQGGTFSISNLGM+PVD+FCAIINPPQAGILAVGR
Subjt:  NAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVGR

Query:  GNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVGDS
        GNKVVEPVIG DGIE+P V+ KMN+TLSADHR+FDG+VG S
Subjt:  GNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVGDS

AT3G52200.2 Dihydrolipoamide acetyltransferase, long form protein2.7e-19969.13Show/hide
Query:  SHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKD
        S  VL MPALSPTM+ GN+ KW KKEGDK+ VGDVLCEIETDKAT+EFES EEGFLAKILV EGSKD+PV +PIAI VE+ DDI  V A  + G  D K+
Subjt:  SHAVLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKD

Query:  DAIGDQKDK-NEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGK
        +    Q  K +E   Q SS++ + S LPPH +LEMPALSPTMNQGNIA W KKEGDKIEVGDVI EIETDKATLEFESLEEGYLAKIL PEGSKDVAVGK
Subjt:  DAIGDQKDK-NEDRAQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGK

Query:  PIAITVEDPADIESVKSAVSSSLGTKEDKPAQSTIRNDAGTSKGSVARISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEK
        PIA+ VED   IE++KS+ + S      K    ++ +     K    +ISPAAKLLI EHGL+ASS++ASG +GTLLK DV+AAI SGK  S+ S S +K
Subjt:  PIAITVEDPADIESVKSAVSSSLGTKEDKPAQSTIRNDAGTSKGSVARISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEK

Query:  RSPEVHAQASSTASSESKSPIKQSD-SFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDVKVSVNDIVIKAVAVALRNVPGA
        + P    +  S +SS SK  + QSD ++ED PNSQIRK+IAKRLLESKQ  PHLYL +DV+LDPLL+ RK+L+E H VKVSVNDIVIKAVAVALRNV  A
Subjt:  RSPEVHAQASSTASSESKSPIKQSD-SFEDLPNSQIRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDVKVSVNDIVIKAVAVALRNVPGA

Query:  NAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVGR
        NA+WD EKG++V CDS+DISIAVATEKGLMTPI++NADQK++S IS EVKELA+KAR+GKL P EFQGGTFSISNLGM+PVD+FCAIINPPQAGILAVGR
Subjt:  NAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMISSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVGR

Query:  GNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVGDS
        GNKVVEPVIG DGIE+P V+ KMN+TLSADHR+FDG+VG S
Subjt:  GNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVGDS

AT5G04490.1 vitamin E pathway gene 54.3e-8867.8Show/hide
Query:  LPSPRAASVDLLHDAGATAVVLAGAHSLVLGFDNLARRNLIQQNLSRKLVHILSGLLFMISWPIFSTSTEARYFASLVPTLNCLRLVVHGLSLTKDEGLL
        L S   A+  LLHD GAT  VL GA++LVL F++L +RN+IQQ+LSRKLVHILSGLLF+++WPIFS STEARYFA+ VP +N LRLV++GLS++ +  L+
Subjt:  LPSPRAASVDLLHDAGATAVVLAGAHSLVLGFDNLARRNLIQQNLSRKLVHILSGLLFMISWPIFSTSTEARYFASLVPTLNCLRLVVHGLSLTKDEGLL

Query:  NSLTREGKPEELLRGPLYYVLMLILSAVVFWRESPVGLISLAMMCGGDGIADIMGRRFGSKRLPYNQGKSWIGSTSMFIFGFGVSIGMLYYYSALGYLDL
         S+TREG+ EELL+GPL+YVL L+ SAV FWRESP+G+ISLAMMCGGDGIADIMGR+FGS ++PYN  KSW GS SMFIFGF +SI +LYYYS+LGYL +
Subjt:  NSLTREGKPEELLRGPLYYVLMLILSAVVFWRESPVGLISLAMMCGGDGIADIMGRRFGSKRLPYNQGKSWIGSTSMFIFGFGVSIGMLYYYSALGYLDL

Query:  DWGRTIQKVALVSLVATVVESLPIANVVDDNISVPL
        +W  T+Q+VA+VS+VATVVESLPI + +DDNISVPL
Subjt:  DWGRTIQKVALVSLVATVVESLPIANVVDDNISVPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAGGAACTAGGAAGTGCGGTTGCTGAAAGGATTCTGCCTCCGACTGAACAGGAGGTCTCTAATGAGATAGATGTGAGAATAAAAAAGTATCTAAGAGGAGAAGG
TGCTAATCTGGAGGTTCTGAAAGATAAAAAGTTGAAGGGTCAACTTGCTGTTAGAGAAGATTTATATGGGAAATCTGCTAAAGCTGCTGCCAAGGTCGAGAAGTGGCTTA
TGCCAAGTGAAGGAGGATATTTGGAGGCTGAAGGATTGGAGAAGACATGGAGAATCAAACAGGAAACGATTTCCCATGAAGTAGATATCTTGAGTAGAAGGAATCAACAT
GATATTATTTTACCAGCTCTTGGACCATATTCTCTTGACTATACTTCAAATGGTAGATATATGACCATTGCTGGACGTAAGGGCCACCTGGCAATTGTAGACATGAAGAA
CCTTAACTTAATTAAAGAGTTTCAGGTTAAGGAAACTGTGCGTGATGTGGTCTTCTTACACAATGAGTTGTTCTTTGCTGCTGCACAAAAAAAGTATCCATATATTTATA
ACCGGGATGGCACAGAGCTTCACTGTCTTAAGGAGCATGGATCAGTCTTGAGGCTTCAATTTCTGAAAAATCACTTCCTTTTGGCGTCCATAAACAAGTTTGGACAGCTT
CATTATCAAGATGTAACAACAGGTAGCATGGTCAGTGTCTTCCGCACTGGTTTAGGCCGCACTGATGTGATGCAGGTAAATCCATTCAACGGGGTTATAGCAACTGGTCA
TTCAGGTGGTTCAGTTGTTATGTGGAAACCTACAAGCTCTGCTCCTCTTGTAAAGATGCTTTGTCATCAAGGGCCTGTTTCAGCATTGGCGTTCCACCCGAATGGCCATC
TCATGGCTACCTCTGGTTCCGAGAGGAAAATTAAGCTCTGGGACTTGAGGAAATTTGAGGTGCTTCAGACTCTGCCAGGCCATGCTAAGACCCTAGATTTCAGCCAGAAA
GGGCTACTTGCATATGGAACTGGATCATTTGTACAGGTTCTGGGCGATTTTTCTGGAGTTCAGAATTACAGTAGGTATATGGCTCACTCAATGGTGAAAGGTTACCAAAT
AAGAAAGATACTGTTTCGACCTTACGAAGATGTTTTAGGCATAGGTCACTCAATGGGTTGGTCTAGCATCCTTATCCCAGGATCTGGTGAACCCAACTTTGATACTTGGG
TAGCAAATCCATTTGAGACTTCAAAACAGCGGAGAGAAAAGGAGGTCCAGTCTCTTCTCGATAAGCTTCCTCCCGAGACAATTTCGCTGAACCCTTCAAAAATTGGTGCG
GTTATGTCAGTAAAGAAGAAAGAGAAGAAGACGCAGAAGGAAAGAGAAGCTGAGCAGGAGTCTGCAGTTGAAGCTGCAAAGGGCATTACCCAGAAGAAGAAAACCAAGGG
AAGAAATAGGCCAAGCAAGAGGGAAAAGAAGAAGCGTGAAACTGTTGAGAAGGTCAAAAGGCCTTTCCTTCATGAACAAATAAACGAAGAAGAAGTGTCTAGAAAGAGAC
CGAGGTTGAGCGAGGAAGTCGAACTTCCGAAGTCTTTGCAGCGGTTTGCTCGTATGTACCAACTGGTGTTTGCTTGCTCGTCTTCATACACTCAACACACACATGCACAT
TGGTTTAGTGGTGCATTAATAGGAGTGTCTGCCACAACCATTCAACGTTACGCCGTCGCAGTCTCCGCCGCTGTCATTATTATAACGGATGGGCTAAATGCTGCTAACAT
ACGCCCTCTCTGGCTGCTTGCGGTCGTAATTGCTACCATGAGCCTCTTGGCGTTCACCCACTCTCCACCCCGCCTCCGGCTCCGTAACGTGCGCCACCGCCGCGACCTGT
TTACGAGTGCCGCCGTCGTTTCGCTTTCCGGCGAACTGAGGCAACCTCGGATTCTTACTTTACGAATTGTCCATCGTGCCCCGCAGGCTCCCTTCCGGACTTCTTCCTCC
TCTAACCGCTTACCATCTCCGCGAGCAGCTTCTGTAGATCTCTTGCATGACGCCGGTGCGACGGCTGTTGTTCTCGCCGGCGCCCACTCGCTCGTTCTCGGATTCGATAA
TCTTGCCCGGCGGAATCTGATCCAACAGAATTTGAGCAGAAAACTAGTCCACATATTGTCTGGATTGCTTTTCATGATTTCCTGGCCTATATTCAGCACCTCAACAGAGG
CTCGCTACTTTGCTTCTCTAGTTCCCACACTGAATTGCTTGAGGCTTGTTGTCCATGGCCTCTCATTAACCAAAGATGAAGGACTTTTAAACTCTCTTACTCGGGAAGGA
AAACCAGAGGAATTGCTGAGAGGCCCTCTATATTATGTCCTAATGCTGATTTTATCAGCTGTTGTCTTCTGGCGTGAATCTCCAGTTGGTTTGATCTCATTAGCAATGAT
GTGTGGCGGGGATGGTATTGCTGATATAATGGGAAGAAGATTTGGGTCCAAAAGGCTTCCTTATAATCAGGGGAAGAGTTGGATTGGAAGCACATCCATGTTCATCTTTG
GATTTGGGGTCTCCATTGGGATGTTGTACTATTACTCAGCTCTTGGATATTTGGATTTGGATTGGGGAAGGACAATTCAAAAAGTTGCTTTAGTTTCTCTGGTGGCAACT
GTGGTGGAGTCTCTTCCAATTGCAAATGTTGTAGATGACAATATATCAGTACCACTGAATCAAACTCAATTGCAAGTTCTATTCACAACCGGCCGTCCGGTGACTGTGCT
CCGACCTCCGGCATTGCGGTCTCGCAATTTTTGTATTTCCAAGTCCAATACGCGATTTCTGCATCTGAATCTGTTCTCTGAAGGAGGATCGCTATGTCGCTTCAACGCCT
CCGGGATCCGGTTGTCGCCCGCGTTCGTTCTCTTCTCCGAGCTCGCCTCCGTGCTTTCTCTTCGTCTTCCCCTACCTCCTCCAGATACATTTCTCGATTCTTCACATGCA
GTCCTTGAAATGCCAGCATTATCACCTACGATGAATCAAGGTAATATTGCCAAATGGAGGAAAAAAGAAGGGGATAAGATAGCTGTTGGCGATGTCCTGTGTGAAATTGA
AACCGACAAAGCTACTCTTGAGTTTGAAAGTCTCGAGGAGGGGTTTTTGGCTAAAATATTGGTTCCTGAAGGTTCAAAAGATGTCCCAGTTGGACAACCCATTGCAATTA
CGGTTGAGGATCCAGATGATATCCATAGGGTCCTTGCCAATGGTGTATCAGGTGCATCTGATATTAAAGATGATGCGATAGGGGATCAAAAGGACAAAAATGAGGACAGG
GCACAAGCAAGTTCGGTAGAGATAAACACGTCAAAACTTCCTCCTCATTTTATACTCGAAATGCCAGCTTTATCTCCAACTATGAACCAAGGTAATATTGCTAATTGGAG
AAAGAAAGAGGGAGACAAGATTGAGGTGGGTGATGTAATATGCGAGATAGAGACAGACAAGGCTACCCTTGAATTTGAAAGTCTTGAAGAAGGGTACCTGGCTAAGATAC
TAGCACCTGAAGGCTCGAAGGATGTGGCAGTTGGAAAACCGATTGCAATTACAGTTGAAGATCCTGCAGATATTGAATCTGTAAAAAGTGCTGTTAGTAGTAGTTTGGGG
ACTAAAGAGGACAAACCTGCCCAAAGTACAATTAGAAATGATGCTGGAACATCAAAAGGTAGTGTTGCAAGAATAAGTCCTGCTGCAAAGTTGCTAATTGCGGAACATGG
TTTGGATGCATCATCATTGAAGGCATCAGGTTCTCATGGCACACTGTTGAAAGGGGATGTTCTGGCTGCCATTAAATCAGGGAAAGGCTTGTCAGAAGTTTCTTTGTCCA
AAGAGAAAAGATCACCTGAGGTCCATGCACAGGCTTCTTCCACGGCTTCATCAGAATCAAAATCTCCAATAAAGCAATCTGATTCTTTTGAAGATTTACCAAATAGTCAA
ATTCGCAAGGTGATCGCTAAAAGATTATTGGAATCAAAACAAAATACACCACATCTATATTTGTCCACAGATGTCATATTGGATCCTCTTCTTTCACTTAGAAAAGATCT
AAAAGAGAAGCATGATGTTAAAGTTTCCGTGAACGATATTGTCATCAAAGCCGTAGCAGTTGCTCTAAGAAACGTGCCTGGAGCCAATGCTTATTGGGACAATGAGAAAG
GGGAAGTTGTTTTTTGCGACTCTATTGACATATCAATTGCAGTTGCTACTGAGAAGGGCTTAATGACTCCAATTGTGAGGAATGCTGATCAGAAGACTGTATCAATGATT
TCTTCGGAGGTCAAGGAGTTAGCTGAGAAGGCCCGTGCAGGGAAGCTCAAACCAGATGAATTCCAAGGAGGCACTTTCAGCATTTCAAATCTAGGAATGTTCCCAGTGGA
TCACTTTTGTGCTATAATAAACCCTCCGCAGGCTGGAATTCTTGCTGTAGGTAGAGGTAACAAAGTTGTTGAGCCAGTTATTGGAACTGATGGAATTGAGAGACCTGTGG
TCCTCAACAAAATGAATTTGACATTATCTGCTGATCATCGTGTTTTTGATGGAAAAGTTGGAGACTCGGCGATACCAACTTCACCTGAACCATTTCAAACGCTTCAACAC
GAAGAAGCTTGTAACCCTACACCATCGTCATCAACAAACAAGTATATACACAAAGAGAAGAGGAAGAAGACGGGTTGTTACATCGGAAAGCCTGGCGGGGCGGTCGGGGA
CGGCGAGATCGACGGAGGGATCGTAAGGGCGAGAGATGGCGCCGTGAAGCCAACGAGAAGCGACGGCGTCGCCGAGTTCGGCCTTCTCGAAGGGGTCGGAAGTATTGAGG
ACTCGCAGCGCCGCTTCCACCAAGGTCTCGTCTTCCATCTTCCAGCTTATGCTCCCGCCTCTTCTTCCACGTTAGCTCCTGGGCCAAGGGGTGGGAATGAAATTGGGCCG
GGCCGCACTTGTAAAGGCCGTTCAAGTTGTTGGATCCTATTTCAGTCGGCCCTTGGAGGCATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAGGAACTAGGAAGTGCGGTTGCTGAAAGGATTCTGCCTCCGACTGAACAGGAGGTCTCTAATGAGATAGATGTGAGAATAAAAAAGTATCTAAGAGGAGAAGG
TGCTAATCTGGAGGTTCTGAAAGATAAAAAGTTGAAGGGTCAACTTGCTGTTAGAGAAGATTTATATGGGAAATCTGCTAAAGCTGCTGCCAAGGTCGAGAAGTGGCTTA
TGCCAAGTGAAGGAGGATATTTGGAGGCTGAAGGATTGGAGAAGACATGGAGAATCAAACAGGAAACGATTTCCCATGAAGTAGATATCTTGAGTAGAAGGAATCAACAT
GATATTATTTTACCAGCTCTTGGACCATATTCTCTTGACTATACTTCAAATGGTAGATATATGACCATTGCTGGACGTAAGGGCCACCTGGCAATTGTAGACATGAAGAA
CCTTAACTTAATTAAAGAGTTTCAGGTTAAGGAAACTGTGCGTGATGTGGTCTTCTTACACAATGAGTTGTTCTTTGCTGCTGCACAAAAAAAGTATCCATATATTTATA
ACCGGGATGGCACAGAGCTTCACTGTCTTAAGGAGCATGGATCAGTCTTGAGGCTTCAATTTCTGAAAAATCACTTCCTTTTGGCGTCCATAAACAAGTTTGGACAGCTT
CATTATCAAGATGTAACAACAGGTAGCATGGTCAGTGTCTTCCGCACTGGTTTAGGCCGCACTGATGTGATGCAGGTAAATCCATTCAACGGGGTTATAGCAACTGGTCA
TTCAGGTGGTTCAGTTGTTATGTGGAAACCTACAAGCTCTGCTCCTCTTGTAAAGATGCTTTGTCATCAAGGGCCTGTTTCAGCATTGGCGTTCCACCCGAATGGCCATC
TCATGGCTACCTCTGGTTCCGAGAGGAAAATTAAGCTCTGGGACTTGAGGAAATTTGAGGTGCTTCAGACTCTGCCAGGCCATGCTAAGACCCTAGATTTCAGCCAGAAA
GGGCTACTTGCATATGGAACTGGATCATTTGTACAGGTTCTGGGCGATTTTTCTGGAGTTCAGAATTACAGTAGGTATATGGCTCACTCAATGGTGAAAGGTTACCAAAT
AAGAAAGATACTGTTTCGACCTTACGAAGATGTTTTAGGCATAGGTCACTCAATGGGTTGGTCTAGCATCCTTATCCCAGGATCTGGTGAACCCAACTTTGATACTTGGG
TAGCAAATCCATTTGAGACTTCAAAACAGCGGAGAGAAAAGGAGGTCCAGTCTCTTCTCGATAAGCTTCCTCCCGAGACAATTTCGCTGAACCCTTCAAAAATTGGTGCG
GTTATGTCAGTAAAGAAGAAAGAGAAGAAGACGCAGAAGGAAAGAGAAGCTGAGCAGGAGTCTGCAGTTGAAGCTGCAAAGGGCATTACCCAGAAGAAGAAAACCAAGGG
AAGAAATAGGCCAAGCAAGAGGGAAAAGAAGAAGCGTGAAACTGTTGAGAAGGTCAAAAGGCCTTTCCTTCATGAACAAATAAACGAAGAAGAAGTGTCTAGAAAGAGAC
CGAGGTTGAGCGAGGAAGTCGAACTTCCGAAGTCTTTGCAGCGGTTTGCTCGTATGTACCAACTGGTGTTTGCTTGCTCGTCTTCATACACTCAACACACACATGCACAT
TGGTTTAGTGGTGCATTAATAGGAGTGTCTGCCACAACCATTCAACGTTACGCCGTCGCAGTCTCCGCCGCTGTCATTATTATAACGGATGGGCTAAATGCTGCTAACAT
ACGCCCTCTCTGGCTGCTTGCGGTCGTAATTGCTACCATGAGCCTCTTGGCGTTCACCCACTCTCCACCCCGCCTCCGGCTCCGTAACGTGCGCCACCGCCGCGACCTGT
TTACGAGTGCCGCCGTCGTTTCGCTTTCCGGCGAACTGAGGCAACCTCGGATTCTTACTTTACGAATTGTCCATCGTGCCCCGCAGGCTCCCTTCCGGACTTCTTCCTCC
TCTAACCGCTTACCATCTCCGCGAGCAGCTTCTGTAGATCTCTTGCATGACGCCGGTGCGACGGCTGTTGTTCTCGCCGGCGCCCACTCGCTCGTTCTCGGATTCGATAA
TCTTGCCCGGCGGAATCTGATCCAACAGAATTTGAGCAGAAAACTAGTCCACATATTGTCTGGATTGCTTTTCATGATTTCCTGGCCTATATTCAGCACCTCAACAGAGG
CTCGCTACTTTGCTTCTCTAGTTCCCACACTGAATTGCTTGAGGCTTGTTGTCCATGGCCTCTCATTAACCAAAGATGAAGGACTTTTAAACTCTCTTACTCGGGAAGGA
AAACCAGAGGAATTGCTGAGAGGCCCTCTATATTATGTCCTAATGCTGATTTTATCAGCTGTTGTCTTCTGGCGTGAATCTCCAGTTGGTTTGATCTCATTAGCAATGAT
GTGTGGCGGGGATGGTATTGCTGATATAATGGGAAGAAGATTTGGGTCCAAAAGGCTTCCTTATAATCAGGGGAAGAGTTGGATTGGAAGCACATCCATGTTCATCTTTG
GATTTGGGGTCTCCATTGGGATGTTGTACTATTACTCAGCTCTTGGATATTTGGATTTGGATTGGGGAAGGACAATTCAAAAAGTTGCTTTAGTTTCTCTGGTGGCAACT
GTGGTGGAGTCTCTTCCAATTGCAAATGTTGTAGATGACAATATATCAGTACCACTGAATCAAACTCAATTGCAAGTTCTATTCACAACCGGCCGTCCGGTGACTGTGCT
CCGACCTCCGGCATTGCGGTCTCGCAATTTTTGTATTTCCAAGTCCAATACGCGATTTCTGCATCTGAATCTGTTCTCTGAAGGAGGATCGCTATGTCGCTTCAACGCCT
CCGGGATCCGGTTGTCGCCCGCGTTCGTTCTCTTCTCCGAGCTCGCCTCCGTGCTTTCTCTTCGTCTTCCCCTACCTCCTCCAGATACATTTCTCGATTCTTCACATGCA
GTCCTTGAAATGCCAGCATTATCACCTACGATGAATCAAGGTAATATTGCCAAATGGAGGAAAAAAGAAGGGGATAAGATAGCTGTTGGCGATGTCCTGTGTGAAATTGA
AACCGACAAAGCTACTCTTGAGTTTGAAAGTCTCGAGGAGGGGTTTTTGGCTAAAATATTGGTTCCTGAAGGTTCAAAAGATGTCCCAGTTGGACAACCCATTGCAATTA
CGGTTGAGGATCCAGATGATATCCATAGGGTCCTTGCCAATGGTGTATCAGGTGCATCTGATATTAAAGATGATGCGATAGGGGATCAAAAGGACAAAAATGAGGACAGG
GCACAAGCAAGTTCGGTAGAGATAAACACGTCAAAACTTCCTCCTCATTTTATACTCGAAATGCCAGCTTTATCTCCAACTATGAACCAAGGTAATATTGCTAATTGGAG
AAAGAAAGAGGGAGACAAGATTGAGGTGGGTGATGTAATATGCGAGATAGAGACAGACAAGGCTACCCTTGAATTTGAAAGTCTTGAAGAAGGGTACCTGGCTAAGATAC
TAGCACCTGAAGGCTCGAAGGATGTGGCAGTTGGAAAACCGATTGCAATTACAGTTGAAGATCCTGCAGATATTGAATCTGTAAAAAGTGCTGTTAGTAGTAGTTTGGGG
ACTAAAGAGGACAAACCTGCCCAAAGTACAATTAGAAATGATGCTGGAACATCAAAAGGTAGTGTTGCAAGAATAAGTCCTGCTGCAAAGTTGCTAATTGCGGAACATGG
TTTGGATGCATCATCATTGAAGGCATCAGGTTCTCATGGCACACTGTTGAAAGGGGATGTTCTGGCTGCCATTAAATCAGGGAAAGGCTTGTCAGAAGTTTCTTTGTCCA
AAGAGAAAAGATCACCTGAGGTCCATGCACAGGCTTCTTCCACGGCTTCATCAGAATCAAAATCTCCAATAAAGCAATCTGATTCTTTTGAAGATTTACCAAATAGTCAA
ATTCGCAAGGTGATCGCTAAAAGATTATTGGAATCAAAACAAAATACACCACATCTATATTTGTCCACAGATGTCATATTGGATCCTCTTCTTTCACTTAGAAAAGATCT
AAAAGAGAAGCATGATGTTAAAGTTTCCGTGAACGATATTGTCATCAAAGCCGTAGCAGTTGCTCTAAGAAACGTGCCTGGAGCCAATGCTTATTGGGACAATGAGAAAG
GGGAAGTTGTTTTTTGCGACTCTATTGACATATCAATTGCAGTTGCTACTGAGAAGGGCTTAATGACTCCAATTGTGAGGAATGCTGATCAGAAGACTGTATCAATGATT
TCTTCGGAGGTCAAGGAGTTAGCTGAGAAGGCCCGTGCAGGGAAGCTCAAACCAGATGAATTCCAAGGAGGCACTTTCAGCATTTCAAATCTAGGAATGTTCCCAGTGGA
TCACTTTTGTGCTATAATAAACCCTCCGCAGGCTGGAATTCTTGCTGTAGGTAGAGGTAACAAAGTTGTTGAGCCAGTTATTGGAACTGATGGAATTGAGAGACCTGTGG
TCCTCAACAAAATGAATTTGACATTATCTGCTGATCATCGTGTTTTTGATGGAAAAGTTGGAGACTCGGCGATACCAACTTCACCTGAACCATTTCAAACGCTTCAACAC
GAAGAAGCTTGTAACCCTACACCATCGTCATCAACAAACAAGTATATACACAAAGAGAAGAGGAAGAAGACGGGTTGTTACATCGGAAAGCCTGGCGGGGCGGTCGGGGA
CGGCGAGATCGACGGAGGGATCGTAAGGGCGAGAGATGGCGCCGTGAAGCCAACGAGAAGCGACGGCGTCGCCGAGTTCGGCCTTCTCGAAGGGGTCGGAAGTATTGAGG
ACTCGCAGCGCCGCTTCCACCAAGGTCTCGTCTTCCATCTTCCAGCTTATGCTCCCGCCTCTTCTTCCACGTTAGCTCCTGGGCCAAGGGGTGGGAATGAAATTGGGCCG
GGCCGCACTTGTAAAGGCCGTTCAAGTTGTTGGATCCTATTTCAGTCGGCCCTTGGAGGCATCTGA
Protein sequenceShow/hide protein sequence
MEEELGSAVAERILPPTEQEVSNEIDVRIKKYLRGEGANLEVLKDKKLKGQLAVREDLYGKSAKAAAKVEKWLMPSEGGYLEAEGLEKTWRIKQETISHEVDILSRRNQH
DIILPALGPYSLDYTSNGRYMTIAGRKGHLAIVDMKNLNLIKEFQVKETVRDVVFLHNELFFAAAQKKYPYIYNRDGTELHCLKEHGSVLRLQFLKNHFLLASINKFGQL
HYQDVTTGSMVSVFRTGLGRTDVMQVNPFNGVIATGHSGGSVVMWKPTSSAPLVKMLCHQGPVSALAFHPNGHLMATSGSERKIKLWDLRKFEVLQTLPGHAKTLDFSQK
GLLAYGTGSFVQVLGDFSGVQNYSRYMAHSMVKGYQIRKILFRPYEDVLGIGHSMGWSSILIPGSGEPNFDTWVANPFETSKQRREKEVQSLLDKLPPETISLNPSKIGA
VMSVKKKEKKTQKEREAEQESAVEAAKGITQKKKTKGRNRPSKREKKKRETVEKVKRPFLHEQINEEEVSRKRPRLSEEVELPKSLQRFARMYQLVFACSSSYTQHTHAH
WFSGALIGVSATTIQRYAVAVSAAVIIITDGLNAANIRPLWLLAVVIATMSLLAFTHSPPRLRLRNVRHRRDLFTSAAVVSLSGELRQPRILTLRIVHRAPQAPFRTSSS
SNRLPSPRAASVDLLHDAGATAVVLAGAHSLVLGFDNLARRNLIQQNLSRKLVHILSGLLFMISWPIFSTSTEARYFASLVPTLNCLRLVVHGLSLTKDEGLLNSLTREG
KPEELLRGPLYYVLMLILSAVVFWRESPVGLISLAMMCGGDGIADIMGRRFGSKRLPYNQGKSWIGSTSMFIFGFGVSIGMLYYYSALGYLDLDWGRTIQKVALVSLVAT
VVESLPIANVVDDNISVPLNQTQLQVLFTTGRPVTVLRPPALRSRNFCISKSNTRFLHLNLFSEGGSLCRFNASGIRLSPAFVLFSELASVLSLRLPLPPPDTFLDSSHA
VLEMPALSPTMNQGNIAKWRKKEGDKIAVGDVLCEIETDKATLEFESLEEGFLAKILVPEGSKDVPVGQPIAITVEDPDDIHRVLANGVSGASDIKDDAIGDQKDKNEDR
AQASSVEINTSKLPPHFILEMPALSPTMNQGNIANWRKKEGDKIEVGDVICEIETDKATLEFESLEEGYLAKILAPEGSKDVAVGKPIAITVEDPADIESVKSAVSSSLG
TKEDKPAQSTIRNDAGTSKGSVARISPAAKLLIAEHGLDASSLKASGSHGTLLKGDVLAAIKSGKGLSEVSLSKEKRSPEVHAQASSTASSESKSPIKQSDSFEDLPNSQ
IRKVIAKRLLESKQNTPHLYLSTDVILDPLLSLRKDLKEKHDVKVSVNDIVIKAVAVALRNVPGANAYWDNEKGEVVFCDSIDISIAVATEKGLMTPIVRNADQKTVSMI
SSEVKELAEKARAGKLKPDEFQGGTFSISNLGMFPVDHFCAIINPPQAGILAVGRGNKVVEPVIGTDGIERPVVLNKMNLTLSADHRVFDGKVGDSAIPTSPEPFQTLQH
EEACNPTPSSSTNKYIHKEKRKKTGCYIGKPGGAVGDGEIDGGIVRARDGAVKPTRSDGVAEFGLLEGVGSIEDSQRRFHQGLVFHLPAYAPASSSTLAPGPRGGNEIGP
GRTCKGRSSCWILFQSALGGI