; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021082 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021082
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionElongator complex protein 5
Genome locationtig00153640:361213..373860
RNA-Seq ExpressionSgr021082
SyntenySgr021082
Gene Ontology termsGO:0002098 - tRNA wobble uridine modification (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005829 - cytosol (cellular component)
GO:0033588 - Elongator holoenzyme complex (cellular component)
GO:0000049 - tRNA binding (molecular function)
InterPro domainsIPR019519 - Elongator complex protein 5
IPR022800 - Spt4/RpoE2 zinc finger
IPR029040 - RNA polymerase subunit RPABC4/transcription elongation factor Spt4
IPR038510 - Spt4 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607058.1 Elongator complex protein 5, partial [Cucurbita argyrosperma subsp. sororia]3.8e-14874.03Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+  ALRDG LEGELAPALT+KDSI+SPFGFHAFAHVL QLS NILAGKSQSRGLVLLAFSRSPAYY++LLKK G+ VGSS+KWIQILDCYTDPLGWKER
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN-------------------
        +  GEK SNV QEVS LS+LCTNVRDMD LFSSII LGKGFVG+G VRFCVAIDSV +  R+  T    +VAG LSNLRSN                   
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN-------------------

Query:  ----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQFN
                  VATVEPLTPSP+VR  NL+  YLEH+STKGRFHVR KRRNGRVRVI          CE F VEQSGIKFTSI SEDAV+NQGL+PKVQFN
Subjt:  ----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQFN

Query:  LQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI
        LQLSEKE+IDRARVVLPFEHQGNGKPIQIYDGRRSLTESKD +KPL T+EK KDEG+GKGEI+YFRDS+DE+PDSDEDPDDDLDI
Subjt:  LQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI

KAG7036759.1 Elongator complex protein 5 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-14773.77Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+  ALRDG LEGELAPALT+KDSI+SPFGFHAFAHV+ QLS NILAGKSQSRGLVLLAFSRSPAYY+ LLKK G+ VGSS+KWIQILDCYTDPLGWKER
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN-------------------
        +  GEK SNV QEVS LS+LCTNVRDMD LFSSII LGKGFVG+G VRFCVAIDSV +  R+  T    +VAG LSNLRSN                   
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN-------------------

Query:  ----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQFN
                  VATVEPLTPSP+VR  NL+  YLEH+STKGRFHVR KRRNGRVRVI          CE F VEQSGIKFTSI SEDAV+NQGL+PKVQFN
Subjt:  ----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQFN

Query:  LQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI
        LQLSEKE+IDRARVVLPFEHQGNGKPIQIYDGRRSLTESKD +KPL T+EK KDEG+GKGEI+YFRDS+DE+PDSDEDPDDDLDI
Subjt:  LQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI

XP_022143441.1 elongator complex protein 5 isoform X1 [Momordica charantia]1.1e-15075.97Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+  ALRDG LEGELAPALTIKDS+ SPFGFHAFAH+LAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRG+ VGSSDKWIQILDCYTDPLGWKER
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGK-GFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN------------------
         MEGEK+SNV QEVS+LSNLCTNVRDMDKLFSSI+ALGK GFVG+G VRFCVAIDSV +  R+  T    +VAG LS+LRSN                  
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGK-GFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN------------------

Query:  -----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSG-IKFTSILSEDAVINQGLLPKVQ
                   VA+VEP TPSP+V RGNLDNSY EHNST GRFHVRFKRRNGRVR+I          CE F VE SG IKFTSILSEDA+INQGL+PKV 
Subjt:  -----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSG-IKFTSILSEDAVINQGLLPKVQ

Query:  FNLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI
        FNLQLSEKE +DRA+VVLPFEHQGNGKPIQIYDGRRSLTESK+ +KPLLT+EKGKDEGSGKGEIVYFRDSDDE PDSDEDPD+DLDI
Subjt:  FNLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI

XP_022143442.1 elongator complex protein 5 isoform X2 [Momordica charantia]4.3e-15276.17Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+  ALRDG LEGELAPALTIKDS+ SPFGFHAFAH+LAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRG+ VGSSDKWIQILDCYTDPLGWKER
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN-------------------
         MEGEK+SNV QEVS+LSNLCTNVRDMDKLFSSI+ALGKGFVG+G VRFCVAIDSV +  R+  T    +VAG LS+LRSN                   
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN-------------------

Query:  ----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSG-IKFTSILSEDAVINQGLLPKVQF
                  VA+VEP TPSP+V RGNLDNSY EHNST GRFHVRFKRRNGRVR+I          CE F VE SG IKFTSILSEDA+INQGL+PKV F
Subjt:  ----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSG-IKFTSILSEDAVINQGLLPKVQF

Query:  NLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI
        NLQLSEKE +DRA+VVLPFEHQGNGKPIQIYDGRRSLTESK+ +KPLLT+EKGKDEGSGKGEIVYFRDSDDE PDSDEDPD+DLDI
Subjt:  NLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI

XP_022948561.1 elongator complex protein 5 isoform X2 [Cucurbita moschata]4.2e-14773.51Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+  ALRDG LEGELAPALT+KDSI+SPFGFHAFAHVL QLS NILAGKSQSRGLVLLAFSRSPAYY++LLKK G+ VGSS+KWIQILDCYTDPLGWKER
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN-------------------
        +  GEK+SNV QEVS LS+LCTNVRDMD LFSSII LGKGFVG+G VRFCVA+DSV +  R+  T    +VAG LSNLRSN                   
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN-------------------

Query:  ----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQFN
                  VATVEPLTPSP+VR  +L+  YLEH+STKGRFHVR KRRNGRVRVI          CE F VEQSGIKFTSI SEDAV+NQGL+PKVQFN
Subjt:  ----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQFN

Query:  LQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI
        LQLSEKE+IDRARVVLPFEHQGNGKPIQIYDGRRSLTESKD +KPL TNEK KDE  GKGEI+YFRDS+DE+PDSDEDPDDDLDI
Subjt:  LQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI

TrEMBL top hitse value%identityAlignment
A0A0A0L7D4 Elongator complex protein 52.9e-14673.63Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+  ALRDG L+GELAPALTIKD+I+SPFGFHAF+HVL QLS NILAGKSQSRGLVLL+FSRSPAYYV LLKKRGL VGSS KWIQILDCYTDPLGWK+R
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCNRYVKTFIYISVAGFLSNLRSN---------------------
        +MEGE +SNV QEVS LS+LCTNV DMDKLFSSIIALGKGFVG+G VRFCVA+DSV N    + I  ++AG LS+LRSN                     
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCNRYVKTFIYISVAGFLSNLRSN---------------------

Query:  --------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQFNLQ
                VAT+E LTPSP+V R N+DNSYLEH STKGRFHVR KRRNGRVRVI           E FNVEQSGIKFTSI SEDAVINQ L+PKVQFNLQ
Subjt:  --------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQFNLQ

Query:  LSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI
        LSEKER DRARVVLPFEHQG GKPIQIYDGRRS +ESKD   PL+TNEKG D+GSGKGEIVYFRDSDDE+PDSDEDPDDDLDI
Subjt:  LSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI

A0A6J1CQ94 Elongator complex protein 55.1e-15175.97Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+  ALRDG LEGELAPALTIKDS+ SPFGFHAFAH+LAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRG+ VGSSDKWIQILDCYTDPLGWKER
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGK-GFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN------------------
         MEGEK+SNV QEVS+LSNLCTNVRDMDKLFSSI+ALGK GFVG+G VRFCVAIDSV +  R+  T    +VAG LS+LRSN                  
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGK-GFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN------------------

Query:  -----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSG-IKFTSILSEDAVINQGLLPKVQ
                   VA+VEP TPSP+V RGNLDNSY EHNST GRFHVRFKRRNGRVR+I          CE F VE SG IKFTSILSEDA+INQGL+PKV 
Subjt:  -----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSG-IKFTSILSEDAVINQGLLPKVQ

Query:  FNLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI
        FNLQLSEKE +DRA+VVLPFEHQGNGKPIQIYDGRRSLTESK+ +KPLLT+EKGKDEGSGKGEIVYFRDSDDE PDSDEDPD+DLDI
Subjt:  FNLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI

A0A6J1CQU5 Elongator complex protein 52.1e-15276.17Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+  ALRDG LEGELAPALTIKDS+ SPFGFHAFAH+LAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRG+ VGSSDKWIQILDCYTDPLGWKER
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN-------------------
         MEGEK+SNV QEVS+LSNLCTNVRDMDKLFSSI+ALGKGFVG+G VRFCVAIDSV +  R+  T    +VAG LS+LRSN                   
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN-------------------

Query:  ----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSG-IKFTSILSEDAVINQGLLPKVQF
                  VA+VEP TPSP+V RGNLDNSY EHNST GRFHVRFKRRNGRVR+I          CE F VE SG IKFTSILSEDA+INQGL+PKV F
Subjt:  ----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSG-IKFTSILSEDAVINQGLLPKVQF

Query:  NLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI
        NLQLSEKE +DRA+VVLPFEHQGNGKPIQIYDGRRSLTESK+ +KPLLT+EKGKDEGSGKGEIVYFRDSDDE PDSDEDPD+DLDI
Subjt:  NLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI

A0A6J1G9J7 Elongator complex protein 52.0e-14773.51Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+  ALRDG LEGELAPALT+KDSI+SPFGFHAFAHVL QLS NILAGKSQSRGLVLLAFSRSPAYY++LLKK G+ VGSS+KWIQILDCYTDPLGWKER
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN-------------------
        +  GEK+SNV QEVS LS+LCTNVRDMD LFSSII LGKGFVG+G VRFCVA+DSV +  R+  T    +VAG LSNLRSN                   
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN-------------------

Query:  ----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQFN
                  VATVEPLTPSP+VR  +L+  YLEH+STKGRFHVR KRRNGRVRVI          CE F VEQSGIKFTSI SEDAV+NQGL+PKVQFN
Subjt:  ----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQFN

Query:  LQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI
        LQLSEKE+IDRARVVLPFEHQGNGKPIQIYDGRRSLTESKD +KPL TNEK KDE  GKGEI+YFRDS+DE+PDSDEDPDDDLDI
Subjt:  LQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI

A0A6J1GA78 Elongator complex protein 55.0e-14673.32Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+  ALRDG LEGELAPALT+KDSI+SPFGFHAFAHVL QLS NILAGKSQSRGLVLLAFSRSPAYY++LLKK G+ VGSS+KWIQILDCYTDPLGWKER
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGK-GFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN------------------
        +  GEK+SNV QEVS LS+LCTNVRDMD LFSSII LGK GFVG+G VRFCVA+DSV +  R+  T    +VAG LSNLRSN                  
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGK-GFVGQGAVRFCVAIDSVCN--RYVKTFIYISVAGFLSNLRSN------------------

Query:  -----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQF
                   VATVEPLTPSP+VR  +L+  YLEH+STKGRFHVR KRRNGRVRVI          CE F VEQSGIKFTSI SEDAV+NQGL+PKVQF
Subjt:  -----------VATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQF

Query:  NLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI
        NLQLSEKE+IDRARVVLPFEHQGNGKPIQIYDGRRSLTESKD +KPL TNEK KDE  GKGEI+YFRDS+DE+PDSDEDPDDDLDI
Subjt:  NLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI

SwissProt top hitse value%identityAlignment
F4IQJ2 Elongator complex protein 52.9e-8751.03Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+   LRDG  EGELAPALTI+++++SPFG     ++L  LS +ILAGKS S+GLVL+ FSRSP++Y+ LLK++G+ V SS KWI+ILDCYTDPLGW   
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCNRYVKTFIYISVAGFLSNLRSNV--------------------
            ++ S  F E S+L  L   V D+ KLFSSII  G+  VG G  RFCVAIDSV N  ++      V+G L++LRS+                     
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCNRYVKTFIYISVAGFLSNLRSNV--------------------

Query:  ---------ATVEPLTPSPHVRRGNLDNSYLEHNS-TKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVI--NQGLLPKVQF
                 A +EPL PS   +R  L+N +  H    KGRFHVRFK R GRVRV+           E ++V+QSGI F+ I S D VI   + LLPKVQF
Subjt:  ---------ATVEPLTPSPHVRRGNLDNSYLEHNS-TKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVI--NQGLLPKVQF

Query:  NLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKD-EGSGK-GEIVYFRDSDDEIPDSDEDPDDDLDI
        NLQLSEKER+++ +VVLPFEHQ +GK  +IYDGRRSL + K    PL + E   D   SGK GEI+YFRDSDDE PDSDEDPDDDLDI
Subjt:  NLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKD-EGSGK-GEIVYFRDSDDEIPDSDEDPDDDLDI

Q5RFH5 Transcription elongation factor SPT42.2e-1844.09Show/hide
Query:  IPTSFGHELRACLRCRLVKTYDQFRESGCENC-PFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALPEDL
        +P    H LRACL C LVKT DQF   GC+NC  + +M  + E V +CT+ +F+GII++M P  SW ++W R+  F PG Y ++V+  LP+ +
Subjt:  IPTSFGHELRACLRCRLVKTYDQFRESGCENC-PFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALPEDL

Q6DGQ0 Transcription elongation factor SPT44.5e-1948.89Show/hide
Query:  IPTSFGHELRACLRCRLVKTYDQFRESGCENC-PFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALP
        +P    H LRACL C LVKT DQF   GC+NC  + +M  + E V ECT+ +F+G+I++M P  SW A+W RIG F PG Y + V+  LP
Subjt:  IPTSFGHELRACLRCRLVKTYDQFRESGCENC-PFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALP

Q8LCQ3 Transcription elongation factor SPT4 homolog 18.4e-5086Show/hide
Query:  MGSAPAQIPTSFGHELRACLRCRLVKTYDQFRESGCENCPFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALPEDLQ
        MG APAQIPTSFGHELRACLRCRLVKTYDQFR+SGCENCPFFK+++DHER+V+ TTPNFNGIIS+MDP RSWAARWLRIG+F PGCYTLAVSEALPE++Q
Subjt:  MGSAPAQIPTSFGHELRACLRCRLVKTYDQFRESGCENCPFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALPEDLQ

Q94C60 Transcription elongation factor SPT4 homolog 22.6e-5187.25Show/hide
Query:  MGSAPAQIPTSFGHELRACLRCRLVKTYDQFRESGCENCPFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALPEDLQ
        MGSAPAQIPTSFGHELRACLRCRLVKTYDQFR++GCENCPFFKM+EDHER+VE TTPNFNGIISVMDP+RSWAARWLRIG+F PGCYTLAVSE LPE++Q
Subjt:  MGSAPAQIPTSFGHELRACLRCRLVKTYDQFRESGCENCPFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALPEDLQ

Query:  NV
        ++
Subjt:  NV

Arabidopsis top hitse value%identityAlignment
AT2G18410.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Histone acetylation protein 2 (InterPro:IPR019519); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink).2.1e-8851.03Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+   LRDG  EGELAPALTI+++++SPFG     ++L  LS +ILAGKS S+GLVL+ FSRSP++Y+ LLK++G+ V SS KWI+ILDCYTDPLGW   
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCNRYVKTFIYISVAGFLSNLRSNV--------------------
            ++ S  F E S+L  L   V D+ KLFSSII  G+  VG G  RFCVAIDSV N  ++      V+G L++LRS+                     
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCNRYVKTFIYISVAGFLSNLRSNV--------------------

Query:  ---------ATVEPLTPSPHVRRGNLDNSYLEHNS-TKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVI--NQGLLPKVQF
                 A +EPL PS   +R  L+N +  H    KGRFHVRFK R GRVRV+           E ++V+QSGI F+ I S D VI   + LLPKVQF
Subjt:  ---------ATVEPLTPSPHVRRGNLDNSYLEHNS-TKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVI--NQGLLPKVQF

Query:  NLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKD-EGSGK-GEIVYFRDSDDEIPDSDEDPDDDLDI
        NLQLSEKER+++ +VVLPFEHQ +GK  +IYDGRRSL + K    PL + E   D   SGK GEI+YFRDSDDE PDSDEDPDDDLDI
Subjt:  NLQLSEKERIDRARVVLPFEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKD-EGSGK-GEIVYFRDSDDEIPDSDEDPDDDLDI

AT2G18410.2 unknown protein4.3e-5745.33Show/hide
Query:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER
        S+   LRDG  EGELAPALTI+++++SPFG     ++L  LS +ILAGKS S+GLVL+ FSRSP++Y+ LLK++G+ V SS KWI+ILDCYTDPLGW   
Subjt:  SVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAYYVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKER

Query:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCNRYVKTFIYISVAGFLSNLRSNV--------------------
            ++ S  F E S+L  L   V D+ KLFSSII  G+  VG G  RFCVAIDSV N  ++      V+G L++LRS+                     
Subjt:  YMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCNRYVKTFIYISVAGFLSNLRSNV--------------------

Query:  ---------ATVEPLTPSPHVRRGNLDNSYLEHNS-TKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVI--NQGLLPKVQF
                 A +EPL PS   +R  L+N +  H    KGRFHVRFK R GRVRV+           E ++V+QSGI F+ I S D VI   + LLPK  +
Subjt:  ---------ATVEPLTPSPHVRRGNLDNSYLEHNS-TKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVI--NQGLLPKVQF

AT5G08565.1 Transcription initiation Spt4-like protein6.0e-5186Show/hide
Query:  MGSAPAQIPTSFGHELRACLRCRLVKTYDQFRESGCENCPFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALPEDLQ
        MG APAQIPTSFGHELRACLRCRLVKTYDQFR+SGCENCPFFK+++DHER+V+ TTPNFNGIIS+MDP RSWAARWLRIG+F PGCYTLAVSEALPE++Q
Subjt:  MGSAPAQIPTSFGHELRACLRCRLVKTYDQFRESGCENCPFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALPEDLQ

AT5G63670.1 SPT4 homolog 21.9e-5287.25Show/hide
Query:  MGSAPAQIPTSFGHELRACLRCRLVKTYDQFRESGCENCPFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALPEDLQ
        MGSAPAQIPTSFGHELRACLRCRLVKTYDQFR++GCENCPFFKM+EDHER+VE TTPNFNGIISVMDP+RSWAARWLRIG+F PGCYTLAVSE LPE++Q
Subjt:  MGSAPAQIPTSFGHELRACLRCRLVKTYDQFRESGCENCPFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALPEDLQ

Query:  NV
        ++
Subjt:  NV

AT5G63670.2 SPT4 homolog 24.3e-4192.41Show/hide
Query:  MGSAPAQIPTSFGHELRACLRCRLVKTYDQFRESGCENCPFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRI
        MGSAPAQIPTSFGHELRACLRCRLVKTYDQFR++GCENCPFFKM+EDHER+VE TTPNFNGIISVMDP+RSWAARWLRI
Subjt:  MGSAPAQIPTSFGHELRACLRCRLVKTYDQFRESGCENCPFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGTGCACCTGCTCAAATTCCTACCAGTTTTGGGCATGAGCTCAGAGCTTGTCTTCGTTGTCGTCTCGTCAAAACTTACGATCAGTTTCGAGAATCGGGCTGTGA
GAATTGTCCCTTCTTCAAGATGGACGAAGATCATGAGCGTGTTGTAGAGTGTACTACTCCTAATTTTAACGGGATAATTTCTGTCATGGATCCTGCCAGAAGTTGGGCTG
CTAGATGGCTGCGAATTGGAAGATTTGTCCCAGGTTGTTACACTCTGGCAGTCTCAGAGGCACTCCCAGAGGATTTGCAGAATGTGCATCCTCATATGGTTCTCTCCAAG
CTGGTGCTCTTGTGCAAAGCAGATAGTGAGACTCAATCCCATTCTTTTCCACTCGCTCTTTTCCTGCGGGATCGAACAAGGATGGCCTTCTTCCTTTGTATGGTTCCATC
AAAATATGGTTCCATCGAAATCCGACAGAAACTACCCTTCTTAACTCGAATGATTTCTGCTCGAGAACAAGCAAGGATCTTCCTTGCTCGTTCTCTCTCGTTTTCTCCTG
AATGTGCATTACTTGCACTCCGATTGTTGTTGAATTGGTTTGGAGAACTCTTCACCTTCCCATCTCTTAGGATTGCTGTTTCAATTTATTTTGGACGTGGTCTTGGAACG
TTTGATTCTGTCAGGGATTTATGGTTTAGCTGGCGTCGCGCCGCTCCCCGTCCCTCTCATCATAGACAGACAGACCTCGACGGCCAAATCTCACAGCTCCGGTTCCGTCG
ATCGATCTCCGGCAGAGACCCAAGCGTGACTTCAGCACTTAGAGACGGTGTGTTAGAGGGAGAGCTCGCGCCTGCTCTCACTATAAAGGACTCCATAAGTTCGCCATTCG
GTTTTCATGCCTTCGCCCACGTTCTGGCACAGCTCTCCATCAATATTTTGGCGGGTAAATCGCAGTCTCGAGGCCTCGTTCTACTCGCGTTCTCTCGAAGTCCGGCGTAT
TATGTCGACCTGTTAAAGAAGAGAGGACTTACTGTTGGGTCGTCTGATAAATGGATTCAAATTTTGGACTGTTATACAGATCCTCTTGGTTGGAAGGAACGGTATATGGA
GGGTGAAAAATTATCAAATGTTTTCCAAGAAGTTTCAACTTTATCTAATCTTTGTACAAATGTGAGGGATATGGATAAACTATTCTCTTCAATTATAGCACTTGGAAAAG
GATTTGTTGGACAAGGAGCTGTACGCTTTTGTGTCGCCATAGACTCTGTATGTAACCGATATGTTAAGACATTCATCTACATCAGCGTTGCAGGTTTTTTAAGCAACCTC
CGGAGCAATGTGGCCACAGTTGAGCCATTAACACCATCCCCACATGTGCGCAGAGGCAATCTGGACAACTCCTATCTTGAACACAACTCTACAAAAGGGAGGTTTCATGT
GCGGTTTAAACGCAGAAATGGACGCGTGCGAGTCATTGTAAATTCTCTTCTCTATGCAGAACTTTGGTGTGAAGGTTTCAATGTTGAGCAGTCAGGCATCAAATTTACGT
CCATTTTATCTGAAGATGCAGTCATCAATCAAGGGCTATTACCGAAGGTGCAGTTCAATCTACAGTTGTCAGAGAAAGAGCGAATTGACAGGGCTAGAGTTGTTCTTCCT
TTTGAACATCAAGGAAATGGTAAACCGATACAAATATACGATGGCCGAAGATCCCTTACCGAAAGCAAAGATTACGAGAAACCTCTCTTGACCAATGAGAAAGGCAAGGA
TGAAGGATCTGGAAAGGGTGAGATTGTCTATTTCCGTGATTCAGATGATGAGATCCCAGATTCTGATGAGGACCCAGATGATGATTTAGACATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGTGCACCTGCTCAAATTCCTACCAGTTTTGGGCATGAGCTCAGAGCTTGTCTTCGTTGTCGTCTCGTCAAAACTTACGATCAGTTTCGAGAATCGGGCTGTGA
GAATTGTCCCTTCTTCAAGATGGACGAAGATCATGAGCGTGTTGTAGAGTGTACTACTCCTAATTTTAACGGGATAATTTCTGTCATGGATCCTGCCAGAAGTTGGGCTG
CTAGATGGCTGCGAATTGGAAGATTTGTCCCAGGTTGTTACACTCTGGCAGTCTCAGAGGCACTCCCAGAGGATTTGCAGAATGTGCATCCTCATATGGTTCTCTCCAAG
CTGGTGCTCTTGTGCAAAGCAGATAGTGAGACTCAATCCCATTCTTTTCCACTCGCTCTTTTCCTGCGGGATCGAACAAGGATGGCCTTCTTCCTTTGTATGGTTCCATC
AAAATATGGTTCCATCGAAATCCGACAGAAACTACCCTTCTTAACTCGAATGATTTCTGCTCGAGAACAAGCAAGGATCTTCCTTGCTCGTTCTCTCTCGTTTTCTCCTG
AATGTGCATTACTTGCACTCCGATTGTTGTTGAATTGGTTTGGAGAACTCTTCACCTTCCCATCTCTTAGGATTGCTGTTTCAATTTATTTTGGACGTGGTCTTGGAACG
TTTGATTCTGTCAGGGATTTATGGTTTAGCTGGCGTCGCGCCGCTCCCCGTCCCTCTCATCATAGACAGACAGACCTCGACGGCCAAATCTCACAGCTCCGGTTCCGTCG
ATCGATCTCCGGCAGAGACCCAAGCGTGACTTCAGCACTTAGAGACGGTGTGTTAGAGGGAGAGCTCGCGCCTGCTCTCACTATAAAGGACTCCATAAGTTCGCCATTCG
GTTTTCATGCCTTCGCCCACGTTCTGGCACAGCTCTCCATCAATATTTTGGCGGGTAAATCGCAGTCTCGAGGCCTCGTTCTACTCGCGTTCTCTCGAAGTCCGGCGTAT
TATGTCGACCTGTTAAAGAAGAGAGGACTTACTGTTGGGTCGTCTGATAAATGGATTCAAATTTTGGACTGTTATACAGATCCTCTTGGTTGGAAGGAACGGTATATGGA
GGGTGAAAAATTATCAAATGTTTTCCAAGAAGTTTCAACTTTATCTAATCTTTGTACAAATGTGAGGGATATGGATAAACTATTCTCTTCAATTATAGCACTTGGAAAAG
GATTTGTTGGACAAGGAGCTGTACGCTTTTGTGTCGCCATAGACTCTGTATGTAACCGATATGTTAAGACATTCATCTACATCAGCGTTGCAGGTTTTTTAAGCAACCTC
CGGAGCAATGTGGCCACAGTTGAGCCATTAACACCATCCCCACATGTGCGCAGAGGCAATCTGGACAACTCCTATCTTGAACACAACTCTACAAAAGGGAGGTTTCATGT
GCGGTTTAAACGCAGAAATGGACGCGTGCGAGTCATTGTAAATTCTCTTCTCTATGCAGAACTTTGGTGTGAAGGTTTCAATGTTGAGCAGTCAGGCATCAAATTTACGT
CCATTTTATCTGAAGATGCAGTCATCAATCAAGGGCTATTACCGAAGGTGCAGTTCAATCTACAGTTGTCAGAGAAAGAGCGAATTGACAGGGCTAGAGTTGTTCTTCCT
TTTGAACATCAAGGAAATGGTAAACCGATACAAATATACGATGGCCGAAGATCCCTTACCGAAAGCAAAGATTACGAGAAACCTCTCTTGACCAATGAGAAAGGCAAGGA
TGAAGGATCTGGAAAGGGTGAGATTGTCTATTTCCGTGATTCAGATGATGAGATCCCAGATTCTGATGAGGACCCAGATGATGATTTAGACATATAA
Protein sequenceShow/hide protein sequence
MGSAPAQIPTSFGHELRACLRCRLVKTYDQFRESGCENCPFFKMDEDHERVVECTTPNFNGIISVMDPARSWAARWLRIGRFVPGCYTLAVSEALPEDLQNVHPHMVLSK
LVLLCKADSETQSHSFPLALFLRDRTRMAFFLCMVPSKYGSIEIRQKLPFLTRMISAREQARIFLARSLSFSPECALLALRLLLNWFGELFTFPSLRIAVSIYFGRGLGT
FDSVRDLWFSWRRAAPRPSHHRQTDLDGQISQLRFRRSISGRDPSVTSALRDGVLEGELAPALTIKDSISSPFGFHAFAHVLAQLSINILAGKSQSRGLVLLAFSRSPAY
YVDLLKKRGLTVGSSDKWIQILDCYTDPLGWKERYMEGEKLSNVFQEVSTLSNLCTNVRDMDKLFSSIIALGKGFVGQGAVRFCVAIDSVCNRYVKTFIYISVAGFLSNL
RSNVATVEPLTPSPHVRRGNLDNSYLEHNSTKGRFHVRFKRRNGRVRVIVNSLLYAELWCEGFNVEQSGIKFTSILSEDAVINQGLLPKVQFNLQLSEKERIDRARVVLP
FEHQGNGKPIQIYDGRRSLTESKDYEKPLLTNEKGKDEGSGKGEIVYFRDSDDEIPDSDEDPDDDLDI