; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011726 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011726
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr1:31722410..31731568
RNA-Seq ExpressionLag0011726
SyntenyLag0011726
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR036237 - Xylose isomerase-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046182.1 zf-CCHC domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]1.3e-8141.37Show/hide
Query:  WEPKVTAIQEARNLKSLSIDEL-----------------------------MEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSK
        W+ KVTAIQEA++L  L ++EL                             + ++ +DEDDLDEDD+   SRKYKNFIKRKK FKK+LS QK   GEKSK
Subjt:  WEPKVTAIQEARNLKSLSIDEL-----------------------------MEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSK

Query:  KDEVICYGCKKQGHIRTDCPLLKSSKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYV
        KDEVICY CKK  HIRTDCP LKSSKKSK+KAMKATWDDS ESE  S+ EE AN   M  SD EDE +DEVTLEP S  ELFE FEN+QNDLEKLSSKYV
Subjt:  KDEVICYGCKKQGHIRTDCPLLKSSKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYV

Query:  VLKKKFNALSSKNKSLLEEIACLKENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQS
        VLKKK+N LSS+NKSLL++IAC KEN +    I+E+N+S DKH                                                         
Subjt:  VLKKKFNALSSKNKSLLEEIACLKENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQS

Query:  SNVSLSFPAKSVSFGCVSSRRWCCSVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQ
                                                                                                            
Subjt:  SNVSLSFPAKSVSFGCVSSRRWCCSVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQ

Query:  IASSLLALTDRPTQRSSWHRLGIGKHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTIC
                              IG                       DC+EK+ALLDKV+FLEHDSCEKDNLIK+LKE E +V+Q+LDKAKE+IKKLTI 
Subjt:  IASSLLALTDRPTQRSSWHRLGIGKHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTIC

Query:  AQRLDKIIEVGKSFDDKRGLGYIDECSTPSSSKTIFVKASHNMPKVVSKHVKSNFVPICHYCGVVKLK
        AQRLDKIIEVGKS+ DKR LGYIDE ST  SSKT FVKAS  +PK        N V +C Y  +V LK
Subjt:  AQRLDKIIEVGKSFDDKRGLGYIDECSTPSSSKTIFVKASHNMPKVVSKHVKSNFVPICHYCGVVKLK

TYK02592.1 zf-CCHC domain-containing protein/DUF4219 domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]6.4e-5267.2Show/hide
Query:  WEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKSSKKSK
        WE K     E++  KS++++ +  ++ +DEDDLDEDD+A  SRKY+NFIKRKK FKKHLS QKE  GEK+KKDEVI Y CKK G+IRTDCP LKSSKKSK
Subjt:  WEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKSSKKSK

Query:  KKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKS
        KKA+KATWDDS +SE  S+ EE+AN   MAHSD EDE +DEVTLE  S +ELFE FE+MQNDLEKLSSK VVLKKK+N L+S+NKS
Subjt:  KKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKS

XP_022156978.1 uncharacterized protein LOC111023806 [Momordica charantia]5.8e-6935.46Show/hide
Query:  SIILRWEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKS
        S+  +WEPKVT IQEA++LK+LS+DEL+      E  + +    N+  + K   K K    K ++ + +  GE +  ++ + Y  +K     TDCPLLKS
Subjt:  SIILRWEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKS

Query:  SKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKSLLEEIACLK
        SKKSKKKAMKATWDDSDES S S+NEEVANFCFMAHSD EDEQ+DEV L+PLSY+ELFE FENMQN+LEKL SKYV+LK K N  +S+NKSL ++IACLK
Subjt:  SKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKSLLEEIACLK

Query:  ENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQSSNVSLSFPAKSVSFGCVSSRRWCC
        +NEHDV                                                                                              
Subjt:  ENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQSSNVSLSFPAKSVSFGCVSSRRWCC

Query:  SVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQIASSLLALTDRPTQRSSWHRLGIG
                                                                                                            
Subjt:  SVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQIASSLLALTDRPTQRSSWHRLGIG

Query:  KHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTICAQRLDKIIEVGKSFDDKRGLGYID
                                                     DNLIKLLK+ E   + ELDKAK+ IK+LTI AQRLDKIIE GK + DKRGLGYI+
Subjt:  KHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTICAQRLDKIIEVGKSFDDKRGLGYID

Query:  ECSTPSSSKTIFVKASHNMPKVVSKHVKSNFVPICHYCGVVKLKYAHTTSPRRIFSQRAKFHNAPRNNFSKKGRVQQFAVCLKVSKKSKWYLDSGCSRHM
        EC+TPSSSKTIFVKAS NMPK+V                                        AP+             VCLK SKKSKWYLDS CSRHM
Subjt:  ECSTPSSSKTIFVKASHNMPKVVSKHVKSNFVPICHYCGVVKLKYAHTTSPRRIFSQRAKFHNAPRNNFSKKGRVQQFAVCLKVSKKSKWYLDSGCSRHM

Query:  TGDPSKFVTLSKKDGGLVTFGDNKKG
        TGD SKFVT SK DGG VTFGDNKKG
Subjt:  TGDPSKFVTLSKKDGGLVTFGDNKKG

XP_022158792.1 uncharacterized protein LOC111025259 [Momordica charantia]3.5e-7436Show/hide
Query:  SIILRWEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKS
        S+  +WEPKVTAIQEA++LK+LS+DEL+      E+ LDEDDVA LSRKYKNFIKRKKQFKK+ SN KE   E SKKDEVICY CKK GHIRTDCP LKS
Subjt:  SIILRWEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKS

Query:  SKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKSLLEEIACLK
        SKKSKKKAMKATWDDSDES + S+NEEVANFCFMAHSD EDE++DE+TL+PLSY+ELFE FENMQNDLEKL                             
Subjt:  SKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKSLLEEIACLK

Query:  ENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQSSNVSLSFPAKSVSFGCVSSRRWCC
                                                                                                            
Subjt:  ENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQSSNVSLSFPAKSVSFGCVSSRRWCC

Query:  SVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQIASSLLALTDRPTQRSSWHRLGIG
                                                                                                            
Subjt:  SVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQIASSLLALTDRPTQRSSWHRLGIG

Query:  KHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTICAQRLDKIIEVGKSFDDKRGLGYID
                                                                     ELDKAK+SIKKLTI AQRLDKIIE+GK + DKRGLGYID
Subjt:  KHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTICAQRLDKIIEVGKSFDDKRGLGYID

Query:  ECSTPSSSKTIFVKASHNMPKVVSKHVKSNFVPICHYCGVVKLKYAHTTSPRRIFSQRAKFHNAPRNNFSKKGRVQQFAVCLKVSKKSKWYLDSGCSRHM
        ECSTPSSSK IFVKAS NMPK+V+  V                                                    VCLK SKK KWYLDSGCSR+M
Subjt:  ECSTPSSSKTIFVKASHNMPKVVSKHVKSNFVPICHYCGVVKLKYAHTTSPRRIFSQRAKFHNAPRNNFSKKGRVQQFAVCLKVSKKSKWYLDSGCSRHM

Query:  TGDPSKFVTLSKKDGGLVTFGDNKK
        TGD SKFVT SKKDGG VTFGD+KK
Subjt:  TGDPSKFVTLSKKDGGLVTFGDNKK

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]7.5e-10945.07Show/hide
Query:  WEPKVTAIQEARNLKSLSIDEL-------------------------------MEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEK
        WE KVTAIQEA++L  L ++EL                               +EVD +DED LDEDD+A  SRKYKNFIKRKK FKKHLS QKE  GEK
Subjt:  WEPKVTAIQEARNLKSLSIDEL-------------------------------MEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEK

Query:  SKKDEVICYGCKKQGHIRTDCPLLKSSKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSK
        SKKDEVICY CK+ GHIRTDCPLLKSSKKSKKKAMKATWDDS ESE  S+ EE+AN   MAHSD +DE +D+VTLEPLS +ELFE FE+MQNDLEKLSSK
Subjt:  SKKDEVICYGCKKQGHIRTDCPLLKSSKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSK

Query:  YVVLKKKFNALSSKNKSLLEEIACLKENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDR
        YVVLKKK+N L S+NKSLL+ IAC KENE +   I+E+N+S DKH                                                       
Subjt:  YVVLKKKFNALSSKNKSLLEEIACLKENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDR

Query:  QSSNVSLSFPAKSVSFGCVSSRRWCCSVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARG
                                                                                                            
Subjt:  QSSNVSLSFPAKSVSFGCVSSRRWCCSVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARG

Query:  CQIASSLLALTDRPTQRSSWHRLGIGKHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLT
                                                          C EK+ALLDKV+FLEHDSCEKDNLIK+LKE E SV+QELDKAKE+IKKLT
Subjt:  CQIASSLLALTDRPTQRSSWHRLGIGKHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLT

Query:  ICAQRLDKIIEVGKSFDDKRGLGYIDECSTPSSSKTIFVKASHNMPKV-----VSKHVKSNFVPICHYCGV--------VKLKYAHTTSPRRIFSQRAKF
        I AQRLDKIIEVGKS+ DKRGLGYIDE STPSSSKT FVKAS  +PK      VS HVKS+FVPICH CGV         KLKYA  T  RR FSQRAKF
Subjt:  ICAQRLDKIIEVGKSFDDKRGLGYIDECSTPSSSKTIFVKASHNMPKV-----VSKHVKSNFVPICHYCGV--------VKLKYAHTTSPRRIFSQRAKF

Query:  HNAPRNNFSKKGRVQQFAV
        + APR NFS K RV +F +
Subjt:  HNAPRNNFSKKGRVQQFAV

TrEMBL top hitse value%identityAlignment
A0A5A7SRY3 Zf-CCHC domain-containing protein/DUF4219 domain-containing protein/UBN2 domain-containing protein7.7e-5168.93Show/hide
Query:  EARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKSSKKSKKKAMKATWD
        E++  KS++++ +  ++ +DEDDLDEDD+A  SRKY+NFIKRKK FKKHLS QKE  GEK+KKDEVI Y CKK G+IRTDCP LKSSKKSKKKA+KATWD
Subjt:  EARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKSSKKSKKKAMKATWD

Query:  DSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKS
        DS +SE  S+ EE+AN   MAHSD EDE +DEVTLE  S +ELFE FE+MQNDLEKLSSK VVLKKK+N L+S+NKS
Subjt:  DSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKS

A0A5A7TRZ7 Zf-CCHC domain-containing protein/UBN2 domain-containing protein6.4e-8241.37Show/hide
Query:  WEPKVTAIQEARNLKSLSIDEL-----------------------------MEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSK
        W+ KVTAIQEA++L  L ++EL                             + ++ +DEDDLDEDD+   SRKYKNFIKRKK FKK+LS QK   GEKSK
Subjt:  WEPKVTAIQEARNLKSLSIDEL-----------------------------MEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSK

Query:  KDEVICYGCKKQGHIRTDCPLLKSSKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYV
        KDEVICY CKK  HIRTDCP LKSSKKSK+KAMKATWDDS ESE  S+ EE AN   M  SD EDE +DEVTLEP S  ELFE FEN+QNDLEKLSSKYV
Subjt:  KDEVICYGCKKQGHIRTDCPLLKSSKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYV

Query:  VLKKKFNALSSKNKSLLEEIACLKENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQS
        VLKKK+N LSS+NKSLL++IAC KEN +    I+E+N+S DKH                                                         
Subjt:  VLKKKFNALSSKNKSLLEEIACLKENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQS

Query:  SNVSLSFPAKSVSFGCVSSRRWCCSVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQ
                                                                                                            
Subjt:  SNVSLSFPAKSVSFGCVSSRRWCCSVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQ

Query:  IASSLLALTDRPTQRSSWHRLGIGKHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTIC
                              IG                       DC+EK+ALLDKV+FLEHDSCEKDNLIK+LKE E +V+Q+LDKAKE+IKKLTI 
Subjt:  IASSLLALTDRPTQRSSWHRLGIGKHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTIC

Query:  AQRLDKIIEVGKSFDDKRGLGYIDECSTPSSSKTIFVKASHNMPKVVSKHVKSNFVPICHYCGVVKLK
        AQRLDKIIEVGKS+ DKR LGYIDE ST  SSKT FVKAS  +PK        N V +C Y  +V LK
Subjt:  AQRLDKIIEVGKSFDDKRGLGYIDECSTPSSSKTIFVKASHNMPKVVSKHVKSNFVPICHYCGVVKLK

A0A5D3BUV2 Zf-CCHC domain-containing protein/DUF4219 domain-containing protein/UBN2 domain-containing protein3.1e-5267.2Show/hide
Query:  WEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKSSKKSK
        WE K     E++  KS++++ +  ++ +DEDDLDEDD+A  SRKY+NFIKRKK FKKHLS QKE  GEK+KKDEVI Y CKK G+IRTDCP LKSSKKSK
Subjt:  WEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKSSKKSK

Query:  KKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKS
        KKA+KATWDDS +SE  S+ EE+AN   MAHSD EDE +DEVTLE  S +ELFE FE+MQNDLEKLSSK VVLKKK+N L+S+NKS
Subjt:  KKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKS

A0A6J1DS74 uncharacterized protein LOC1110238062.8e-6935.46Show/hide
Query:  SIILRWEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKS
        S+  +WEPKVT IQEA++LK+LS+DEL+      E  + +    N+  + K   K K    K ++ + +  GE +  ++ + Y  +K     TDCPLLKS
Subjt:  SIILRWEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKS

Query:  SKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKSLLEEIACLK
        SKKSKKKAMKATWDDSDES S S+NEEVANFCFMAHSD EDEQ+DEV L+PLSY+ELFE FENMQN+LEKL SKYV+LK K N  +S+NKSL ++IACLK
Subjt:  SKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKSLLEEIACLK

Query:  ENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQSSNVSLSFPAKSVSFGCVSSRRWCC
        +NEHDV                                                                                              
Subjt:  ENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQSSNVSLSFPAKSVSFGCVSSRRWCC

Query:  SVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQIASSLLALTDRPTQRSSWHRLGIG
                                                                                                            
Subjt:  SVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQIASSLLALTDRPTQRSSWHRLGIG

Query:  KHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTICAQRLDKIIEVGKSFDDKRGLGYID
                                                     DNLIKLLK+ E   + ELDKAK+ IK+LTI AQRLDKIIE GK + DKRGLGYI+
Subjt:  KHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTICAQRLDKIIEVGKSFDDKRGLGYID

Query:  ECSTPSSSKTIFVKASHNMPKVVSKHVKSNFVPICHYCGVVKLKYAHTTSPRRIFSQRAKFHNAPRNNFSKKGRVQQFAVCLKVSKKSKWYLDSGCSRHM
        EC+TPSSSKTIFVKAS NMPK+V                                        AP+             VCLK SKKSKWYLDS CSRHM
Subjt:  ECSTPSSSKTIFVKASHNMPKVVSKHVKSNFVPICHYCGVVKLKYAHTTSPRRIFSQRAKFHNAPRNNFSKKGRVQQFAVCLKVSKKSKWYLDSGCSRHM

Query:  TGDPSKFVTLSKKDGGLVTFGDNKKG
        TGD SKFVT SK DGG VTFGDNKKG
Subjt:  TGDPSKFVTLSKKDGGLVTFGDNKKG

A0A6J1DY46 uncharacterized protein LOC1110252591.7e-7436Show/hide
Query:  SIILRWEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKS
        S+  +WEPKVTAIQEA++LK+LS+DEL+      E+ LDEDDVA LSRKYKNFIKRKKQFKK+ SN KE   E SKKDEVICY CKK GHIRTDCP LKS
Subjt:  SIILRWEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKSKKDEVICYGCKKQGHIRTDCPLLKS

Query:  SKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKSLLEEIACLK
        SKKSKKKAMKATWDDSDES + S+NEEVANFCFMAHSD EDE++DE+TL+PLSY+ELFE FENMQNDLEKL                             
Subjt:  SKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNALSSKNKSLLEEIACLK

Query:  ENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQSSNVSLSFPAKSVSFGCVSSRRWCC
                                                                                                            
Subjt:  ENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQSSNVSLSFPAKSVSFGCVSSRRWCC

Query:  SVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQIASSLLALTDRPTQRSSWHRLGIG
                                                                                                            
Subjt:  SVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQIASSLLALTDRPTQRSSWHRLGIG

Query:  KHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTICAQRLDKIIEVGKSFDDKRGLGYID
                                                                     ELDKAK+SIKKLTI AQRLDKIIE+GK + DKRGLGYID
Subjt:  KHPRQNVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTICAQRLDKIIEVGKSFDDKRGLGYID

Query:  ECSTPSSSKTIFVKASHNMPKVVSKHVKSNFVPICHYCGVVKLKYAHTTSPRRIFSQRAKFHNAPRNNFSKKGRVQQFAVCLKVSKKSKWYLDSGCSRHM
        ECSTPSSSK IFVKAS NMPK+V+  V                                                    VCLK SKK KWYLDSGCSR+M
Subjt:  ECSTPSSSKTIFVKASHNMPKVVSKHVKSNFVPICHYCGVVKLKYAHTTSPRRIFSQRAKFHNAPRNNFSKKGRVQQFAVCLKVSKKSKWYLDSGCSRHM

Query:  TGDPSKFVTLSKKDGGLVTFGDNKK
        TGD SKFVT SKKDGG VTFGD+KK
Subjt:  TGDPSKFVTLSKKDGGLVTFGDNKK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-1543.43Show/hide
Query:  YAFVKVDDFSRFSWVLMLQHKDDALKSFSSFAKRVQNEKGILISKIRSDHGEEFDSNAFETFCEENGFSHNFSAPRTPQQNGVVERKNRTLQEFARSML
        Y    +DD SR  WV +L+ KD   + F  F   V+ E G  + ++RSD+G E+ S  FE +C  +G  H  + P TPQ NGV ER NRT+ E  RSML
Subjt:  YAFVKVDDFSRFSWVLMLQHKDDALKSFSSFAKRVQNEKGILISKIRSDHGEEFDSNAFETFCEENGFSHNFSAPRTPQQNGVVERKNRTLQEFARSML

Q40082 Xylose isomerase1.0e-2365.17Show/hide
Query:  SCHHELETARLNGLLGNIDANIGDPQIG------------------------GLAPGGFNFDAKLRRESTDVEDLFIAHISGMDTLARG
        SCHHELETAR+N +LGNIDAN GDPQ+G                        GLAPGGFNF AKLRRESTDVEDLFIAHISGMDT+ARG
Subjt:  SCHHELETARLNGLLGNIDANIGDPQIG------------------------GLAPGGFNFDAKLRRESTDVEDLFIAHISGMDTLARG

Q4UTU6 Xylose isomerase 12.5e-1450Show/hide
Query:  SCHHELETARLNGLLGNIDANIGDPQ------------------------IGGLAPGGFNFDAKLRRESTDVEDLFIAHISGMDTLARGL
        S  H+L+ A   GLLG+IDAN G+PQ                         GGLAPGG NFDAK+RRES+D +DLF+AHI GMD  ARGL
Subjt:  SCHHELETARLNGLLGNIDANIGDPQ------------------------IGGLAPGGFNFDAKLRRESTDVEDLFIAHISGMDTLARGL

Q8P9T9 Xylose isomerase 12.5e-1450Show/hide
Query:  SCHHELETARLNGLLGNIDANIGDPQ------------------------IGGLAPGGFNFDAKLRRESTDVEDLFIAHISGMDTLARGL
        S  H+L+ A   GLLG+IDAN G+PQ                         GGLAPGG NFDAK+RRES+D +DLF+AHI GMD  ARGL
Subjt:  SCHHELETARLNGLLGNIDANIGDPQ------------------------IGGLAPGGFNFDAKLRRESTDVEDLFIAHISGMDTLARGL

Q9FKK7 Xylose isomerase2.6e-2465.56Show/hide
Query:  SCHHELETARLNGLLGNIDANIGDPQI------------------------GGLAPGGFNFDAKLRRESTDVEDLFIAHISGMDTLARGL
        +CHHELETAR+NGLLGNIDAN GD Q                         GG+APGGFNFDAKLRRESTDVEDLFIAHISGMDT+ARGL
Subjt:  SCHHELETARLNGLLGNIDANIGDPQI------------------------GGLAPGGFNFDAKLRRESTDVEDLFIAHISGMDTLARGL

Arabidopsis top hitse value%identityAlignment
AT5G57655.2 xylose isomerase family protein1.9e-2565.56Show/hide
Query:  SCHHELETARLNGLLGNIDANIGDPQI------------------------GGLAPGGFNFDAKLRRESTDVEDLFIAHISGMDTLARGL
        +CHHELETAR+NGLLGNIDAN GD Q                         GG+APGGFNFDAKLRRESTDVEDLFIAHISGMDT+ARGL
Subjt:  SCHHELETARLNGLLGNIDANIGDPQI------------------------GGLAPGGFNFDAKLRRESTDVEDLFIAHISGMDTLARGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCAAAACCTCGCAAGTCTCCCCTTCAGACGCAAACTTTTTGTCGATTGGTTTTCTTCTACGTGGGTTTTTGTCGATTGGCTGCCCTTCAGACACAAGCTTCTAT
TATCTTAAGGTGGGAGCCTAAAGTCACCGCCATTCAAGAGGCAAGGAATCTCAAATCTCTCTCCATTGATGAACTCATGGAAGTTGACTCCCAAGATGAAGATGACCTTG
ATGAAGATGATGTTGCAAATCTTTCACGTAAGTATAAGAACTTCATCAAGAGAAAGAAGCAATTCAAGAAGCATCTCTCCAACCAAAAAGAGTTAAATGGTGAAAAGAGC
AAAAAGGATGAGGTAATATGCTATGGATGCAAGAAGCAGGGTCATATTAGAACCGATTGTCCTCTTCTCAAATCATCCAAGAAATCCAAGAAGAAAGCAATGAAGGCTAC
TTGGGATGATAGCGATGAAAGTGAAAGTGGGAGTGATAATGAAGAAGTGGCCAACTTTTGCTTCATGGCTCATAGTGACATAGAGGATGAACAAAATGATGAGGTTACTC
TAGAACCTCTTTCTTATAATGAATTGTTTGAAGTTTTTGAAAATATGCAAAATGATTTAGAAAAACTTAGTTCTAAGTATGTTGTGCTTAAAAAGAAATTCAATGCTTTA
TCTAGTAAAAACAAGTCTCTACTTGAAGAAATTGCTTGCTTGAAAGAGAATGAGCATGATGTTATGCATATTGATGAAATGAATATCTCTTGTGACAAGCATTGTGTAAC
GCCAAGAATTTCTCGACGGCAAGGCAGTGTGCTACAGTATCAGGCTTCTTCTTCGGCGACCTGGACGTTTGTGGGTGAATCAACGACGTCTTTCAGCGATTTTCAGCACC
TCTATCGCGCGACAATAAGTTCTCTGATGCCAGTGCAAGAAGATCGACAGTCCAGCAACGTTTCCCTCTCTTTTCCGGCGAAATCAGTGAGTTTTGGTTGCGTGAGTAGC
AGGCGGTGGTGTTGCAGCGTGTTGGGGTCCATTTTCCGGCGATCTCGACTCAAGAGCCCAGCTTCAGGCACATTATCGTGCGCTCTCTCAGTTCAGGACGGTGTTGAGCT
TATGATCAGGCACGTTATTCGTGCGCCAGATGGCACAGGCATGTTTTCATGCGTCAGTTGCTCCACCGTATTTTATATTCATATATCAGGGGTGTTGTGTTGGGATAGGT
GTAGTTCTGCTAGGGGTTGCCAAATAGCCTCTAGTTTACTTGCCTTAACCGACCGACCCACACAACGCTCTAGTTGGCACAGGTTGGGGATAGGTAAACACCCTAGGCAG
AACGTCTCATGTACTCAGTTATGGCCCCAGGTCGGGTCGTTACACATTGCTTTTGATTGTGATGAGAAAAATGCTTTGCTTGACAAAGTTAAATTTCTTGAGCATGATAG
TTGTGAAAAAGATAATTTGATTAAATTACTCAAAGAAAAGGAATTTAGTGTTGTGCAAGAACTTGATAAGGCTAAAGAATCTATTAAAAAGTTGACAATATGTGCTCAAA
GATTAGACAAGATAATTGAAGTAGGCAAGTCTTTTGATGATAAAAGAGGTTTAGGCTATATTGATGAATGTTCTACTCCTTCAAGTTCTAAAACTATCTTTGTTAAAGCA
TCTCATAATATGCCTAAGGTTGTGTCTAAGCATGTTAAATCTAACTTTGTGCCTATATGTCATTATTGTGGTGTTGTTAAATTGAAATATGCTCATACTACTTCTCCAAG
AAGAATTTTTTCTCAAAGGGCAAAGTTTCACAATGCTCCAAGGAATAATTTCTCCAAGAAAGGTAGAGTGCAACAATTTGCTGTTTGTTTAAAAGTCTCCAAGAAAAGCA
AGTGGTACTTGGATAGTGGTTGCTCGAGGCACATGACGGGAGACCCATCCAAGTTTGTCACTCTCTCCAAAAAGGATGGAGGTCTTGTAACTTTTGGTGACAACAAGAAA
GGCCTCTACAACTATTACACATGGACTTATTTGGCCCTTCTAGAATTGCTAGTTTTTTGGAGGGACTATTATGCCTTTGTGAAAGTTGATGATTTTTCAAGATTTAGTTG
GGTTTTGATGCTGCAACATAAGGATGATGCTTTGAAAAGTTTTTCTAGTTTTGCAAAAAGAGTTCAAAATGAAAAGGGTATTTTGATTTCTAAAATTAGGAGTGATCATG
GAGAAGAATTTGATAGCAATGCTTTTGAAACTTTTTGTGAAGAAAATGGTTTTTCTCATAATTTCTCCGCTCCAAGAACTCCTCAACAAAATGGTGTAGTTGAAAGGAAA
AATCGTACTTTGCAAGAATTTGCTCGATCAATGTTGCATGACTTAAATCAGAACACAACTCGAAACCCACACAAGTCGAATCGCCGGCGTTACAGACGGATCGATGCCTT
TGGAAGGAGCAACAACGAAATCGAAGGGAAGATTTCGATGATTTCTACTGGAAACCCACATAGCTCGAAACCCATCGAAAAACACAAATCGAAACCCACACCGCCGACCA
CACAGCTCGAAACCCACACTTGCTGGCCACAACTGAGATTGAGACCCACGCCAGTGTTTTTCAGCTCTTCCGCCGCCGTCAATCGCCTTCACGGACGGATAATGGCAATC
CTCATGTATGTGCATTTCAGTTGCCACCATGAACTTGAAACCGCAAGGCTCAATGGGCTTCTTGGGAACATCGATGCTAACATTGGCGATCCTCAAATTGGAGGTTTGGC
ACCTGGTGGATTCAACTTTGATGCAAAACTGCGGAGAGAGAGCACAGATGTTGAAGATTTGTTTATAGCTCACATTAGTGGAATGGATACGCTTGCCCGTGGACTTTACC
GGCGTGGATTGAGACCCACATTTGTTTCTGCACCTGGTGGATTGAGACTCACGCCGTGTTCTTCTCATCCACGCAGGCGTCCAGATCTGTGCTCACAGGCCGATAAATCC
GTGCTCACGCGAATAAATCGTCCCAGTCGGGAGTTTCCGACGGCGGAGAAGTGGGTAAACGAAGGAGGAGCCCTCCTCCTTCATTTCCGGCCGGAAACGAGGAGTCCCTC
CTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAATCAAAACCTCGCAAGTCTCCCCTTCAGACGCAAACTTTTTGTCGATTGGTTTTCTTCTACGTGGGTTTTTGTCGATTGGCTGCCCTTCAGACACAAGCTTCTAT
TATCTTAAGGTGGGAGCCTAAAGTCACCGCCATTCAAGAGGCAAGGAATCTCAAATCTCTCTCCATTGATGAACTCATGGAAGTTGACTCCCAAGATGAAGATGACCTTG
ATGAAGATGATGTTGCAAATCTTTCACGTAAGTATAAGAACTTCATCAAGAGAAAGAAGCAATTCAAGAAGCATCTCTCCAACCAAAAAGAGTTAAATGGTGAAAAGAGC
AAAAAGGATGAGGTAATATGCTATGGATGCAAGAAGCAGGGTCATATTAGAACCGATTGTCCTCTTCTCAAATCATCCAAGAAATCCAAGAAGAAAGCAATGAAGGCTAC
TTGGGATGATAGCGATGAAAGTGAAAGTGGGAGTGATAATGAAGAAGTGGCCAACTTTTGCTTCATGGCTCATAGTGACATAGAGGATGAACAAAATGATGAGGTTACTC
TAGAACCTCTTTCTTATAATGAATTGTTTGAAGTTTTTGAAAATATGCAAAATGATTTAGAAAAACTTAGTTCTAAGTATGTTGTGCTTAAAAAGAAATTCAATGCTTTA
TCTAGTAAAAACAAGTCTCTACTTGAAGAAATTGCTTGCTTGAAAGAGAATGAGCATGATGTTATGCATATTGATGAAATGAATATCTCTTGTGACAAGCATTGTGTAAC
GCCAAGAATTTCTCGACGGCAAGGCAGTGTGCTACAGTATCAGGCTTCTTCTTCGGCGACCTGGACGTTTGTGGGTGAATCAACGACGTCTTTCAGCGATTTTCAGCACC
TCTATCGCGCGACAATAAGTTCTCTGATGCCAGTGCAAGAAGATCGACAGTCCAGCAACGTTTCCCTCTCTTTTCCGGCGAAATCAGTGAGTTTTGGTTGCGTGAGTAGC
AGGCGGTGGTGTTGCAGCGTGTTGGGGTCCATTTTCCGGCGATCTCGACTCAAGAGCCCAGCTTCAGGCACATTATCGTGCGCTCTCTCAGTTCAGGACGGTGTTGAGCT
TATGATCAGGCACGTTATTCGTGCGCCAGATGGCACAGGCATGTTTTCATGCGTCAGTTGCTCCACCGTATTTTATATTCATATATCAGGGGTGTTGTGTTGGGATAGGT
GTAGTTCTGCTAGGGGTTGCCAAATAGCCTCTAGTTTACTTGCCTTAACCGACCGACCCACACAACGCTCTAGTTGGCACAGGTTGGGGATAGGTAAACACCCTAGGCAG
AACGTCTCATGTACTCAGTTATGGCCCCAGGTCGGGTCGTTACACATTGCTTTTGATTGTGATGAGAAAAATGCTTTGCTTGACAAAGTTAAATTTCTTGAGCATGATAG
TTGTGAAAAAGATAATTTGATTAAATTACTCAAAGAAAAGGAATTTAGTGTTGTGCAAGAACTTGATAAGGCTAAAGAATCTATTAAAAAGTTGACAATATGTGCTCAAA
GATTAGACAAGATAATTGAAGTAGGCAAGTCTTTTGATGATAAAAGAGGTTTAGGCTATATTGATGAATGTTCTACTCCTTCAAGTTCTAAAACTATCTTTGTTAAAGCA
TCTCATAATATGCCTAAGGTTGTGTCTAAGCATGTTAAATCTAACTTTGTGCCTATATGTCATTATTGTGGTGTTGTTAAATTGAAATATGCTCATACTACTTCTCCAAG
AAGAATTTTTTCTCAAAGGGCAAAGTTTCACAATGCTCCAAGGAATAATTTCTCCAAGAAAGGTAGAGTGCAACAATTTGCTGTTTGTTTAAAAGTCTCCAAGAAAAGCA
AGTGGTACTTGGATAGTGGTTGCTCGAGGCACATGACGGGAGACCCATCCAAGTTTGTCACTCTCTCCAAAAAGGATGGAGGTCTTGTAACTTTTGGTGACAACAAGAAA
GGCCTCTACAACTATTACACATGGACTTATTTGGCCCTTCTAGAATTGCTAGTTTTTTGGAGGGACTATTATGCCTTTGTGAAAGTTGATGATTTTTCAAGATTTAGTTG
GGTTTTGATGCTGCAACATAAGGATGATGCTTTGAAAAGTTTTTCTAGTTTTGCAAAAAGAGTTCAAAATGAAAAGGGTATTTTGATTTCTAAAATTAGGAGTGATCATG
GAGAAGAATTTGATAGCAATGCTTTTGAAACTTTTTGTGAAGAAAATGGTTTTTCTCATAATTTCTCCGCTCCAAGAACTCCTCAACAAAATGGTGTAGTTGAAAGGAAA
AATCGTACTTTGCAAGAATTTGCTCGATCAATGTTGCATGACTTAAATCAGAACACAACTCGAAACCCACACAAGTCGAATCGCCGGCGTTACAGACGGATCGATGCCTT
TGGAAGGAGCAACAACGAAATCGAAGGGAAGATTTCGATGATTTCTACTGGAAACCCACATAGCTCGAAACCCATCGAAAAACACAAATCGAAACCCACACCGCCGACCA
CACAGCTCGAAACCCACACTTGCTGGCCACAACTGAGATTGAGACCCACGCCAGTGTTTTTCAGCTCTTCCGCCGCCGTCAATCGCCTTCACGGACGGATAATGGCAATC
CTCATGTATGTGCATTTCAGTTGCCACCATGAACTTGAAACCGCAAGGCTCAATGGGCTTCTTGGGAACATCGATGCTAACATTGGCGATCCTCAAATTGGAGGTTTGGC
ACCTGGTGGATTCAACTTTGATGCAAAACTGCGGAGAGAGAGCACAGATGTTGAAGATTTGTTTATAGCTCACATTAGTGGAATGGATACGCTTGCCCGTGGACTTTACC
GGCGTGGATTGAGACCCACATTTGTTTCTGCACCTGGTGGATTGAGACTCACGCCGTGTTCTTCTCATCCACGCAGGCGTCCAGATCTGTGCTCACAGGCCGATAAATCC
GTGCTCACGCGAATAAATCGTCCCAGTCGGGAGTTTCCGACGGCGGAGAAGTGGGTAAACGAAGGAGGAGCCCTCCTCCTTCATTTCCGGCCGGAAACGAGGAGTCCCTC
CTTCTAA
Protein sequenceShow/hide protein sequence
MESKPRKSPLQTQTFCRLVFFYVGFCRLAALQTQASIILRWEPKVTAIQEARNLKSLSIDELMEVDSQDEDDLDEDDVANLSRKYKNFIKRKKQFKKHLSNQKELNGEKS
KKDEVICYGCKKQGHIRTDCPLLKSSKKSKKKAMKATWDDSDESESGSDNEEVANFCFMAHSDIEDEQNDEVTLEPLSYNELFEVFENMQNDLEKLSSKYVVLKKKFNAL
SSKNKSLLEEIACLKENEHDVMHIDEMNISCDKHCVTPRISRRQGSVLQYQASSSATWTFVGESTTSFSDFQHLYRATISSLMPVQEDRQSSNVSLSFPAKSVSFGCVSS
RRWCCSVLGSIFRRSRLKSPASGTLSCALSVQDGVELMIRHVIRAPDGTGMFSCVSCSTVFYIHISGVLCWDRCSSARGCQIASSLLALTDRPTQRSSWHRLGIGKHPRQ
NVSCTQLWPQVGSLHIAFDCDEKNALLDKVKFLEHDSCEKDNLIKLLKEKEFSVVQELDKAKESIKKLTICAQRLDKIIEVGKSFDDKRGLGYIDECSTPSSSKTIFVKA
SHNMPKVVSKHVKSNFVPICHYCGVVKLKYAHTTSPRRIFSQRAKFHNAPRNNFSKKGRVQQFAVCLKVSKKSKWYLDSGCSRHMTGDPSKFVTLSKKDGGLVTFGDNKK
GLYNYYTWTYLALLELLVFWRDYYAFVKVDDFSRFSWVLMLQHKDDALKSFSSFAKRVQNEKGILISKIRSDHGEEFDSNAFETFCEENGFSHNFSAPRTPQQNGVVERK
NRTLQEFARSMLHDLNQNTTRNPHKSNRRRYRRIDAFGRSNNEIEGKISMISTGNPHSSKPIEKHKSKPTPPTTQLETHTCWPQLRLRPTPVFFSSSAAVNRLHGRIMAI
LMYVHFSCHHELETARLNGLLGNIDANIGDPQIGGLAPGGFNFDAKLRRESTDVEDLFIAHISGMDTLARGLYRRGLRPTFVSAPGGLRLTPCSSHPRRRPDLCSQADKS
VLTRINRPSREFPTAEKWVNEGGALLLHFRPETRSPSF