; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G018290 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G018290
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein disulfide isomerase pTAC5, chloroplastic
Genome locationchr05:25559297..25571269
RNA-Seq ExpressionLsi05G018290
SyntenyLsi05G018290
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0005886 - plasma membrane (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003756 - protein disulfide isomerase activity (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR002477 - Peptidoglycan binding-like
IPR036365 - PGBD-like superfamily
IPR036366 - PGBD superfamily
IPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049159.1 golgin subfamily A member 6-like protein 6 isoform X2 [Cucumis melo var. makuwa]7.4e-18485.22Show/hide
Query:  LQFLRCISMSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAEL
        L+FL CISMSSSITLPLNPSL LTPR  SLSVFSS KLSSSRSL +S IC SLNPASNDREELRW+REEQRWFREEERWIREEQRWARERQ LLQEIAEL
Subjt:  LQFLRCISMSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAEL

Query:  KLQIQALERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAE
        KLQIQALERRNSVQGGT+SVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDES+REEDVE+EKK+IVEEVV F EESKAEKEVKKERKSLR+GSEGAE
Subjt:  KLQIQALERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAE

Query:  VLAMQEALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------
        VLAMQEALL LGFY GEEDMEFSSFSSGTERAVKTWQ+ASGFREDGIMT +LLEIL+KEK+TESVGSDAKTDEKGNIPTDQ+                  
Subjt:  VLAMQEALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------

Query:  KTVVKEGSVSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDA
        KT+VKEGS   D+S+QRVFLIGENRWEDPTRLHSSNGKASDGKTK IST CLTCRGEGRLLC+ECDG+GEPNIEPQFLEWVDEG KCPYCEGVGY  CD 
Subjt:  KTVVKEGSVSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDA

Query:  CEGRTV
        CEG+TV
Subjt:  CEGRTV

XP_008438372.1 PREDICTED: uncharacterized protein LOC103483490 [Cucumis melo]4.0e-18285.68Show/hide
Query:  MSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQALE
        MSSSITLPLNPSL LTPR  SLSVFSSPK SSSRSL NS IC SLNPASNDREELRW+REEQRWFREEERWIREEQRWARERQ LLQEIAELKLQIQALE
Subjt:  MSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQALE

Query:  RRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAEVLAMQEAL
        RRNSVQGGT+SVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDES+REEDVE+EKK+IVEEVV F EESKAEKEVKKERKSLR+GSEGAEVLAMQEAL
Subjt:  RRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAEVLAMQEAL

Query:  LMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------KTVVKEGS
        L LGFY GEEDMEFSSFSSGTERAVKTWQ+ASGFREDGIMT +LLEIL+KEK+TESVGSDAKTDEKGNIPTDQ+                  KT+VKEGS
Subjt:  LMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------KTVVKEGS

Query:  VSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGRTV
           D+S+QRVFLIGENRWEDPTRLHSSNGKASDGKTK IST CLTCRGEGRLLC+ECDG+GEPNIEPQFLEWVDEG KCPYCEGVGY  CD CEG+TV
Subjt:  VSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGRTV

XP_011650882.1 protein disulfide isomerase pTAC5, chloroplastic [Cucumis sativus]5.1e-18586.72Show/hide
Query:  MSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQALE
        MSSSITLPLNPSL LTPR FSLS+FSSPKLSSSRSL NS IC SLNP+SNDREELRW+REEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQALE
Subjt:  MSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQALE

Query:  RRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAEVLAMQEAL
        RRNSVQGGT+SVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDES+REEDVE+EKK+IVEEVV F EESKAEKEVKKERKSLR GSEGAEVLAMQEAL
Subjt:  RRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAEVLAMQEAL

Query:  LMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------KTVVKEGS
        + LGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMT +LL+IL+KE++TESVGSDAKTDEKGNIPTDQ+                  KT+VKEGS
Subjt:  LMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------KTVVKEGS

Query:  VSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGRTVT
         SFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTK IST CLTCRGEGRLLC+ECDG+GEPNIEPQFLEWV EGTKCPYCEGVGY  CD CEG+TVT
Subjt:  VSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGRTVT

XP_038876037.1 protein disulfide isomerase pTAC5, chloroplastic isoform X1 [Benincasa hispida]1.7e-18888.06Show/hide
Query:  MSSSITLPLNPSLHLTPRLF---SLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQ
        MSSSITLPLNPSLHLTPR F   SLS+FSSP LSSSRSLPNSF+C SLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEI ELKLQIQ
Subjt:  MSSSITLPLNPSLHLTPRLF---SLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQ

Query:  ALERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAEVLAMQ
        ALERRNSVQGGTVSVS+TIANIAGLLQVLKEKNLIAESGPT  RILLDESTREEDVE+EKK+IVEEVV F EESKAEKEVKKERKSLRIGSEGAEVLAMQ
Subjt:  ALERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAEVLAMQ

Query:  EALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------KTVVK
        EALL LGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMT DLLEILYKEK+TES GSDAKTDEKG IPTDQ+                  KTV+K
Subjt:  EALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------KTVVK

Query:  EGSVSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGRT
        EGS SFDVSQQRVFLIGENRWEDP RLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEG KCPYCEG GY ICD CEG+T
Subjt:  EGSVSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGRT

Query:  VT
        VT
Subjt:  VT

XP_038880778.1 uncharacterized protein LOC120072460 [Benincasa hispida]7.4e-18491.67Show/hide
Query:  MEFCNSFIFFTLLIILPLARCEDTGSVLFVDSSSHQYLRSHSPDDGFEVSSMSLPEVGAAVSVLLGFAPPSTLSASGSSKLNGILMPNPLDRPRSVFMLE
        MEFCNSFIFFTLLIILPLARCEDTGSVLFVDSSSHQYLRSHSPDDGFE SSMSLPEVGAAVSVLLGFAPPSTLSASGSSKLNGILMPNPLDRPRSVFMLE
Subjt:  MEFCNSFIFFTLLIILPLARCEDTGSVLFVDSSSHQYLRSHSPDDGFEVSSMSLPEVGAAVSVLLGFAPPSTLSASGSSKLNGILMPNPLDRPRSVFMLE

Query:  IKGEYGLCNPEILSLEGGMSSNVLTSKVHVGSESADIQLPGEDEVSVVPLNEPLPDYTDEDVREFASFIGGSYIADASKTLSGEFTVRLTDDVKINLHMS
        IKGEY   +PEILSLE GMSSNVLTSKV+VGSESADIQLPGEDEVSVVPLNEPL DYTD+DV EFASFIGGSY+ADAS+TL+GEFTVRLTDD  IN H+S
Subjt:  IKGEYGLCNPEILSLEGGMSSNVLTSKVHVGSESADIQLPGEDEVSVVPLNEPLPDYTDEDVREFASFIGGSYIADASKTLSGEFTVRLTDDVKINLHMS

Query:  KTGDREFIGSLLCLFHNIKRAIHIHEDLSQNVQSPSELITGSFNSIKAFQDESDSEGDADHRSRLFVVALSKIFHLLQKAYDGQIVGVVFFSGSSSPKAE
        KTGDREFIGSLLCLFHNIKRAIHIHEDLSQNVQSPSELITGSFNSIKAFQDESDSEGDAD+RSRLF+VALSKIFHLLQKAYDG+IVGVVFFSGSSS KAE
Subjt:  KTGDREFIGSLLCLFHNIKRAIHIHEDLSQNVQSPSELITGSFNSIKAFQDESDSEGDADHRSRLFVVALSKIFHLLQKAYDGQIVGVVFFSGSSSPKAE

Query:  KGLNVMFNPRLTPRWLVEEVKVNTTIQEVILVRTTLAWLTGIILLIATLMGVNFAPVLHFAPSSLNILRFSP
        KGLNVMFN  LTPRWLVE+VKVNTTI EVILVRTTLAW+TGIILLIATLMGVNFAP+LH+APSSL++L FSP
Subjt:  KGLNVMFNPRLTPRWLVEEVKVNTTIQEVILVRTTLAWLTGIILLIATLMGVNFAPVLHFAPSSLNILRFSP

TrEMBL top hitse value%identityAlignment
A0A0A0L7Z4 PG_binding_1 domain-containing protein2.5e-18586.72Show/hide
Query:  MSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQALE
        MSSSITLPLNPSL LTPR FSLS+FSSPKLSSSRSL NS IC SLNP+SNDREELRW+REEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQALE
Subjt:  MSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQALE

Query:  RRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAEVLAMQEAL
        RRNSVQGGT+SVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDES+REEDVE+EKK+IVEEVV F EESKAEKEVKKERKSLR GSEGAEVLAMQEAL
Subjt:  RRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAEVLAMQEAL

Query:  LMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------KTVVKEGS
        + LGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMT +LL+IL+KE++TESVGSDAKTDEKGNIPTDQ+                  KT+VKEGS
Subjt:  LMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------KTVVKEGS

Query:  VSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGRTVT
         SFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTK IST CLTCRGEGRLLC+ECDG+GEPNIEPQFLEWV EGTKCPYCEGVGY  CD CEG+TVT
Subjt:  VSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGRTVT

A0A1S3AWV2 uncharacterized protein LOC1034834902.0e-18285.68Show/hide
Query:  MSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQALE
        MSSSITLPLNPSL LTPR  SLSVFSSPK SSSRSL NS IC SLNPASNDREELRW+REEQRWFREEERWIREEQRWARERQ LLQEIAELKLQIQALE
Subjt:  MSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQALE

Query:  RRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAEVLAMQEAL
        RRNSVQGGT+SVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDES+REEDVE+EKK+IVEEVV F EESKAEKEVKKERKSLR+GSEGAEVLAMQEAL
Subjt:  RRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAEVLAMQEAL

Query:  LMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------KTVVKEGS
        L LGFY GEEDMEFSSFSSGTERAVKTWQ+ASGFREDGIMT +LLEIL+KEK+TESVGSDAKTDEKGNIPTDQ+                  KT+VKEGS
Subjt:  LMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------KTVVKEGS

Query:  VSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGRTV
           D+S+QRVFLIGENRWEDPTRLHSSNGKASDGKTK IST CLTCRGEGRLLC+ECDG+GEPNIEPQFLEWVDEG KCPYCEGVGY  CD CEG+TV
Subjt:  VSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGRTV

A0A5D3CZL8 Golgin subfamily A member 6-like protein 6 isoform X23.6e-18485.22Show/hide
Query:  LQFLRCISMSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAEL
        L+FL CISMSSSITLPLNPSL LTPR  SLSVFSS KLSSSRSL +S IC SLNPASNDREELRW+REEQRWFREEERWIREEQRWARERQ LLQEIAEL
Subjt:  LQFLRCISMSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAEL

Query:  KLQIQALERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAE
        KLQIQALERRNSVQGGT+SVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDES+REEDVE+EKK+IVEEVV F EESKAEKEVKKERKSLR+GSEGAE
Subjt:  KLQIQALERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAE

Query:  VLAMQEALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------
        VLAMQEALL LGFY GEEDMEFSSFSSGTERAVKTWQ+ASGFREDGIMT +LLEIL+KEK+TESVGSDAKTDEKGNIPTDQ+                  
Subjt:  VLAMQEALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------------

Query:  KTVVKEGSVSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDA
        KT+VKEGS   D+S+QRVFLIGENRWEDPTRLHSSNGKASDGKTK IST CLTCRGEGRLLC+ECDG+GEPNIEPQFLEWVDEG KCPYCEGVGY  CD 
Subjt:  KTVVKEGSVSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDA

Query:  CEGRTV
        CEG+TV
Subjt:  CEGRTV

A0A6J1F8Z4 protein disulfide isomerase pTAC5, chloroplastic9.7e-17481.84Show/hide
Query:  MSSSITLP-LNPSLHLTPRLF---SLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQI
        MSSSITLP LNPSLHLTP  F   SLS+FSSPKL SSRS  NSFIC SLNPASN+REE+RW+REEQRW REEERWIREEQRWARER+SLLQEIAELKLQI
Subjt:  MSSSITLP-LNPSLHLTPRLF---SLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQI

Query:  QALERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIV----------EEVVTFPEESKAEKEVKKERKSLRI
        QALER++S+QGGTVS SE+IANIAGLLQVLKEKNLIAESG +VSRILLDESTREEDVE+EKK+IV          EEVV F +E KAEKEVK ERKSLRI
Subjt:  QALERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIV----------EEVVTFPEESKAEKEVKKERKSLRI

Query:  GSEGAEVLAMQEALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------
        GSEGAEVLAMQEALL LGFYSGEEDMEFSSFSSGTERAVKTWQA SGFREDGIMTA+LLEILY EKITESVGS+AKTDEKGNIPTDQK            
Subjt:  GSEGAEVLAMQEALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------

Query:  -------KTVVKEGSVSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGV
               +TVVKEGS SFDVSQQRVFL+GENRWEDP RL SSNGKASDGKTKD S KCLTCRGEGRLLCSECDGSGEPN+EPQFLEWVDEG KCPYCEG+
Subjt:  -------KTVVKEGSVSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGV

Query:  GYTICDACEGRTV
        GYTICDACEG+TV
Subjt:  GYTICDACEGRTV

A0A6J1ICM1 protein disulfide isomerase pTAC5, chloroplastic3.7e-17381.8Show/hide
Query:  MSSSITLP-LNPSLHLTPRLF---SLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQI
        MSSSITLP LNPSLHLTP  F   SLS+FSSPKL SSRS  NSFIC SLNPASN+REE+RW+REEQRW REEERWIREEQRW RER+SLLQEIAELKLQI
Subjt:  MSSSITLP-LNPSLHLTPRLF---SLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQI

Query:  QALERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEE----------VVTFPEESKAEKEVKKERKSLRI
        QALER++S+QGGTVS SETIANIAGLLQVLKEKNLIAESG +VSRILLDESTREEDVE+EKK+IVEE          VV + +E KAEKEVK ERKSLRI
Subjt:  QALERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEE----------VVTFPEESKAEKEVKKERKSLRI

Query:  GSEGAEVLAMQEALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------
        GSEGAEVLAMQEALL LGFYSGEEDMEFSSFSSGTERAVKTWQA SGFREDGIMTA+LLEILY EK+TESVGSDAKTDEKGNIPTDQK            
Subjt:  GSEGAEVLAMQEALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQK------------

Query:  -------KTVVKEGSVSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGV
               KTVVKEGS SFDVSQQRVFL+GENRWEDP RL SSNGKASDGKTK  STKCLTCRGEGRLLCSECDGSGEPN+EPQFLEWVDEG KCPYCEG+
Subjt:  -------KTVVKEGSVSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGV

Query:  GYTICDACEGRT
        GYTICDACEG+T
Subjt:  GYTICDACEGRT

SwissProt top hitse value%identityAlignment
A1A6M1 Protein disulfide isomerase pTAC5, chloroplastic1.8e-10054.45Show/hide
Query:  SMSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSL-NPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQA
        S S  ++LP  P   LT    SL    SP      S+P+S +C S  NP   DREE+RWLREEQRW REE+RWIREEQRW RER+SLLQEI++L+L+IQ+
Subjt:  SMSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSL-NPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQA

Query:  LERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKK-ERKSLRIGSEGAEVLAMQ
        LE RNS  G   S+ +TI+NIA LLQVLKEKN I+ESG + + ++L ESTRE+ VE E +   + V+   E+ +  + VKK +R+ L++GSEG +V A+Q
Subjt:  LERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKK-ERKSLRIGSEGAEVLAMQ

Query:  EALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKT---DEKGN---------IPTDQKKTVVKEGSVSF
        EALL LGFYSGEEDMEFSSFSSGT  AVKTWQA+ G REDG+MTA+LL+ L+ ++  E+   +A T   +E GN         +P  ++  V  +     
Subjt:  EALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKT---DEKGN---------IPTDQKKTVVKEGSVSF

Query:  DVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGR
        DV+Q RVFL+GENRWEDP+RL   N      ++ +  T+C+TCRGEGRL+C ECDG+GEPNIEPQF+EWV E TKCPYCEG+GYT+CD C+G+
Subjt:  DVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGR

Arabidopsis top hitse value%identityAlignment
AT3G24160.1 putative type 1 membrane protein5.4e-6041.71Show/hide
Query:  FIFFTLLIILPLARCEDTGSVLFVDSSSHQYLRSHSPDDGFEVSSMSLPEVGAAVSVLLGFAPPSTLSASGSSKLNGILMPNPLDRPRSVFMLEIKGEYG
        ++F   L++    R E +GSV F+D S++QYLR   P    E   MS  E+ AAVS LLGFAP +TL+A GSSKLN IL PNP +RPR+ F+LEI G   
Subjt:  FIFFTLLIILPLARCEDTGSVLFVDSSSHQYLRSHSPDDGFEVSSMSLPEVGAAVSVLLGFAPPSTLSASGSSKLNGILMPNPLDRPRSVFMLEIKGEYG

Query:  LCNPEILSLEGGMSSNVLTSKVHVGSESADIQLPGEDEVSVVPLNEPLPDYTDEDVREFASFIGGSYIADASKTLSGEFTVRLTDDVKINLHMSKTGDRE
        +      S       N + S +   S  AD +LP ++EV VV +NEP  D TD+D+ +FAS++GGSY+A A  + SG  ++ L     +  ++ K  +R+
Subjt:  LCNPEILSLEGGMSSNVLTSKVHVGSESADIQLPGEDEVSVVPLNEPLPDYTDEDVREFASFIGGSYIADASKTLSGEFTVRLTDDVKINLHMSKTGDRE

Query:  FIGSLLCLFHNIKRAIHIHEDLSQNVQSPSELITGSFNSIKAFQDESDSEGDADHRSRLFVVALSKIFHLLQKAYDGQIVGVVFFSGSSSPKAEKGLNVM
        F  +LL L+ NI++A+ +++DLS  +   +EL  G F  I A   E   +G A     + +  LSK+F+LL+ ++ GQIVGV+      + ++E  LN  
Subjt:  FIGSLLCLFHNIKRAIHIHEDLSQNVQSPSELITGSFNSIKAFQDESDSEGDADHRSRLFVVALSKIFHLLQKAYDGQIVGVVFFSGSSSPKAEKGLNVM

Query:  FNPRLTPRWLVEEVKVNTT--IQEVILVRTTLAWLTGIILLIATLMGVNF
         + R + R +VE   + +   I EVILVR TLAWLTGIILLIAT++GV F
Subjt:  FNPRLTPRWLVEEVKVNTT--IQEVILVRTTLAWLTGIILLIATLMGVNF

AT3G47650.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein5.5e-0429.17Show/hide
Query:  FDVSQQRVFLIGENRWEDPTRLHSSNGKASDGK---TKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEG
        F  +     L+ +      +R  S   KA++     TK  S  C  C GEG + CS+C G G  N+   F      G  C  C G    +C  C G
Subjt:  FDVSQQRVFLIGENRWEDPTRLHSSNGKASDGK---TKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEG

AT4G13670.1 plastid transcriptionally active 51.3e-10154.45Show/hide
Query:  SMSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSL-NPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQA
        S S  ++LP  P   LT    SL    SP      S+P+S +C S  NP   DREE+RWLREEQRW REE+RWIREEQRW RER+SLLQEI++L+L+IQ+
Subjt:  SMSSSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSL-NPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQA

Query:  LERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKK-ERKSLRIGSEGAEVLAMQ
        LE RNS  G   S+ +TI+NIA LLQVLKEKN I+ESG + + ++L ESTRE+ VE E +   + V+   E+ +  + VKK +R+ L++GSEG +V A+Q
Subjt:  LERRNSVQGGTVSVSETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKK-ERKSLRIGSEGAEVLAMQ

Query:  EALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKT---DEKGN---------IPTDQKKTVVKEGSVSF
        EALL LGFYSGEEDMEFSSFSSGT  AVKTWQA+ G REDG+MTA+LL+ L+ ++  E+   +A T   +E GN         +P  ++  V  +     
Subjt:  EALLMLGFYSGEEDMEFSSFSSGTERAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKT---DEKGN---------IPTDQKKTVVKEGSVSF

Query:  DVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGR
        DV+Q RVFL+GENRWEDP+RL   N      ++ +  T+C+TCRGEGRL+C ECDG+GEPNIEPQF+EWV E TKCPYCEG+GYT+CD C+G+
Subjt:  DVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEGRLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAACTAAAAGGTGAACGGAAGTTTGCTTGGAAAAAATGGTCATCGGAATATTGCCTTTTATCTCCTCCGTTTGTGGTCAATACGGCTTCACAGATTCTCTCCAG
GATGGAATTCTGCAATTCCTTCATCTTCTTCACTCTGCTAATCATCCTCCCTCTCGCTAGGTGCGAGGATACTGGCTCGGTGCTCTTCGTTGATAGCTCGTCGCATCAAT
ATCTTCGATCTCATTCGCCCGATGATGGCTTTGAGGTTAGTTCAATGTCACTACCAGAAGTTGGTGCTGCTGTGTCAGTCTTGCTCGGTTTTGCGCCACCTTCAACGCTT
TCAGCTTCTGGATCATCTAAGCTGAATGGGATTTTGATGCCAAATCCGCTTGATAGGCCTCGTTCAGTTTTTATGCTTGAAATTAAAGGAGAATATGGTCTCTGCAACCC
TGAAATTTTAAGCCTGGAAGGTGGCATGTCCAGCAATGTTCTTACGAGCAAGGTTCATGTAGGTTCTGAGAGTGCTGATATCCAACTTCCTGGTGAGGATGAAGTGTCTG
TTGTTCCTTTGAATGAACCATTGCCAGATTATACAGATGAAGATGTTAGAGAATTTGCATCTTTCATCGGTGGATCATATATTGCTGATGCATCAAAAACTTTAAGTGGA
GAGTTTACTGTGCGCTTGACTGATGATGTTAAGATCAATCTTCATATGTCTAAGACGGGGGACAGAGAATTTATAGGTAGTCTTTTGTGTCTCTTTCACAATATTAAGAG
GGCTATTCACATTCATGAGGATTTGTCACAAAATGTGCAAAGTCCATCTGAGCTCATTACTGGCTCTTTCAATAGCATCAAGGCATTCCAAGATGAAAGTGATTCTGAAG
GAGATGCTGATCATAGATCGAGACTATTTGTGGTTGCTTTGTCCAAGATATTCCACTTGCTCCAAAAAGCATATGATGGTCAAATTGTTGGAGTTGTTTTCTTTTCTGGA
TCATCATCACCAAAGGCAGAAAAGGGATTAAATGTGATGTTTAACCCTCGGTTGACTCCACGTTGGTTGGTGGAAGAAGTCAAAGTTAACACAACTATTCAAGAAGTAAT
ATTGGTTAGGACAACCCTTGCCTGGCTCACAGGAATCATCCTTTTGATTGCTACTCTTATGGGGGTAAATTTTGCACCTGTACTCCATTTTGCTCCATCAAGTTTAAATA
TTCTGAGATTTTCCCCATCGGAAAAATATCTTCTCAAAGTCTCTACGTTCTTCCATTTCTCCTCTTTCTTCCACACTCTCCTTCAATTTCTCCGCTGCATTTCAATGTCT
TCCTCCATTACTCTTCCTCTCAATCCTTCTCTTCATCTCACTCCTCGCCTCTTCTCTCTCTCAGTCTTCTCATCTCCCAAGCTCTCATCATCTCGTTCTCTTCCCAATTC
ATTCATCTGTTGCTCTCTGAATCCCGCTAGCAATGACCGCGAGGAGCTCCGATGGCTCCGCGAGGAGCAGCGGTGGTTTCGCGAAGAGGAACGGTGGATCCGCGAAGAGC
AGCGTTGGGCCAGAGAACGCCAGTCACTTCTGCAGGAAATTGCTGAACTTAAGCTGCAAATTCAAGCTTTAGAACGCCGAAATTCCGTTCAAGGAGGGACGGTTTCTGTA
TCAGAGACTATTGCGAATATCGCCGGTCTGTTGCAGGTCTTGAAGGAGAAGAATCTGATTGCCGAGAGCGGGCCGACGGTGAGTCGCATTTTGTTGGATGAGAGTACTCG
TGAAGAGGACGTGGAGGTAGAGAAGAAGTCAATTGTTGAAGAAGTCGTTACGTTTCCCGAAGAAAGTAAGGCGGAGAAAGAGGTGAAGAAGGAGAGAAAGTCTTTGAGAA
TTGGTTCGGAAGGAGCGGAAGTCCTAGCGATGCAGGAAGCATTGCTGATGCTAGGATTTTACTCTGGTGAAGAAGACATGGAATTTTCAAGTTTCTCCAGTGGAACTGAG
CGTGCAGTTAAGACTTGGCAGGCCGCCTCGGGTTTCCGTGAAGATGGTATAATGACTGCAGACCTTCTTGAGATTTTATACAAGGAGAAAATAACTGAGAGTGTTGGATC
AGATGCCAAAACAGATGAAAAAGGGAATATTCCAACAGATCAGAAAAAAACAGTAGTAAAAGAAGGCAGTGTAAGTTTTGATGTTTCCCAACAGCGAGTTTTTCTCATTG
GAGAGAACCGATGGGAAGACCCCACACGACTTCATAGTTCAAATGGAAAAGCTTCTGATGGCAAAACCAAAGACATATCTACCAAGTGTCTGACTTGTCGTGGGGAGGGT
CGTCTGTTATGCTCAGAATGCGACGGAAGTGGTGAGCCAAATATTGAACCACAGTTTCTGGAATGGGTGGATGAAGGAACAAAATGTCCATATTGCGAAGGCGTTGGTTA
TACAATATGCGACGCATGCGAAGGGAGGACAGTGACGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAACTAAAAGGTGAACGGAAGTTTGCTTGGAAAAAATGGTCATCGGAATATTGCCTTTTATCTCCTCCGTTTGTGGTCAATACGGCTTCACAGATTCTCTCCAG
GATGGAATTCTGCAATTCCTTCATCTTCTTCACTCTGCTAATCATCCTCCCTCTCGCTAGGTGCGAGGATACTGGCTCGGTGCTCTTCGTTGATAGCTCGTCGCATCAAT
ATCTTCGATCTCATTCGCCCGATGATGGCTTTGAGGTTAGTTCAATGTCACTACCAGAAGTTGGTGCTGCTGTGTCAGTCTTGCTCGGTTTTGCGCCACCTTCAACGCTT
TCAGCTTCTGGATCATCTAAGCTGAATGGGATTTTGATGCCAAATCCGCTTGATAGGCCTCGTTCAGTTTTTATGCTTGAAATTAAAGGAGAATATGGTCTCTGCAACCC
TGAAATTTTAAGCCTGGAAGGTGGCATGTCCAGCAATGTTCTTACGAGCAAGGTTCATGTAGGTTCTGAGAGTGCTGATATCCAACTTCCTGGTGAGGATGAAGTGTCTG
TTGTTCCTTTGAATGAACCATTGCCAGATTATACAGATGAAGATGTTAGAGAATTTGCATCTTTCATCGGTGGATCATATATTGCTGATGCATCAAAAACTTTAAGTGGA
GAGTTTACTGTGCGCTTGACTGATGATGTTAAGATCAATCTTCATATGTCTAAGACGGGGGACAGAGAATTTATAGGTAGTCTTTTGTGTCTCTTTCACAATATTAAGAG
GGCTATTCACATTCATGAGGATTTGTCACAAAATGTGCAAAGTCCATCTGAGCTCATTACTGGCTCTTTCAATAGCATCAAGGCATTCCAAGATGAAAGTGATTCTGAAG
GAGATGCTGATCATAGATCGAGACTATTTGTGGTTGCTTTGTCCAAGATATTCCACTTGCTCCAAAAAGCATATGATGGTCAAATTGTTGGAGTTGTTTTCTTTTCTGGA
TCATCATCACCAAAGGCAGAAAAGGGATTAAATGTGATGTTTAACCCTCGGTTGACTCCACGTTGGTTGGTGGAAGAAGTCAAAGTTAACACAACTATTCAAGAAGTAAT
ATTGGTTAGGACAACCCTTGCCTGGCTCACAGGAATCATCCTTTTGATTGCTACTCTTATGGGGGTAAATTTTGCACCTGTACTCCATTTTGCTCCATCAAGTTTAAATA
TTCTGAGATTTTCCCCATCGGAAAAATATCTTCTCAAAGTCTCTACGTTCTTCCATTTCTCCTCTTTCTTCCACACTCTCCTTCAATTTCTCCGCTGCATTTCAATGTCT
TCCTCCATTACTCTTCCTCTCAATCCTTCTCTTCATCTCACTCCTCGCCTCTTCTCTCTCTCAGTCTTCTCATCTCCCAAGCTCTCATCATCTCGTTCTCTTCCCAATTC
ATTCATCTGTTGCTCTCTGAATCCCGCTAGCAATGACCGCGAGGAGCTCCGATGGCTCCGCGAGGAGCAGCGGTGGTTTCGCGAAGAGGAACGGTGGATCCGCGAAGAGC
AGCGTTGGGCCAGAGAACGCCAGTCACTTCTGCAGGAAATTGCTGAACTTAAGCTGCAAATTCAAGCTTTAGAACGCCGAAATTCCGTTCAAGGAGGGACGGTTTCTGTA
TCAGAGACTATTGCGAATATCGCCGGTCTGTTGCAGGTCTTGAAGGAGAAGAATCTGATTGCCGAGAGCGGGCCGACGGTGAGTCGCATTTTGTTGGATGAGAGTACTCG
TGAAGAGGACGTGGAGGTAGAGAAGAAGTCAATTGTTGAAGAAGTCGTTACGTTTCCCGAAGAAAGTAAGGCGGAGAAAGAGGTGAAGAAGGAGAGAAAGTCTTTGAGAA
TTGGTTCGGAAGGAGCGGAAGTCCTAGCGATGCAGGAAGCATTGCTGATGCTAGGATTTTACTCTGGTGAAGAAGACATGGAATTTTCAAGTTTCTCCAGTGGAACTGAG
CGTGCAGTTAAGACTTGGCAGGCCGCCTCGGGTTTCCGTGAAGATGGTATAATGACTGCAGACCTTCTTGAGATTTTATACAAGGAGAAAATAACTGAGAGTGTTGGATC
AGATGCCAAAACAGATGAAAAAGGGAATATTCCAACAGATCAGAAAAAAACAGTAGTAAAAGAAGGCAGTGTAAGTTTTGATGTTTCCCAACAGCGAGTTTTTCTCATTG
GAGAGAACCGATGGGAAGACCCCACACGACTTCATAGTTCAAATGGAAAAGCTTCTGATGGCAAAACCAAAGACATATCTACCAAGTGTCTGACTTGTCGTGGGGAGGGT
CGTCTGTTATGCTCAGAATGCGACGGAAGTGGTGAGCCAAATATTGAACCACAGTTTCTGGAATGGGTGGATGAAGGAACAAAATGTCCATATTGCGAAGGCGTTGGTTA
TACAATATGCGACGCATGCGAAGGGAGGACAGTGACGTAGGTATGACTACTCACCCGTTGTAAATAATGGCTGTTAATTATTCACTTATTAAGCTCCTTTTCAATTTTGT
TATATCTCTTTCGAATCTATATTATATATAGACAGTATAAATAAGACGATGTATAATCTTTTTAACTCTCCACCATAGTTGTACTGAAAAAAAAACACACTCCACTATAG
CATTATGTTCTGGTAGATTTATAGGTTCAAAAGTACCAATGATAGGCTACAAAAATCAGGGTGAGAATGGACACTATTATCAATTCTCCTTGGGAAACTATTGAGATTAC
TCTTTACTTATGTTGGGGC
Protein sequenceShow/hide protein sequence
MSQLKGERKFAWKKWSSEYCLLSPPFVVNTASQILSRMEFCNSFIFFTLLIILPLARCEDTGSVLFVDSSSHQYLRSHSPDDGFEVSSMSLPEVGAAVSVLLGFAPPSTL
SASGSSKLNGILMPNPLDRPRSVFMLEIKGEYGLCNPEILSLEGGMSSNVLTSKVHVGSESADIQLPGEDEVSVVPLNEPLPDYTDEDVREFASFIGGSYIADASKTLSG
EFTVRLTDDVKINLHMSKTGDREFIGSLLCLFHNIKRAIHIHEDLSQNVQSPSELITGSFNSIKAFQDESDSEGDADHRSRLFVVALSKIFHLLQKAYDGQIVGVVFFSG
SSSPKAEKGLNVMFNPRLTPRWLVEEVKVNTTIQEVILVRTTLAWLTGIILLIATLMGVNFAPVLHFAPSSLNILRFSPSEKYLLKVSTFFHFSSFFHTLLQFLRCISMS
SSITLPLNPSLHLTPRLFSLSVFSSPKLSSSRSLPNSFICCSLNPASNDREELRWLREEQRWFREEERWIREEQRWARERQSLLQEIAELKLQIQALERRNSVQGGTVSV
SETIANIAGLLQVLKEKNLIAESGPTVSRILLDESTREEDVEVEKKSIVEEVVTFPEESKAEKEVKKERKSLRIGSEGAEVLAMQEALLMLGFYSGEEDMEFSSFSSGTE
RAVKTWQAASGFREDGIMTADLLEILYKEKITESVGSDAKTDEKGNIPTDQKKTVVKEGSVSFDVSQQRVFLIGENRWEDPTRLHSSNGKASDGKTKDISTKCLTCRGEG
RLLCSECDGSGEPNIEPQFLEWVDEGTKCPYCEGVGYTICDACEGRTVT