; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0015043 (gene) of Chayote v1 genome

Gene IDSed0015043
OrganismSechium edule (Chayote v1)
DescriptionAnkyrin repeat-containing protein
Genome locationLG01:10993171..10997459
RNA-Seq ExpressionSed0015043
SyntenySed0015043
Gene Ontology termsGO:0007131 - reciprocal meiotic recombination (biological process)
GO:0051177 - meiotic sister chromatid cohesion (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055387.1 ankyrin repeat-containing protein [Cucumis melo var. makuwa]5.4e-17173.67Show/hide
Query:  PVSFFSDPLEEVVVEPNLASPPPPTAALAQSEPIS-GRFSDCTPPQ-TRTMQHVPPPSDQEQDFD-CNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFP
        P++F      EVVVE NL SPP  TA  A+SEP S    S+CT P+ +RT+      SD E D D   + RVE  RRLLLYKSALKG+WKR E ++  +P
Subjt:  PVSFFSDPLEEVVVEPNLASPPPPTAALAQSEPIS-GRFSDCTPPQ-TRTMQHVPPPSDQEQDFD-CNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFP

Query:  HFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLL
        H+VRCAITRNKETVLHVAAGAKQSVFVEELV RMT +DM L+DKYGNTALCFAATSRIVKIAKLMVEKN  LPLIRTFRE TPLLIA+SYK R MISYLL
Subjt:  HFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLL

Query:  SVTDLSQLSSEERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS--------------CVKEI---
        SVTDLSQL+++ERIELLIATIHS+F+DLSLWIL+LYPELA+MKD KNNNE+ALHVLARKPSAMDSTKQL++W+  INS               + EI   
Subjt:  SVTDLSQLSSEERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS--------------CVKEI---

Query:  ------SNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIFHVAVENRLENVFNLIN
              SNK+V KTLA +LVE LWRYVVYELPQ+ MLEFI+HPTSLLNDAAG GNVEFLIVLIRE+PDI+W  +DD+DDSKSIFHVAVENRLENVFNLIN
Subjt:  ------SNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIFHVAVENRLENVFNLIN

Query:  EIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK
        EIGKLNEFSTKYRTFKGKY+ILHLAGNLAAPNHLN+VSGAALQMQREMLWFK
Subjt:  EIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK

KGN64473.1 hypothetical protein Csa_013828 [Cucumis sativus]1.1e-12867.3Show/hide
Query:  DQEQDFDCNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIV
        D ++  D N  ++ A  R+ LY++ALKGEW+ VE L+++ P+ VR AITRN+ET+LH+AAGAKQ  FV +L+ RM+ DDM LQ+++GNTALCFAA S +V
Subjt:  DQEQDFDCNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIV

Query:  KIAKLMVEKNSRLPLIRTFRE-VTPLLIAISYKCRPMISYLLSVTDLSQLSSEERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLAR
        +IA+LMVEKN  LPLIR F   VTPL IA+SYKC  M+SYLLSVTDL+QL  +E+IELLIATI S+FYD+SLWILQ YP LAIM+D   N E+ALHV+AR
Subjt:  KIAKLMVEKNSRLPLIRTFRE-VTPLLIAISYKCRPMISYLLSVTDLSQLSSEERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLAR

Query:  KPSAMDSTKQLKHWEKCINSCVKEISNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVWEDDDDDSKSI
        KPSAMD TKQL  W   +NS    I  K V KTLA ELV +L   V+  LP++ ML+FIKHPT LLNDAA  GNVEFLIVLIR++PDI+WE D DD KSI
Subjt:  KPSAMDSTKQLKHWEKCINSCVKEISNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVWEDDDDDSKSI

Query:  FHVAVENRLENVFNLINEIGKLNEFSTKYRTFKGK-YNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK
        FHVA+ENRLENVFNLINEIG+LNEF+ KYRTFKG+ YNILHLAG+LA PNHLN+VSGAALQMQREMLWFK
Subjt:  FHVAVENRLENVFNLINEIGKLNEFSTKYRTFKGK-YNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK

XP_004145199.1 uncharacterized protein LOC101215460 [Cucumis sativus]1.5e-16574.55Show/hide
Query:  EVVVEPNLASPPPPTAALAQSEPIS-GRFSDCTPPQ-TRTMQHVPPPSDQEQDFD-CNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRN
        EVVVE N  SPP  T   A+SEP S    S+CT  + +RT+      SD E D D   + R E  RRLLLYKSALKG+WKR E ++  +PH+VRCAITRN
Subjt:  EVVVEPNLASPPPPTAALAQSEPIS-GRFSDCTPPQ-TRTMQHVPPPSDQEQDFD-CNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRN

Query:  KETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQLSS
        KETVLHVAAGAKQSVFVEELV RMT  DM L+DKYGNTALCFAATSRIVKIAKLMVEKN  LPLIRTFRE TPLLIA+SYK R MISYLLSVTDLSQL++
Subjt:  KETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQLSS

Query:  EERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS-------------CVKEI---------SNKEV
        +ERIELLIATIHS+F DLSLWIL+LYPELA+MKD KNNNE+ALHVLARKPSAMDSTKQL++ +  INS              + EI         SNK+V
Subjt:  EERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS-------------CVKEI---------SNKEV

Query:  IKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVWEDDD-DDSKSIFHVAVENRLENVFNLINEIGKLNEFSTKY
         KTLA +LVE LWRYVVYELPQ+ MLEFIKHPTSLLNDAAG GNVEFLIVLI EFPDI+W DDD DDSKSIFHVAVENRLENVFNLINEIGKLNEFSTKY
Subjt:  IKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVWEDDD-DDSKSIFHVAVENRLENVFNLINEIGKLNEFSTKY

Query:  RTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK
        RTFKGKY+ILHLAGNLAAPNHLN+VSGAALQMQREMLWFK
Subjt:  RTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK

XP_008440631.1 PREDICTED: uncharacterized protein LOC103484989 isoform X1 [Cucumis melo]7.8e-17074.89Show/hide
Query:  EVVVEPNLASPPPPTAALAQSEPIS-GRFSDCTPPQ-TRTMQHVPPPSDQEQDFD-CNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRN
        EVVVE NL SPP  TA  A+SEP S    S+CT P+ +RT+      SD E D D   + RVE  RRLLLYKSALKG+WKR E ++  +PH+VRCAITRN
Subjt:  EVVVEPNLASPPPPTAALAQSEPIS-GRFSDCTPPQ-TRTMQHVPPPSDQEQDFD-CNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRN

Query:  KETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQLSS
        KETVLHVAAGAKQSVFVEELV RMT +DM L+DKYGNTALCFAATSRIVKIAKLMVEKN  LPLIRTFRE TPLLIA+SYK R MISYLLSVTDLSQL++
Subjt:  KETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQLSS

Query:  EERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS--------------CVKEI---------SNKE
        +ERIELLIATIHS+F+DLSLWIL+LYPELA+MKD KNNNE+ALHVLARKPSAMDSTKQL++W+  INS               + EI         SNK+
Subjt:  EERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS--------------CVKEI---------SNKE

Query:  VIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIFHVAVENRLENVFNLINEIGKLNEFST
        V KTLA +LVE LWRYVVYELPQ+ MLEFI+HPTSLLNDAAG GNVEFLIVLIRE+PDI+W  +DD+DDSKSIFHVAVENRLENVFNLINEIGKLNEFST
Subjt:  VIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIFHVAVENRLENVFNLINEIGKLNEFST

Query:  KYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK
        KYRTFKGKY+ILHLAGNLAAPNHLN+VSGAALQMQREMLWFK
Subjt:  KYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK

XP_008440640.1 PREDICTED: uncharacterized protein LOC103484989 isoform X2 [Cucumis melo]4.1e-15578.53Show/hide
Query:  LKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPL
        ++G+WKR E ++  +PH+VRCAITRNKETVLHVAAGAKQSVFVEELV RMT +DM L+DKYGNTALCFAATSRIVKIAKLMVEKN  LPLIRTFRE TPL
Subjt:  LKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPL

Query:  LIAISYKCRPMISYLLSVTDLSQLSSEERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS------
        LIA+SYK R MISYLLSVTDLSQL+++ERIELLIATIHS+F+DLSLWIL+LYPELA+MKD KNNNE+ALHVLARKPSAMDSTKQL++W+  INS      
Subjt:  LIAISYKCRPMISYLLSVTDLSQLSSEERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS------

Query:  --------CVKEI---------SNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIF
                 + EI         SNK+V KTLA +LVE LWRYVVYELPQ+ MLEFI+HPTSLLNDAAG GNVEFLIVLIRE+PDI+W  +DD+DDSKSIF
Subjt:  --------CVKEI---------SNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIF

Query:  HVAVENRLENVFNLINEIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK
        HVAVENRLENVFNLINEIGKLNEFSTKYRTFKGKY+ILHLAGNLAAPNHLN+VSGAALQMQREMLWFK
Subjt:  HVAVENRLENVFNLINEIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK

TrEMBL top hitse value%identityAlignment
A0A0A0LRT6 ANK_REP_REGION domain-containing protein5.5e-12967.3Show/hide
Query:  DQEQDFDCNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIV
        D ++  D N  ++ A  R+ LY++ALKGEW+ VE L+++ P+ VR AITRN+ET+LH+AAGAKQ  FV +L+ RM+ DDM LQ+++GNTALCFAA S +V
Subjt:  DQEQDFDCNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIV

Query:  KIAKLMVEKNSRLPLIRTFRE-VTPLLIAISYKCRPMISYLLSVTDLSQLSSEERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLAR
        +IA+LMVEKN  LPLIR F   VTPL IA+SYKC  M+SYLLSVTDL+QL  +E+IELLIATI S+FYD+SLWILQ YP LAIM+D   N E+ALHV+AR
Subjt:  KIAKLMVEKNSRLPLIRTFRE-VTPLLIAISYKCRPMISYLLSVTDLSQLSSEERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLAR

Query:  KPSAMDSTKQLKHWEKCINSCVKEISNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVWEDDDDDSKSI
        KPSAMD TKQL  W   +NS    I  K V KTLA ELV +L   V+  LP++ ML+FIKHPT LLNDAA  GNVEFLIVLIR++PDI+WE D DD KSI
Subjt:  KPSAMDSTKQLKHWEKCINSCVKEISNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVWEDDDDDSKSI

Query:  FHVAVENRLENVFNLINEIGKLNEFSTKYRTFKGK-YNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK
        FHVA+ENRLENVFNLINEIG+LNEF+ KYRTFKG+ YNILHLAG+LA PNHLN+VSGAALQMQREMLWFK
Subjt:  FHVAVENRLENVFNLINEIGKLNEFSTKYRTFKGK-YNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK

A0A1S3B146 uncharacterized protein LOC103484989 isoform X13.8e-17074.89Show/hide
Query:  EVVVEPNLASPPPPTAALAQSEPIS-GRFSDCTPPQ-TRTMQHVPPPSDQEQDFD-CNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRN
        EVVVE NL SPP  TA  A+SEP S    S+CT P+ +RT+      SD E D D   + RVE  RRLLLYKSALKG+WKR E ++  +PH+VRCAITRN
Subjt:  EVVVEPNLASPPPPTAALAQSEPIS-GRFSDCTPPQ-TRTMQHVPPPSDQEQDFD-CNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRN

Query:  KETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQLSS
        KETVLHVAAGAKQSVFVEELV RMT +DM L+DKYGNTALCFAATSRIVKIAKLMVEKN  LPLIRTFRE TPLLIA+SYK R MISYLLSVTDLSQL++
Subjt:  KETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQLSS

Query:  EERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS--------------CVKEI---------SNKE
        +ERIELLIATIHS+F+DLSLWIL+LYPELA+MKD KNNNE+ALHVLARKPSAMDSTKQL++W+  INS               + EI         SNK+
Subjt:  EERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS--------------CVKEI---------SNKE

Query:  VIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIFHVAVENRLENVFNLINEIGKLNEFST
        V KTLA +LVE LWRYVVYELPQ+ MLEFI+HPTSLLNDAAG GNVEFLIVLIRE+PDI+W  +DD+DDSKSIFHVAVENRLENVFNLINEIGKLNEFST
Subjt:  VIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIFHVAVENRLENVFNLINEIGKLNEFST

Query:  KYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK
        KYRTFKGKY+ILHLAGNLAAPNHLN+VSGAALQMQREMLWFK
Subjt:  KYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK

A0A1S3B1K1 uncharacterized protein LOC103484989 isoform X22.0e-15578.53Show/hide
Query:  LKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPL
        ++G+WKR E ++  +PH+VRCAITRNKETVLHVAAGAKQSVFVEELV RMT +DM L+DKYGNTALCFAATSRIVKIAKLMVEKN  LPLIRTFRE TPL
Subjt:  LKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPL

Query:  LIAISYKCRPMISYLLSVTDLSQLSSEERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS------
        LIA+SYK R MISYLLSVTDLSQL+++ERIELLIATIHS+F+DLSLWIL+LYPELA+MKD KNNNE+ALHVLARKPSAMDSTKQL++W+  INS      
Subjt:  LIAISYKCRPMISYLLSVTDLSQLSSEERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS------

Query:  --------CVKEI---------SNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIF
                 + EI         SNK+V KTLA +LVE LWRYVVYELPQ+ MLEFI+HPTSLLNDAAG GNVEFLIVLIRE+PDI+W  +DD+DDSKSIF
Subjt:  --------CVKEI---------SNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIF

Query:  HVAVENRLENVFNLINEIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK
        HVAVENRLENVFNLINEIGKLNEFSTKYRTFKGKY+ILHLAGNLAAPNHLN+VSGAALQMQREMLWFK
Subjt:  HVAVENRLENVFNLINEIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK

A0A5A7ULD5 Ankyrin repeat-containing protein2.6e-17173.67Show/hide
Query:  PVSFFSDPLEEVVVEPNLASPPPPTAALAQSEPIS-GRFSDCTPPQ-TRTMQHVPPPSDQEQDFD-CNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFP
        P++F      EVVVE NL SPP  TA  A+SEP S    S+CT P+ +RT+      SD E D D   + RVE  RRLLLYKSALKG+WKR E ++  +P
Subjt:  PVSFFSDPLEEVVVEPNLASPPPPTAALAQSEPIS-GRFSDCTPPQ-TRTMQHVPPPSDQEQDFD-CNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFP

Query:  HFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLL
        H+VRCAITRNKETVLHVAAGAKQSVFVEELV RMT +DM L+DKYGNTALCFAATSRIVKIAKLMVEKN  LPLIRTFRE TPLLIA+SYK R MISYLL
Subjt:  HFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLL

Query:  SVTDLSQLSSEERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS--------------CVKEI---
        SVTDLSQL+++ERIELLIATIHS+F+DLSLWIL+LYPELA+MKD KNNNE+ALHVLARKPSAMDSTKQL++W+  INS               + EI   
Subjt:  SVTDLSQLSSEERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS--------------CVKEI---

Query:  ------SNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIFHVAVENRLENVFNLIN
              SNK+V KTLA +LVE LWRYVVYELPQ+ MLEFI+HPTSLLNDAAG GNVEFLIVLIRE+PDI+W  +DD+DDSKSIFHVAVENRLENVFNLIN
Subjt:  ------SNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIFHVAVENRLENVFNLIN

Query:  EIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK
        EIGKLNEFSTKYRTFKGKY+ILHLAGNLAAPNHLN+VSGAALQMQREMLWFK
Subjt:  EIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK

A0A5D3BJN3 Ankyrin repeat-containing protein3.8e-17074.89Show/hide
Query:  EVVVEPNLASPPPPTAALAQSEPIS-GRFSDCTPPQ-TRTMQHVPPPSDQEQDFD-CNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRN
        EVVVE NL SPP  TA  A+SEP S    S+CT P+ +RT+      SD E D D   + RVE  RRLLLYKSALKG+WKR E ++  +PH+VRCAITRN
Subjt:  EVVVEPNLASPPPPTAALAQSEPIS-GRFSDCTPPQ-TRTMQHVPPPSDQEQDFD-CNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRN

Query:  KETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQLSS
        KETVLHVAAGAKQSVFVEELV RMT +DM L+DKYGNTALCFAATSRIVKIAKLMVEKN  LPLIRTFRE TPLLIA+SYK R MISYLLSVTDLSQL++
Subjt:  KETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQLSS

Query:  EERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS--------------CVKEI---------SNKE
        +ERIELLIATIHS+F+DLSLWIL+LYPELA+MKD KNNNE+ALHVLARKPSAMDSTKQL++W+  INS               + EI         SNK+
Subjt:  EERIELLIATIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINS--------------CVKEI---------SNKE

Query:  VIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIFHVAVENRLENVFNLINEIGKLNEFST
        V KTLA +LVE LWRYVVYELPQ+ MLEFI+HPTSLLNDAAG GNVEFLIVLIRE+PDI+W  +DD+DDSKSIFHVAVENRLENVFNLINEIGKLNEFST
Subjt:  VIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVW--EDDDDDSKSIFHVAVENRLENVFNLINEIGKLNEFST

Query:  KYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK
        KYRTFKGKY+ILHLAGNLAAPNHLN+VSGAALQMQREMLWFK
Subjt:  KYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK

SwissProt top hitse value%identityAlignment
Q1RI31 Putative ankyrin repeat protein RBE_09026.9e-0434.09Show/hide
Query:  SALKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNS
        +A  G  K  E+L+ K        +T N +TVL +AA        E L+ +MT   +   +K GNTAL  AA+S + KI + ++ K S
Subjt:  SALKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNS

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein2.7e-3529.9Show/hide
Query:  DQEQDFDCNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGN--TALCFAATSR
        D+  D   +++R E    L+L+K+   GE +  +  +++ P  +   +T N +T +H A  +     VEE++ R+   +  L+ K  N  TAL +AAT  
Subjt:  DQEQDFDCNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGN--TALCFAATSR

Query:  IVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQL-----SSEER----IELLIATIHSEFYDLSLWILQLYPELAIMKDIKNN
        IV+IA+ +V K   L  +R  +E  P+++A  Y  + ++ YL S T LS L     S E +      L+   I    Y ++L ++Q YP+LA  +D  ++
Subjt:  IVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQL-----SSEER----IELLIATIHSEFYDLSLWILQLYPELAIMKDIKNN

Query:  NESALHVLARKPSAMDSTKQLKHWEKCINSCVKEISNKEVIKTLARELVEVLWRYV-VYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIV
        N++A+  LA+ P A  S  ++          ++ +   ++    A+E+++ + + +  ++  QQ            L  A   G VE++  ++R +PDIV
Subjt:  NESALHVLARKPSAMDSTKQLKHWEKCINSCVKEISNKEVIKTLARELVEVLWRYV-VYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIV

Query:  WEDDDDDSKSIFHVAVENRLENVFNLINEIG-KLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFKVLSYGTSPLFHFNFVYQTN
        W   +    +IF  AV  R E +F+LI  IG K N  +T +  F    N+LH A   A  + LN + GAALQMQRE+ WFK +     P  H   V   N
Subjt:  WEDDDDDSKSIFHVAVENRLENVFNLINEIG-KLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFKVLSYGTSPLFHFNFVYQTN

Query:  YNQESTKK
          Q+ T K
Subjt:  YNQESTKK

AT3G54070.1 Ankyrin repeat family protein1.1e-4434.07Show/hide
Query:  RLLLYKSALKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIR
        R L+YK+ L G+WK   +L+ +    V   IT N E  LH+A  AK   FV  L+  M P D++L++K GNT L FAA    ++ A++++     LP I 
Subjt:  RLLLYKSALKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIR

Query:  TFREVTPLLIAISYKCRPMISYLLSVTDLSQLSSEERIEL----LIATIHSEFYDLSLWILQ---LY-PELAIMKDIKNNNESALHVLARKPSAMDSTKQ
          + +TP+ IA  Y    M+ YL S T +  L+ ++ + L    + A I+  F D+ LW+L+   LY  ELA+      N+  ALH+LARK SA+    Q
Subjt:  TFREVTPLLIAISYKCRPMISYLLSVTDLSQLSSEERIEL----LIATIHSEFYDLSLWILQ---LY-PELAIMKDIKNNNESALHVLARKPSAMDSTKQ

Query:  LKHWEKCINSCVKEISNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVWEDDDDDSKSIFHVAVENRLE
        L  +++  +S                      W                     LL DAA  GNVE L++LIR   D++W   D++++++FHVA   R E
Subjt:  LKHWEKCINSCVKEISNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVWEDDDDDSKSIFHVAVENRLE

Query:  NVFNLINEIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFKVL
        N+F+LI E+G + +    Y+  + K  +LHL   L   N     SGAAL MQ+E+LWFK +
Subjt:  NVFNLINEIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFKVL

AT5G04700.1 Ankyrin repeat family protein3.2e-2027.2Show/hide
Query:  ETVLHVAAGAKQSVFVEELVCRMTPDDM---TLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQL
        ET L  A    +   V+EL+ RMTP+ M     Q+   +T L   A S  ++IA+ +V KN +L  I       P+++A+      M  YL + T +  L
Subjt:  ETVLHVAAGAKQSVFVEELVCRMTPDDM---TLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQL

Query:  SSEERIELLIATIHSEFY---DLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINSCVK---------EISNKEVIKTLAREL
          ++     +  +++ FY   D++L +  +   LA+ K  +  +   + VLA KP        L    + I S ++           SN++   TL R+L
Subjt:  SSEERIELLIATIHSEFY---DLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINSCVK---------EISNKEVIKTLAREL

Query:  VEVLWRYV----VYEL--------------PQQTMLEFIKHPTSLLND----AAGEGNVEFLIVLIREFPDIVWEDDDDDSKSIFHVAVENRLENVFNLI
        ++ L ++     VY L               ++T+   +K  +  +++    A   GNV+FL+ +IR   +++W      S ++F +AVE R E VF+L+
Subjt:  VEVLWRYV----VYEL--------------PQQTMLEFIKHPTSLLND----AAGEGNVEFLIVLIREFPDIVWEDDDDDSKSIFHVAVENRLENVFNLI

Query:  NEIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK
          +          +   G   +LHLAG  + P+ L+ V GA LQ+QRE+ WFK
Subjt:  NEIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFK

AT5G35810.1 Ankyrin repeat family protein1.4e-2843.97Show/hide
Query:  IKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVWEDDDDDSKSIFHVAVENRLENVFNLINEIGKLNEFSTKYR
        ++TLA  +VE LW +V+ +LP + + +F+     LL DAA  GN+E L++LIR +PD++W   D  ++S+FH+A  NR E +FN I E+G + +    Y+
Subjt:  IKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAGEGNVEFLIVLIREFPDIVWEDDDDDSKSIFHVAVENRLENVFNLINEIGKLNEFSTKYR

Query:  TFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFKVL
          +   N+LHL   L  PN L  VSGAALQMQRE+LW+K +
Subjt:  TFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFKVL

AT5G35830.1 Ankyrin repeat family protein9.5e-2535.39Show/hide
Query:  QHVPPPSDQEQDFDCNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCF
        +H  P  + +++F C+  ++ A + + LY++ALKG+WK    ++ +  + +   IT   ETVLH+A  AK   FV  L+  +  +D+ L++  GNTALCF
Subjt:  QHVPPPSDQEQDFDCNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITRNKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCF

Query:  AATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQLSSEERIELLIATIHSEFYDLSL
        AA S +V+IAK+++EKN  LP+IR   + TP+ +A  +    M+ YL   T   + + EE + L  A I ++ Y  S+
Subjt:  AATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQLSSEERIELLIATIHSEFYDLSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAATCGCCTGTTTCTTTTTTCAGCGACCCATTGGAAGAGGTCGTTGTTGAACCCAACCTCGCTTCTCCGCCGCCGCCTACGGCTGCCCTCGCCCAATCCGAGCC
CATTTCCGGCCGTTTCTCCGATTGCACGCCACCCCAGACTCGGACAATGCAGCATGTTCCACCACCCTCGGATCAGGAACAGGATTTCGACTGCAACCAAGTTCGAGTTG
AGGCTCAGAGACGACTTCTTTTGTATAAATCTGCACTAAAAGGTGAGTGGAAAAGAGTTGAATCACTGGTTGAGAAGTTCCCACATTTTGTTCGTTGTGCAATAACAAGA
AACAAAGAGACTGTTCTTCATGTTGCTGCCGGAGCCAAGCAATCTGTGTTCGTGGAGGAGCTCGTCTGTAGAATGACTCCGGATGACATGACTTTGCAAGACAAATATGG
AAACACTGCCCTTTGCTTTGCTGCTACATCAAGAATTGTAAAAATTGCTAAACTCATGGTGGAAAAGAACAGTCGTCTTCCTCTGATTCGGACTTTTCGGGAAGTTACTC
CACTGCTCATTGCAATATCTTATAAATGTAGACCTATGATTTCCTACCTTTTGTCTGTCACTGATCTCAGCCAGCTATCCTCTGAAGAACGGATCGAGCTTCTTATTGCC
ACTATTCATAGCGAGTTTTATGATTTAAGCTTGTGGATTTTGCAGTTATATCCTGAGTTAGCGATTATGAAAGATATAAAGAACAATAATGAAAGTGCATTACATGTTTT
GGCTAGAAAACCTTCTGCAATGGATAGCACAAAGCAGCTAAAGCATTGGGAAAAGTGCATCAACTCTTGCGTAAAAGAGATCTCCAATAAAGAAGTGATCAAAACATTAG
CTCGTGAATTAGTTGAAGTCCTATGGAGATATGTTGTATATGAGCTTCCACAACAGACGATGCTGGAGTTTATTAAACATCCCACGAGTTTATTGAATGATGCTGCCGGC
GAAGGCAATGTTGAGTTTTTGATTGTGCTCATTCGTGAATTCCCAGATATAGTATGGGAAGACGATGACGACGATAGTAAGAGTATATTTCACGTAGCTGTTGAAAATCG
TCTTGAGAATGTGTTTAACCTAATAAATGAGATTGGTAAGCTTAATGAGTTCTCCACAAAATATAGAACTTTCAAAGGAAAGTACAACATATTGCATTTGGCTGGAAATC
TAGCAGCTCCAAACCATCTCAATAAAGTTTCAGGAGCTGCGCTTCAAATGCAACGTGAAATGCTTTGGTTTAAGGTACTTTCTTATGGAACATCCCCACTTTTTCACTTT
AATTTTGTTTATCAAACGAATTACAACCAGGAATCAACAAAGAAAACAAGAAAAATCAAATACACAAATTTACGTGGTTTACTATTAGTGTGTTAG
mRNA sequenceShow/hide mRNA sequence
CCAAAATTGACCAATTCCTCAAATACAGCGGCTACGGACTGCACGCTGGTTCTTGCTGCGCCTCACTCATAACGCAAAATCCACACAAAAATGATGATGCCGCTCTAAAT
TCCATGGCCCAATCGCCTGTTTCTTTTTTCAGCGACCCATTGGAAGAGGTCGTTGTTGAACCCAACCTCGCTTCTCCGCCGCCGCCTACGGCTGCCCTCGCCCAATCCGA
GCCCATTTCCGGCCGTTTCTCCGATTGCACGCCACCCCAGACTCGGACAATGCAGCATGTTCCACCACCCTCGGATCAGGAACAGGATTTCGACTGCAACCAAGTTCGAG
TTGAGGCTCAGAGACGACTTCTTTTGTATAAATCTGCACTAAAAGGTGAGTGGAAAAGAGTTGAATCACTGGTTGAGAAGTTCCCACATTTTGTTCGTTGTGCAATAACA
AGAAACAAAGAGACTGTTCTTCATGTTGCTGCCGGAGCCAAGCAATCTGTGTTCGTGGAGGAGCTCGTCTGTAGAATGACTCCGGATGACATGACTTTGCAAGACAAATA
TGGAAACACTGCCCTTTGCTTTGCTGCTACATCAAGAATTGTAAAAATTGCTAAACTCATGGTGGAAAAGAACAGTCGTCTTCCTCTGATTCGGACTTTTCGGGAAGTTA
CTCCACTGCTCATTGCAATATCTTATAAATGTAGACCTATGATTTCCTACCTTTTGTCTGTCACTGATCTCAGCCAGCTATCCTCTGAAGAACGGATCGAGCTTCTTATT
GCCACTATTCATAGCGAGTTTTATGATTTAAGCTTGTGGATTTTGCAGTTATATCCTGAGTTAGCGATTATGAAAGATATAAAGAACAATAATGAAAGTGCATTACATGT
TTTGGCTAGAAAACCTTCTGCAATGGATAGCACAAAGCAGCTAAAGCATTGGGAAAAGTGCATCAACTCTTGCGTAAAAGAGATCTCCAATAAAGAAGTGATCAAAACAT
TAGCTCGTGAATTAGTTGAAGTCCTATGGAGATATGTTGTATATGAGCTTCCACAACAGACGATGCTGGAGTTTATTAAACATCCCACGAGTTTATTGAATGATGCTGCC
GGCGAAGGCAATGTTGAGTTTTTGATTGTGCTCATTCGTGAATTCCCAGATATAGTATGGGAAGACGATGACGACGATAGTAAGAGTATATTTCACGTAGCTGTTGAAAA
TCGTCTTGAGAATGTGTTTAACCTAATAAATGAGATTGGTAAGCTTAATGAGTTCTCCACAAAATATAGAACTTTCAAAGGAAAGTACAACATATTGCATTTGGCTGGAA
ATCTAGCAGCTCCAAACCATCTCAATAAAGTTTCAGGAGCTGCGCTTCAAATGCAACGTGAAATGCTTTGGTTTAAGGTACTTTCTTATGGAACATCCCCACTTTTTCAC
TTTAATTTTGTTTATCAAACGAATTACAACCAGGAATCAACAAAGAAAACAAGAAAAATCAAATACACAAATTTACGTGGTTTACTATTAGTGTGTTAGTTATGTCTATG
AGTATGGACTGATGGAGAAAGTCTAGGTACATCTAGTAATCTTTATGTTGCATCTTAGTACATCTGTTTGGCACATCTGGTAACTTTTGTGTTGCATTTGGTATATCTGT
GAGGGCATCTGGTAATCTTTATTTGTATCTAGAACCTTTTTTTTTTATATTTGTGAGTGTTCGGGTTAACTTATGTGCACTTAAACTATTTTTACGGGACATAAATGTCA
AGAAAACCAGTAAGAAATTAATTCCTAGCTAGGTGACCATCATGGATTGAGTCCACCACCTCTGTGTTCTTCAAGACCCTCTTGACCACTCGATCACTTCATATTTTTCA
TCTAATACATTGGTTAAGCACATCTGGTAACCTTCATGTTGCATTTGGTACATCTGTTTTATCAATGAAATAATTCAAGTTAATATGGAGATTATTGAAATGTCTTGAAT
TAATTATTTAGTACTTCAAAACCTAGGACAGGTTTCTCTTTAATAAACGATTTCTCCCCCTATCCGTTGACGTAACTGACACATCATTAGTTAGGAAACGATGTTGATTT
TTTTGCTCTTTTATTCATTGCATCTATTATAAGATATTTCTTATTATCATGACATCTCAATTTGGATCAAGGAAGTGGAGAAGATAGTTCTACCTTCCCAACTAGATGCG
AAATCCAATGATCCAGATCCAAGTGTACCGAAGTTGACACCGCGCCAATTATTCACCGAAAAGCACAAAGGGCTTCGCAAAGAGGGTGAGGAATGGATGAAAAACACAGC
AAACTCTTGCATGCTGGTAGCAACTTTGATCTCCACGGTGGTTTTTGCTGCAGCCTTCACAGTTCCTGGGGGCAATGATGATAAAACGGGCATTCCTATTTTTCAAAACA
AGTTTTTGTTTGCAGTGTTTGTGATATCAGATGCAATCGCTCTGTTTTCATCCTCAACTGCTATTCTAATGTTCTTGTCGATCTTAACTTCGCGCTACGCAGAAGAAGAT
TTCTTGCACTCGCTGCCCTCGAGATTGCTCTTTGGACTCGCGTCACTGTTCATCTCCATTGTGTTCATGGTTGCAGCTTTTAGTGCCACCTTCTTTATGATTTACCAGAA
TGCTGATATCTCCATTCCAACCATGGTTACTGCAATGGCGGTTATTCCCGTTAGTTGTTTTTGTGTTCTTCAGTTTAAATTGTGGATTGATATTTTTCACAACACTTACT
CTTCTACATTTCTTTTTAAGCCTAATCCACGTAAATTGTTCTAACCTTTTGCATGTCACATTATTGTTTTCCCTGTCTTTTATTTTTCCGTGTCAGGTTAATAAAACTTT
GGGGATAAAATGTAACTCCTTCACTCAACAAATTCTTCTTCCA
Protein sequenceShow/hide protein sequence
MAQSPVSFFSDPLEEVVVEPNLASPPPPTAALAQSEPISGRFSDCTPPQTRTMQHVPPPSDQEQDFDCNQVRVEAQRRLLLYKSALKGEWKRVESLVEKFPHFVRCAITR
NKETVLHVAAGAKQSVFVEELVCRMTPDDMTLQDKYGNTALCFAATSRIVKIAKLMVEKNSRLPLIRTFREVTPLLIAISYKCRPMISYLLSVTDLSQLSSEERIELLIA
TIHSEFYDLSLWILQLYPELAIMKDIKNNNESALHVLARKPSAMDSTKQLKHWEKCINSCVKEISNKEVIKTLARELVEVLWRYVVYELPQQTMLEFIKHPTSLLNDAAG
EGNVEFLIVLIREFPDIVWEDDDDDSKSIFHVAVENRLENVFNLINEIGKLNEFSTKYRTFKGKYNILHLAGNLAAPNHLNKVSGAALQMQREMLWFKVLSYGTSPLFHF
NFVYQTNYNQESTKKTRKIKYTNLRGLLLVC