; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr002815 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr002815
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAPO protein 2, chloroplastic
Genome locationtig00001784:336..5229
RNA-Seq ExpressionSgr002815
SyntenySgr002815
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR023342 - APO domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589769.1 APO protein 2, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.5e-17487.14Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        +KEKKPFP+PIVELRRAARERMKKS+GQPR+PVPPPKNGL VKSM+PIAYNVFNARITLINNLK LLKVVPVHACGFCNEIHVGPVGHPFKSCRG NANF
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        RKGLHEWTKA LEDIFLPVEAYHLYDRLGKRISH ERYSIPRIPA+VELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPE PLKPLLTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        IPDSDVVAPSD+ED+AWLADQTLQAWEQMRQGAKRLMKMYP                 +     +F+    NGQHGWQRAVLDDLIPPRYVWHVPDVNGP
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        PLQRELRNFYGQAPA+VEICIQAGAAIPD+YKSTMRMDVGIPLDIKEAEM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

KAG7023442.1 APO protein 2, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]3.5e-17487.14Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        +KEKKPFP+PIVELRRAARERMKKS+GQPR+PVPPPKNGL VKSM+PIAYNVFNARITLINNLK LLKVVPVHACGFCNEIHVGPVGHPFKSCRG NANF
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        RKGLHEWTKA LEDIFLPVEAYHLYDRLGKRISH ERYSIPRIPA+VELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPE PLKPLLTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        IPDSDVVAPSD+ED+AWLADQTLQAWEQMRQGAKRLMKMYP                 +     +F+    NGQHGWQRAVLDDLIPPRYVWHVPDVNGP
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        PLQRELRNFYGQAPA+VEICIQAGAAIPD+YKSTMRMDVGIPLDIKEAEM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

XP_022921740.1 APO protein 2, chloroplastic [Cucurbita moschata]4.6e-17486.86Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        +KEKKPFP+PIVELRRAARERMKKS+GQPR+PVPPPKNGL VKSM+PIAYNVFNARITLINNLK LLKVVPVHACGFCNEIHVGPVGHPFKSCRG NANF
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        RKGLHEWTKA LEDIFLPVEAYHLYDRLGKRISH ERYSIPRIPA+VELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPE PLKPLLTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        IPDSDVVAPSD+ED+AWLADQTLQAWEQMRQGAKRLMKMYP                 +     +F+    NGQHGWQRAVLDDLIPPRYVWH+PDVNGP
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        PLQRELRNFYGQAPA+VEICIQAGAAIPD+YKSTMRMDVGIPLDIKEAEM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

XP_023516533.1 APO protein 2, chloroplastic [Cucurbita pepo subsp. pepo]1.7e-17386.86Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        +KEKKPFP+PIVELRRAARERMKKS+GQPR+PVPPPKNGL VKSM+PIAYNVFNARITLINNLK LLKVVPVHACGFCNEIHVGPVGHPFKSCRG NANF
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        RKGLHEWTKA LEDIFLPVEAYHLYDRLGKRISH ERYSIPRIPA+VELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPE PLKPLLTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        IPDSDVVAPSD+ED+AWLADQTLQAWEQMRQGAKRLMKMYP                 +     +F+    NGQHGWQRAVLDDLIPPRYVWHVPDVNGP
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        PLQRELRNFYGQAPA+VEICIQAGAAIPD+YKSTMRMDVGIP DIKEAEM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

XP_038880638.1 APO protein 2, chloroplastic [Benincasa hispida]1.9e-17588Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        +KEKKPFP+PIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSM+PIAYNVFNARITLINNLK LL VVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        RKGLHEWTKATLEDIFLPVEAYHLYDRLG+RISH ERYSIPRIPA+VELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        IPDSDVVAPSD+ED+AWLADQT+QAWE+MRQGAKRLMKMYP                 +     +F+    NGQHGWQRAVLDDLIPPRYVWHVPDVNG 
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

TrEMBL top hitse value%identityAlignment
A0A1S3B8J0 APO protein 2, chloroplastic4.2e-17386Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        +KEKKPFP+PIVELRRAARERMK SKGQPRRPVPPPKNGLLVKSM+PIAYNVFNARITL+NNLK LLKVVPVHACGFCNEIHVGPVGHPFKSCRGQ+ANF
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        RKGLHEWTKATLEDIFLPVEAYHLYDRLG+RISH ERYSIPRIPA+VELCIQAGVDLP+YPAKRRRKPI+RISKSE+IDADESELPDPEPEVPLKPLLTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        I DSD VAPSD ED+AWLADQTLQAWEQMR+GAKRLMKMYP                 +     +F+    NGQHGWQRAVLDDLIPPRYVWHVPD+NGP
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        PLQRELRNFYGQAPAIVE+CIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

A0A5A7SV25 APO protein 24.2e-17386Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        +KEKKPFP+PIVELRRAARERMK SKGQPRRPVPPPKNGLLVKSM+PIAYNVFNARITL+NNLK LLKVVPVHACGFCNEIHVGPVGHPFKSCRGQ+ANF
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        RKGLHEWTKATLEDIFLPVEAYHLYDRLG+RISH ERYSIPRIPA+VELCIQAGVDLP+YPAKRRRKPI+RISKSE+IDADESELPDPEPEVPLKPLLTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        I DSD VAPSD ED+AWLADQTLQAWEQMR+GAKRLMKMYP                 +     +F+    NGQHGWQRAVLDDLIPPRYVWHVPD+NGP
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        PLQRELRNFYGQAPAIVE+CIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

A0A6J1E6N3 APO protein 2, chloroplastic2.2e-17486.86Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        +KEKKPFP+PIVELRRAARERMKKS+GQPR+PVPPPKNGL VKSM+PIAYNVFNARITLINNLK LLKVVPVHACGFCNEIHVGPVGHPFKSCRG NANF
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        RKGLHEWTKA LEDIFLPVEAYHLYDRLGKRISH ERYSIPRIPA+VELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPE PLKPLLTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        IPDSDVVAPSD+ED+AWLADQTLQAWEQMRQGAKRLMKMYP                 +     +F+    NGQHGWQRAVLDDLIPPRYVWH+PDVNGP
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        PLQRELRNFYGQAPA+VEICIQAGAAIPD+YKSTMRMDVGIPLDIKEAEM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

A0A6J1JHK9 APO protein 2, chloroplastic3.2e-17386.57Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        +KEKKPFP+PIVELRRAARERMKKS+GQPR+PVPPPKNGL VKSM+PIAYNVFNARITLINNLK LLKVVPVHACGFCNEIHVGPVGHPFKSCRG NANF
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        RKGLHEWT A LEDIFLPVEAYHLYDRLGKRISH ERYSIPRIPA+VELCIQAGV+LPEYPAKRRRKPIIRISKSEFIDADESELPDPEPE PLKPLLTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        IPDSDVVAPSD+ED+AWLADQTLQAWEQMRQGAKRLMKMYP                 +     +F+    NGQHGWQRAVLDDLIPPRYVWHVPDVNGP
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        PLQRELRNFYGQAPA+VEICIQAGAAIPD+YKSTMRMDVGIPLDIKEAEM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

A0A803QAZ8 Uncharacterized protein1.4e-17668.24Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        +KEKKPFP+PIVELRRAARER+KK KGQPR+PVPPPKNG+LVKS +PIAY+V+NARITLINNLK LLK V VHAC +CNEIHVGP GHPF+SCRGQN + 
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        R+GLH+WT AT EDIFLPV+AYHLYDRLGKRI H++R++IPRIPALVELCIQAGVD+P++P KRRRKPIIRI+KSEF+DADESELPDP+P+ P  P+LTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        IPD D+V PS+EE+   LA++TLQAWE+MR+GA++LMK+YP                 +     + +    NGQHGWQ AV+DDLIPPRYVWHVPDVNGP
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEMSSETQTSAMAGSQQVKAAGTADQMKGPSPAHHSTEVLHQRKKLPVCPMRM
        PLQRELR+FYGQ PA+VE+C+QAGAA+P+Q++ TMR+DVGIP D KEAEM S+     MAG++ VK        K  +   +STE LHQR KLP CPMRM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEMSSETQTSAMAGSQQVKAAGTADQMKGPSPAHHSTEVLHQRKKLPVCPMRM

Query:  AVGGFVIAATLGYFVLYSKKKPEASAIDVAKVTAGMSTPENTHP
        A+GGF IA TLGYF LYSKKKPEA+A+DVAKV  G S+PENT P
Subjt:  AVGGFVIAATLGYFVLYSKKKPEASAIDVAKVTAGMSTPENTHP

SwissProt top hitse value%identityAlignment
Q8W4A5 APO protein 2, chloroplastic1.2e-14066Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        ++EKKPFP+PIV+LRRAARER+K +K +P+RP+PPPKNG++VKS++P+AY V+NARI LINNL  L+KVV V+ACG+CNEIHVGP GHPFKSC+G N + 
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        RKGLHEWT + +ED+ +P+EAYHL+DRLGKRI H ER+SIPR+PA+VELCIQ GV++PE+PAKRRRKPIIRI KSEF+DADE+ELPDPEP+ P  PLLTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY---------------PWAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        +P S++  PS EE+   LA++TLQAWE+MR GAK+LM+MY                  +     +F+    NGQHGWQ AVLDDLIPPRYVWHVPDVNGP
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY---------------PWAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        P+QRELR+FYGQAPA+VEIC QAGA +P+ Y++TMR++VGIP  +KEAEM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

Q9FH50 APO protein 3, mitochondrial1.7e-6237.54Show/hide
Query:  PTLFQKEKKPFPIPIVELRRAARERMKKSKGQPRRPV-PPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRG
        P   + E+KP+P P+ EL R A+E  +  K QP R +  PP NGLLV  ++ +A+ V   R  L++ L  ++  VPVH C  C E+H+G  GH  ++C G
Subjt:  PTLFQKEKKPFPIPIVELRRAARERMKKSKGQPRRPV-PPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRG

Query:  QNANFRKGLHEWTKATLEDIFLPVEAYHLYDRLGK-RISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPL
          +  R   H W +  + D+ L  + +HLYDR  K R+ H ER+++P+I A++ELCIQAGVDL ++P+KRR KP+  I   E    D  ++ D   E+ +
Subjt:  QNANFRKGLHEWTKATLEDIFLPVEAYHLYDRLGK-RISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPL

Query:  KPLLTEIPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY-PWA-----------QGTELWSFQA---PTTNGQHGWQRAVLDDLIPPRYVWHV
            T I + D     +++ +  L+ +T+++W +M  G ++LM+ Y  W            +G ++   +A      +G H WQ A +DD++ P YVWHV
Subjt:  KPLLTEIPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY-PWA-----------QGTELWSFQA---PTTNGQHGWQRAVLDDLIPPRYVWHV

Query:  PD-VNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIP
         D  +G  L   L+ FYG+APA++E+C+Q GA +PDQY S MR+DV  P
Subjt:  PD-VNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIP

Q9LSZ0 APO protein 4, mitochondrial2.6e-3932.2Show/hide
Query:  VKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANFRKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIP
        VK ++P+A  +  AR  LI+N+  LLKV PV  C FC+E+ VG  GH  ++CR         LHEW   ++ DI +PVE+YHL++     I H ER+   
Subjt:  VKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANFRKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIP

Query:  RIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTEIPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP
        R+PA++ELC QAG   PE         I++ S                 E+   P ++E    + +      D+ ++    L AWE++R G K+L+ +YP
Subjt:  RIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTEIPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMYP

Query:  ---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMR
                         +      F+  +  G H W++A ++DL+P + VWH    +   L  E R++YG APAIV +C   GA +P +Y   M+
Subjt:  ---------------WAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMR

Q9XIR4 APO protein 1, chloroplastic7.8e-7641.27Show/hide
Query:  ISPTLFQKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCR
        + P L + +KKP+PIP  +++  AR+  K ++    + + PPKNGLLV +++P+A  V +    LI  L  LL VVPV AC  C  +HV  VGH  + C 
Subjt:  ISPTLFQKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCR

Query:  GQNANFRKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDAD--ESELPDPEPEV
        G   + R+G H W K T+ D+ +PVE+YH+YD  G+RI H  R+   RIPALVELCIQAGV++PEYP +RR +P IR+     ID      E   P+   
Subjt:  GQNANFRKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDAD--ESELPDPEPEV

Query:  PLKPLLTEIPDSDV---VAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY--------------PWAQGTEL-WSFQAPTTNGQHGWQRAVLDDLIPPR
         L   L E+    V     P   ED+  +A +T+ A+E++R G  +LM+ +              PW    +L   F+    +G+HGWQ A++D++ PP 
Subjt:  PLKPLLTEIPDSDV---VAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY--------------PWAQGTEL-WSFQAPTTNGQHGWQRAVLDDLIPPR

Query:  YVWHVPDVNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        YVWHV D+ G PL   LR FYG+APA+VEIC+ +GA +P +YK+ MR+D+ +P D +EA+M
Subjt:  YVWHVPDVNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

Arabidopsis top hitse value%identityAlignment
AT1G64810.1 Arabidopsis thaliana protein of unknown function (DUF794)5.6e-7741.27Show/hide
Query:  ISPTLFQKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCR
        + P L + +KKP+PIP  +++  AR+  K ++    + + PPKNGLLV +++P+A  V +    LI  L  LL VVPV AC  C  +HV  VGH  + C 
Subjt:  ISPTLFQKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCR

Query:  GQNANFRKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDAD--ESELPDPEPEV
        G   + R+G H W K T+ D+ +PVE+YH+YD  G+RI H  R+   RIPALVELCIQAGV++PEYP +RR +P IR+     ID      E   P+   
Subjt:  GQNANFRKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDAD--ESELPDPEPEV

Query:  PLKPLLTEIPDSDV---VAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY--------------PWAQGTEL-WSFQAPTTNGQHGWQRAVLDDLIPPR
         L   L E+    V     P   ED+  +A +T+ A+E++R G  +LM+ +              PW    +L   F+    +G+HGWQ A++D++ PP 
Subjt:  PLKPLLTEIPDSDV---VAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY--------------PWAQGTEL-WSFQAPTTNGQHGWQRAVLDDLIPPR

Query:  YVWHVPDVNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        YVWHV D+ G PL   LR FYG+APA+VEIC+ +GA +P +YK+ MR+D+ +P D +EA+M
Subjt:  YVWHVPDVNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

AT1G64810.2 Arabidopsis thaliana protein of unknown function (DUF794)5.6e-7741.27Show/hide
Query:  ISPTLFQKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCR
        + P L + +KKP+PIP  +++  AR+  K ++    + + PPKNGLLV +++P+A  V +    LI  L  LL VVPV AC  C  +HV  VGH  + C 
Subjt:  ISPTLFQKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCR

Query:  GQNANFRKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDAD--ESELPDPEPEV
        G   + R+G H W K T+ D+ +PVE+YH+YD  G+RI H  R+   RIPALVELCIQAGV++PEYP +RR +P IR+     ID      E   P+   
Subjt:  GQNANFRKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDAD--ESELPDPEPEV

Query:  PLKPLLTEIPDSDV---VAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY--------------PWAQGTEL-WSFQAPTTNGQHGWQRAVLDDLIPPR
         L   L E+    V     P   ED+  +A +T+ A+E++R G  +LM+ +              PW    +L   F+    +G+HGWQ A++D++ PP 
Subjt:  PLKPLLTEIPDSDV---VAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY--------------PWAQGTEL-WSFQAPTTNGQHGWQRAVLDDLIPPR

Query:  YVWHVPDVNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        YVWHV D+ G PL   LR FYG+APA+VEIC+ +GA +P +YK+ MR+D+ +P D +EA+M
Subjt:  YVWHVPDVNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

AT5G57930.1 Arabidopsis thaliana protein of unknown function (DUF794)8.4e-14266Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        ++EKKPFP+PIV+LRRAARER+K +K +P+RP+PPPKNG++VKS++P+AY V+NARI LINNL  L+KVV V+ACG+CNEIHVGP GHPFKSC+G N + 
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        RKGLHEWT + +ED+ +P+EAYHL+DRLGKRI H ER+SIPR+PA+VELCIQ GV++PE+PAKRRRKPIIRI KSEF+DADE+ELPDPEP+ P  PLLTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY---------------PWAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        +P S++  PS EE+   LA++TLQAWE+MR GAK+LM+MY                  +     +F+    NGQHGWQ AVLDDLIPPRYVWHVPDVNGP
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY---------------PWAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        P+QRELR+FYGQAPA+VEIC QAGA +P+ Y++TMR++VGIP  +KEAEM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

AT5G57930.2 Arabidopsis thaliana protein of unknown function (DUF794)8.4e-14266Show/hide
Query:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF
        ++EKKPFP+PIV+LRRAARER+K +K +P+RP+PPPKNG++VKS++P+AY V+NARI LINNL  L+KVV V+ACG+CNEIHVGP GHPFKSC+G N + 
Subjt:  QKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANF

Query:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE
        RKGLHEWT + +ED+ +P+EAYHL+DRLGKRI H ER+SIPR+PA+VELCIQ GV++PE+PAKRRRKPIIRI KSEF+DADE+ELPDPEP+ P  PLLTE
Subjt:  RKGLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTE

Query:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY---------------PWAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP
        +P S++  PS EE+   LA++TLQAWE+MR GAK+LM+MY                  +     +F+    NGQHGWQ AVLDDLIPPRYVWHVPDVNGP
Subjt:  IPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY---------------PWAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGP

Query:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM
        P+QRELR+FYGQAPA+VEIC QAGA +P+ Y++TMR++VGIP  +KEAEM
Subjt:  PLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIPLDIKEAEM

AT5G61930.1 Arabidopsis thaliana protein of unknown function (DUF794)1.2e-6337.54Show/hide
Query:  PTLFQKEKKPFPIPIVELRRAARERMKKSKGQPRRPV-PPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRG
        P   + E+KP+P P+ EL R A+E  +  K QP R +  PP NGLLV  ++ +A+ V   R  L++ L  ++  VPVH C  C E+H+G  GH  ++C G
Subjt:  PTLFQKEKKPFPIPIVELRRAARERMKKSKGQPRRPV-PPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRG

Query:  QNANFRKGLHEWTKATLEDIFLPVEAYHLYDRLGK-RISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPL
          +  R   H W +  + D+ L  + +HLYDR  K R+ H ER+++P+I A++ELCIQAGVDL ++P+KRR KP+  I   E    D  ++ D   E+ +
Subjt:  QNANFRKGLHEWTKATLEDIFLPVEAYHLYDRLGK-RISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPL

Query:  KPLLTEIPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY-PWA-----------QGTELWSFQA---PTTNGQHGWQRAVLDDLIPPRYVWHV
            T I + D     +++ +  L+ +T+++W +M  G ++LM+ Y  W            +G ++   +A      +G H WQ A +DD++ P YVWHV
Subjt:  KPLLTEIPDSDVVAPSDEEDMAWLADQTLQAWEQMRQGAKRLMKMY-PWA-----------QGTELWSFQA---PTTNGQHGWQRAVLDDLIPPRYVWHV

Query:  PD-VNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIP
         D  +G  L   L+ FYG+APA++E+C+Q GA +PDQY S MR+DV  P
Subjt:  PD-VNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRMDVGIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAATTTCCCCCACACTATTCCAAAAGGAGAAGAAGCCATTTCCAATTCCCATTGTGGAATTGAGACGAGCGGCCAGGGAGAGGATGAAAAAGAGTAAAGGCCAGCC
GAGAAGACCAGTACCACCTCCAAAGAATGGGTTGCTAGTGAAGAGCATGATGCCAATTGCCTACAATGTGTTCAATGCAAGAATTACATTGATCAACAATCTCAAGACGC
TCTTAAAGGTGGTACCTGTGCATGCTTGCGGGTTTTGCAATGAAATCCATGTGGGACCTGTTGGACACCCATTCAAGTCATGCAGAGGGCAAAATGCCAATTTCCGGAAG
GGGCTTCATGAGTGGACAAAGGCAACTCTTGAAGACATATTCCTGCCAGTAGAAGCATACCACCTCTACGATCGCCTTGGAAAACGTATTTCTCATCACGAAAGATACTC
AATTCCACGAATTCCAGCACTAGTTGAGCTTTGCATTCAAGCGGGTGTTGATCTTCCCGAATATCCTGCAAAGAGGAGAAGGAAACCAATCATCCGAATCTCAAAAAGTG
AATTCATCGATGCAGATGAAAGTGAACTACCAGACCCAGAACCAGAAGTACCTCTGAAACCACTTTTAACAGAAATACCAGATTCTGATGTTGTGGCCCCCAGTGATGAA
GAAGATATGGCTTGGCTTGCTGACCAGACGCTTCAAGCATGGGAGCAAATGAGGCAAGGAGCCAAAAGACTCATGAAGATGTATCCATGGGCACAAGGCACAGAACTGTG
GAGCTTTCAAGCACCAACAACGAACGGGCAACATGGTTGGCAAAGGGCCGTGCTCGATGACTTGATACCACCCAGATACGTGTGGCACGTTCCAGACGTAAATGGACCTC
CATTGCAGAGGGAGCTAAGAAACTTTTATGGGCAGGCGCCTGCAATAGTTGAAATATGCATTCAAGCTGGCGCTGCTATCCCAGATCAGTACAAATCAACCATGCGAATG
GATGTGGGAATTCCCTTGGACATTAAAGAGGCTGAAATGAGTTCGGAGACACAGACATCGGCAATGGCAGGATCACAGCAAGTGAAGGCTGCAGGAACAGCCGACCAAAT
GAAGGGGCCGTCTCCGGCTCACCATTCCACCGAGGTGCTTCACCAGCGAAAGAAACTCCCCGTTTGTCCTATGAGAATGGCGGTCGGCGGTTTCGTCATCGCCGCCACGC
TCGGTTACTTCGTCCTCTACTCCAAGAAGAAGCCCGAGGCTTCCGCCATTGATGTCGCCAAAGTCACCGCCGGCATGTCTACTCCGGAGAACACCCATCCCCGCATACAC
TGGCTTTGTTATCTCGAGCTTCGTGAAGAAATAGAAAGCAGCATAGAGAAAGAGGTTGCAGAGGAACCTGAAGTCAGCGCAAGTGACAATCAGGATGAGGAAGACAATGA
AGAGGAAACCAAATATATAGTAAAATTGGTGCACCATATGGAGGTGAGGATGAAAAATAACTCAATAAAGACTGCCCCAAAAGGGAGTATCCCTCCAATGAGAACGGAGA
AAGTTGGGTTCATGTACCAAGCTTGTTCTGGGATCTGTCTTGGGATCTTGTTGGTCTTCACAGGGTCCTCAATTGCTGGCTTCTTAAACCCAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCAATTTCCCCCACACTATTCCAAAAGGAGAAGAAGCCATTTCCAATTCCCATTGTGGAATTGAGACGAGCGGCCAGGGAGAGGATGAAAAAGAGTAAAGGCCAGCC
GAGAAGACCAGTACCACCTCCAAAGAATGGGTTGCTAGTGAAGAGCATGATGCCAATTGCCTACAATGTGTTCAATGCAAGAATTACATTGATCAACAATCTCAAGACGC
TCTTAAAGGTGGTACCTGTGCATGCTTGCGGGTTTTGCAATGAAATCCATGTGGGACCTGTTGGACACCCATTCAAGTCATGCAGAGGGCAAAATGCCAATTTCCGGAAG
GGGCTTCATGAGTGGACAAAGGCAACTCTTGAAGACATATTCCTGCCAGTAGAAGCATACCACCTCTACGATCGCCTTGGAAAACGTATTTCTCATCACGAAAGATACTC
AATTCCACGAATTCCAGCACTAGTTGAGCTTTGCATTCAAGCGGGTGTTGATCTTCCCGAATATCCTGCAAAGAGGAGAAGGAAACCAATCATCCGAATCTCAAAAAGTG
AATTCATCGATGCAGATGAAAGTGAACTACCAGACCCAGAACCAGAAGTACCTCTGAAACCACTTTTAACAGAAATACCAGATTCTGATGTTGTGGCCCCCAGTGATGAA
GAAGATATGGCTTGGCTTGCTGACCAGACGCTTCAAGCATGGGAGCAAATGAGGCAAGGAGCCAAAAGACTCATGAAGATGTATCCATGGGCACAAGGCACAGAACTGTG
GAGCTTTCAAGCACCAACAACGAACGGGCAACATGGTTGGCAAAGGGCCGTGCTCGATGACTTGATACCACCCAGATACGTGTGGCACGTTCCAGACGTAAATGGACCTC
CATTGCAGAGGGAGCTAAGAAACTTTTATGGGCAGGCGCCTGCAATAGTTGAAATATGCATTCAAGCTGGCGCTGCTATCCCAGATCAGTACAAATCAACCATGCGAATG
GATGTGGGAATTCCCTTGGACATTAAAGAGGCTGAAATGAGTTCGGAGACACAGACATCGGCAATGGCAGGATCACAGCAAGTGAAGGCTGCAGGAACAGCCGACCAAAT
GAAGGGGCCGTCTCCGGCTCACCATTCCACCGAGGTGCTTCACCAGCGAAAGAAACTCCCCGTTTGTCCTATGAGAATGGCGGTCGGCGGTTTCGTCATCGCCGCCACGC
TCGGTTACTTCGTCCTCTACTCCAAGAAGAAGCCCGAGGCTTCCGCCATTGATGTCGCCAAAGTCACCGCCGGCATGTCTACTCCGGAGAACACCCATCCCCGCATACAC
TGGCTTTGTTATCTCGAGCTTCGTGAAGAAATAGAAAGCAGCATAGAGAAAGAGGTTGCAGAGGAACCTGAAGTCAGCGCAAGTGACAATCAGGATGAGGAAGACAATGA
AGAGGAAACCAAATATATAGTAAAATTGGTGCACCATATGGAGGTGAGGATGAAAAATAACTCAATAAAGACTGCCCCAAAAGGGAGTATCCCTCCAATGAGAACGGAGA
AAGTTGGGTTCATGTACCAAGCTTGTTCTGGGATCTGTCTTGGGATCTTGTTGGTCTTCACAGGGTCCTCAATTGCTGGCTTCTTAAACCCAACATAA
Protein sequenceShow/hide protein sequence
MPISPTLFQKEKKPFPIPIVELRRAARERMKKSKGQPRRPVPPPKNGLLVKSMMPIAYNVFNARITLINNLKTLLKVVPVHACGFCNEIHVGPVGHPFKSCRGQNANFRK
GLHEWTKATLEDIFLPVEAYHLYDRLGKRISHHERYSIPRIPALVELCIQAGVDLPEYPAKRRRKPIIRISKSEFIDADESELPDPEPEVPLKPLLTEIPDSDVVAPSDE
EDMAWLADQTLQAWEQMRQGAKRLMKMYPWAQGTELWSFQAPTTNGQHGWQRAVLDDLIPPRYVWHVPDVNGPPLQRELRNFYGQAPAIVEICIQAGAAIPDQYKSTMRM
DVGIPLDIKEAEMSSETQTSAMAGSQQVKAAGTADQMKGPSPAHHSTEVLHQRKKLPVCPMRMAVGGFVIAATLGYFVLYSKKKPEASAIDVAKVTAGMSTPENTHPRIH
WLCYLELREEIESSIEKEVAEEPEVSASDNQDEEDNEEETKYIVKLVHHMEVRMKNNSIKTAPKGSIPPMRTEKVGFMYQACSGICLGILLVFTGSSIAGFLNPT