; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G019900 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G019900
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
Descriptionaspartic proteinase CDR1-like
Genome locationCma_Chr04:11790583..11798286
RNA-Seq ExpressionCmaCh04G019900
SyntenyCmaCh04G019900
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0009772 - photosynthetic electron transport in photosystem II (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0045156 - electron transporter, transferring electrons within the cyclic electron transport pathway of photosynthesis activity (molecular function)
InterPro domainsIPR000484 - Photosynthetic reaction centre, L/M
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant
IPR036854 - Photosystem II protein D1/D2 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022975010.1 aspartic proteinase CDR1-like [Cucurbita maxima]1.3e-17996.04Show/hide
Query:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST
        M AKSQI PETSEFIVKI +GTPPTEVHAILDTGSDLFWAQ RPCAKCYQQTNPIYDPSKSSTFR LSCKSPQCHLRGSGAACSGTDTCKY+YGYGSGST
Subjt:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
        QGELASEKM VTSRSGATT FPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ

Query:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
        LVRTSDQTSYSLTLTGISVRKTLVPYSTS PPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLST+
Subjt:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV

Query:  QTFNKMPDGSFCFTAMGVDDKDALIGNN
        QTFNKMPDGSFCFTAM VDDKDALIGN+
Subjt:  QTFNKMPDGSFCFTAMGVDDKDALIGNN

XP_022975011.1 aspartic proteinase CDR1-like [Cucurbita maxima]3.3e-18096.65Show/hide
Query:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST
        M AKSQI PETSEFIVKI +GTPPTEVHAILDTGSDLFWAQ RPCAKCYQQTNPIYDPSKSSTFRTLSCK PQCHLRGSGAACSGTDTCKY+YGYGSGST
Subjt:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
        QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ

Query:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
        LVRTSDQTSYSLTLTGISVRKTLVPYSTS PPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFD GVDLRLSTV
Subjt:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV

Query:  QTFNKMPDGSFCFTAMGVDDKDALIGNN
        QTFNKMPDGSFCFTAMGVD KDALIGN+
Subjt:  QTFNKMPDGSFCFTAMGVDDKDALIGNN

XP_022984181.1 aspartic proteinase CDR1-like [Cucurbita maxima]1.1e-18699.7Show/hide
Query:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST
        MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST
Subjt:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
        QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ

Query:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
        LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
Subjt:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV

Query:  QTFNKMPDGSFCFTAMGVDDKDALIGNN
        QTFNKMPDGSFCFTAMGVDDKDALIGN+
Subjt:  QTFNKMPDGSFCFTAMGVDDKDALIGNN

XP_022987324.1 aspartic proteinase CDR1-like [Cucurbita maxima]1.6e-18297.56Show/hide
Query:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST
        M AKSQI PETSEFIVKI +GTPPTEVHAILDTGSDLFWAQ RPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKY+YGYGSGST
Subjt:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
        QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ

Query:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
        LVRTSDQTSYSLTLTGISVRKTLVPYSTS PPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
Subjt:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV

Query:  QTFNKMPDGSFCFTAMGVDDKDALIGNN
        QTFNKMPDGSFCFTAMGVDDKDALIGN+
Subjt:  QTFNKMPDGSFCFTAMGVDDKDALIGNN

XP_023525703.1 aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo]6.2e-17995.73Show/hide
Query:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST
        M AKSQI PETSEF+VKI VGTPPTEVHAILDTGSDLFWAQ RPCAKCY+QTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKY YGYGSGST
Subjt:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
        QGELA+EKMAVTSRSGATTPF GVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ

Query:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
        LVRT+DQTSYSLTLTGISV KTLVPYSTS PPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
Subjt:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV

Query:  QTFNKMPDGSFCFTAMGVDDKDALIGNN
        QTFNKMPDGSFCFTAMGVDDKDA+IGN+
Subjt:  QTFNKMPDGSFCFTAMGVDDKDALIGNN

TrEMBL top hitse value%identityAlignment
A0A6J1ID07 aspartic proteinase CDR1-like1.6e-18096.65Show/hide
Query:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST
        M AKSQI PETSEFIVKI +GTPPTEVHAILDTGSDLFWAQ RPCAKCYQQTNPIYDPSKSSTFRTLSCK PQCHLRGSGAACSGTDTCKY+YGYGSGST
Subjt:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
        QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ

Query:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
        LVRTSDQTSYSLTLTGISVRKTLVPYSTS PPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFD GVDLRLSTV
Subjt:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV

Query:  QTFNKMPDGSFCFTAMGVDDKDALIGNN
        QTFNKMPDGSFCFTAMGVD KDALIGN+
Subjt:  QTFNKMPDGSFCFTAMGVDDKDALIGNN

A0A6J1IFH1 aspartic proteinase CDR1-like6.1e-18096.04Show/hide
Query:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST
        M AKSQI PETSEFIVKI +GTPPTEVHAILDTGSDLFWAQ RPCAKCYQQTNPIYDPSKSSTFR LSCKSPQCHLRGSGAACSGTDTCKY+YGYGSGST
Subjt:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
        QGELASEKM VTSRSGATT FPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ

Query:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
        LVRTSDQTSYSLTLTGISVRKTLVPYSTS PPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLST+
Subjt:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV

Query:  QTFNKMPDGSFCFTAMGVDDKDALIGNN
        QTFNKMPDGSFCFTAM VDDKDALIGN+
Subjt:  QTFNKMPDGSFCFTAMGVDDKDALIGNN

A0A6J1IJB2 aspartic proteinase CDR1-like1.5e-17895.43Show/hide
Query:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST
        M AKSQI PETSEFIVKI VGTPPTEVHAILDTGSDLFWAQ+ PCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRG GAACS TDTCKY+Y YGS ST
Subjt:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
        QGELASEKM VTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFS+CLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ

Query:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
        LVRTSDQTSYSLTLTGISVRKTLVPYSTS PP KGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
Subjt:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV

Query:  QTFNKMPDGSFCFTAMGVDDKDALIGNN
        QTFNKMPDGSFCFTAMGVDDKDALIGN+
Subjt:  QTFNKMPDGSFCFTAMGVDDKDALIGNN

A0A6J1J4I9 aspartic proteinase CDR1-like5.1e-18799.7Show/hide
Query:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST
        MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST
Subjt:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
        QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ

Query:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
        LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
Subjt:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV

Query:  QTFNKMPDGSFCFTAMGVDDKDALIGNN
        QTFNKMPDGSFCFTAMGVDDKDALIGN+
Subjt:  QTFNKMPDGSFCFTAMGVDDKDALIGNN

A0A6J1JIJ5 aspartic proteinase CDR1-like7.7e-18397.56Show/hide
Query:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST
        M AKSQI PETSEFIVKI +GTPPTEVHAILDTGSDLFWAQ RPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKY+YGYGSGST
Subjt:  MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGST

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
        QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQ

Query:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
        LVRTSDQTSYSLTLTGISVRKTLVPYSTS PPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV
Subjt:  LVRTSDQTSYSLTLTGISVRKTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTV

Query:  QTFNKMPDGSFCFTAMGVDDKDALIGNN
        QTFNKMPDGSFCFTAMGVDDKDALIGN+
Subjt:  QTFNKMPDGSFCFTAMGVDDKDALIGNN

SwissProt top hitse value%identityAlignment
B2XWK0 Photosystem II D2 protein6.3e-4943.95Show/hide
Query:  IGNNSNDFFFEIMDDSLRRDRFGLVGWSSLLLFPCAYFVVVGGWFT-------------------GCNFLTAAVSTPANNLAHSFFVVTMGPEAQGD---
        +  + ND  F+IMDD LRRDRF  VGWS LLLFPCAYF  VGGWFT                   GCNFLTAAVSTPAN+LAHS  ++  GPEAQGD   
Subjt:  IGNNSNDFFFEIMDDSLRRDRFGLVGWSSLLLFPCAYFVVVGGWFT-------------------GCNFLTAAVSTPANNLAHSFFVVTMGPEAQGD---

Query:  -------------------------RFESLVSITIR-----------------------WSVDFCFSPRFGLIG-FMMVMMLQ-----------------
                                 +FE   S+ +R                           + F+P FG+   F  ++  Q                 
Subjt:  -------------------------RFESLVSITIR-----------------------WSVDFCFSPRFGLIG-FMMVMMLQ-----------------

Query:  ---------IHSVLLTQLKLK----------------EETYSMVTANRFWYQIFGVAFSTKGWLHFFLLFVPVTALWPSALGVVGLALNLRAYDFGSQEI
                 IH   +     +                EETYSMVTANRFW QIFGVAFS K WLHFF+LFVPVT LW SALGVVGLALNLRAYDF SQEI
Subjt:  ---------IHSVLLTQLKLK----------------EETYSMVTANRFWYQIFGVAFSTKGWLHFFLLFVPVTALWPSALGVVGLALNLRAYDFGSQEI

Query:  RAAREDPEFETFYT
        RAA EDPEFETFYT
Subjt:  RAAREDPEFETFYT

Q3EBM5 Probable aspartic protease At2g356155.1e-5135.34Show/hide
Query:  KSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCH-LRGSGAAC-SGTDTCKYNYGYGSGS-T
        +S ++    EF + I +GTPP +V AI DTGSDL W Q +PC +CY++  PI+D  KSST+++  C S  C  L  +   C    + CKY Y YG  S +
Subjt:  KSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCH-LRGSGAAC-SGTDTCKYNYGYGSGS-T

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGS----GSEVKGPGV
        +G++A+E +++ S SG+   FPG VFGCG+NN GTF+    G+IG G G +S +SQ+G S+  +KFS CL   +     +S +++G+     S  K  GV
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGS----GSEVKGPGV

Query:  ITAQLVRTSDQTSYSLTLTGISVRKTLVPYSTSR---------PPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIP-SKPIDDD----TLCYKDNLGD
        ++  LV     T Y LTL  ISV K  +PY+ S              GN ++D+GT  TLL    + + ++ V   +  +K + D     + C+K    +
Subjt:  ITAQLVRTSDQTSYSLTLTGISVRKTLVPYSTSR---------PPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIP-SKPIDDD----TLCYKDNLGD

Query:  L---VMTLHFDGGVDLRLSTVQTFNKMPDGSFCFTAMGVDDKDALIGN
        +    +T+HF  G D+RLS +  F K+ +   C + +   +  A+ GN
Subjt:  L---VMTLHFDGGVDLRLSTVQTFNKMPDGSFCFTAMGVDDKDALIGN

Q56P05 Photosystem II D2 protein6.3e-4943.95Show/hide
Query:  IGNNSNDFFFEIMDDSLRRDRFGLVGWSSLLLFPCAYFVVVGGWFT-------------------GCNFLTAAVSTPANNLAHSFFVVTMGPEAQGD---
        +  + ND  F+IMDD LRRDRF  VGWS LLLFPCAYF  VGGWFT                   GCNFLTAAVSTPAN+LAHS  ++  GPEAQGD   
Subjt:  IGNNSNDFFFEIMDDSLRRDRFGLVGWSSLLLFPCAYFVVVGGWFT-------------------GCNFLTAAVSTPANNLAHSFFVVTMGPEAQGD---

Query:  -------------------------RFESLVSITIR-----------------------WSVDFCFSPRFGLIG-FMMVMMLQ-----------------
                                 +FE   S+ +R                           + F+P FG+   F  ++  Q                 
Subjt:  -------------------------RFESLVSITIR-----------------------WSVDFCFSPRFGLIG-FMMVMMLQ-----------------

Query:  ---------IHSVLLTQLKLK----------------EETYSMVTANRFWYQIFGVAFSTKGWLHFFLLFVPVTALWPSALGVVGLALNLRAYDFGSQEI
                 IH   +     +                EETYSMVTANRFW QIFGVAFS K WLHFF+LFVPVT LW SALGVVGLALNLRAYDF SQEI
Subjt:  ---------IHSVLLTQLKLK----------------EETYSMVTANRFWYQIFGVAFSTKGWLHFFLLFVPVTALWPSALGVVGLALNLRAYDFGSQEI

Query:  RAAREDPEFETFYT
        RAA EDPEFETFYT
Subjt:  RAAREDPEFETFYT

Q68S11 Photosystem II D2 protein4.8e-4944.59Show/hide
Query:  FEIMDDSLRRDRFGLVGWSSLLLFPCAYFVVVGGWFT-------------------GCNFLTAAVSTPANNLAHSFFVVTMGPEAQGD------------
        F+IMDD LRRDRF  VGWS LLLFPCAYF  +GGWFT                   GCNFLTAAVSTPAN+LAHS  ++  GPEAQGD            
Subjt:  FEIMDDSLRRDRFGLVGWSSLLLFPCAYFVVVGGWFT-------------------GCNFLTAAVSTPANNLAHSFFVVTMGPEAQGD------------

Query:  ----------------RFESLVSITIR-----------------------WSVDFCFSPRFGLIG-FMMVMMLQ--------------------------
                        +FE   S+ +R                           + F+P FG+   F  ++  Q                          
Subjt:  ----------------RFESLVSITIR-----------------------WSVDFCFSPRFGLIG-FMMVMMLQ--------------------------

Query:  IHSVLLTQLKLK----------------EETYSMVTANRFWYQIFGVAFSTKGWLHFFLLFVPVTALWPSALGVVGLALNLRAYDFGSQEIRAAREDPEF
        IH   +  L  +                EETYSMVTANRFW QIFGVAFS K WLHFF+LFVPVT LW SALGVVGLALNLRAYDF SQEIRAA EDPEF
Subjt:  IHSVLLTQLKLK----------------EETYSMVTANRFWYQIFGVAFSTKGWLHFFLLFVPVTALWPSALGVVGLALNLRAYDFGSQEIRAAREDPEF

Query:  ETFYT
        ETFYT
Subjt:  ETFYT

Q6XBF8 Aspartic proteinase CDR11.5e-5538.36Show/hide
Query:  TSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTD-TCKYNYGYGSGS-TQGELASEK
        + E+++ + +GTPP  + AI DTGSDL W Q  PC  CY Q +P++DP  SST++ +SC S QC    + A+CS  D TC Y+  YG  S T+G +A + 
Subjt:  TSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTD-TCKYNYGYGSGS-TQGELASEK

Query:  MAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLV-RTSDQ
        + + S          ++ GCGHNN+GTFN    G++G G G +S + Q+G S+ G KFS CL+P  +    +S ++ G+ + V G GV++  L+ + S +
Subjt:  MAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLV-RTSDQ

Query:  TSYSLTLTGISVRKTLVPYSTS-RPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDD----TLCYKDNLGDL---VMTLHFDGGVDLRLST
        T Y LTL  ISV    + YS S    ++GN ++D+GT  TLLP E Y  L   V   I ++   D     +LCY    GDL   V+T+HFD G D++L +
Subjt:  TSYSLTLTGISVRKTLVPYSTS-RPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDD----TLCYKDNLGDL---VMTLHFDGGVDLRLST

Query:  VQTFNKMPDGSFCFTAMG
           F ++ +   CF   G
Subjt:  VQTFNKMPDGSFCFTAMG

Arabidopsis top hitse value%identityAlignment
AT1G31450.1 Eukaryotic aspartyl protease family protein3.9e-5436.13Show/hide
Query:  KSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCH-LRGSGAAC-SGTDTCKYNYGYGSGS-T
        +S ++    E+ + I +GTPP++V AI DTGSDL W Q +PC +CY+Q +P++D  KSST++T SC S  C  L      C    D CKY Y YG  S T
Subjt:  KSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCH-LRGSGAAC-SGTDTCKYNYGYGSGS-T

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGP----GV
        +G++A+E +++ S SG++  FPG VFGCG+NN GTF     G+IG G G +S VSQ+G S+ G+KFS CL         +S +++G+ S    P      
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGP----GV

Query:  ITAQLVRTSDQTSYSLTLTGISVRKTLVPY-------STSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIP-SKPIDDD----TLCYKD---NLG
        +T  L++   +T Y LTL  ++V KT +PY       +       GN ++D+GT  TLL    Y      V   +  +K + D     T C+K     +G
Subjt:  ITAQLVRTSDQTSYSLTLTGISVRKTLVPY-------STSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIP-SKPIDDD----TLCYKD---NLG

Query:  DLVMTLHFDGGVDLRLSTVQTFNKMPDGSFCFTAMGVDDKDALIGN
           +T+HF    D++LS +  F K+ + + C + +   +  A+ GN
Subjt:  DLVMTLHFDGGVDLRLSTVQTFNKMPDGSFCFTAMGVDDKDALIGN

AT1G64830.1 Eukaryotic aspartyl protease family protein8.4e-5736.75Show/hide
Query:  KSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGS-TQG
        +S I     E+++ I +GTPP  + AI DTGSDL W Q  PC  CYQQT+P++DP +SST+R +SC S QC      +  +  +TC Y   YG  S T+G
Subjt:  KSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGS-TQG

Query:  ELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLV
        ++A + + + S          ++ GCGH N+GTF+    G+IG G G+ S VSQ+  S+ G KFS CL+P+ ++  ++S ++ G+   V G GV++  +V
Subjt:  ELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLV

Query:  RTSDQTSYSLTLTGISVRKTLVPY-STSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDD----TLCYKDNLGDLV--MTLHFDGGVDL
        +    T Y L L  ISV    + + ST     +GN V+D+GT  TLLP   Y  L + V   I ++ + D     +LCY+D+    V  +T+HF GG D+
Subjt:  RTSDQTSYSLTLTGISVRKTLVPY-STSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDD----TLCYKDNLGDLV--MTLHFDGGVDL

Query:  RLSTVQTFNKMPDGSFCFTAMGVDDKDALIGN
        +L  + TF  + +   CF A   +++  + GN
Subjt:  RLSTVQTFNKMPDGSFCFTAMGVDDKDALIGN

AT2G35615.1 Eukaryotic aspartyl protease family protein3.7e-5235.34Show/hide
Query:  KSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCH-LRGSGAAC-SGTDTCKYNYGYGSGS-T
        +S ++    EF + I +GTPP +V AI DTGSDL W Q +PC +CY++  PI+D  KSST+++  C S  C  L  +   C    + CKY Y YG  S +
Subjt:  KSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCH-LRGSGAAC-SGTDTCKYNYGYGSGS-T

Query:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGS----GSEVKGPGV
        +G++A+E +++ S SG+   FPG VFGCG+NN GTF+    G+IG G G +S +SQ+G S+  +KFS CL   +     +S +++G+     S  K  GV
Subjt:  QGELASEKMAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGS----GSEVKGPGV

Query:  ITAQLVRTSDQTSYSLTLTGISVRKTLVPYSTSR---------PPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIP-SKPIDDD----TLCYKDNLGD
        ++  LV     T Y LTL  ISV K  +PY+ S              GN ++D+GT  TLL    + + ++ V   +  +K + D     + C+K    +
Subjt:  ITAQLVRTSDQTSYSLTLTGISVRKTLVPYSTSR---------PPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIP-SKPIDDD----TLCYKDNLGD

Query:  L---VMTLHFDGGVDLRLSTVQTFNKMPDGSFCFTAMGVDDKDALIGN
        +    +T+HF  G D+RLS +  F K+ +   C + +   +  A+ GN
Subjt:  L---VMTLHFDGGVDLRLSTVQTFNKMPDGSFCFTAMGVDDKDALIGN

AT5G33340.1 Eukaryotic aspartyl protease family protein1.1e-5638.36Show/hide
Query:  TSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTD-TCKYNYGYGSGS-TQGELASEK
        + E+++ + +GTPP  + AI DTGSDL W Q  PC  CY Q +P++DP  SST++ +SC S QC    + A+CS  D TC Y+  YG  S T+G +A + 
Subjt:  TSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTD-TCKYNYGYGSGS-TQGELASEK

Query:  MAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLV-RTSDQ
        + + S          ++ GCGHNN+GTFN    G++G G G +S + Q+G S+ G KFS CL+P  +    +S ++ G+ + V G GV++  L+ + S +
Subjt:  MAVTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLV-RTSDQ

Query:  TSYSLTLTGISVRKTLVPYSTS-RPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDD----TLCYKDNLGDL---VMTLHFDGGVDLRLST
        T Y LTL  ISV    + YS S    ++GN ++D+GT  TLLP E Y  L   V   I ++   D     +LCY    GDL   V+T+HFD G D++L +
Subjt:  TSYSLTLTGISVRKTLVPYSTS-RPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDD----TLCYKDNLGDL---VMTLHFDGGVDLRLST

Query:  VQTFNKMPDGSFCFTAMG
           F ++ +   CF   G
Subjt:  VQTFNKMPDGSFCFTAMG

ATCG00270.1 photosystem II reaction center protein D1.3e-4944.26Show/hide
Query:  FEIMDDSLRRDRFGLVGWSSLLLFPCAYFVVVGGWFT-------------------GCNFLTAAVSTPANNLAHSFFVVTMGPEAQGD------------
        F+IMDD LRRDRF  VGWS LLLFPCAYF  +GGWFT                   GCNFLTAAVSTPAN+LAHS  ++  GPEAQGD            
Subjt:  FEIMDDSLRRDRFGLVGWSSLLLFPCAYFVVVGGWFT-------------------GCNFLTAAVSTPANNLAHSFFVVTMGPEAQGD------------

Query:  ----------------RFESLVSITIR-----------------------WSVDFCFSPRFGLIG-FMMVMMLQ--------------------------
                        +FE   S+ +R                           + F+P FG+   F  ++  Q                          
Subjt:  ----------------RFESLVSITIR-----------------------WSVDFCFSPRFGLIG-FMMVMMLQ--------------------------

Query:  IHSVLLTQLKLK----------------EETYSMVTANRFWYQIFGVAFSTKGWLHFFLLFVPVTALWPSALGVVGLALNLRAYDFGSQEIRAAREDPEF
        IH   +     +                EETYSMVTANRFW QIFGVAFS K WLHFF+LFVPVT LW SALGVVGLALNLRAYDF SQEIRAA EDPEF
Subjt:  IHSVLLTQLKLK----------------EETYSMVTANRFWYQIFGVAFSTKGWLHFFLLFVPVTALWPSALGVVGLALNLRAYDFGSQEIRAAREDPEF

Query:  ETFYT
        ETFYT
Subjt:  ETFYT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGCCAAATCACAAATTTTGCCAGAAACCAGCGAATTTATAGTGAAAATCGTCGTCGGAACGCCACCGACAGAGGTGCATGCAATCCTCGACACTGGCAGTGATTT
ATTTTGGGCTCAGTTTCGTCCATGTGCGAAATGTTACCAGCAAACGAATCCGATTTACGACCCTTCAAAATCGTCAACCTTTCGAACCCTTTCTTGCAAGTCGCCGCAGT
GCCATTTGAGGGGGTCTGGTGCGGCGTGCTCCGGCACCGACACGTGTAAGTACAACTATGGGTATGGAAGCGGATCTACGCAGGGAGAATTGGCGAGTGAAAAAATGGCT
GTAACTTCGAGGTCTGGAGCAACGACGCCGTTTCCTGGGGTGGTGTTTGGTTGCGGACATAATAATAGTGGAACGTTTAATGCCAATGAAATGGGATTGATCGGATTTGG
AAGAGGAGCGATTTCCTTCGTTTCTCAGATAGGTCCATCGGTCGGCGGTAGAAAGTTCTCCCTTTGTCTGATGCCATACAACACCGACCCGAGAATCTCAAGTAGCCTCT
CTATTGGGTCGGGTTCTGAAGTTAAAGGACCCGGAGTCATCACAGCCCAACTCGTTCGAACATCCGACCAGACATCTTACTCTCTCACCCTCACGGGAATCTCCGTCAGA
AAAACCCTCGTTCCGTACAGTACGTCGAGACCTCCGGCCAAGGGGAATGCGGTTCTCGATACCGGCACGCCGCCGACTCTCCTCCCCAAAGAATTGTACGGACGATTGGC
TGCTGAAGTTCGGCGGCATATCCCGTCAAAGCCCATTGACGATGATACTCTTTGCTACAAAGATAATTTGGGGGATTTGGTGATGACTTTGCACTTCGACGGCGGCGTGG
ATCTGCGATTGAGTACGGTTCAGACTTTCAATAAGATGCCGGATGGGTCCTTTTGCTTCACCGCGATGGGCGTTGACGACAAGGACGCACTCATCGGGAACAACTCAAAT
GATTTCTTCTTCGAGATTATGGATGACTCGTTACGGAGGGACCGGTTCGGTCTTGTAGGTTGGTCCAGTCTATTGCTCTTTCCTTGTGCCTACTTTGTTGTCGTAGGAGG
TTGGTTTACAGGATGCAACTTCTTAACCGCCGCTGTTTCGACTCCTGCTAATAATTTAGCACACTCTTTCTTTGTTGTTACTATGGGGCCTGAAGCACAAGGAGATAGAT
TTGAATCCTTGGTGTCCATTACCATTAGGTGGTCTGTGGACTTTTGTTTCTCACCACGGTTCGGACTAATAGGTTTCATGATGGTGATGATGCTTCAAATACATTCCGTG
CTTTTAACCCAACTCAAGCTGAAGGAAGAAACTTATTCAATGGTCACTGCTAACCGCTTTTGGTACCAAATCTTTGGGGTTGCTTTTTCCACTAAAGGTTGGTTACATTT
CTTTCTGTTATTTGTACCAGTAACTGCTTTATGGCCGAGTGCTCTTGGAGTAGTTGGTCTGGCCCTGAACCTACGTGCCTATGACTTCGGTTCTCAGGAAATCCGTGCAG
CAAGGGAAGATCCTGAATTTGAGACTTTCTATACCCTACTATACCCAGAATCTTCTCTTAACCGAAGGTATTCGTGCTTGGATGGCGGCTCAAGATCAGCCTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGACAGCCAAATCACAAATTTTGCCAGAAACCAGCGAATTTATAGTGAAAATCGTCGTCGGAACGCCACCGACAGAGGTGCATGCAATCCTCGACACTGGCAGTGATTT
ATTTTGGGCTCAGTTTCGTCCATGTGCGAAATGTTACCAGCAAACGAATCCGATTTACGACCCTTCAAAATCGTCAACCTTTCGAACCCTTTCTTGCAAGTCGCCGCAGT
GCCATTTGAGGGGGTCTGGTGCGGCGTGCTCCGGCACCGACACGTGTAAGTACAACTATGGGTATGGAAGCGGATCTACGCAGGGAGAATTGGCGAGTGAAAAAATGGCT
GTAACTTCGAGGTCTGGAGCAACGACGCCGTTTCCTGGGGTGGTGTTTGGTTGCGGACATAATAATAGTGGAACGTTTAATGCCAATGAAATGGGATTGATCGGATTTGG
AAGAGGAGCGATTTCCTTCGTTTCTCAGATAGGTCCATCGGTCGGCGGTAGAAAGTTCTCCCTTTGTCTGATGCCATACAACACCGACCCGAGAATCTCAAGTAGCCTCT
CTATTGGGTCGGGTTCTGAAGTTAAAGGACCCGGAGTCATCACAGCCCAACTCGTTCGAACATCCGACCAGACATCTTACTCTCTCACCCTCACGGGAATCTCCGTCAGA
AAAACCCTCGTTCCGTACAGTACGTCGAGACCTCCGGCCAAGGGGAATGCGGTTCTCGATACCGGCACGCCGCCGACTCTCCTCCCCAAAGAATTGTACGGACGATTGGC
TGCTGAAGTTCGGCGGCATATCCCGTCAAAGCCCATTGACGATGATACTCTTTGCTACAAAGATAATTTGGGGGATTTGGTGATGACTTTGCACTTCGACGGCGGCGTGG
ATCTGCGATTGAGTACGGTTCAGACTTTCAATAAGATGCCGGATGGGTCCTTTTGCTTCACCGCGATGGGCGTTGACGACAAGGACGCACTCATCGGGAACAACTCAAAT
GATTTCTTCTTCGAGATTATGGATGACTCGTTACGGAGGGACCGGTTCGGTCTTGTAGGTTGGTCCAGTCTATTGCTCTTTCCTTGTGCCTACTTTGTTGTCGTAGGAGG
TTGGTTTACAGGATGCAACTTCTTAACCGCCGCTGTTTCGACTCCTGCTAATAATTTAGCACACTCTTTCTTTGTTGTTACTATGGGGCCTGAAGCACAAGGAGATAGAT
TTGAATCCTTGGTGTCCATTACCATTAGGTGGTCTGTGGACTTTTGTTTCTCACCACGGTTCGGACTAATAGGTTTCATGATGGTGATGATGCTTCAAATACATTCCGTG
CTTTTAACCCAACTCAAGCTGAAGGAAGAAACTTATTCAATGGTCACTGCTAACCGCTTTTGGTACCAAATCTTTGGGGTTGCTTTTTCCACTAAAGGTTGGTTACATTT
CTTTCTGTTATTTGTACCAGTAACTGCTTTATGGCCGAGTGCTCTTGGAGTAGTTGGTCTGGCCCTGAACCTACGTGCCTATGACTTCGGTTCTCAGGAAATCCGTGCAG
CAAGGGAAGATCCTGAATTTGAGACTTTCTATACCCTACTATACCCAGAATCTTCTCTTAACCGAAGGTATTCGTGCTTGGATGGCGGCTCAAGATCAGCCTCATGA
Protein sequenceShow/hide protein sequence
MTAKSQILPETSEFIVKIVVGTPPTEVHAILDTGSDLFWAQFRPCAKCYQQTNPIYDPSKSSTFRTLSCKSPQCHLRGSGAACSGTDTCKYNYGYGSGSTQGELASEKMA
VTSRSGATTPFPGVVFGCGHNNSGTFNANEMGLIGFGRGAISFVSQIGPSVGGRKFSLCLMPYNTDPRISSSLSIGSGSEVKGPGVITAQLVRTSDQTSYSLTLTGISVR
KTLVPYSTSRPPAKGNAVLDTGTPPTLLPKELYGRLAAEVRRHIPSKPIDDDTLCYKDNLGDLVMTLHFDGGVDLRLSTVQTFNKMPDGSFCFTAMGVDDKDALIGNNSN
DFFFEIMDDSLRRDRFGLVGWSSLLLFPCAYFVVVGGWFTGCNFLTAAVSTPANNLAHSFFVVTMGPEAQGDRFESLVSITIRWSVDFCFSPRFGLIGFMMVMMLQIHSV
LLTQLKLKEETYSMVTANRFWYQIFGVAFSTKGWLHFFLLFVPVTALWPSALGVVGLALNLRAYDFGSQEIRAAREDPEFETFYTLLYPESSLNRRYSCLDGGSRSAS