; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0008627 (gene) of Chayote v1 genome

Gene IDSed0008627
OrganismSechium edule (Chayote v1)
DescriptionATP-dependent caseinolytic protease/crotonase family protein
Genome locationLG04:31452820..31454647
RNA-Seq ExpressionSed0008627
SyntenySed0008627
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147260.1 uncharacterized protein LOC101212491 isoform X1 [Cucumis sativus]5.8e-9464.57Show/hide
Query:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK
        SLLSEPPD+GNWFSSYVYESP LNPSQEFG CES+ TGL  +IEETL+N      GGEG QL  F   NG+  GNRQDNQ   +Q+L SE+T++QDPKEK
Subjt:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK

Query:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI
        + GTN+I PTKEVPIS+LTTE LQC FQ+RV QEN        RSSN+ N KP THTNLI KID  LENS+T   VQ +L GS    H+P V  N R P 
Subjt:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI

Query:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH
        DLS  KE+K + V   GF+TANKR F  A CKKSVE  EN++NKE G   ACL RK LSDKTN E S+I+E+IGKWSCPQKSKPNLGPPLKQLRLERW+ 
Subjt:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH

Query:  KK
        +K
Subjt:  KK

XP_008448796.1 PREDICTED: uncharacterized protein LOC103490858 isoform X1 [Cucumis melo]1.0e-9063.25Show/hide
Query:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK
        SLLSEPPD+GNWFSSYVY SP L+PSQEFG CES+ TGL  ++EETL+N      GGEG QL  F   N N  GNRQDNQ   EQ+L SE+T++QDPKEK
Subjt:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK

Query:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI
        + GTN I PTKEVPISSLTTE LQC  Q+RV QEN        RSSN+ N  P THTNLI KID  LE+S+T   VQ +LNGS    H+P V  N R P 
Subjt:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI

Query:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH
        DLS  KE+K + V   GF+TANKR F  A  KKSVE  EN++NKE G+  AC  RK LSDKTN E S+I E+IGKWSCPQKSKPNLGPPLKQLRLERW+H
Subjt:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH

Query:  KK
        +K
Subjt:  KK

XP_011650381.1 uncharacterized protein LOC101212491 isoform X2 [Cucumis sativus]4.2e-9264.24Show/hide
Query:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK
        SLLSEPPD+GNWFSSYVYESP LNPSQEFG CES+ TGL  +IEETL+N      GGEG QL  F   NG+  GNRQDNQ    ++L SE+T++QDPKEK
Subjt:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK

Query:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI
        + GTN+I PTKEVPIS+LTTE LQC FQ+RV QEN        RSSN+ N KP THTNLI KID  LENS+T   VQ +L GS    H+P V  N R P 
Subjt:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI

Query:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH
        DLS  KE+K + V   GF+TANKR F  A CKKSVE  EN++NKE G   ACL RK LSDKTN E S+I+E+IGKWSCPQKSKPNLGPPLKQLRLERW+ 
Subjt:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH

Query:  KK
        +K
Subjt:  KK

XP_038905454.1 uncharacterized protein LOC120091478 isoform X1 [Benincasa hispida]2.1e-9664.63Show/hide
Query:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK
        SLLSEPPDVGNWFSSYVYESP LN SQEFG CES+ TGL  +IEETLE VRKT+ GG G QL  F KCNGN  GNRQDNQ L  ++L SE+T++Q+PKEK
Subjt:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK

Query:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQ--TMVQTKLNGSPARTHVPSVLTN----R
        + GTNEI PTKEVPISSLTTE  Q   Q +V Q+N         S N+DNKKPATHTNLI KID+ LE+S+  + VQ +LNGS A   VPSV TN    R
Subjt:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQ--TMVQTKLNGSPARTHVPSVLTN----R

Query:  NPIDLSKTKEDKGKRVLED------GFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLK
         PIDLS  KE+K  +  ++      GF+TANKR F  A+CKKSV+  EN+++KE G+  ACL RK LSDKTN E+S+I+EIIGKWSCPQKSKPNLGPPLK
Subjt:  NPIDLSKTKEDKGKRVLED------GFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLK

Query:  QLRLERWIHKK
        QLRLERW+HKK
Subjt:  QLRLERWIHKK

XP_038905455.1 uncharacterized protein LOC120091478 isoform X2 [Benincasa hispida]6.7e-9864.95Show/hide
Query:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK
        SLLSEPPDVGNWFSSYVYESP LN SQEFG CES+ TGL  +IEETLE VRKT+ GG G QL  F KCNGN  GNRQDNQ   EQ+L SE+T++Q+PKEK
Subjt:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK

Query:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQ--TMVQTKLNGSPARTHVPSVLTN----R
        + GTNEI PTKEVPISSLTTE  Q   Q +V Q+N         S N+DNKKPATHTNLI KID+ LE+S+  + VQ +LNGS A   VPSV TN    R
Subjt:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQ--TMVQTKLNGSPARTHVPSVLTN----R

Query:  NPIDLSKTKEDKGKRVLED------GFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLK
         PIDLS  KE+K  +  ++      GF+TANKR F  A+CKKSV+  EN+++KE G+  ACL RK LSDKTN E+S+I+EIIGKWSCPQKSKPNLGPPLK
Subjt:  NPIDLSKTKEDKGKRVLED------GFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLK

Query:  QLRLERWIHKK
        QLRLERW+HKK
Subjt:  QLRLERWIHKK

TrEMBL top hitse value%identityAlignment
A0A0A0L1G5 Uncharacterized protein2.8e-9464.57Show/hide
Query:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK
        SLLSEPPD+GNWFSSYVYESP LNPSQEFG CES+ TGL  +IEETL+N      GGEG QL  F   NG+  GNRQDNQ   +Q+L SE+T++QDPKEK
Subjt:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK

Query:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI
        + GTN+I PTKEVPIS+LTTE LQC FQ+RV QEN        RSSN+ N KP THTNLI KID  LENS+T   VQ +L GS    H+P V  N R P 
Subjt:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI

Query:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH
        DLS  KE+K + V   GF+TANKR F  A CKKSVE  EN++NKE G   ACL RK LSDKTN E S+I+E+IGKWSCPQKSKPNLGPPLKQLRLERW+ 
Subjt:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH

Query:  KK
        +K
Subjt:  KK

A0A1S3BKJ3 uncharacterized protein LOC103490858 isoform X24.7e-8962.91Show/hide
Query:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK
        SLLSEPPD+GNWFSSYVY SP L+PSQEFG CES+ TGL  ++EETL+N      GGEG QL  F   N N  GNRQDNQ   E +L SE+T++QDPKEK
Subjt:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK

Query:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI
        + GTN I PTKEVPISSLTTE LQC  Q+RV QEN        RSSN+ N  P THTNLI KID  LE+S+T   VQ +LNGS    H+P V  N R P 
Subjt:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI

Query:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH
        DLS  KE+K + V   GF+TANKR F  A  KKSVE  EN++NKE G+  AC  RK LSDKTN E S+I E+IGKWSCPQKSKPNLGPPLKQLRLERW+H
Subjt:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH

Query:  KK
        +K
Subjt:  KK

A0A1S3BKJ7 uncharacterized protein LOC103490858 isoform X15.0e-9163.25Show/hide
Query:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK
        SLLSEPPD+GNWFSSYVY SP L+PSQEFG CES+ TGL  ++EETL+N      GGEG QL  F   N N  GNRQDNQ   EQ+L SE+T++QDPKEK
Subjt:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK

Query:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI
        + GTN I PTKEVPISSLTTE LQC  Q+RV QEN        RSSN+ N  P THTNLI KID  LE+S+T   VQ +LNGS    H+P V  N R P 
Subjt:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI

Query:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH
        DLS  KE+K + V   GF+TANKR F  A  KKSVE  EN++NKE G+  AC  RK LSDKTN E S+I E+IGKWSCPQKSKPNLGPPLKQLRLERW+H
Subjt:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH

Query:  KK
        +K
Subjt:  KK

A0A5A7TCQ7 ATP-dependent caseinolytic protease/crotonase family protein4.7e-8962.91Show/hide
Query:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK
        SLLSEPPD+GNWFSSYVY SP L+PSQEFG CES+ TGL  ++EETL+N      GGEG QL  F   N N  GNRQDNQ   E +L SE+T++QDPKEK
Subjt:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK

Query:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI
        + GTN I PTKEVPISSLTTE LQC  Q+RV QEN        RSSN+ N  P THTNLI KID  LE+S+T   VQ +LNGS    H+P V  N R P 
Subjt:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI

Query:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH
        DLS  KE+K + V   GF+TANKR F  A  KKSVE  EN++NKE G+  AC  RK LSDKTN E S+I E+IGKWSCPQKSKPNLGPPLKQLRLERW+H
Subjt:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH

Query:  KK
        +K
Subjt:  KK

A0A5D3CLN8 ATP-dependent caseinolytic protease/crotonase family protein4.7e-8962.91Show/hide
Query:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK
        SLLSEPPD+GNWFSSYVY SP L+PSQEFG CES+ TGL  ++EETL+N      GGEG QL  F   N N  GNRQDNQ   E +L SE+T++QDPKEK
Subjt:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPKEK

Query:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI
        + GTN I PTKEVPISSLTTE LQC  Q+RV QEN        RSSN+ N  P THTNLI KID  LE+S+T   VQ +LNGS    H+P V  N R P 
Subjt:  SQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQEN-------ERSSNEDNKKPATHTNLIPKIDIALENSQTM--VQTKLNGSPARTHVPSVLTN-RNPI

Query:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH
        DLS  KE+K + V   GF+TANKR F  A  KKSVE  EN++NKE G+  AC  RK LSDKTN E S+I E+IGKWSCPQKSKPNLGPPLKQLRLERW+H
Subjt:  DLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIH

Query:  KK
        +K
Subjt:  KK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G16807.1 unknown protein1.6e-1730.6Show/hide
Query:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENV-----RKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQ
        S+LSEPPD+ NWFSSY Y+SP+L+  QEFG   SE   L  +  +T E +     RK K   E   L    + + N         T +E SL  +     
Subjt:  SLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENV-----RKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQ

Query:  DPKEKSQGTNEIRPTKE--VPISSLTTEGLQCNFQD------RVSQENERSSNEDNKKPATHTNLIPKIDIALENSQTMVQTKLNGSPARTHVPSVLTNR
           EK      +  +K+     SS   E L C  Q+      RVS+ N +  +      + H  L P   I ++ S +M   +    P            
Subjt:  DPKEKSQGTNEIRPTKE--VPISSLTTEGLQCNFQD------RVSQENERSSNEDNKKPATHTNLIPKIDIALENSQTMVQTKLNGSPARTHVPSVLTNR

Query:  NPIDLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVER------LENFSNKETGNGVA-------CLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKP
             S  KE+   +  + GFVT  K  FR A  + S+++      ++  S+KE                R+ L + +N   S   EI GKW CPQK+K 
Subjt:  NPIDLSKTKEDKGKRVLEDGFVTANKREFRGAACKKSVER------LENFSNKETGNGVA-------CLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKP

Query:  NLGPPLKQLRLERWIHK
         + PPLKQLRL+ WIHK
Subjt:  NLGPPLKQLRLERWIHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGAACGCAAGGATAGTTACATATTTGTGCTTTCCCTTTTATCAGAGCCTCCTGATGTTGGGAACTGGTTTTCAAGTTATGTATATGAATCACCTAGATTAAATCC
AAGCCAGGAATTTGGATCATGTGAAAGCGAAAGCACTGGTTTAGACGACAAAATAGAAGAGACTTTGGAGAATGTCAGAAAAACCAAAAAGGGAGGTGAAGGGGGACAGT
TGAAGCAGTTCGTAAAATGCAATGGCAATTTCAGGGGCAACAGGCAGGATAATCAGACCCTAAGAGAGCAGAGTTTAGTTTCTGAAAAAACTACAAAGCAAGACCCTAAA
GAGAAAAGTCAGGGAACCAACGAAATCAGGCCAACCAAAGAAGTTCCAATTTCAAGCCTAACTACTGAAGGTCTTCAATGTAATTTCCAAGATAGAGTTTCCCAAGAGAA
TGAAAGAAGTTCAAATGAAGATAACAAGAAACCTGCAACTCATACAAACTTGATCCCAAAGATAGATATCGCACTCGAAAATTCCCAAACCATGGTCCAAACGAAACTCA
ATGGCTCACCAGCTAGAACTCATGTACCTTCTGTCTTAACCAATAGAAATCCGATTGATCTAAGCAAAACCAAAGAAGACAAAGGAAAAAGAGTTTTAGAGGATGGTTTC
GTTACCGCAAACAAGCGTGAATTTCGTGGAGCAGCTTGCAAGAAATCAGTAGAAAGGCTAGAAAACTTCAGTAACAAAGAGACAGGAAATGGTGTTGCTTGTTTAAGTAG
AAAGCCACTATCAGACAAAACCAATGTTGAGCGCTCAGATATAGTAGAGATTATTGGGAAATGGAGTTGTCCTCAGAAAAGTAAGCCGAATCTTGGACCGCCATTGAAGC
AGCTTCGACTTGAACGATGGATTCACAAGAAATAA
mRNA sequenceShow/hide mRNA sequence
CAAAAAAAGAAAAGAAAAGATGTTTGAACGCAAGGATAGTTACATATTTGTGCTTTCCCTTTTATCAGAGCCTCCTGATGTTGGGAACTGGTTTTCAAGTTATGTATATG
AATCACCTAGATTAAATCCAAGCCAGGAATTTGGATCATGTGAAAGCGAAAGCACTGGTTTAGACGACAAAATAGAAGAGACTTTGGAGAATGTCAGAAAAACCAAAAAG
GGAGGTGAAGGGGGACAGTTGAAGCAGTTCGTAAAATGCAATGGCAATTTCAGGGGCAACAGGCAGGATAATCAGACCCTAAGAGAGCAGAGTTTAGTTTCTGAAAAAAC
TACAAAGCAAGACCCTAAAGAGAAAAGTCAGGGAACCAACGAAATCAGGCCAACCAAAGAAGTTCCAATTTCAAGCCTAACTACTGAAGGTCTTCAATGTAATTTCCAAG
ATAGAGTTTCCCAAGAGAATGAAAGAAGTTCAAATGAAGATAACAAGAAACCTGCAACTCATACAAACTTGATCCCAAAGATAGATATCGCACTCGAAAATTCCCAAACC
ATGGTCCAAACGAAACTCAATGGCTCACCAGCTAGAACTCATGTACCTTCTGTCTTAACCAATAGAAATCCGATTGATCTAAGCAAAACCAAAGAAGACAAAGGAAAAAG
AGTTTTAGAGGATGGTTTCGTTACCGCAAACAAGCGTGAATTTCGTGGAGCAGCTTGCAAGAAATCAGTAGAAAGGCTAGAAAACTTCAGTAACAAAGAGACAGGAAATG
GTGTTGCTTGTTTAAGTAGAAAGCCACTATCAGACAAAACCAATGTTGAGCGCTCAGATATAGTAGAGATTATTGGGAAATGGAGTTGTCCTCAGAAAAGTAAGCCGAAT
CTTGGACCGCCATTGAAGCAGCTTCGACTTGAACGATGGATTCACAAGAAATAAAACATGTCCTTGGCTCAGTGATAAAGAGGGAAGATTTCCATAACCAATATGATTGT
GGTATGCAAATCATGCATGACTCTCATTTTACAAATTTCTATTTCTCCTGAATCAGGTTTGCATTGTTCTATCTTCTCCTATGTATATATTTTTTGTTGTAATCTTTATG
TAGAAAAACAGATACTTCTAAGAAAGGGATCAAAGCTTTTGCAGAGAAGAATTGACATATTACAGATTTGTGGTATTATGAGG
Protein sequenceShow/hide protein sequence
MFERKDSYIFVLSLLSEPPDVGNWFSSYVYESPRLNPSQEFGSCESESTGLDDKIEETLENVRKTKKGGEGGQLKQFVKCNGNFRGNRQDNQTLREQSLVSEKTTKQDPK
EKSQGTNEIRPTKEVPISSLTTEGLQCNFQDRVSQENERSSNEDNKKPATHTNLIPKIDIALENSQTMVQTKLNGSPARTHVPSVLTNRNPIDLSKTKEDKGKRVLEDGF
VTANKREFRGAACKKSVERLENFSNKETGNGVACLSRKPLSDKTNVERSDIVEIIGKWSCPQKSKPNLGPPLKQLRLERWIHKK