; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0022113 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0022113
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionSuppressor protein SRP40-like isoform X2
Genome locationchr04:32139060..32141442
RNA-Seq ExpressionPay0022113
SyntenyPay0022113
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044053.1 suppressor protein SRP40-like isoform X2 [Cucumis melo var. makuwa]1.1e-10298.6Show/hide
Query:  MVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVP
        MVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVP
Subjt:  MVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVP

Query:  PLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQS
        PLPRNSKGINEISHSEEEN NARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAI AVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQS
Subjt:  PLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQS

Query:  SMKICEHMIEVSPV
        SM+ICEHMIEVSPV
Subjt:  SMKICEHMIEVSPV

KAG7027980.1 hypothetical protein SDJN02_09159, partial [Cucurbita argyrosperma subsp. argyrosperma]9.4e-8969.69Show/hide
Query:  NYQTAATFAKYFQHLKKTVFPKSSLPKKKKKIVKRVNLSTFSDYLYCNPDNCANAKKMSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLA
        NYQT A+FA+  QHLKKT F KSSL   KK ++KRV+LSTF +YL  NP +C+ AK ++  TKMDIKWSGN M EG EV KTMECLRRRLLAER ASLLA
Subjt:  NYQTAATFAKYFQHLKKTVFPKSSLPKKKKKIVKRVNLSTFSDYLYCNPDNCANAKKMSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLA

Query:  KEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTAS
        KEEAE+M KRS+ELEKQIT+Q QM+ +AEKKLQLL KKLESLNLS TM+NSE SISSEIC+EDEPKTL     LP NS+ I EISHS+EENPNARGST+S
Subjt:  KEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTAS

Query:  NISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQSSMKICE-HMIEVSPV
        + S S+I S+E SK K  + G  F+SVDDSLA+ AV+SPE++ETGE +K VISER+IEVLNDLK+AR RIQ SMKICE +M++VSPV
Subjt:  NISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQSSMKICE-HMIEVSPV

XP_008442604.2 PREDICTED: uncharacterized protein LOC103486426 [Cucumis melo]1.5e-15098.31Show/hide
Query:  MEVDELGQNYNYQTAATFAKYFQHLKKTVFPKSSLPKKKKKIVKRVNLSTFSDYLYCNPDNCANAKKMSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRL
        MEVDELGQNYNYQTAATFAKYFQHLKKTVFPKSSLP KKKKIVKRVNLSTFSDYLYCNPDNCA AKKMSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRL
Subjt:  MEVDELGQNYNYQTAATFAKYFQHLKKTVFPKSSLPKKKKKIVKRVNLSTFSDYLYCNPDNCANAKKMSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRL

Query:  LAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVPPLPRNSKGINEISHSEEE
        LAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVPPLPRNSKGINEISHSEEE
Subjt:  LAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVPPLPRNSKGINEISHSEEE

Query:  NPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQSSMKICEHMIEVSPV
        N NARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAI AVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQSSM+ICEHMIEVSPV
Subjt:  NPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQSSMKICEHMIEVSPV

XP_011651920.1 uncharacterized protein LOC105434957 [Cucumis sativus]2.0e-9988.65Show/hide
Query:  MSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISS
        M+P TKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLAK+EAE+MDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEAS+SS
Subjt:  MSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISS

Query:  EICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERII
        EICNE+EPKT IEV PLP +SKGI+EI HSEEEN NARGST+SNISASKI+SD+PSKTK G+CGKE DSVDDSLAI AVDSP KSET EQLKPVISERII
Subjt:  EICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERII

Query:  EVLNDLKRARGRIQSSMKICEHMIEVSPV
        EVLNDLKRAR RIQSSMK+C+HMIEVSP+
Subjt:  EVLNDLKRARGRIQSSMKICEHMIEVSPV

XP_038906280.1 uncharacterized protein LOC120092141 [Benincasa hispida]4.0e-8781.01Show/hide
Query:  NCANAKKMSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVN
        +CA AKKM+P    +IKWSGNKM EGGEVSKTMECLRRRLLAER ASLLAKEEAE+M KRS+ELEKQITKQIQMK +AEKKLQLLKKKL SLNLS+TMVN
Subjt:  NCANAKKMSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVN

Query:  SEASISSEICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKP
        SEAS+SSEIC+EDEPKTLIEVP L  NSK I EISH EEEN NARGST+SN SAS+I SDEPSKTK G C KEFDSVDDSLAI AV+SP KSETG++LKP
Subjt:  SEASISSEICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKP

Query:  VISERIIEVLNDLKRARGRIQSSMKICE-HMIEVSPV
        +ISERIIEVLNDLK AR  I+SSMKICE +MIEVSPV
Subjt:  VISERIIEVLNDLKRARGRIQSSMKICE-HMIEVSPV

TrEMBL top hitse value%identityAlignment
A0A0A0LDB5 Uncharacterized protein9.8e-10088.65Show/hide
Query:  MSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISS
        M+P TKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLAK+EAE+MDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEAS+SS
Subjt:  MSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISS

Query:  EICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERII
        EICNE+EPKT IEV PLP +SKGI+EI HSEEEN NARGST+SNISASKI+SD+PSKTK G+CGKE DSVDDSLAI AVDSP KSET EQLKPVISERII
Subjt:  EICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERII

Query:  EVLNDLKRARGRIQSSMKICEHMIEVSPV
        EVLNDLKRAR RIQSSMK+C+HMIEVSP+
Subjt:  EVLNDLKRARGRIQSSMKICEHMIEVSPV

A0A1S3B6U8 uncharacterized protein LOC1034864267.1e-15198.31Show/hide
Query:  MEVDELGQNYNYQTAATFAKYFQHLKKTVFPKSSLPKKKKKIVKRVNLSTFSDYLYCNPDNCANAKKMSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRL
        MEVDELGQNYNYQTAATFAKYFQHLKKTVFPKSSLP KKKKIVKRVNLSTFSDYLYCNPDNCA AKKMSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRL
Subjt:  MEVDELGQNYNYQTAATFAKYFQHLKKTVFPKSSLPKKKKKIVKRVNLSTFSDYLYCNPDNCANAKKMSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRL

Query:  LAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVPPLPRNSKGINEISHSEEE
        LAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVPPLPRNSKGINEISHSEEE
Subjt:  LAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVPPLPRNSKGINEISHSEEE

Query:  NPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQSSMKICEHMIEVSPV
        N NARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAI AVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQSSM+ICEHMIEVSPV
Subjt:  NPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQSSMKICEHMIEVSPV

A0A5A7TKN7 Suppressor protein SRP40-like isoform X25.5e-10398.6Show/hide
Query:  MVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVP
        MVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVP
Subjt:  MVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVP

Query:  PLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQS
        PLPRNSKGINEISHSEEEN NARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAI AVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQS
Subjt:  PLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQS

Query:  SMKICEHMIEVSPV
        SM+ICEHMIEVSPV
Subjt:  SMKICEHMIEVSPV

A0A6J1CXG5 uncharacterized protein LOC1110150991.9e-6366.52Show/hide
Query:  MSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISS
        M+P  K DIKWSGNKM EG EV KTMECLRRRLLAER AS LAKE+AE+M+KRS ELEKQIT+QI M+ +AEKKL+LL+KKLESLNL ST V SE S+SS
Subjt:  MSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISS

Query:  EICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERII
        EIC+ ++PKTLI    LP N++   EISHSEE NPNARG  A + SAS+I  D+ SK K  + G EF S  DS A+ AV+SPE S+TGE  KP I+E II
Subjt:  EICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERII

Query:  EVLNDLKRARGRIQSSMKICE-HMIEVSPV
        EVLNDLK AR RIQSSM+I E +MI+VSPV
Subjt:  EVLNDLKRARGRIQSSMKICE-HMIEVSPV

A0A6J1L305 uncharacterized protein LOC1114986453.0e-7271.74Show/hide
Query:  MSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISS
        M+  TK+DIKWSGN M EG EV KTMECLRRRLLAER ASLLAKEEAE+M KRS+ELEK+IT+Q QM+ +AEKKLQLLKKKLESL+LS TM+NSE SISS
Subjt:  MSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISS

Query:  EICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERII
        EIC+EDEPKTL     LP NS+ I EISHS+EENPNARG+T+S+ S S+I+S+E SK K  + G+EF+SVDDSLA  AV+SPE++ETGE +K VISER+I
Subjt:  EICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERII

Query:  EVLNDLKRARGRIQSSMKICE-HMIEVSPV
        EVLNDLK+AR RIQ SMKICE +M++VSPV
Subjt:  EVLNDLKRARGRIQSSMKICE-HMIEVSPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37300.1 unknown protein2.3e-0832.32Show/hide
Query:  EGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESL---NLSSTMVNSEASISSEIC-----NEDEPK
        E  E  +T+ECLR RLLAER  S  AKEEAE++ ++  ELE+ + ++I+++ +AEK+L+LL KKLE +     S    +SE S  S +C      E+E  
Subjt:  EGGEVSKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESL---NLSSTMVNSEASISSEIC-----NEDEPK

Query:  TLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDS
                 +       ++ +EE + +++    S+  AS + S    K ++   G EF    D+
Subjt:  TLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDS

AT2G37300.2 unknown protein7.9e-0931.67Show/hide
Query:  TKMDIKWSGNKMVEGGEV---SKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESL---NLSSTMVNSEASI
        TK    ++  KM E  E     +T+ECLR RLLAER  S  AKEEAE++ ++  ELE+ + ++I+++ +AEK+L+LL KKLE +     S    +SE S 
Subjt:  TKMDIKWSGNKMVEGGEV---SKTMECLRRRLLAERHASLLAKEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESL---NLSSTMVNSEASI

Query:  SSEIC-----NEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDS
         S +C      E+E           +       ++ +EE + +++    S+  AS + S    K ++   G EF    D+
Subjt:  SSEIC-----NEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSDEPSKTKTGSCGKEFDSVDDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTGGATGAATTGGGTCAGAATTATAATTACCAAACTGCTGCCACTTTTGCAAAGTATTTCCAGCATCTGAAGAAGACAGTTTTTCCAAAAAGTTCCTTACCAAA
AAAAAAAAAAAAAATTGTTAAAAGAGTTAATCTTTCCACTTTTTCTGACTATTTGTATTGTAATCCAGACAATTGTGCAAACGCCAAGAAAATGTCACCCTATACCAAGA
TGGACATCAAATGGAGTGGCAACAAAATGGTGGAAGGTGGTGAAGTTTCTAAAACAATGGAGTGCCTAAGAAGGAGATTACTAGCAGAGAGGCACGCATCATTACTTGCC
AAAGAGGAGGCAGAGGTCATGGACAAAAGGTCATCAGAATTAGAGAAGCAGATCACAAAGCAGATTCAAATGAAAGCAAGAGCAGAAAAGAAGCTTCAATTACTAAAGAA
AAAGCTCGAATCACTTAACCTTTCTTCCACGATGGTGAACTCAGAGGCTTCAATTTCGTCGGAAATTTGCAACGAAGATGAACCGAAAACCCTAATCGAAGTTCCGCCTC
TCCCAAGAAACTCCAAGGGCATTAATGAAATTTCTCATTCGGAAGAAGAGAATCCAAATGCTCGTGGTTCTACAGCCTCGAACATCTCTGCTTCTAAAATTTACTCCGAC
GAACCTTCAAAAACGAAGACCGGAAGTTGTGGAAAGGAATTTGATTCCGTCGACGATTCACTTGCAATCGCTGCAGTGGACTCGCCGGAAAAATCAGAAACCGGCGAGCA
GTTGAAGCCGGTGATCAGTGAAAGGATCATTGAAGTTCTTAATGATCTGAAGCGCGCCCGTGGGAGAATTCAAAGCTCAATGAAAATTTGTGAACACATGATTGAAGTCA
GCCCAGTTTGA
mRNA sequenceShow/hide mRNA sequence
GGTAGATCTTAAAGTTCAAAGTATTATGATAAACTTTAACTATATTCCAAGTCCTTGTTTGTCTACTATTTTCGTTGACGCCAAGAAATGGATCAACTGAAAAATGGTAA
AATAACTCCAGTTGAGCACTAGAGAATCGGGTTTGGAAAATAGTTTGATAGACTTGTAAGTACCAAGTAGATCTCAGCATACCTAGTTCAACAAGTTAATTTTAATTTTG
TAACTGAAATAGGAACTGCTTATCCAACTTTGAAGCAACCTTTAAAACTCCTTTAACCTTAGGATTCGATTAAAAACCTTTAATATGCTTAAGTTCCTTTATCAACTTCA
CTAATAAGTGTAAAATTTTTGTTGACTTAGAGGCTTAAAATATAAAAAATGGTGCCTGTTATATTATAATATAGATCGAGCATAAAGTTGAGGGATCTCTCAAGCTCCCT
AAATCTGTGGGTGTCGCTAATTGTAACATAATTTTGGATAACCAACCTCTCCAGACCCTTCCGCACACCTTGCAGGCTCAAGGTCTCTCCTAAAGTTAAAGCCCCACAAT
GAGCCAAATGGGCCGCCACCATTCTCGACTTAAGCCTTATCCCATAAGCTAATTATTTGCCAATCTTTAGTTTTTCTTTTGCTTGGATAGGGATTCATATTTCAAAATTC
ATGTCATCCTACGAAATCAAATTCCAAATCATTAAAAAAAAAAAAAAAAAAGTTGAACTCTGCTGACCTAACCTAGCAGACTTAAACTATCACTCTATAATACATCCAAT
CATATTTTATATTTCAAATTTCTCTATTATCCAAGGTCAGGTAAATGAAATGGATGCTAACTGCTTTTCTGGTCCAATTTGAGCTCGAATTCCCTTTCAATCCAAGCTGT
GGCCAAAAACAGGAGCCAACGCCAACGAAGAGTATTTAATACATCATTGCCTTCTGGTATTATTGAAAGAATGAATGTCTATGATTGAAATTCTAGGTTGAGTTGGGTTT
GATATTTAAAGAGCAATAAATAGATGTTAATAAACGAGAAGGTTTAAACATGGAGGTGGATGAATTGGGTCAGAATTATAATTACCAAACTGCTGCCACTTTTGCAAAGT
ATTTCCAGCATCTGAAGAAGACAGTTTTTCCAAAAAGTTCCTTACCAAAAAAAAAAAAAAAAATTGTTAAAAGAGTTAATCTTTCCACTTTTTCTGACTATTTGTATTGT
AATCCAGACAATTGTGCAAACGCCAAGAAAATGTCACCCTATACCAAGATGGACATCAAATGGAGTGGCAACAAAATGGTGGAAGGTGGTGAAGTTTCTAAAACAATGGA
GTGCCTAAGAAGGAGATTACTAGCAGAGAGGCACGCATCATTACTTGCCAAAGAGGAGGCAGAGGTCATGGACAAAAGGTCATCAGAATTAGAGAAGCAGATCACAAAGC
AGATTCAAATGAAAGCAAGAGCAGAAAAGAAGCTTCAATTACTAAAGAAAAAGCTCGAATCACTTAACCTTTCTTCCACGATGGTGAACTCAGAGGCTTCAATTTCGTCG
GAAATTTGCAACGAAGATGAACCGAAAACCCTAATCGAAGTTCCGCCTCTCCCAAGAAACTCCAAGGGCATTAATGAAATTTCTCATTCGGAAGAAGAGAATCCAAATGC
TCGTGGTTCTACAGCCTCGAACATCTCTGCTTCTAAAATTTACTCCGACGAACCTTCAAAAACGAAGACCGGAAGTTGTGGAAAGGAATTTGATTCCGTCGACGATTCAC
TTGCAATCGCTGCAGTGGACTCGCCGGAAAAATCAGAAACCGGCGAGCAGTTGAAGCCGGTGATCAGTGAAAGGATCATTGAAGTTCTTAATGATCTGAAGCGCGCCCGT
GGGAGAATTCAAAGCTCAATGAAAATTTGTGAACACATGATTGAAGTCAGCCCAGTTTGAACTCAAATGTGTGTGAAAGAAAGGGAATGGGCCTTATAGAAAGGCCCATC
AGACAAAGCCCATCAGATCTTGGTTTAAGTACTTTTTTAATACGGTAATTACAATAGGTAGCAATTTTTAGAATAATTATTAAGTATGTATCAATATTTTAAAAAATTTG
CAAATATAGCAAAATCTATCGGTGATAGACTTCTATCATTGATAGACTCTTATGGTTTATC
Protein sequenceShow/hide protein sequence
MEVDELGQNYNYQTAATFAKYFQHLKKTVFPKSSLPKKKKKIVKRVNLSTFSDYLYCNPDNCANAKKMSPYTKMDIKWSGNKMVEGGEVSKTMECLRRRLLAERHASLLA
KEEAEVMDKRSSELEKQITKQIQMKARAEKKLQLLKKKLESLNLSSTMVNSEASISSEICNEDEPKTLIEVPPLPRNSKGINEISHSEEENPNARGSTASNISASKIYSD
EPSKTKTGSCGKEFDSVDDSLAIAAVDSPEKSETGEQLKPVISERIIEVLNDLKRARGRIQSSMKICEHMIEVSPV