; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g23690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g23690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr3:16765671..16769153
RNA-Seq ExpressionMoc03g23690
SyntenyMoc03g23690
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142676.1 uncharacterized protein LOC111012732 [Momordica charantia]8.1e-81100Show/hide
Query:  MDMDEDPKTGKEPKTGDEPRMDEDPKNSEEPADVTESNVEMDHAPTIVGATQEVPSGHPSPVDVIEDLTLGKCASDGEASKGQMVNVPTPQPAGPPRKQT
        MDMDEDPKTGKEPKTGDEPRMDEDPKNSEEPADVTESNVEMDHAPTIVGATQEVPSGHPSPVDVIEDLTLGKCASDGEASKGQMVNVPTPQPAGPPRKQT
Subjt:  MDMDEDPKTGKEPKTGDEPRMDEDPKNSEEPADVTESNVEMDHAPTIVGATQEVPSGHPSPVDVIEDLTLGKCASDGEASKGQMVNVPTPQPAGPPRKQT

Query:  NRTESRPLPLSHGGTPHLTVVKVEPEFIEGPLGQGLRKRKYPWKLRAIYTPTGQL
        NRTESRPLPLSHGGTPHLTVVKVEPEFIEGPLGQGLRKRKYPWKLRAIYTPTGQL
Subjt:  NRTESRPLPLSHGGTPHLTVVKVEPEFIEGPLGQGLRKRKYPWKLRAIYTPTGQL

XP_022153247.1 uncharacterized protein LOC111020782 [Momordica charantia]1.1e-8293.71Show/hide
Query:  YDWRQERTIFRYVLGRQSDYDTPWSEADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPMCTIIPAILHWSGMLALRPNLPTVPW
        YDWRQERTIFRYVLGRQSDYDTPWSEADIVYTPMNIGGNHWVMI IDLVEGDLTVWDSLQ+ITPL+DLEKALKPMCTIIP ILHWSGMLALRP L TVPW
Subjt:  YDWRQERTIFRYVLGRQSDYDTPWSEADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPMCTIIPAILHWSGMLALRPNLPTVPW

Query:  RVRRRTVPQQAGFTNCSIFCVRFFEYDVTGSKMDTLTQSNISLFRRQYAVQMWARRPFF
        RVRR TVPQQAGFT+C IFCVRFFEYDVTGSKMDTL QSNISLFRRQYAVQMWARRPFF
Subjt:  RVRRRTVPQQAGFTNCSIFCVRFFEYDVTGSKMDTLTQSNISLFRRQYAVQMWARRPFF

XP_022154965.1 uncharacterized protein LOC111022110 [Momordica charantia]2.3e-15284.05Show/hide
Query:  DRWEVFCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVEMYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLER
        DRWEVFC+YD SSMIFERTLWSLKNALKDKVEAYKQKVA+DSSHVE YSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLER
Subjt:  DRWEVFCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVEMYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLER

Query:  EIFENVKSKVVVRLEATDVERQHMARVMHPP------------------------KSPVTGEIGDPVELDDVAKDASPVDDHVTEDIIETDGGQDQLLPQ
        E+FENVKSKVVVRLEATDVERQHMARVMHPP                        KSPVT E+GD VELDDVAKDASP+ D VTEDII TDGGQDQLLPQ
Subjt:  EIFENVKSKVVVRLEATDVERQHMARVMHPP------------------------KSPVTGEIGDPVELDDVAKDASPVDDHVTEDIIETDGGQDQLLPQ

Query:  KGTEKKKKKSKHKWSREPRRLGDRVTAIETTLTGMTTDIKDIKKFMKRLTK-----------KGGGPDQDGSSGGRDPSGRNEEDMDMDEDPKTGKEPKT
        KGTEKKKKKSKHKWSRE RRLGDRVTAIETTLTGMTTDIKDIKKFMKRLTK           +GG PDQDGSSGGRDPSGRNEEDMDMDEDPKTGKEPKT
Subjt:  KGTEKKKKKSKHKWSREPRRLGDRVTAIETTLTGMTTDIKDIKKFMKRLTK-----------KGGGPDQDGSSGGRDPSGRNEEDMDMDEDPKTGKEPKT

Query:  GDEPRMDEDPKNSEEPADVTESNVEMDHAPTIVGATQEVPSGHPSPVDVIE
        GDEPRMDEDPK  EEP DV ES+VEMDHAPTIVGATQEVPSGH SPVDVIE
Subjt:  GDEPRMDEDPKNSEEPADVTESNVEMDHAPTIVGATQEVPSGHPSPVDVIE

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]1.1e-9667.11Show/hide
Query:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKA-----------------------------------REVDEPRDNLISFNLFGNRFSFGKREFDLITGL
        M+MTLKINQDDWFPAALSNLAHVGKTSSRLKA                                   REV+EP+D+LISFNLFGNR SFGKREFDLITGL
Subjt:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKA-----------------------------------REVDEPRDNLISFNLFGNRFSFGKREFDLITGL

Query:  RHTINRVDEDVRNRRLRILAFHDGQGKEA--------ENGHEPPWDC------------------------------DRWEVFCNYDWSSMIFERTLWSL
        RHT+NRVDEDVRNRRLRIL F D    +         E+  E   D                               DRWEVFCNYDWSSMIFERTLWSL
Subjt:  RHTINRVDEDVRNRRLRILAFHDGQGKEA--------ENGHEPPWDC------------------------------DRWEVFCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVEMYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREIFENVKSKVVVRLEATDVER
        KNALKDKVE YKQKVAMDSSHVE YSLY FPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLERE+FENVKSKVVVRLEATDVER
Subjt:  KNALKDKVEAYKQKVAMDSSHVEMYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREIFENVKSKVVVRLEATDVER

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]5.2e-9693.41Show/hide
Query:  NLLRRTDGPYAAMKPGVLPSKCTYDWRQERTIFRYVLGRQSDYDTPWSEADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPMCT
        NLLRRTDGPYAAMKPGVLPSKCTYDWRQERTIFRYVLGRQSDYDT WSEADIVYT MNIGGNHWVMI IDLVEGDLTVWDSLQ+ITPL+DLEKALKPMCT
Subjt:  NLLRRTDGPYAAMKPGVLPSKCTYDWRQERTIFRYVLGRQSDYDTPWSEADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPMCT

Query:  IIPAILHWSGMLALRPNLPTVPWRVRRRTVPQQAGFTNCSIFCVRFFEYDVTGSKMDTLTQSNISLFRRQYAVQMWARRPFF
        IIPAILHWSG+LALRPNLP VPWRVRR TVPQQAGFT+CSIFCVRFFEYDV GSK+DTL QSNISLFRRQYAVQMWARRPFF
Subjt:  IIPAILHWSGMLALRPNLPTVPWRVRRRTVPQQAGFTNCSIFCVRFFEYDVTGSKMDTLTQSNISLFRRQYAVQMWARRPFF

TrEMBL top hitse value%identityAlignment
A0A6J1CMW6 uncharacterized protein LOC1110127323.9e-81100Show/hide
Query:  MDMDEDPKTGKEPKTGDEPRMDEDPKNSEEPADVTESNVEMDHAPTIVGATQEVPSGHPSPVDVIEDLTLGKCASDGEASKGQMVNVPTPQPAGPPRKQT
        MDMDEDPKTGKEPKTGDEPRMDEDPKNSEEPADVTESNVEMDHAPTIVGATQEVPSGHPSPVDVIEDLTLGKCASDGEASKGQMVNVPTPQPAGPPRKQT
Subjt:  MDMDEDPKTGKEPKTGDEPRMDEDPKNSEEPADVTESNVEMDHAPTIVGATQEVPSGHPSPVDVIEDLTLGKCASDGEASKGQMVNVPTPQPAGPPRKQT

Query:  NRTESRPLPLSHGGTPHLTVVKVEPEFIEGPLGQGLRKRKYPWKLRAIYTPTGQL
        NRTESRPLPLSHGGTPHLTVVKVEPEFIEGPLGQGLRKRKYPWKLRAIYTPTGQL
Subjt:  NRTESRPLPLSHGGTPHLTVVKVEPEFIEGPLGQGLRKRKYPWKLRAIYTPTGQL

A0A6J1DID7 uncharacterized protein LOC1110207825.5e-8393.71Show/hide
Query:  YDWRQERTIFRYVLGRQSDYDTPWSEADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPMCTIIPAILHWSGMLALRPNLPTVPW
        YDWRQERTIFRYVLGRQSDYDTPWSEADIVYTPMNIGGNHWVMI IDLVEGDLTVWDSLQ+ITPL+DLEKALKPMCTIIP ILHWSGMLALRP L TVPW
Subjt:  YDWRQERTIFRYVLGRQSDYDTPWSEADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPMCTIIPAILHWSGMLALRPNLPTVPW

Query:  RVRRRTVPQQAGFTNCSIFCVRFFEYDVTGSKMDTLTQSNISLFRRQYAVQMWARRPFF
        RVRR TVPQQAGFT+C IFCVRFFEYDVTGSKMDTL QSNISLFRRQYAVQMWARRPFF
Subjt:  RVRRRTVPQQAGFTNCSIFCVRFFEYDVTGSKMDTLTQSNISLFRRQYAVQMWARRPFF

A0A6J1DL40 uncharacterized protein LOC1110221101.1e-15284.05Show/hide
Query:  DRWEVFCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVEMYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLER
        DRWEVFC+YD SSMIFERTLWSLKNALKDKVEAYKQKVA+DSSHVE YSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLER
Subjt:  DRWEVFCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVEMYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLER

Query:  EIFENVKSKVVVRLEATDVERQHMARVMHPP------------------------KSPVTGEIGDPVELDDVAKDASPVDDHVTEDIIETDGGQDQLLPQ
        E+FENVKSKVVVRLEATDVERQHMARVMHPP                        KSPVT E+GD VELDDVAKDASP+ D VTEDII TDGGQDQLLPQ
Subjt:  EIFENVKSKVVVRLEATDVERQHMARVMHPP------------------------KSPVTGEIGDPVELDDVAKDASPVDDHVTEDIIETDGGQDQLLPQ

Query:  KGTEKKKKKSKHKWSREPRRLGDRVTAIETTLTGMTTDIKDIKKFMKRLTK-----------KGGGPDQDGSSGGRDPSGRNEEDMDMDEDPKTGKEPKT
        KGTEKKKKKSKHKWSRE RRLGDRVTAIETTLTGMTTDIKDIKKFMKRLTK           +GG PDQDGSSGGRDPSGRNEEDMDMDEDPKTGKEPKT
Subjt:  KGTEKKKKKSKHKWSREPRRLGDRVTAIETTLTGMTTDIKDIKKFMKRLTK-----------KGGGPDQDGSSGGRDPSGRNEEDMDMDEDPKTGKEPKT

Query:  GDEPRMDEDPKNSEEPADVTESNVEMDHAPTIVGATQEVPSGHPSPVDVIE
        GDEPRMDEDPK  EEP DV ES+VEMDHAPTIVGATQEVPSGH SPVDVIE
Subjt:  GDEPRMDEDPKNSEEPADVTESNVEMDHAPTIVGATQEVPSGHPSPVDVIE

A0A6J1DRZ7 uncharacterized protein LOC1110238475.1e-9767.11Show/hide
Query:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKA-----------------------------------REVDEPRDNLISFNLFGNRFSFGKREFDLITGL
        M+MTLKINQDDWFPAALSNLAHVGKTSSRLKA                                   REV+EP+D+LISFNLFGNR SFGKREFDLITGL
Subjt:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKA-----------------------------------REVDEPRDNLISFNLFGNRFSFGKREFDLITGL

Query:  RHTINRVDEDVRNRRLRILAFHDGQGKEA--------ENGHEPPWDC------------------------------DRWEVFCNYDWSSMIFERTLWSL
        RHT+NRVDEDVRNRRLRIL F D    +         E+  E   D                               DRWEVFCNYDWSSMIFERTLWSL
Subjt:  RHTINRVDEDVRNRRLRILAFHDGQGKEA--------ENGHEPPWDC------------------------------DRWEVFCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVEMYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREIFENVKSKVVVRLEATDVER
        KNALKDKVE YKQKVAMDSSHVE YSLY FPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLERE+FENVKSKVVVRLEATDVER
Subjt:  KNALKDKVEAYKQKVAMDSSHVEMYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREIFENVKSKVVVRLEATDVER

A0A6J1DY60 uncharacterized protein LOC1110252732.5e-9693.41Show/hide
Query:  NLLRRTDGPYAAMKPGVLPSKCTYDWRQERTIFRYVLGRQSDYDTPWSEADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPMCT
        NLLRRTDGPYAAMKPGVLPSKCTYDWRQERTIFRYVLGRQSDYDT WSEADIVYT MNIGGNHWVMI IDLVEGDLTVWDSLQ+ITPL+DLEKALKPMCT
Subjt:  NLLRRTDGPYAAMKPGVLPSKCTYDWRQERTIFRYVLGRQSDYDTPWSEADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPMCT

Query:  IIPAILHWSGMLALRPNLPTVPWRVRRRTVPQQAGFTNCSIFCVRFFEYDVTGSKMDTLTQSNISLFRRQYAVQMWARRPFF
        IIPAILHWSG+LALRPNLP VPWRVRR TVPQQAGFT+CSIFCVRFFEYDV GSK+DTL QSNISLFRRQYAVQMWARRPFF
Subjt:  IIPAILHWSGMLALRPNLPTVPWRVRRRTVPQQAGFTNCSIFCVRFFEYDVTGSKMDTLTQSNISLFRRQYAVQMWARRPFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G08430.1 Ulp1 protease family protein1.2e-0826.36Show/hide
Query:  EADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPMCTIIPAIL-HWSGMLALRPNLPTVPWRVRRRTVPQQAGFTNCSIFCVRFF
        + D +Y  + + GNHWV + IDL +  + V+DS+ S+T   ++      + T+IPA+L  +      R +   + W+ R   +P+     +C+I+ +++ 
Subjt:  EADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPMCTIIPAIL-HWSGMLALRPNLPTVPWRVRRRTVPQQAGFTNCSIFCVRFF

Query:  EYDVTGSKMDTLTQSNISLFRRQYAVQMW
        E    G   D L   N+     + AV+M+
Subjt:  EYDVTGSKMDTLTQSNISLFRRQYAVQMW

AT5G28235.1 Ulp1 protease family protein5.0e-0440Show/hide
Query:  EADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPM--CTIIPAIL
        + D +Y  + + GNHWV + IDL +  + V+DS+ S+T   D E A++ M   T+IPA+L
Subjt:  EADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPM--CTIIPAIL

AT5G45570.1 Ulp1 protease family protein1.0e-0929.77Show/hide
Query:  EADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPM--CTIIPAIL-HWSGMLALRPNLPTVPWRVRRRTVPQQAGFTNCSIFCVR
        + D +Y  + + GNHWV + IDL    + V+DS+ S+T   D E A++ M   T+IPA+L  +      R +   + W+ R   +P+     +C+I+ ++
Subjt:  EADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPM--CTIIPAIL-HWSGMLALRPNLPTVPWRVRRRTVPQQAGFTNCSIFCVR

Query:  FFEYDVTGSKMDTLTQSNISLFRRQYAVQMW
        + E    G   D L   N+   R + AV+M+
Subjt:  FFEYDVTGSKMDTLTQSNISLFRRQYAVQMW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGACCGAGATGGTATCCAAAAAGACCATCCTTCTTTCTGGATTCTATCTCGGGCCTCGTCCCGATCCATTTCGAACCGGGATGAGGCCCGAAATGGAATCCAGAAA
GAAGGATGGACTTTCTGGAATCCATCTTGGGACTCATCCCGGAAACATGGATATGACACTTAAGATCAACCAAGACGACTGGTTCCCGGCCGCGCTGTCAAATCTCGCTC
ACGTAGGGAAAACCTCTTCTCGTCTTAAGGCTAGAGAGGTGGATGAACCTAGAGACAACCTCATTAGCTTTAACCTATTCGGGAATAGGTTCTCTTTTGGGAAGCGGGAG
TTCGACCTAATAACCGGTCTTAGACACACCATTAATAGGGTAGATGAGGATGTTCGTAACCGGAGACTTAGAATTCTAGCTTTCCATGATGGGCAAGGAAAGGAAGCAGA
AAATGGACACGAGCCTCCTTGGGATTGTGATCGGTGGGAAGTTTTCTGTAATTATGACTGGAGTTCAATGATTTTTGAAAGGACTCTCTGGAGCTTGAAGAACGCTCTGA
AGGACAAGGTCGAGGCGTACAAACAGAAGGTCGCTATGGACTCGAGCCATGTTGAGATGTATAGCTTGTATGGGTTTCCATACGCTTTTCAGGTTTGGGCATACGAGACA
ATATCAACCTTGTCGACTCGAGTAGCATTGAGGCTGAATGACGATGCTATTCCTCGTCTACTTAGATGGTCCTGCACCTATTCGCGTGCTTTTAATGTTTTGGAGCGAGA
GATCTTCGAGAACGTCAAGTCGAAGGTTGTAGTTCGTTTGGAGGCGACTGATGTCGAACGACAGCATATGGCTCGCGTTATGCATCCACCAAAGTCTCCCGTTACTGGTG
AGATTGGGGATCCAGTTGAGCTCGATGATGTAGCAAAGGATGCTTCCCCAGTGGATGATCATGTAACAGAAGATATTATTGAGACCGATGGAGGACAAGATCAATTGTTG
CCACAGAAAGGGACGGAGAAGAAGAAGAAGAAGTCGAAGCATAAGTGGAGTCGGGAGCCGCGGAGGCTCGGCGACAGAGTGACGGCCATTGAGACAACTCTGACGGGCAT
GACGACTGACATAAAAGACATAAAGAAGTTTATGAAAAGGCTAACAAAGAAGGGCGGTGGGCCGGATCAGGATGGTTCTTCGGGCGGACGTGATCCGAGTGGGCGTAACG
AGGAGGATATGGACATGGATGAGGATCCGAAGACAGGGAAAGAGCCGAAGACAGGGGACGAGCCGAGGATGGACGAGGATCCGAAGAATTCTGAAGAACCCGCCGACGTC
ACCGAGAGTAACGTGGAGATGGATCACGCTCCTACCATTGTTGGAGCTACCCAGGAGGTCCCAAGTGGCCACCCTAGTCCGGTCGACGTAATTGAGGATCTTACTCTAGG
TAAGTGCGCCAGTGACGGGGAGGCAAGTAAGGGGCAGATGGTTAACGTACCGACACCGCAACCCGCAGGGCCACCGAGAAAGCAAACTAATAGGACAGAAAGTCGACCCC
TACCCTTATCACATGGAGGAACTCCACACTTGACCGTTGTTAAGGTTGAACCTGAGTTTATTGAAGGCCCACTTGGCCAGGGTCTTCGGAAGAGGAAATATCCGTGGAAG
TTGCGGGCCATATACACGCCCACCGGCCAACTGAACTTGCTACGACGAACAGACGGACCATATGCAGCTATGAAGCCAGGTGTCCTGCCGTCGAAATGTACGTATGATTG
GAGGCAAGAGCGTACCATCTTTCGGTACGTGCTTGGTCGACAATCGGACTACGACACCCCATGGAGCGAGGCCGATATCGTGTACACGCCGATGAATATTGGCGGCAACC
ACTGGGTCATGATCAGGATTGATCTTGTGGAGGGTGATTTAACCGTATGGGACTCACTGCAGTCGATCACCCCGTTGGATGATCTCGAGAAGGCGCTCAAGCCAATGTGC
ACGATAATCCCGGCGATTCTTCATTGGAGCGGGATGCTCGCACTTCGGCCTAACCTGCCCACGGTGCCGTGGAGGGTCCGAAGACGTACTGTACCTCAGCAAGCCGGGTT
CACAAATTGCAGCATATTTTGTGTTAGATTTTTCGAGTACGATGTAACTGGGTCAAAGATGGACACTTTGACTCAAAGTAACATTTCTTTATTTCGTCGTCAATATGCTG
TACAAATGTGGGCTCGCAGACCCTTTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGACCGAGATGGTATCCAAAAAGACCATCCTTCTTTCTGGATTCTATCTCGGGCCTCGTCCCGATCCATTTCGAACCGGGATGAGGCCCGAAATGGAATCCAGAAA
GAAGGATGGACTTTCTGGAATCCATCTTGGGACTCATCCCGGAAACATGGATATGACACTTAAGATCAACCAAGACGACTGGTTCCCGGCCGCGCTGTCAAATCTCGCTC
ACGTAGGGAAAACCTCTTCTCGTCTTAAGGCTAGAGAGGTGGATGAACCTAGAGACAACCTCATTAGCTTTAACCTATTCGGGAATAGGTTCTCTTTTGGGAAGCGGGAG
TTCGACCTAATAACCGGTCTTAGACACACCATTAATAGGGTAGATGAGGATGTTCGTAACCGGAGACTTAGAATTCTAGCTTTCCATGATGGGCAAGGAAAGGAAGCAGA
AAATGGACACGAGCCTCCTTGGGATTGTGATCGGTGGGAAGTTTTCTGTAATTATGACTGGAGTTCAATGATTTTTGAAAGGACTCTCTGGAGCTTGAAGAACGCTCTGA
AGGACAAGGTCGAGGCGTACAAACAGAAGGTCGCTATGGACTCGAGCCATGTTGAGATGTATAGCTTGTATGGGTTTCCATACGCTTTTCAGGTTTGGGCATACGAGACA
ATATCAACCTTGTCGACTCGAGTAGCATTGAGGCTGAATGACGATGCTATTCCTCGTCTACTTAGATGGTCCTGCACCTATTCGCGTGCTTTTAATGTTTTGGAGCGAGA
GATCTTCGAGAACGTCAAGTCGAAGGTTGTAGTTCGTTTGGAGGCGACTGATGTCGAACGACAGCATATGGCTCGCGTTATGCATCCACCAAAGTCTCCCGTTACTGGTG
AGATTGGGGATCCAGTTGAGCTCGATGATGTAGCAAAGGATGCTTCCCCAGTGGATGATCATGTAACAGAAGATATTATTGAGACCGATGGAGGACAAGATCAATTGTTG
CCACAGAAAGGGACGGAGAAGAAGAAGAAGAAGTCGAAGCATAAGTGGAGTCGGGAGCCGCGGAGGCTCGGCGACAGAGTGACGGCCATTGAGACAACTCTGACGGGCAT
GACGACTGACATAAAAGACATAAAGAAGTTTATGAAAAGGCTAACAAAGAAGGGCGGTGGGCCGGATCAGGATGGTTCTTCGGGCGGACGTGATCCGAGTGGGCGTAACG
AGGAGGATATGGACATGGATGAGGATCCGAAGACAGGGAAAGAGCCGAAGACAGGGGACGAGCCGAGGATGGACGAGGATCCGAAGAATTCTGAAGAACCCGCCGACGTC
ACCGAGAGTAACGTGGAGATGGATCACGCTCCTACCATTGTTGGAGCTACCCAGGAGGTCCCAAGTGGCCACCCTAGTCCGGTCGACGTAATTGAGGATCTTACTCTAGG
TAAGTGCGCCAGTGACGGGGAGGCAAGTAAGGGGCAGATGGTTAACGTACCGACACCGCAACCCGCAGGGCCACCGAGAAAGCAAACTAATAGGACAGAAAGTCGACCCC
TACCCTTATCACATGGAGGAACTCCACACTTGACCGTTGTTAAGGTTGAACCTGAGTTTATTGAAGGCCCACTTGGCCAGGGTCTTCGGAAGAGGAAATATCCGTGGAAG
TTGCGGGCCATATACACGCCCACCGGCCAACTGAACTTGCTACGACGAACAGACGGACCATATGCAGCTATGAAGCCAGGTGTCCTGCCGTCGAAATGTACGTATGATTG
GAGGCAAGAGCGTACCATCTTTCGGTACGTGCTTGGTCGACAATCGGACTACGACACCCCATGGAGCGAGGCCGATATCGTGTACACGCCGATGAATATTGGCGGCAACC
ACTGGGTCATGATCAGGATTGATCTTGTGGAGGGTGATTTAACCGTATGGGACTCACTGCAGTCGATCACCCCGTTGGATGATCTCGAGAAGGCGCTCAAGCCAATGTGC
ACGATAATCCCGGCGATTCTTCATTGGAGCGGGATGCTCGCACTTCGGCCTAACCTGCCCACGGTGCCGTGGAGGGTCCGAAGACGTACTGTACCTCAGCAAGCCGGGTT
CACAAATTGCAGCATATTTTGTGTTAGATTTTTCGAGTACGATGTAACTGGGTCAAAGATGGACACTTTGACTCAAAGTAACATTTCTTTATTTCGTCGTCAATATGCTG
TACAAATGTGGGCTCGCAGACCCTTTTTTTAG
Protein sequenceShow/hide protein sequence
MKTEMVSKKTILLSGFYLGPRPDPFRTGMRPEMESRKKDGLSGIHLGTHPGNMDMTLKINQDDWFPAALSNLAHVGKTSSRLKAREVDEPRDNLISFNLFGNRFSFGKRE
FDLITGLRHTINRVDEDVRNRRLRILAFHDGQGKEAENGHEPPWDCDRWEVFCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVEMYSLYGFPYAFQVWAYET
ISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREIFENVKSKVVVRLEATDVERQHMARVMHPPKSPVTGEIGDPVELDDVAKDASPVDDHVTEDIIETDGGQDQLL
PQKGTEKKKKKSKHKWSREPRRLGDRVTAIETTLTGMTTDIKDIKKFMKRLTKKGGGPDQDGSSGGRDPSGRNEEDMDMDEDPKTGKEPKTGDEPRMDEDPKNSEEPADV
TESNVEMDHAPTIVGATQEVPSGHPSPVDVIEDLTLGKCASDGEASKGQMVNVPTPQPAGPPRKQTNRTESRPLPLSHGGTPHLTVVKVEPEFIEGPLGQGLRKRKYPWK
LRAIYTPTGQLNLLRRTDGPYAAMKPGVLPSKCTYDWRQERTIFRYVLGRQSDYDTPWSEADIVYTPMNIGGNHWVMIRIDLVEGDLTVWDSLQSITPLDDLEKALKPMC
TIIPAILHWSGMLALRPNLPTVPWRVRRRTVPQQAGFTNCSIFCVRFFEYDVTGSKMDTLTQSNISLFRRQYAVQMWARRPFF