; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g34640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g34640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr1:24512621..24515819
RNA-Seq ExpressionMoc01g34640
SyntenyMoc01g34640
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR015410 - Domain of unknown function DUF1985
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154561.1 uncharacterized protein LOC111021802 [Momordica charantia]4.3e-12163.41Show/hide
Query:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKYDRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAAR
        MFRKT F HLLD+DLVFNG LIHNILLRE+E+STP+TISFNLF  ++SF R +F +ISGLKY R+PVR++T PHRL  LYFND  D++LS+FEK+Y AAR
Subjt:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKYDRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAAR

Query:  FEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGRFRKSYSLYGFPWAFQVWAYETISSLSGR
        FEDD+D +K+ IVY+V + LLGRER +K+D+TLLGIVDDWE CCN++W  LSF+KTI SL+RGP K SKDG+ RKSYSLYGFPW FQVWAY+TISSLS R
Subjt:  FEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGRFRKSYSLYGFPWAFQVWAYETISSLSGR

Query:  VANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVPRENDAVEPSSARVGPEKDDERQGGNVDEGVRE
        VANKV  D VP I +WR  HSTAWHVLDR+IF S+ GRTR+++ TD ET+FL+R+F+PP  +D+D +    D   PS+ R G + DDE +G +V     E
Subjt:  VANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVPRENDAVEPSSARVGPEKDDERQGGNVDEGVRE

Query:  DVYEEAEEEEENGRGK-KVRISSVRLKKVEKRMKCMDKRMDNGFEGIKAELKSIRKFL
         V ++AE E E  +GK KV IS+ RLK+VEK +K MDKRMD     I+AELKSI+KFL
Subjt:  DVYEEAEEEEENGRGK-KVRISSVRLKKVEKRMKCMDKRMDNGFEGIKAELKSIRKFL

XP_022156465.1 uncharacterized protein LOC111023353 [Momordica charantia]2.6e-8659.14Show/hide
Query:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKYDRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAAR
        MFRKT+FGHLLD+DLVFNGPLIHNILLRE+EDSTP+TISFNLFG +VSFGRREFD+ISGL YDRSPVRK T  H+LR LYFND  + +LS+F K+Y+AA 
Subjt:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKYDRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAAR

Query:  FEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGRFRKSYSLYGFPWAFQVWAYETISSLSGR
        F+DDFD IK+SI+Y+VELVLLGRE T+K+D  LLG+VDDWE CCNHD   LSFDKTI SL RGPT  +KD   RKSYSLYGFPW FQVW YE        
Subjt:  FEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGRFRKSYSLYGFPWAFQVWAYETISSLSGR

Query:  VANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVPRENDAVEPSSARVGPEKDDERQGGNV-DEGVR
                                             RTR +EATDAET F+ RTFEPPEPED+D   R+ DA  PS+ R G +  D  +G +     VR
Subjt:  VANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVPRENDAVEPSSARVGPEKDDERQGGNV-DEGVR

Query:  E
        E
Subjt:  E

XP_022158744.1 uncharacterized protein LOC111025209 [Momordica charantia]1.2e-9666.06Show/hide
Query:  MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKY
        M+PKI  ++YASA L CLSH+AKT+  IK+KLTP QL MFRKT+F HLLD+DLVFNGPL+                     G KVSFGRREFDIISGLKY
Subjt:  MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKY

Query:  DRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKR
         RSPVRK T P R   LYFN+S D+LLSE EK+Y + RFEDD DA+K+ +VY VELVLLGRER+ K+D+ LLGIVDDWE CCNHDW +LSFDKTIYSL+R
Subjt:  DRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKR

Query:  GPTKRSKDGRFRKSYSLYGFPWAFQVWAYETISSLSGRVANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSST
        G + +SK+G  RKSYSLYGFPWAFQVWAYE ISSLSG +   V+ D VP+IL+WR  HSTA+H+L REIFRSST
Subjt:  GPTKRSKDGRFRKSYSLYGFPWAFQVWAYETISSLSGRVANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSST

XP_022159061.1 uncharacterized protein LOC111025501 [Momordica charantia]1.6e-9697.84Show/hide
Query:  MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKY
        MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIE STPDTISFNLFGNK SFGR EFDIISGLKY
Subjt:  MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKY

Query:  DRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHD
        DRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDA+KISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHD
Subjt:  DRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHD

XP_022159253.1 uncharacterized protein LOC111025666 [Momordica charantia]4.7e-10488.74Show/hide
Query:  MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIED-STPDTISFNLFGNKVSFGRREFDIISGLK
        MV KIP SS ASANLT LSHLAKTT AIKSKLTPPQL MFRKTVFGHLLDLDLVFN  LIH ILLREIED STP+TISFNLFG+KV F RREFDIISGLK
Subjt:  MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIED-STPDTISFNLFGNKVSFGRREFDIISGLK

Query:  YDRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLK
        YDRSPVRKDTSPHRLRALYFNDSND+LLS+FEKIY+  RFEDDFDA KISIVYL+ELVLLGRERTLKYDYTLLGIVDD ETCCNHDWGM+SFDKTIYSLK
Subjt:  YDRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLK

Query:  RGPTKRSKDGRFRKSYSLYGFP
        RGPTKRSKDG FRK YSLYGFP
Subjt:  RGPTKRSKDGRFRKSYSLYGFP

TrEMBL top hitse value%identityAlignment
A0A6J1DP34 uncharacterized protein LOC1110218022.1e-12163.41Show/hide
Query:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKYDRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAAR
        MFRKT F HLLD+DLVFNG LIHNILLRE+E+STP+TISFNLF  ++SF R +F +ISGLKY R+PVR++T PHRL  LYFND  D++LS+FEK+Y AAR
Subjt:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKYDRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAAR

Query:  FEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGRFRKSYSLYGFPWAFQVWAYETISSLSGR
        FEDD+D +K+ IVY+V + LLGRER +K+D+TLLGIVDDWE CCN++W  LSF+KTI SL+RGP K SKDG+ RKSYSLYGFPW FQVWAY+TISSLS R
Subjt:  FEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGRFRKSYSLYGFPWAFQVWAYETISSLSGR

Query:  VANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVPRENDAVEPSSARVGPEKDDERQGGNVDEGVRE
        VANKV  D VP I +WR  HSTAWHVLDR+IF S+ GRTR+++ TD ET+FL+R+F+PP  +D+D +    D   PS+ R G + DDE +G +V     E
Subjt:  VANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVPRENDAVEPSSARVGPEKDDERQGGNVDEGVRE

Query:  DVYEEAEEEEENGRGK-KVRISSVRLKKVEKRMKCMDKRMDNGFEGIKAELKSIRKFL
         V ++AE E E  +GK KV IS+ RLK+VEK +K MDKRMD     I+AELKSI+KFL
Subjt:  DVYEEAEEEEENGRGK-KVRISSVRLKKVEKRMKCMDKRMDNGFEGIKAELKSIRKFL

A0A6J1DQC8 uncharacterized protein LOC1110233531.3e-8659.14Show/hide
Query:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKYDRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAAR
        MFRKT+FGHLLD+DLVFNGPLIHNILLRE+EDSTP+TISFNLFG +VSFGRREFD+ISGL YDRSPVRK T  H+LR LYFND  + +LS+F K+Y+AA 
Subjt:  MFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKYDRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAAR

Query:  FEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGRFRKSYSLYGFPWAFQVWAYETISSLSGR
        F+DDFD IK+SI+Y+VELVLLGRE T+K+D  LLG+VDDWE CCNHD   LSFDKTI SL RGPT  +KD   RKSYSLYGFPW FQVW YE        
Subjt:  FEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGRFRKSYSLYGFPWAFQVWAYETISSLSGR

Query:  VANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVPRENDAVEPSSARVGPEKDDERQGGNV-DEGVR
                                             RTR +EATDAET F+ RTFEPPEPED+D   R+ DA  PS+ R G +  D  +G +     VR
Subjt:  VANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVPRENDAVEPSSARVGPEKDDERQGGNV-DEGVR

Query:  E
        E
Subjt:  E

A0A6J1DYB1 uncharacterized protein LOC1110256662.3e-10488.74Show/hide
Query:  MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIED-STPDTISFNLFGNKVSFGRREFDIISGLK
        MV KIP SS ASANLT LSHLAKTT AIKSKLTPPQL MFRKTVFGHLLDLDLVFN  LIH ILLREIED STP+TISFNLFG+KV F RREFDIISGLK
Subjt:  MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIED-STPDTISFNLFGNKVSFGRREFDIISGLK

Query:  YDRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLK
        YDRSPVRKDTSPHRLRALYFNDSND+LLS+FEKIY+  RFEDDFDA KISIVYL+ELVLLGRERTLKYDYTLLGIVDD ETCCNHDWGM+SFDKTIYSLK
Subjt:  YDRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLK

Query:  RGPTKRSKDGRFRKSYSLYGFP
        RGPTKRSKDG FRK YSLYGFP
Subjt:  RGPTKRSKDGRFRKSYSLYGFP

A0A6J1E0A9 uncharacterized protein LOC1110252096.0e-9766.06Show/hide
Query:  MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKY
        M+PKI  ++YASA L CLSH+AKT+  IK+KLTP QL MFRKT+F HLLD+DLVFNGPL+                     G KVSFGRREFDIISGLKY
Subjt:  MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKY

Query:  DRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKR
         RSPVRK T P R   LYFN+S D+LLSE EK+Y + RFEDD DA+K+ +VY VELVLLGRER+ K+D+ LLGIVDDWE CCNHDW +LSFDKTIYSL+R
Subjt:  DRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKR

Query:  GPTKRSKDGRFRKSYSLYGFPWAFQVWAYETISSLSGRVANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSST
        G + +SK+G  RKSYSLYGFPWAFQVWAYE ISSLSG +   V+ D VP+IL+WR  HSTA+H+L REIFRSST
Subjt:  GPTKRSKDGRFRKSYSLYGFPWAFQVWAYETISSLSGRVANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSST

A0A6J1E2S4 uncharacterized protein LOC1110255017.9e-9797.84Show/hide
Query:  MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKY
        MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIE STPDTISFNLFGNK SFGR EFDIISGLKY
Subjt:  MVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKY

Query:  DRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHD
        DRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDA+KISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHD
Subjt:  DRSPVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)4.4e-0720.93Show/hide
Query:  RKTVFGHLLDLDLV---FNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKYDRSPVRKDTSPHR------LRALYFNDSNDVLLSEFE
        + + FG L +  +     +G LIH +L R++       + F   G+ + F  REF I++GL+  + P   +   H+      +    F +   V + +  
Subjt:  RKTVFGHLLDLDLV---FNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKYDRSPVRKDTSPHR------LRALYFNDSNDVLLSEFE

Query:  KIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTK-------RSKDGRFRKSYSLYGFPWAF
        ++ L  +    +  + ++++ +V+ V++  +++       + +++D +    + WG  +F  TI   + GP K       + K    +K+ + YGFP A 
Subjt:  KIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTK-------RSKDGRFRKSYSLYGFPWAF

Query:  QVWAYETISSLSGRV
        Q+  +E+I  +  R+
Subjt:  QVWAYETISSLSGRV

AT5G45570.1 Ulp1 protease family protein1.3e-1129.37Show/hide
Query:  VYLSYNIGGNHWIMLHIDLQEGEIIVWDSMRSMTPFPALESELRPMAVVLPALMHR-SGVQVLRPTLPNTPWPIRQVTSAPQQSGSGDCGMFCVKYFEYD
        +Y    + GNHW+ L IDL    + V+DS+ S+T    +  +   +  ++PA++      +  R +     W  +++T  P+    GDC ++ +KY E  
Subjt:  VYLSYNIGGNHWIMLHIDLQEGEIIVWDSMRSMTPFPALESELRPMAVVLPALMHR-SGVQVLRPTLPNTPWPIRQVTSAPQQSGSGDCGMFCVKYFEYD

Query:  VTGSNMTGLTQDNMSFFREKLAIEMW
          G +  GL  +NM   R KLA+EM+
Subjt:  VTGSNMTGLTQDNMSFFREKLAIEMW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATGGTACCTAAAATCCCTTCCTCATCCTATGCCTCCGCCAACTTGACATGTCTATCGCACTTAGCGAAGACTACCGCAGCCATTAAAAGTAAATTGACC
CCACCACAACTTAGGATGTTTAGGAAAACTGTATTCGGTCATCTCCTTGACTTGGATCTCGTATTTAACGGGCCATTGATACACAACATCCTACTTAGGGAGATC
GAGGATAGTACACCTGACACCATTAGCTTCAACCTATTTGGTAATAAGGTGTCATTTGGGCGGAGGGAGTTTGACATTATTTCTGGGCTTAAGTATGATAGGAGT
CCAGTTAGAAAAGACACATCTCCCCACAGACTCAGGGCTCTTTACTTTAATGATAGCAACGACGTTCTCTTGAGTGAATTTGAGAAGATTTATTTAGCCGCACGG
TTCGAGGACGACTTCGACGCGATCAAGATATCTATTGTTTACTTAGTAGAGTTAGTTCTGTTGGGGAGAGAGAGGACTCTAAAGTACGACTATACATTGCTGGGA
ATAGTCGATGATTGGGAAACGTGCTGCAACCACGATTGGGGGATGCTGTCCTTTGATAAGACTATATATAGTCTGAAACGCGGCCCGACGAAGAGGTCGAAGGAT
GGTCGGTTCAGGAAATCGTACAGTCTCTACGGTTTCCCTTGGGCGTTCCAGGTGTGGGCCTACGAGACTATATCTTCCTTATCTGGGCGTGTGGCCAATAAGGTA
AATCCGGACGCCGTGCCACAGATTCTTCGGTGGAGGTGCGGCCATTCAACTGCATGGCATGTGCTGGACAGAGAGATTTTCCGATCTAGCACAGGAAGAACTCGA
TCAATAGAAGCAACGGATGCTGAGACGACCTTCTTAGATAGGACGTTCGAACCACCGGAGCCCGAAGATGAGGACGAAGTCCCACGCGAGAATGATGCTGTTGAA
CCATCAAGTGCACGTGTAGGACCAGAGAAGGATGACGAGAGACAAGGAGGGAACGTCGACGAAGGTGTCAGAGAAGACGTCTATGAAGAGGCTGAGGAGGAGGAG
GAGAATGGACGTGGTAAGAAGGTACGCATATCTAGTGTGCGTCTGAAGAAGGTTGAGAAACGGATGAAGTGCATGGACAAGCGCATGGATAACGGGTTCGAAGGT
ATTAAGGCCGAATTAAAATCCATCCGGAAGTTCTTGCGAAGAATCGCTAAGGGTTTACCTGTCGACCCGAATGACATGAGAAGAGGCAGCAGCGGTGATGGTACT
GGGCAAGGAGATGGTCCGAGTGATGGTTCTGGGCCAAGAGATGGTCCGAGTGATGCACCCAGTGGGGGACGTAGTGATCGAGGTGATGGTCCTAAGAATGGCAAT
GGTTCTACACCATCTCCTAGGGACGTGGACGACACAGATGACATGATCATTGATCCCCCCTCATGTGATCGCGCACGAGACAAAGGAACATCAGTCCGGCCAAAC
GACCGGGGACCAGGTTACTCTGCCATGTATCAACAACATTCACCTGCCTTGACGCGCAAGGAAGACATGGGTACAGAGGACGTGTTCAAGGAAGATATCGGTCAA
CATGCGCGTTGTGAGGCTGCCCCTCTCGAGCAAACTCCTGTTCAAAGCAGACAGGTGGACCATATTACGATCGATTCGCATCCGTTAGAGTCATCTATGGACGAC
GAGGACGAATACGCCGAGGACTTCACAGACTCTGATGCGGAAGGGCCGAGAGCGACGTCGCAACTAGACCTGGATGAGGTATGTGTGCTATCGCAGCCCGTTGAA
CGTCAGAACCCTCGGCGGGGGTCTCGGAAGAGGAAGCTCCCATGGAAGCTTCGGGGGTCGTTTAATGTCATGGTGGACAGGAAGAAGGTAATGCGATATGACCCA
CTAGTCCACGTCCCCTCTGAACAAGTTCAGAAGTTTCATGTTTGGCTGGCTAACCCTAAAACTGACCGCGCCACTCGCAAATCATGCTACGGTGATCGAGGAAAG
ACATGGTTCCGTGACCTTATCAACTCGGGCAAGTGGATGACGAGTGAGGTGATCGATTCGTTGTTCATGTTCACGCGGAACAAACTCGAGCAACGGCAGGACTTG
TGTTCTCGAAGATTCACCACCGGTGACATTGTCCTTGCGAACTTTCTTCGACGAACATACGGACTATATCAACGCATGACAGCCCCGAACGCTGTACCTGCGAGA
GTTGCAGCGGAATACGATTGGGCTGGGAAGTATTCTGTTTACCTGTCATACAACATCGGTGGCAACCATTGGATTATGCTACACATCGATCTACAGGAGGGTGAG
ATCATTGTGTGGGATTCGATGAGGTCGATGACACCCTTTCCAGCTCTGGAGTCCGAGTTGAGGCCGATGGCTGTTGTCCTACCGGCGTTGATGCACAGGTCCGGT
GTTCAGGTACTGAGGCCGACACTACCGAATACGCCATGGCCCATTCGTCAAGTAACGTCTGCGCCCCAGCAAAGCGGGTCTGGTGACTGTGGGATGTTTTGCGTT
AAATATTTCGAGTATGATGTAACAGGGTCAAATATGACCGGCTTAACTCAGGACAACATGAGTTTCTTTAGGGAAAAATTGGCCATAGAAATGTGGGCAAACCGA
TCTATATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAATGGTACCTAAAATCCCTTCCTCATCCTATGCCTCCGCCAACTTGACATGTCTATCGCACTTAGCGAAGACTACCGCAGCCATTAAAAGTAAATTGACC
CCACCACAACTTAGGATGTTTAGGAAAACTGTATTCGGTCATCTCCTTGACTTGGATCTCGTATTTAACGGGCCATTGATACACAACATCCTACTTAGGGAGATC
GAGGATAGTACACCTGACACCATTAGCTTCAACCTATTTGGTAATAAGGTGTCATTTGGGCGGAGGGAGTTTGACATTATTTCTGGGCTTAAGTATGATAGGAGT
CCAGTTAGAAAAGACACATCTCCCCACAGACTCAGGGCTCTTTACTTTAATGATAGCAACGACGTTCTCTTGAGTGAATTTGAGAAGATTTATTTAGCCGCACGG
TTCGAGGACGACTTCGACGCGATCAAGATATCTATTGTTTACTTAGTAGAGTTAGTTCTGTTGGGGAGAGAGAGGACTCTAAAGTACGACTATACATTGCTGGGA
ATAGTCGATGATTGGGAAACGTGCTGCAACCACGATTGGGGGATGCTGTCCTTTGATAAGACTATATATAGTCTGAAACGCGGCCCGACGAAGAGGTCGAAGGAT
GGTCGGTTCAGGAAATCGTACAGTCTCTACGGTTTCCCTTGGGCGTTCCAGGTGTGGGCCTACGAGACTATATCTTCCTTATCTGGGCGTGTGGCCAATAAGGTA
AATCCGGACGCCGTGCCACAGATTCTTCGGTGGAGGTGCGGCCATTCAACTGCATGGCATGTGCTGGACAGAGAGATTTTCCGATCTAGCACAGGAAGAACTCGA
TCAATAGAAGCAACGGATGCTGAGACGACCTTCTTAGATAGGACGTTCGAACCACCGGAGCCCGAAGATGAGGACGAAGTCCCACGCGAGAATGATGCTGTTGAA
CCATCAAGTGCACGTGTAGGACCAGAGAAGGATGACGAGAGACAAGGAGGGAACGTCGACGAAGGTGTCAGAGAAGACGTCTATGAAGAGGCTGAGGAGGAGGAG
GAGAATGGACGTGGTAAGAAGGTACGCATATCTAGTGTGCGTCTGAAGAAGGTTGAGAAACGGATGAAGTGCATGGACAAGCGCATGGATAACGGGTTCGAAGGT
ATTAAGGCCGAATTAAAATCCATCCGGAAGTTCTTGCGAAGAATCGCTAAGGGTTTACCTGTCGACCCGAATGACATGAGAAGAGGCAGCAGCGGTGATGGTACT
GGGCAAGGAGATGGTCCGAGTGATGGTTCTGGGCCAAGAGATGGTCCGAGTGATGCACCCAGTGGGGGACGTAGTGATCGAGGTGATGGTCCTAAGAATGGCAAT
GGTTCTACACCATCTCCTAGGGACGTGGACGACACAGATGACATGATCATTGATCCCCCCTCATGTGATCGCGCACGAGACAAAGGAACATCAGTCCGGCCAAAC
GACCGGGGACCAGGTTACTCTGCCATGTATCAACAACATTCACCTGCCTTGACGCGCAAGGAAGACATGGGTACAGAGGACGTGTTCAAGGAAGATATCGGTCAA
CATGCGCGTTGTGAGGCTGCCCCTCTCGAGCAAACTCCTGTTCAAAGCAGACAGGTGGACCATATTACGATCGATTCGCATCCGTTAGAGTCATCTATGGACGAC
GAGGACGAATACGCCGAGGACTTCACAGACTCTGATGCGGAAGGGCCGAGAGCGACGTCGCAACTAGACCTGGATGAGGTATGTGTGCTATCGCAGCCCGTTGAA
CGTCAGAACCCTCGGCGGGGGTCTCGGAAGAGGAAGCTCCCATGGAAGCTTCGGGGGTCGTTTAATGTCATGGTGGACAGGAAGAAGGTAATGCGATATGACCCA
CTAGTCCACGTCCCCTCTGAACAAGTTCAGAAGTTTCATGTTTGGCTGGCTAACCCTAAAACTGACCGCGCCACTCGCAAATCATGCTACGGTGATCGAGGAAAG
ACATGGTTCCGTGACCTTATCAACTCGGGCAAGTGGATGACGAGTGAGGTGATCGATTCGTTGTTCATGTTCACGCGGAACAAACTCGAGCAACGGCAGGACTTG
TGTTCTCGAAGATTCACCACCGGTGACATTGTCCTTGCGAACTTTCTTCGACGAACATACGGACTATATCAACGCATGACAGCCCCGAACGCTGTACCTGCGAGA
GTTGCAGCGGAATACGATTGGGCTGGGAAGTATTCTGTTTACCTGTCATACAACATCGGTGGCAACCATTGGATTATGCTACACATCGATCTACAGGAGGGTGAG
ATCATTGTGTGGGATTCGATGAGGTCGATGACACCCTTTCCAGCTCTGGAGTCCGAGTTGAGGCCGATGGCTGTTGTCCTACCGGCGTTGATGCACAGGTCCGGT
GTTCAGGTACTGAGGCCGACACTACCGAATACGCCATGGCCCATTCGTCAAGTAACGTCTGCGCCCCAGCAAAGCGGGTCTGGTGACTGTGGGATGTTTTGCGTT
AAATATTTCGAGTATGATGTAACAGGGTCAAATATGACCGGCTTAACTCAGGACAACATGAGTTTCTTTAGGGAAAAATTGGCCATAGAAATGTGGGCAAACCGA
TCTATATTTTGA
Protein sequenceShow/hide protein sequence
MEMVPKIPSSSYASANLTCLSHLAKTTAAIKSKLTPPQLRMFRKTVFGHLLDLDLVFNGPLIHNILLREIEDSTPDTISFNLFGNKVSFGRREFDIISGLKYDRS
PVRKDTSPHRLRALYFNDSNDVLLSEFEKIYLAARFEDDFDAIKISIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKD
GRFRKSYSLYGFPWAFQVWAYETISSLSGRVANKVNPDAVPQILRWRCGHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVPRENDAVE
PSSARVGPEKDDERQGGNVDEGVREDVYEEAEEEEENGRGKKVRISSVRLKKVEKRMKCMDKRMDNGFEGIKAELKSIRKFLRRIAKGLPVDPNDMRRGSSGDGT
GQGDGPSDGSGPRDGPSDAPSGGRSDRGDGPKNGNGSTPSPRDVDDTDDMIIDPPSCDRARDKGTSVRPNDRGPGYSAMYQQHSPALTRKEDMGTEDVFKEDIGQ
HARCEAAPLEQTPVQSRQVDHITIDSHPLESSMDDEDEYAEDFTDSDAEGPRATSQLDLDEVCVLSQPVERQNPRRGSRKRKLPWKLRGSFNVMVDRKKVMRYDP
LVHVPSEQVQKFHVWLANPKTDRATRKSCYGDRGKTWFRDLINSGKWMTSEVIDSLFMFTRNKLEQRQDLCSRRFTTGDIVLANFLRRTYGLYQRMTAPNAVPAR
VAAEYDWAGKYSVYLSYNIGGNHWIMLHIDLQEGEIIVWDSMRSMTPFPALESELRPMAVVLPALMHRSGVQVLRPTLPNTPWPIRQVTSAPQQSGSGDCGMFCV
KYFEYDVTGSNMTGLTQDNMSFFREKLAIEMWANRSIF