; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g11310 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g11310
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr7:8746233..8749701
RNA-Seq ExpressionMoc07g11310
SyntenyMoc07g11310
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR015410 - Domain of unknown function DUF1985
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]1.7e-7664.94Show/hide
Query:  RVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELFGN
        RVSFGKREFDLITGLSH+M RV+N IPGRRLRARYFKDSVRVKC ELEKIF+E +F DD+D +KVGIVYF+ELAMMGKERKQFIDT  +GVVDRWE F N
Subjt:  RVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELFGN

Query:  HDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNTMS
         DWSS+IF+RT+WSLKN LKDKL AYQQKA  DPTH ETYSLYGFPY                           R+ R     SR    L  ++FDNT S
Subjt:  HDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNTMS

Query:  KVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTA
        KVKE+L++T+AE +HMVR++ PPE R IP PPAVPD   VPD AVVP P A
Subjt:  KVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTA

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]6.7e-13462.76Show/hide
Query:  GIRVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELF
        G RVSFGKREFDLITGLSHRM RVDN IPGRRLRARYFKD VRVKC ELEKIF+E VF DD+D +KV IVYF+ELAMMGKERKQFIDT LLGVVDRWE+F
Subjt:  GIRVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELF

Query:  GNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNT
         N+DWSS+IF+RT+WSLKNALKDKL  YQQKA  DP+H ETYSLYGFPYAFQVWAYETIST        LSDD+IPRLLRWSC YS GF  L  ++FDNT
Subjt:  GNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNT

Query:  MSKVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTAVRNSPADLERGTEERRVKDKGKNIIEDPIEEAETLDDDALQGLALDD
         SKVKE+L++T+A+ +HMVR++ PPE R IP PPAVPD   VPDP   P   AV + PAD+E G             +EDP+ +A           A+D+
Subjt:  MSKVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTAVRNSPADLERGTEERRVKDKGKNIIEDPIEEAETLDDDALQGLALDD

Query:  AGPSGNDSEALQKRSKRKKLKNKISRRLKRLDDRVGAIEATLTGFEATLTGFGVALKGIQRYLKKMSKGKFPDPTKYFGRGGGPDDDDSSDQRPDEAQ--
        A PS ND E L+KR K+ K K +ISRRLKRLD+ VGAI       E  L  FGVALKGIQ YLKK++KGKFPD +KYFG GGGPDDD  SDQRPDE+   
Subjt:  AGPSGNDSEALQKRSKRKKLKNKISRRLKRLDDRVGAIEATLTGFEATLTGFGVALKGIQRYLKKMSKGKFPDPTKYFGRGGGPDDDDSSDQRPDEAQ--

Query:  ---HESMDEDPKNMDDDPMFMVEDQGTTTERDNAS
            +SMDED ++ +D       D+   TE++  S
Subjt:  ---HESMDEDPKNMDDDPMFMVEDQGTTTERDNAS

XP_022154965.1 uncharacterized protein LOC111022110 [Momordica charantia]1.9e-6744.84Show/hide
Query:  MMGKERKQFIDTTLLGVVDRWELFGNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIP
        MMGKERKQ +DT+LLG+VDRWE+F ++D SS+IFERTLWSLKNALKDK+ AY+QK   D +H ETYSLYGFPYAFQVWAYETISTLS RVA RL+DD+IP
Subjt:  MMGKERKQFIDTTLLGVVDRWELFGNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIP

Query:  RLLRWSCTYSRGFLTLQRDMFDNTMSKVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTAVRNSPADLERGTEERRVKDKGKN
        RLLRWSCTYSR F  L+R++F+N  SKV   L +T+ E +HM R+M PP A                 P   PAPT +   P       ++  V  +  +
Subjt:  RLLRWSCTYSRGFLTLQRDMFDNTMSKVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTAVRNSPADLERGTEERRVKDKGKN

Query:  IIE--DPIEEAETLDDDALQGLALDDAGPSGNDSEALQKR---SKRKKLKNKISRRLKRLDDRVGAIEATLTGFEATLTGFGVALKGIQRYLKKMSKGKF
        ++E  D  ++A  L D   +    D  G  G   + L ++    K+KK K+K SR L+RL DRV AIE TLTG           +K I++++K+++K   
Subjt:  IIE--DPIEEAETLDDDALQGLALDDAGPSGNDSEALQKR---SKRKKLKNKISRRLKRLDDRVGAIEATLTGFEATLTGFGVALKGIQRYLKKMSKGKF

Query:  PDPTKYFGRGGGPDDDDSSDQRPDEAQHE---SMDEDPK-----------NMDDDPMFMVED---QGTTTERDNASTAYPDRPVGLFQDATVGMQEP
            KY  RGG PD D SS  R    ++E    MDEDPK            MD+DP    E      +  E D+A T      VG  Q+   G   P
Subjt:  PDPTKYFGRGGGPDDDDSSDQRPDEAQHE---SMDEDPK-----------NMDDDPMFMVED---QGTTTERDNASTAYPDRPVGLFQDATVGMQEP

XP_022155476.1 uncharacterized protein LOC111022607 [Momordica charantia]2.3e-9467.55Show/hide
Query:  SMDEDPKNMDDDPMFMVEDQGTTTERD---NASTAYPDRP--------------VGLFQDATVGMQEPDVASDTRPVSRRVRRPYKDWAPDAVIKVEPYL
        S+DEDPK  D+DPM M ED G  T+ D   N       RP              V + QD TVG QEPD   DT+P  RRVRRPYKDWAPDA++KVEPYL
Subjt:  SMDEDPKNMDDDPMFMVEDQGTTTERD---NASTAYPDRP--------------VGLFQDATVGMQEPDVASDTRPVSRRVRRPYKDWAPDAVIKVEPYL

Query:  DPDEYDLQQAPTGRGLRKRHYSWKLKDIYTPTGQRGIIVDRYDLVCPILPQLDDKFQRWMDDPKTDRRLRSTATGFQKKEWYRDLLDPSVELKDEVLDGL
        D DE DLQ APTGRGLRK HYSWKLK IYTPTG+R I VD YD  CPI PQLD +FQ WMDD   D R RSTA G Q KEWYRDLLDP+V+LKDEV+D L
Subjt:  DPDEYDLQQAPTGRGLRKRHYSWKLKDIYTPTGQRGIIVDRYDLVCPILPQLDDKFQRWMDDPKTDRRLRSTATGFQKKEWYRDLLDPSVELKDEVLDGL

Query:  VLFTAKKLEKCLHLCRKKFAIGDVLFSTLLNRTNDPYAAMKPGVLSTRINYPWREENTIWRYVHG
        VLFTAKKLEKC++LCRKKFAIGDVL STLLNRT+ PYAAMKPGVLSTRI YP  +ENTI+RYV G
Subjt:  VLFTAKKLEKCLHLCRKKFAIGDVLFSTLLNRTNDPYAAMKPGVLSTRINYPWREENTIWRYVHG

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]2.5e-8069.63Show/hide
Query:  GIRVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELF
        G RVSFGKREFDLITGL H M RVD D+  RRLR  YF+D   VKC ELEKIF+E  F++D+DA+K+ IVYF+ELAMMGKERK  +DT+LLG+VDRWE+F
Subjt:  GIRVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELF

Query:  GNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNT
         N+DWSS+IFERTLWSLKNALKDK+  Y+QK   D +H ETYSLY FPYAFQVWAYETISTLS RVA RL+DD+IPRLLRWSCTYSR F  L+R++F+N 
Subjt:  GNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNT

Query:  MSKVKEYLVSTNAE
         SKV   L +T+ E
Subjt:  MSKVKEYLVSTNAE

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156008.1e-7764.94Show/hide
Query:  RVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELFGN
        RVSFGKREFDLITGLSH+M RV+N IPGRRLRARYFKDSVRVKC ELEKIF+E +F DD+D +KVGIVYF+ELAMMGKERKQFIDT  +GVVDRWE F N
Subjt:  RVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELFGN

Query:  HDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNTMS
         DWSS+IF+RT+WSLKN LKDKL AYQQKA  DPTH ETYSLYGFPY                           R+ R     SR    L  ++FDNT S
Subjt:  HDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNTMS

Query:  KVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTA
        KVKE+L++T+AE +HMVR++ PPE R IP PPAVPD   VPD AVVP P A
Subjt:  KVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTA

A0A6J1DJX9 uncharacterized protein LOC1110207573.2e-13462.76Show/hide
Query:  GIRVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELF
        G RVSFGKREFDLITGLSHRM RVDN IPGRRLRARYFKD VRVKC ELEKIF+E VF DD+D +KV IVYF+ELAMMGKERKQFIDT LLGVVDRWE+F
Subjt:  GIRVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELF

Query:  GNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNT
         N+DWSS+IF+RT+WSLKNALKDKL  YQQKA  DP+H ETYSLYGFPYAFQVWAYETIST        LSDD+IPRLLRWSC YS GF  L  ++FDNT
Subjt:  GNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNT

Query:  MSKVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTAVRNSPADLERGTEERRVKDKGKNIIEDPIEEAETLDDDALQGLALDD
         SKVKE+L++T+A+ +HMVR++ PPE R IP PPAVPD   VPDP   P   AV + PAD+E G             +EDP+ +A           A+D+
Subjt:  MSKVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTAVRNSPADLERGTEERRVKDKGKNIIEDPIEEAETLDDDALQGLALDD

Query:  AGPSGNDSEALQKRSKRKKLKNKISRRLKRLDDRVGAIEATLTGFEATLTGFGVALKGIQRYLKKMSKGKFPDPTKYFGRGGGPDDDDSSDQRPDEAQ--
        A PS ND E L+KR K+ K K +ISRRLKRLD+ VGAI       E  L  FGVALKGIQ YLKK++KGKFPD +KYFG GGGPDDD  SDQRPDE+   
Subjt:  AGPSGNDSEALQKRSKRKKLKNKISRRLKRLDDRVGAIEATLTGFEATLTGFGVALKGIQRYLKKMSKGKFPDPTKYFGRGGGPDDDDSSDQRPDEAQ--

Query:  ---HESMDEDPKNMDDDPMFMVEDQGTTTERDNAS
            +SMDED ++ +D       D+   TE++  S
Subjt:  ---HESMDEDPKNMDDDPMFMVEDQGTTTERDNAS

A0A6J1DL40 uncharacterized protein LOC1110221109.0e-6844.84Show/hide
Query:  MMGKERKQFIDTTLLGVVDRWELFGNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIP
        MMGKERKQ +DT+LLG+VDRWE+F ++D SS+IFERTLWSLKNALKDK+ AY+QK   D +H ETYSLYGFPYAFQVWAYETISTLS RVA RL+DD+IP
Subjt:  MMGKERKQFIDTTLLGVVDRWELFGNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIP

Query:  RLLRWSCTYSRGFLTLQRDMFDNTMSKVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTAVRNSPADLERGTEERRVKDKGKN
        RLLRWSCTYSR F  L+R++F+N  SKV   L +T+ E +HM R+M PP A                 P   PAPT +   P       ++  V  +  +
Subjt:  RLLRWSCTYSRGFLTLQRDMFDNTMSKVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTAVRNSPADLERGTEERRVKDKGKN

Query:  IIE--DPIEEAETLDDDALQGLALDDAGPSGNDSEALQKR---SKRKKLKNKISRRLKRLDDRVGAIEATLTGFEATLTGFGVALKGIQRYLKKMSKGKF
        ++E  D  ++A  L D   +    D  G  G   + L ++    K+KK K+K SR L+RL DRV AIE TLTG           +K I++++K+++K   
Subjt:  IIE--DPIEEAETLDDDALQGLALDDAGPSGNDSEALQKR---SKRKKLKNKISRRLKRLDDRVGAIEATLTGFEATLTGFGVALKGIQRYLKKMSKGKF

Query:  PDPTKYFGRGGGPDDDDSSDQRPDEAQHE---SMDEDPK-----------NMDDDPMFMVED---QGTTTERDNASTAYPDRPVGLFQDATVGMQEP
            KY  RGG PD D SS  R    ++E    MDEDPK            MD+DP    E      +  E D+A T      VG  Q+   G   P
Subjt:  PDPTKYFGRGGGPDDDDSSDQRPDEAQHE---SMDEDPK-----------NMDDDPMFMVED---QGTTTERDNASTAYPDRPVGLFQDATVGMQEP

A0A6J1DRS0 uncharacterized protein LOC1110226071.1e-9467.55Show/hide
Query:  SMDEDPKNMDDDPMFMVEDQGTTTERD---NASTAYPDRP--------------VGLFQDATVGMQEPDVASDTRPVSRRVRRPYKDWAPDAVIKVEPYL
        S+DEDPK  D+DPM M ED G  T+ D   N       RP              V + QD TVG QEPD   DT+P  RRVRRPYKDWAPDA++KVEPYL
Subjt:  SMDEDPKNMDDDPMFMVEDQGTTTERD---NASTAYPDRP--------------VGLFQDATVGMQEPDVASDTRPVSRRVRRPYKDWAPDAVIKVEPYL

Query:  DPDEYDLQQAPTGRGLRKRHYSWKLKDIYTPTGQRGIIVDRYDLVCPILPQLDDKFQRWMDDPKTDRRLRSTATGFQKKEWYRDLLDPSVELKDEVLDGL
        D DE DLQ APTGRGLRK HYSWKLK IYTPTG+R I VD YD  CPI PQLD +FQ WMDD   D R RSTA G Q KEWYRDLLDP+V+LKDEV+D L
Subjt:  DPDEYDLQQAPTGRGLRKRHYSWKLKDIYTPTGQRGIIVDRYDLVCPILPQLDDKFQRWMDDPKTDRRLRSTATGFQKKEWYRDLLDPSVELKDEVLDGL

Query:  VLFTAKKLEKCLHLCRKKFAIGDVLFSTLLNRTNDPYAAMKPGVLSTRINYPWREENTIWRYVHG
        VLFTAKKLEKC++LCRKKFAIGDVL STLLNRT+ PYAAMKPGVLSTRI YP  +ENTI+RYV G
Subjt:  VLFTAKKLEKCLHLCRKKFAIGDVLFSTLLNRTNDPYAAMKPGVLSTRINYPWREENTIWRYVHG

A0A6J1DRZ7 uncharacterized protein LOC1110238471.2e-8069.63Show/hide
Query:  GIRVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELF
        G RVSFGKREFDLITGL H M RVD D+  RRLR  YF+D   VKC ELEKIF+E  F++D+DA+K+ IVYF+ELAMMGKERK  +DT+LLG+VDRWE+F
Subjt:  GIRVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRVKCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELF

Query:  GNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNT
         N+DWSS+IFERTLWSLKNALKDK+  Y+QK   D +H ETYSLY FPYAFQVWAYETISTLS RVA RL+DD+IPRLLRWSCTYSR F  L+R++F+N 
Subjt:  GNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVWAYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNT

Query:  MSKVKEYLVSTNAE
         SKV   L +T+ E
Subjt:  MSKVKEYLVSTNAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGCATAAAATCATTTTCTGCCTTTCATCCCGAAGGCGAAATGGTCCGGGATAAGGTCTGGGATGGAAGGCTTTCCATGCCTAATGCATTCTGCCTTCCATCCCA
GACCTCATCCCGGACCGTTTCGTTCCGGGATGAAAGGCAGAGAATATGGTTTCATTTTCTGCCTTTCATCCCGGAGGCGAAATGGTCCGGGATAAGGGTCTCCTTTGGTA
AGCGGGAGTTTGACCTAATCACCGGCCTCAGTCATAGGATGATTAGGGTAGATAACGATATTCCTGGCCGACGACTTCGAGCACGTTACTTTAAGGATAGTGTCAGGGTT
AAGTGTATTGAGTTAGAGAAGATTTTTATGGAGGCAGTTTTCGACGATGATGACGATGCTATCAAGGTTGGCATAGTTTACTTCGTCGAGCTTGCCATGATGGGGAAGGA
GAGGAAGCAGTTTATAGATACGACCCTTTTAGGGGTTGTGGATAGGTGGGAGCTGTTCGGCAATCACGACTGGAGTTCGTTGATTTTCGAAAGAACACTTTGGAGCCTGA
AGAATGCCCTGAAGGATAAACTACCGGCGTACCAACAGAAGGCCAGAAATGACCCCACACACCAAGAGACTTATAGTCTCTACGGGTTTCCGTACGCATTTCAGGTATGG
GCTTACGAGACGATATCGACGTTGAGTCTGCGCGTAGCCACGAGGCTGAGCGACGACTCCATTCCTCGACTCCTTAGGTGGTCGTGCACTTATTCTCGTGGGTTTCTTAC
TCTACAGAGAGACATGTTCGATAACACGATGTCCAAGGTTAAGGAATACTTGGTTTCGACGAATGCTGAGGCAGAACACATGGTCCGTATCATGCGTCCACCGGAAGCCC
GCGCTATACCTGCCCCGCCGGCTGTACCTGACCCGCCTGCAGTACCTGACCCGGCTGTTGTACCTGCCCCGACTGCAGTACGTAACTCGCCTGCAGATTTGGAAAGGGGT
ACTGAGGAAAGAAGGGTGAAGGACAAAGGAAAGAACATCATAGAGGATCCGATAGAAGAGGCCGAGACATTGGACGATGATGCATTACAGGGTCTTGCATTAGACGATGC
TGGACCTAGTGGAAATGATAGCGAAGCCCTACAGAAGAGGTCGAAACGGAAAAAATTAAAAAATAAGATCAGTAGACGGTTGAAGAGGCTCGATGACCGAGTTGGTGCTA
TCGAGGCCACACTGACTGGCTTCGAGGCCACACTGACTGGCTTCGGGGTCGCCCTGAAAGGTATCCAGAGATACCTTAAGAAAATGTCGAAGGGTAAATTCCCTGATCCG
ACCAAATATTTTGGACGTGGGGGTGGGCCCGATGATGATGATTCATCGGATCAAAGGCCTGATGAGGCCCAACACGAGAGTATGGACGAGGATCCGAAGAATATGGACGA
CGATCCGATGTTTATGGTTGAAGACCAGGGTACGACAACGGAGCGGGACAATGCATCGACTGCTTACCCCGATCGTCCTGTCGGTTTGTTTCAGGATGCCACTGTTGGAA
TGCAAGAGCCGGACGTTGCATCAGATACGCGACCCGTCAGCCGACGCGTTAGGCGTCCCTATAAGGACTGGGCACCAGACGCAGTCATTAAGGTTGAACCTTACCTTGAC
CCGGACGAATATGACCTTCAGCAGGCCCCAACTGGGCGTGGGCTACGCAAGAGGCATTACTCGTGGAAGCTTAAGGATATATACACACCAACCGGTCAGCGTGGGATCAT
CGTGGATAGATACGACCTAGTATGTCCCATTCTGCCGCAGTTGGACGATAAGTTCCAGAGATGGATGGATGACCCGAAGACGGACAGGAGATTGCGGTCCACTGCAACTG
GTTTCCAAAAGAAGGAATGGTATCGCGATCTATTGGACCCTAGTGTTGAATTGAAGGACGAAGTACTTGATGGTCTCGTCCTGTTTACAGCGAAAAAGTTGGAGAAGTGT
CTCCATCTATGTCGCAAGAAGTTTGCGATAGGCGACGTACTTTTTTCGACTCTGCTGAATCGGACAAACGATCCATATGCGGCCATGAAGCCTGGTGTATTGTCCACTAG
GATCAACTACCCCTGGCGCGAGGAGAATACAATCTGGCGATATGTCCATGGCGGGAACCATTGGGTGATGCTCGGCATCGACCTTGTACAGGGCGACATAACCGTATGGG
ATTCACTCCAAACGGTCACTCCACTTGATGAACTTGAGAAGGAGTTGAAGCCCATGTGTACAATCCTACCTGCGCTGCTGCATCATGGCGGGATATTTTCAGTTCGCCCC
GACTTGCCAGTGGTGCCGTGGAGGGTGCGTCGGGTTCGCGTACCACAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGGCATAAAATCATTTTCTGCCTTTCATCCCGAAGGCGAAATGGTCCGGGATAAGGTCTGGGATGGAAGGCTTTCCATGCCTAATGCATTCTGCCTTCCATCCCA
GACCTCATCCCGGACCGTTTCGTTCCGGGATGAAAGGCAGAGAATATGGTTTCATTTTCTGCCTTTCATCCCGGAGGCGAAATGGTCCGGGATAAGGGTCTCCTTTGGTA
AGCGGGAGTTTGACCTAATCACCGGCCTCAGTCATAGGATGATTAGGGTAGATAACGATATTCCTGGCCGACGACTTCGAGCACGTTACTTTAAGGATAGTGTCAGGGTT
AAGTGTATTGAGTTAGAGAAGATTTTTATGGAGGCAGTTTTCGACGATGATGACGATGCTATCAAGGTTGGCATAGTTTACTTCGTCGAGCTTGCCATGATGGGGAAGGA
GAGGAAGCAGTTTATAGATACGACCCTTTTAGGGGTTGTGGATAGGTGGGAGCTGTTCGGCAATCACGACTGGAGTTCGTTGATTTTCGAAAGAACACTTTGGAGCCTGA
AGAATGCCCTGAAGGATAAACTACCGGCGTACCAACAGAAGGCCAGAAATGACCCCACACACCAAGAGACTTATAGTCTCTACGGGTTTCCGTACGCATTTCAGGTATGG
GCTTACGAGACGATATCGACGTTGAGTCTGCGCGTAGCCACGAGGCTGAGCGACGACTCCATTCCTCGACTCCTTAGGTGGTCGTGCACTTATTCTCGTGGGTTTCTTAC
TCTACAGAGAGACATGTTCGATAACACGATGTCCAAGGTTAAGGAATACTTGGTTTCGACGAATGCTGAGGCAGAACACATGGTCCGTATCATGCGTCCACCGGAAGCCC
GCGCTATACCTGCCCCGCCGGCTGTACCTGACCCGCCTGCAGTACCTGACCCGGCTGTTGTACCTGCCCCGACTGCAGTACGTAACTCGCCTGCAGATTTGGAAAGGGGT
ACTGAGGAAAGAAGGGTGAAGGACAAAGGAAAGAACATCATAGAGGATCCGATAGAAGAGGCCGAGACATTGGACGATGATGCATTACAGGGTCTTGCATTAGACGATGC
TGGACCTAGTGGAAATGATAGCGAAGCCCTACAGAAGAGGTCGAAACGGAAAAAATTAAAAAATAAGATCAGTAGACGGTTGAAGAGGCTCGATGACCGAGTTGGTGCTA
TCGAGGCCACACTGACTGGCTTCGAGGCCACACTGACTGGCTTCGGGGTCGCCCTGAAAGGTATCCAGAGATACCTTAAGAAAATGTCGAAGGGTAAATTCCCTGATCCG
ACCAAATATTTTGGACGTGGGGGTGGGCCCGATGATGATGATTCATCGGATCAAAGGCCTGATGAGGCCCAACACGAGAGTATGGACGAGGATCCGAAGAATATGGACGA
CGATCCGATGTTTATGGTTGAAGACCAGGGTACGACAACGGAGCGGGACAATGCATCGACTGCTTACCCCGATCGTCCTGTCGGTTTGTTTCAGGATGCCACTGTTGGAA
TGCAAGAGCCGGACGTTGCATCAGATACGCGACCCGTCAGCCGACGCGTTAGGCGTCCCTATAAGGACTGGGCACCAGACGCAGTCATTAAGGTTGAACCTTACCTTGAC
CCGGACGAATATGACCTTCAGCAGGCCCCAACTGGGCGTGGGCTACGCAAGAGGCATTACTCGTGGAAGCTTAAGGATATATACACACCAACCGGTCAGCGTGGGATCAT
CGTGGATAGATACGACCTAGTATGTCCCATTCTGCCGCAGTTGGACGATAAGTTCCAGAGATGGATGGATGACCCGAAGACGGACAGGAGATTGCGGTCCACTGCAACTG
GTTTCCAAAAGAAGGAATGGTATCGCGATCTATTGGACCCTAGTGTTGAATTGAAGGACGAAGTACTTGATGGTCTCGTCCTGTTTACAGCGAAAAAGTTGGAGAAGTGT
CTCCATCTATGTCGCAAGAAGTTTGCGATAGGCGACGTACTTTTTTCGACTCTGCTGAATCGGACAAACGATCCATATGCGGCCATGAAGCCTGGTGTATTGTCCACTAG
GATCAACTACCCCTGGCGCGAGGAGAATACAATCTGGCGATATGTCCATGGCGGGAACCATTGGGTGATGCTCGGCATCGACCTTGTACAGGGCGACATAACCGTATGGG
ATTCACTCCAAACGGTCACTCCACTTGATGAACTTGAGAAGGAGTTGAAGCCCATGTGTACAATCCTACCTGCGCTGCTGCATCATGGCGGGATATTTTCAGTTCGCCCC
GACTTGCCAGTGGTGCCGTGGAGGGTGCGTCGGGTTCGCGTACCACAGTAG
Protein sequenceShow/hide protein sequence
MKGIKSFSAFHPEGEMVRDKVWDGRLSMPNAFCLPSQTSSRTVSFRDERQRIWFHFLPFIPEAKWSGIRVSFGKREFDLITGLSHRMIRVDNDIPGRRLRARYFKDSVRV
KCIELEKIFMEAVFDDDDDAIKVGIVYFVELAMMGKERKQFIDTTLLGVVDRWELFGNHDWSSLIFERTLWSLKNALKDKLPAYQQKARNDPTHQETYSLYGFPYAFQVW
AYETISTLSLRVATRLSDDSIPRLLRWSCTYSRGFLTLQRDMFDNTMSKVKEYLVSTNAEAEHMVRIMRPPEARAIPAPPAVPDPPAVPDPAVVPAPTAVRNSPADLERG
TEERRVKDKGKNIIEDPIEEAETLDDDALQGLALDDAGPSGNDSEALQKRSKRKKLKNKISRRLKRLDDRVGAIEATLTGFEATLTGFGVALKGIQRYLKKMSKGKFPDP
TKYFGRGGGPDDDDSSDQRPDEAQHESMDEDPKNMDDDPMFMVEDQGTTTERDNASTAYPDRPVGLFQDATVGMQEPDVASDTRPVSRRVRRPYKDWAPDAVIKVEPYLD
PDEYDLQQAPTGRGLRKRHYSWKLKDIYTPTGQRGIIVDRYDLVCPILPQLDDKFQRWMDDPKTDRRLRSTATGFQKKEWYRDLLDPSVELKDEVLDGLVLFTAKKLEKC
LHLCRKKFAIGDVLFSTLLNRTNDPYAAMKPGVLSTRINYPWREENTIWRYVHGGNHWVMLGIDLVQGDITVWDSLQTVTPLDELEKELKPMCTILPALLHHGGIFSVRP
DLPVVPWRVRRVRVPQ