; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g17210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g17210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr8:13120237..13123728
RNA-Seq ExpressionMoc08g17210
SyntenyMoc08g17210
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR015410 - Domain of unknown function DUF1985
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138037.1 uncharacterized protein LOC111009294 [Momordica charantia]7.9e-8491.57Show/hide
Query:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL
        MDLRLILN DDWF  TL NLAHVDKTTSRLKGRLT TQLDMFRQTCF PILDMDVVFN PLIHHLLLR VEEPRQDIISFDLFGKRVSFGKREFDLI  L
Subjt:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL

Query:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVD
         HRMIRVDNDIPGRRLRA YFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVEL+MMGKERKQ I TTLLGVVD
Subjt:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVD

XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]2.4e-11773.93Show/hide
Query:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL
        MDLRLIL+ +DWFP TL NLAHVDKTT+R+K RLT TQLDMFRQTCF PILDM VVFN PLIHHLLL EVEEPRQD+ISFDLF KRVSFGKREFDLI  L
Subjt:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL

Query:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL
         H+M RV+N IPGRRLRA YFKDSVRVKCSELEKIF+E +F DDED VKVGIVYF+ELAMMGKERKQ I T  +GVVDRWE FCN DWSS+IF+RT+WSL
Subjt:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL

Query:  KNALKDKLPAYQQKARNDPTHQETYSLYGFHTHFRYGLRDDIDRDVFDNTMSKVKEYLVSTNAEAEHMVRIMRPPEARTIPAPPAVLDPPAVPDPAVVTA
        KN LKDKL AYQQKA  DPTH ETYSLYGF  + R      +  +VFDNT SKVKE+L++T+AE +HMVR++ PPE R IP PPAV D   VPD AVV  
Subjt:  KNALKDKLPAYQQKARNDPTHQETYSLYGFHTHFRYGLRDDIDRDVFDNTMSKVKEYLVSTNAEAEHMVRIMRPPEARTIPAPPAVLDPPAVPDPAVVTA

Query:  PAA
        P A
Subjt:  PAA

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]1.2e-13757.11Show/hide
Query:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL
        MDLRLI++ +DWFP TL NLAH+DKT++R+K RLT TQLDMFRQTCF PILD+DVVFN PLIHHLLLREVEEPRQD+ISFDLFGKRVSFGKREFDLI  L
Subjt:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL

Query:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL
         HRM RVDN IPGRRLRA YFKD VRVKCSELEKIF+E VF DDED VKV IVYF+ELAMMGKERKQ I T LLGVVDRWE+FCN+DWSS+IF+RT+WSL
Subjt:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL

Query:  KNALKDKLPAYQQKARNDPTHQETYSLYGFHTHFRY-------GLRDD------------------IDRDVFDNTMSKVKEYLVSTNAEAEHMVRIMRPP
        KNALKDKL  YQQKA  DP+H ETYSLYGF   F+         L DD                  +  +VFDNT SKVKE+L++T+A+ +HMVR++ PP
Subjt:  KNALKDKLPAYQQKARNDPTHQETYSLYGFHTHFRY-------GLRDD------------------IDRDVFDNTMSKVKEYLVSTNAEAEHMVRIMRPP

Query:  EARTIPAPPAVLDPPAVPDPAVVTAPAAVRNPPADLERGTEERRVKDKGKNIIEDPVEEAETLDDDALQGPALDDVGPSGNGSEALQKSWVR--------
        E R IP PPAV D   VPDP      AAV +PPAD+E G             +EDPV +A           A+D+  PS N  E L+K   +        
Subjt:  EARTIPAPPAVLDPPAVPDPAVVTAPAAVRNPPADLERGTEERRVKDKGKNIIEDPVEEAETLDDDALQGPALDDVGPSGNGSEALQKSWVR--------

Query:  ------DNAHWSV-------------------LFVQGKFSDPTKYFGRGGGPDDDDPSDQRPDEA--PTRGPKSMDEDRRLEEVPKTDEYRTMDDNPKS
              DN   ++                      +GKF D +KYFG GGGPDDD PSDQRPDE+  P  G KSMDED+R +E  +TDE    +  P S
Subjt:  ------DNAHWSV-------------------LFVQGKFSDPTKYFGRGGGPDDDDPSDQRPDEA--PTRGPKSMDEDRRLEEVPKTDEYRTMDDNPKS

XP_022155158.1 uncharacterized protein LOC111022300 [Momordica charantia]5.3e-10490.57Show/hide
Query:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL
        MDLRLILN DDWFP TL NLAH DKTTSRLKGRLT TQ+DMFRQTCF PILDMDVVFN PLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLI  L
Subjt:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL

Query:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL
         +RMIRVDNDIPGRRLRA YFKDSVRVKCSELEKIFME VF DDEDAVKVGIVYFVELAMMGKERKQ I  TLLGVVDRWELFCNHDWSSLIFERTLWSL
Subjt:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL

Query:  KNALKDKLPAYQ
        KNA+ DKLPAYQ
Subjt:  KNALKDKLPAYQ

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]7.4e-9059.26Show/hide
Query:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL
        M++ L +N DDWFP  L NLAHV KT+SRLK RLT +QLDMF QTCF PIL M+VVFN PL+HHLLLREVEEP+ D+ISF+LFG RVSFGKREFDLI  L
Subjt:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL

Query:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL
         H M RVD D+  RRLR  YF+D   VKCSELEKIF+E  F++DEDAVK+ IVYF+ELAMMGKERK  + T+LLG+VDRWE+FCN+DWSS+IFERTLWSL
Subjt:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL

Query:  KNALKDKLPAYQQKARNDPTHQETYSLYGF---------------HTHFRYGLRDD------------------IDRDVFDNTMSKVKEYLVSTNAE
        KNALKDK+  Y+QK   D +H ETYSLY F                T     L DD                  ++R+VF+N  SKV   L +T+ E
Subjt:  KNALKDKLPAYQQKARNDPTHQETYSLYGF---------------HTHFRYGLRDD------------------IDRDVFDNTMSKVKEYLVSTNAE

TrEMBL top hitse value%identityAlignment
A0A6J1C9X4 uncharacterized protein LOC1110092943.8e-8491.57Show/hide
Query:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL
        MDLRLILN DDWF  TL NLAHVDKTTSRLKGRLT TQLDMFRQTCF PILDMDVVFN PLIHHLLLR VEEPRQDIISFDLFGKRVSFGKREFDLI  L
Subjt:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL

Query:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVD
         HRMIRVDNDIPGRRLRA YFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVEL+MMGKERKQ I TTLLGVVD
Subjt:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVD

A0A6J1CZE8 uncharacterized protein LOC1110156001.2e-11773.93Show/hide
Query:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL
        MDLRLIL+ +DWFP TL NLAHVDKTT+R+K RLT TQLDMFRQTCF PILDM VVFN PLIHHLLL EVEEPRQD+ISFDLF KRVSFGKREFDLI  L
Subjt:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL

Query:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL
         H+M RV+N IPGRRLRA YFKDSVRVKCSELEKIF+E +F DDED VKVGIVYF+ELAMMGKERKQ I T  +GVVDRWE FCN DWSS+IF+RT+WSL
Subjt:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL

Query:  KNALKDKLPAYQQKARNDPTHQETYSLYGFHTHFRYGLRDDIDRDVFDNTMSKVKEYLVSTNAEAEHMVRIMRPPEARTIPAPPAVLDPPAVPDPAVVTA
        KN LKDKL AYQQKA  DPTH ETYSLYGF  + R      +  +VFDNT SKVKE+L++T+AE +HMVR++ PPE R IP PPAV D   VPD AVV  
Subjt:  KNALKDKLPAYQQKARNDPTHQETYSLYGFHTHFRYGLRDDIDRDVFDNTMSKVKEYLVSTNAEAEHMVRIMRPPEARTIPAPPAVLDPPAVPDPAVVTA

Query:  PAA
        P A
Subjt:  PAA

A0A6J1DJX9 uncharacterized protein LOC1110207576.0e-13857.11Show/hide
Query:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL
        MDLRLI++ +DWFP TL NLAH+DKT++R+K RLT TQLDMFRQTCF PILD+DVVFN PLIHHLLLREVEEPRQD+ISFDLFGKRVSFGKREFDLI  L
Subjt:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL

Query:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL
         HRM RVDN IPGRRLRA YFKD VRVKCSELEKIF+E VF DDED VKV IVYF+ELAMMGKERKQ I T LLGVVDRWE+FCN+DWSS+IF+RT+WSL
Subjt:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL

Query:  KNALKDKLPAYQQKARNDPTHQETYSLYGFHTHFRY-------GLRDD------------------IDRDVFDNTMSKVKEYLVSTNAEAEHMVRIMRPP
        KNALKDKL  YQQKA  DP+H ETYSLYGF   F+         L DD                  +  +VFDNT SKVKE+L++T+A+ +HMVR++ PP
Subjt:  KNALKDKLPAYQQKARNDPTHQETYSLYGFHTHFRY-------GLRDD------------------IDRDVFDNTMSKVKEYLVSTNAEAEHMVRIMRPP

Query:  EARTIPAPPAVLDPPAVPDPAVVTAPAAVRNPPADLERGTEERRVKDKGKNIIEDPVEEAETLDDDALQGPALDDVGPSGNGSEALQKSWVR--------
        E R IP PPAV D   VPDP      AAV +PPAD+E G             +EDPV +A           A+D+  PS N  E L+K   +        
Subjt:  EARTIPAPPAVLDPPAVPDPAVVTAPAAVRNPPADLERGTEERRVKDKGKNIIEDPVEEAETLDDDALQGPALDDVGPSGNGSEALQKSWVR--------

Query:  ------DNAHWSV-------------------LFVQGKFSDPTKYFGRGGGPDDDDPSDQRPDEA--PTRGPKSMDEDRRLEEVPKTDEYRTMDDNPKS
              DN   ++                      +GKF D +KYFG GGGPDDD PSDQRPDE+  P  G KSMDED+R +E  +TDE    +  P S
Subjt:  ------DNAHWSV-------------------LFVQGKFSDPTKYFGRGGGPDDDDPSDQRPDEA--PTRGPKSMDEDRRLEEVPKTDEYRTMDDNPKS

A0A6J1DM82 uncharacterized protein LOC1110223002.6e-10490.57Show/hide
Query:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL
        MDLRLILN DDWFP TL NLAH DKTTSRLKGRLT TQ+DMFRQTCF PILDMDVVFN PLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLI  L
Subjt:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL

Query:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL
         +RMIRVDNDIPGRRLRA YFKDSVRVKCSELEKIFME VF DDEDAVKVGIVYFVELAMMGKERKQ I  TLLGVVDRWELFCNHDWSSLIFERTLWSL
Subjt:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL

Query:  KNALKDKLPAYQ
        KNA+ DKLPAYQ
Subjt:  KNALKDKLPAYQ

A0A6J1DRZ7 uncharacterized protein LOC1110238473.6e-9059.26Show/hide
Query:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL
        M++ L +N DDWFP  L NLAHV KT+SRLK RLT +QLDMF QTCF PIL M+VVFN PL+HHLLLREVEEP+ D+ISF+LFG RVSFGKREFDLI  L
Subjt:  MDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLIISL

Query:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL
         H M RVD D+  RRLR  YF+D   VKCSELEKIF+E  F++DEDAVK+ IVYF+ELAMMGKERK  + T+LLG+VDRWE+FCN+DWSS+IFERTLWSL
Subjt:  CHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSL

Query:  KNALKDKLPAYQQKARNDPTHQETYSLYGF---------------HTHFRYGLRDD------------------IDRDVFDNTMSKVKEYLVSTNAE
        KNALKDK+  Y+QK   D +H ETYSLY F                T     L DD                  ++R+VF+N  SKV   L +T+ E
Subjt:  KNALKDKLPAYQQKARNDPTHQETYSLYGF---------------HTHFRYGLRDD------------------IDRDVFDNTMSKVKEYLVSTNAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACCATATTCTCTGCCTTTCATCCCGGAACGAAACGGTCATATAATGGATTTGAGACTAATTCTAAACTGTGATGACTGGTTTCCGACCACGTTGATCAACCTTGC
CCATGTAGATAAAACCACTTCTAGGCTGAAGGGTAGGTTAACCCTAACCCAGTTAGACATGTTTAGGCAAACGTGTTTCAGTCCCATTTTGGACATGGACGTAGTTTTTA
ATGATCCATTAATACATCATCTATTGTTGAGAGAGGTTGAAGAGCCTAGGCAGGACATCATTAGTTTCGACCTGTTTGGGAAAAGGGTCTCCTTTGGTAAGCGGGAGTTT
GACCTAATCATCAGCCTCTGTCATAGGATGATTAGGGTAGATAACGATATTCCTGGCCGACGACTTCGAGCATGTTACTTTAAGGATAGTGTCAGGGTTAAGTGTAGTGA
GTTAGAGAAGATTTTTATGGAGGCAGTTTTTGACGATGATGAGGATGCTGTCAAGGTTGGCATAGTTTACTTCGTCGAGCTTGCCATGATGGGGAAGGAGAGGAAGCAGT
TAATAGGTACGACCCTTTTAGGGGTTGTGGATAGGTGGGAGCTGTTCTGCAATCACGACTGGAGTTCGTTGATTTTCGAAAGAACACTTTGGAGCCTGAAGAATGCCCTG
AAGGATAAACTACCAGCGTACCAACAGAAGGCGAGAAATGACCCCACACACCAAGAGACTTATAGTCTCTACGGGTTCCATACGCATTTCAGGTATGGGTTACGAGATGA
TATCGACAGAGACGTGTTCGATAACACGATGTCCAAGGTTAAGGAATACTTGGTTTCGACTAATGCTGAGGCAGAACACATGGTCCGTATCATGCGTCCACCGGAAGCTC
GCACTATACCTGCCCCGCCGGCTGTACTTGACCCGCCTGCAGTACCTGACCCGGCTGTTGTAACTGCCCCGGCTGCAGTACGTAACCCGCCTGCAGATTTGGAAAGGGGT
ACTGAGGAAAGAAGGGTGAAGGACAAAGGAAAGAACATCATAGAGGATCCGGTAGAAGAGGCCGAGACATTGGACGATGATGCATTACAGGGTCCTGCATTAGACGATGT
TGGACCCAGTGGAAATGGCAGCGAAGCCCTACAGAAGAGTTGGGTCCGAGATAACGCTCATTGGTCCGTACTTTTCGTGCAGGGTAAATTCTCTGATCCGACCAAATATT
TTGGACGTGGGGGTGGGCCCGATGATGATGATCCATCAGATCAAAGGCCTGATGAGGCCCCAACACGAGGTCCGAAGAGTATGGACGAGGACCGGAGGCTGGAAGAGGTC
CCTAAGACTGACGAGTATCGGACCATGGACGATAATCCGAAGAGTATGGACGAGGATCCGAAGAATATGGACGAGGATCCGATGTTTATGGTTGAAGACCAGGGTACGAT
AACGGAGCGGGACGATGCATCGGATGCTTACCCTGATCGTCCTGTTGGTTTGTTTCAGGACGCCACTGTTGGAATGCAAGAGCCGGACGTTGCGTCAGATACGCGACCCG
TCAGCCGACGCGTTAGGCGTCCCTATAAGGACTGGGCACCGGACGCAGTCATTAAGGTTGAACCTTACCTTGACCAGGACGAATATGACCTTCAGCAGGCCCCAATTGGG
CATGGGCTACGCAAGAGGCATTACTCGTGGAAGCTGAAGGATATATACACACCAACCGGTCAGCGTGGGATCACCGTGGATAGATACGACCCAGTACTTGATGGTCTCGT
CCTGTTTACAGCGAAAAAGTTGGAGAAGTGTCTCCATCTATGTCGCAAGAAGTTTGCGATAGGCGACGTACTTCTTTCGACTCTGCTGAATCGGACAGACGGTCCACTAG
GATCAACTACCCCTGGCGCGAAGAGAATACAATCTGGCGATATGTCCACGGGCGACATAACCGTATGGGATTCACTCCAAACGGTCACTCCACTGGATGAACTTGAGAAG
GAGTTGAAGCCCATGTGTACAATCCTACCTACGCTGCTGCATCATGGCGGGATATTTTCAGTTCGACCCGACTTGCCAATGGTGTCGTGGAGGGTACGTCGGGTTCGCGT
ACCACAGCAGAGTAGTGCGGCTGATTGCGAGATTTTCTGTGTCGGGTATTTCGAGTACGATGCCACCGGGTCAAATATGGACACTTTAACCAAAGATAATATTGTATATT
TTAGGCGTCAGTACGCTGTACAGATGTGGGCGCGTCGTCCCATTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAACCATATTCTCTGCCTTTCATCCCGGAACGAAACGGTCATATAATGGATTTGAGACTAATTCTAAACTGTGATGACTGGTTTCCGACCACGTTGATCAACCTTGC
CCATGTAGATAAAACCACTTCTAGGCTGAAGGGTAGGTTAACCCTAACCCAGTTAGACATGTTTAGGCAAACGTGTTTCAGTCCCATTTTGGACATGGACGTAGTTTTTA
ATGATCCATTAATACATCATCTATTGTTGAGAGAGGTTGAAGAGCCTAGGCAGGACATCATTAGTTTCGACCTGTTTGGGAAAAGGGTCTCCTTTGGTAAGCGGGAGTTT
GACCTAATCATCAGCCTCTGTCATAGGATGATTAGGGTAGATAACGATATTCCTGGCCGACGACTTCGAGCATGTTACTTTAAGGATAGTGTCAGGGTTAAGTGTAGTGA
GTTAGAGAAGATTTTTATGGAGGCAGTTTTTGACGATGATGAGGATGCTGTCAAGGTTGGCATAGTTTACTTCGTCGAGCTTGCCATGATGGGGAAGGAGAGGAAGCAGT
TAATAGGTACGACCCTTTTAGGGGTTGTGGATAGGTGGGAGCTGTTCTGCAATCACGACTGGAGTTCGTTGATTTTCGAAAGAACACTTTGGAGCCTGAAGAATGCCCTG
AAGGATAAACTACCAGCGTACCAACAGAAGGCGAGAAATGACCCCACACACCAAGAGACTTATAGTCTCTACGGGTTCCATACGCATTTCAGGTATGGGTTACGAGATGA
TATCGACAGAGACGTGTTCGATAACACGATGTCCAAGGTTAAGGAATACTTGGTTTCGACTAATGCTGAGGCAGAACACATGGTCCGTATCATGCGTCCACCGGAAGCTC
GCACTATACCTGCCCCGCCGGCTGTACTTGACCCGCCTGCAGTACCTGACCCGGCTGTTGTAACTGCCCCGGCTGCAGTACGTAACCCGCCTGCAGATTTGGAAAGGGGT
ACTGAGGAAAGAAGGGTGAAGGACAAAGGAAAGAACATCATAGAGGATCCGGTAGAAGAGGCCGAGACATTGGACGATGATGCATTACAGGGTCCTGCATTAGACGATGT
TGGACCCAGTGGAAATGGCAGCGAAGCCCTACAGAAGAGTTGGGTCCGAGATAACGCTCATTGGTCCGTACTTTTCGTGCAGGGTAAATTCTCTGATCCGACCAAATATT
TTGGACGTGGGGGTGGGCCCGATGATGATGATCCATCAGATCAAAGGCCTGATGAGGCCCCAACACGAGGTCCGAAGAGTATGGACGAGGACCGGAGGCTGGAAGAGGTC
CCTAAGACTGACGAGTATCGGACCATGGACGATAATCCGAAGAGTATGGACGAGGATCCGAAGAATATGGACGAGGATCCGATGTTTATGGTTGAAGACCAGGGTACGAT
AACGGAGCGGGACGATGCATCGGATGCTTACCCTGATCGTCCTGTTGGTTTGTTTCAGGACGCCACTGTTGGAATGCAAGAGCCGGACGTTGCGTCAGATACGCGACCCG
TCAGCCGACGCGTTAGGCGTCCCTATAAGGACTGGGCACCGGACGCAGTCATTAAGGTTGAACCTTACCTTGACCAGGACGAATATGACCTTCAGCAGGCCCCAATTGGG
CATGGGCTACGCAAGAGGCATTACTCGTGGAAGCTGAAGGATATATACACACCAACCGGTCAGCGTGGGATCACCGTGGATAGATACGACCCAGTACTTGATGGTCTCGT
CCTGTTTACAGCGAAAAAGTTGGAGAAGTGTCTCCATCTATGTCGCAAGAAGTTTGCGATAGGCGACGTACTTCTTTCGACTCTGCTGAATCGGACAGACGGTCCACTAG
GATCAACTACCCCTGGCGCGAAGAGAATACAATCTGGCGATATGTCCACGGGCGACATAACCGTATGGGATTCACTCCAAACGGTCACTCCACTGGATGAACTTGAGAAG
GAGTTGAAGCCCATGTGTACAATCCTACCTACGCTGCTGCATCATGGCGGGATATTTTCAGTTCGACCCGACTTGCCAATGGTGTCGTGGAGGGTACGTCGGGTTCGCGT
ACCACAGCAGAGTAGTGCGGCTGATTGCGAGATTTTCTGTGTCGGGTATTTCGAGTACGATGCCACCGGGTCAAATATGGACACTTTAACCAAAGATAATATTGTATATT
TTAGGCGTCAGTACGCTGTACAGATGTGGGCGCGTCGTCCCATTTTTTGA
Protein sequenceShow/hide protein sequence
MKPYSLPFIPERNGHIMDLRLILNCDDWFPTTLINLAHVDKTTSRLKGRLTLTQLDMFRQTCFSPILDMDVVFNDPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREF
DLIISLCHRMIRVDNDIPGRRLRACYFKDSVRVKCSELEKIFMEAVFDDDEDAVKVGIVYFVELAMMGKERKQLIGTTLLGVVDRWELFCNHDWSSLIFERTLWSLKNAL
KDKLPAYQQKARNDPTHQETYSLYGFHTHFRYGLRDDIDRDVFDNTMSKVKEYLVSTNAEAEHMVRIMRPPEARTIPAPPAVLDPPAVPDPAVVTAPAAVRNPPADLERG
TEERRVKDKGKNIIEDPVEEAETLDDDALQGPALDDVGPSGNGSEALQKSWVRDNAHWSVLFVQGKFSDPTKYFGRGGGPDDDDPSDQRPDEAPTRGPKSMDEDRRLEEV
PKTDEYRTMDDNPKSMDEDPKNMDEDPMFMVEDQGTITERDDASDAYPDRPVGLFQDATVGMQEPDVASDTRPVSRRVRRPYKDWAPDAVIKVEPYLDQDEYDLQQAPIG
HGLRKRHYSWKLKDIYTPTGQRGITVDRYDPVLDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPLGSTTPGAKRIQSGDMSTGDITVWDSLQTVTPLDELEK
ELKPMCTILPTLLHHGGIFSVRPDLPMVSWRVRRVRVPQQSSAADCEIFCVGYFEYDATGSNMDTLTKDNIVYFRRQYAVQMWARRPIF