; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g14910 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g14910
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr1:9310994..9320651
RNA-Seq ExpressionMoc01g14910
SyntenyMoc01g14910
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR015410 - Domain of unknown function DUF1985
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145823.1 uncharacterized protein LOC111015183 [Momordica charantia]6.2e-9695.26Show/hide
Query:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG
        MDHQLRIK+NDRFP QATSMSHLSNVNRLIKDKLT DQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVA SSDDS+  LIGGNV TFSKDQFMLITG
Subjt:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLD
        LWRLPGKVVQKKIGKNRLRRKYFNDEASM+LEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLD
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLD

XP_022152154.1 uncharacterized protein LOC111019943 [Momordica charantia]4.1e-9594.27Show/hide
Query:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG
        MDHQLRIK+ND FP QATSMSHLSNVNRLIKDKLTADQLDMFRR TIFGRFVDLEMMFCSGVVHHFLSREV RSSDDSMSFLIGGN+LTFSKDQFMLITG
Subjt:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWG
        LWRL GKVVQKKIGKNRLRRKYFN EASMLLEEFVEVYKQTDFEDDEDA KVTLILYTELVMM KSK KSKVDIDLYNQVDDLDYFNHLDWG
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWG

XP_022154873.1 uncharacterized protein LOC111022026 [Momordica charantia]3.1e-7982.56Show/hide
Query:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG
        MDHQLRIK+NDRF  QATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREV  SSDDSMSFLIGGNVLTFSKDQFMLITG
Subjt:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDV
        LWRLP                           EFVEVYKQTDFEDDEDAVKVTLILYTELVMM KSKSKSKVDIDLYNQVDDL+YFNHLDWGSDV
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDV

XP_022155155.1 uncharacterized protein LOC111022298 [Momordica charantia]3.2e-12495.4Show/hide
Query:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG
        MDHQLRIK+ND FP QAT MSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVA SSD++MSFLIGGNVLTFSKDQFMLITG
Subjt:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV
        LWRLPGK+VQKKIGKN LRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVK+TLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV

Query:  NGLKRAMNGKVALYKNKVRTNKKYLVKYSLQGFPLAFQV
        NGLKRAMNGKVALYKNKVRTNKKYLVKYSL GFPLAFQV
Subjt:  NGLKRAMNGKVALYKNKVRTNKKYLVKYSLQGFPLAFQV

XP_022157199.1 uncharacterized protein LOC111023969 [Momordica charantia]1.1e-10555.75Show/hide
Query:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG
        M+H L++ + DRFP Q TS+SHLS  N++I  KLT  QLDMFR+RTIFGRFVDL+MMFCS +VH+FL REV  +  D M F I G ++TFSK +F+L+TG
Subjt:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV
        LWR  G+V+QKK+ KNRLRR+YF D   + LEEF + YK+  F +D+DAVKV+LI YTE+VMMGK+K KS VD DLY QV+DLDYFN++DWG+ +W RT+
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV

Query:  NGLKRAMNGKVALYKNKVRTNKKYLVKYSLQGFPLAFQVWIYEVVPSLITPGVNRLSETAIPRIIRYSCSRVVDTKDLGEEVLGSAVLVISYPLVETELD
         GL+ AM  KV  YKNKV TNKK+ V+YSL GFP+AFQVW YE++PSL+  GVNRLS+TA+PRI RYSCS+ + +K L  +V  S+ L I++PLVE+E +
Subjt:  NGLKRAMNGKVALYKNKVRTNKKYLVKYSLQGFPLAFQVWIYEVVPSLITPGVNRLSETAIPRIIRYSCSRVVDTKDLGEEVLGSAVLVISYPLVETELD

Query:  KDYQRCPLDEREVVDLTAPGCSTSDSDDGHNPSPITDNLGDEDDLPLD
        + Y+  P D R  ++    G   SD DD        D+ G ++D   D
Subjt:  KDYQRCPLDEREVVDLTAPGCSTSDSDDGHNPSPITDNLGDEDDLPLD

TrEMBL top hitse value%identityAlignment
A0A6J1CX02 uncharacterized protein LOC1110151833.0e-9695.26Show/hide
Query:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG
        MDHQLRIK+NDRFP QATSMSHLSNVNRLIKDKLT DQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVA SSDDS+  LIGGNV TFSKDQFMLITG
Subjt:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLD
        LWRLPGKVVQKKIGKNRLRRKYFNDEASM+LEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLD
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLD

A0A6J1DF70 uncharacterized protein LOC1110199432.0e-9594.27Show/hide
Query:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG
        MDHQLRIK+ND FP QATSMSHLSNVNRLIKDKLTADQLDMFRR TIFGRFVDLEMMFCSGVVHHFLSREV RSSDDSMSFLIGGN+LTFSKDQFMLITG
Subjt:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWG
        LWRL GKVVQKKIGKNRLRRKYFN EASMLLEEFVEVYKQTDFEDDEDA KVTLILYTELVMM KSK KSKVDIDLYNQVDDLDYFNHLDWG
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWG

A0A6J1DLH1 uncharacterized protein LOC1110220261.5e-7982.56Show/hide
Query:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG
        MDHQLRIK+NDRF  QATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREV  SSDDSMSFLIGGNVLTFSKDQFMLITG
Subjt:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDV
        LWRLP                           EFVEVYKQTDFEDDEDAVKVTLILYTELVMM KSKSKSKVDIDLYNQVDDL+YFNHLDWGSDV
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDV

A0A6J1DLM5 uncharacterized protein LOC1110222981.5e-12495.4Show/hide
Query:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG
        MDHQLRIK+ND FP QAT MSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVA SSD++MSFLIGGNVLTFSKDQFMLITG
Subjt:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV
        LWRLPGK+VQKKIGKN LRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVK+TLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV

Query:  NGLKRAMNGKVALYKNKVRTNKKYLVKYSLQGFPLAFQV
        NGLKRAMNGKVALYKNKVRTNKKYLVKYSL GFPLAFQV
Subjt:  NGLKRAMNGKVALYKNKVRTNKKYLVKYSLQGFPLAFQV

A0A6J1DSS5 uncharacterized protein LOC1110239695.5e-10655.75Show/hide
Query:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG
        M+H L++ + DRFP Q TS+SHLS  N++I  KLT  QLDMFR+RTIFGRFVDL+MMFCS +VH+FL REV  +  D M F I G ++TFSK +F+L+TG
Subjt:  MDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFSKDQFMLITG

Query:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV
        LWR  G+V+QKK+ KNRLRR+YF D   + LEEF + YK+  F +D+DAVKV+LI YTE+VMMGK+K KS VD DLY QV+DLDYFN++DWG+ +W RT+
Subjt:  LWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTV

Query:  NGLKRAMNGKVALYKNKVRTNKKYLVKYSLQGFPLAFQVWIYEVVPSLITPGVNRLSETAIPRIIRYSCSRVVDTKDLGEEVLGSAVLVISYPLVETELD
         GL+ AM  KV  YKNKV TNKK+ V+YSL GFP+AFQVW YE++PSL+  GVNRLS+TA+PRI RYSCS+ + +K L  +V  S+ L I++PLVE+E +
Subjt:  NGLKRAMNGKVALYKNKVRTNKKYLVKYSLQGFPLAFQVWIYEVVPSLITPGVNRLSETAIPRIIRYSCSRVVDTKDLGEEVLGSAVLVISYPLVETELD

Query:  KDYQRCPLDEREVVDLTAPGCSTSDSDDGHNPSPITDNLGDEDDLPLD
        + Y+  P D R  ++    G   SD DD        D+ G ++D   D
Subjt:  KDYQRCPLDEREVVDLTAPGCSTSDSDDGHNPSPITDNLGDEDDLPLD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G45570.1 Ulp1 protease family protein8.9e-0825Show/hide
Query:  WVDVDVVYSPLCIK-DHWVLVAIDMTQSEIFVYDSLPGHISTSKLLTDMRPLSHTIPSLLYACGLMDTADCKLKRTPWRVYRPTTDTRQKGSIDCGIFAC
        +VDVD +Y+ L +  +HWV + ID+T   + VYDS+P   + +++      +   IP++L +            +  W+      +    G  DC I++ 
Subjt:  WVDVDVVYSPLCIK-DHWVLVAIDMTQSEIFVYDSLPGHISTSKLLTDMRPLSHTIPSLLYACGLMDTADCKLKRTPWRVYRPTTDTRQKGSIDCGIFAC

Query:  KFLEYLVSGNSLETLVQAQVSHIRRQYATQLW
        K++E L  G S + L    +  +R + A +++
Subjt:  KFLEYLVSGNSLETLVQAQVSHIRRQYATQLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTCGTCACTGCCAGATTTGTTGTTGCAAGAAATGGATCGTCGAAAGTGTCGTCTGAAGCTTCGCAGGTGCCGTTGGAATGCTGGTTCAACGCCGGATCGTGGGT
GTGCTGCGGGAGCTCACTGAGATCTACTGTGGGTGTGCTATCGGAGTTTGTAGAGGATGTTGTCCAATGCCAAAGTAGGTTTTCAGAAGGTCGGGCGCCTGATCCGCTAC
CTTTGACGCCTCCAGAAATTGCTATTCGCACCACTGTCGGTCGATCACTGCCGGATCTGTCATGCCGGTGTGGTCGTTGGAGTTTGCAGTCCCTGCCGGATCTGCCACGC
CGGGGTGTTCGCCAGAGTCTGCCGCTACCTTCTGTGTGGGTGTCGAAGGGGTGCACAATGGACCATCAGTTGAGGATTAAGGACAATGACCGCTTTCCGGATCAAGCCAC
CAGCATGTCTCACTTGAGCAATGTCAACAGGCTTATCAAGGATAAACTCACAGCGGACCAACTTGATATGTTCCGTAGAAGAACAATATTTGGTCGATTTGTCGACTTGG
AGATGATGTTCTGCAGTGGTGTAGTTCATCACTTTCTGTCAAGGGAGGTTGCTAGGAGCAGTGACGACAGCATGAGTTTCTTAATTGGTGGCAACGTGTTGACATTCTCG
AAGGATCAATTCATGCTTATAACGGGATTGTGGCGGCTGCCCGGAAAGGTGGTCCAGAAAAAGATTGGAAAGAATAGGTTGCGGAGGAAGTACTTCAACGATGAAGCCTC
CATGCTGCTCGAAGAGTTTGTGGAGGTTTACAAACAGACTGATTTCGAGGACGACGAGGACGCCGTTAAAGTGACATTAATTTTGTACACGGAGCTTGTGATGATGGGAA
AGAGCAAGAGCAAGTCGAAGGTTGACATCGACTTGTACAACCAAGTCGATGACTTGGACTACTTCAACCATTTGGACTGGGGTTCTGATGTCTGGAGTAGGACAGTTAAC
GGTTTGAAGCGTGCGATGAATGGAAAAGTTGCGCTATACAAGAACAAAGTAAGAACGAACAAAAAGTATCTAGTAAAGTATAGCCTACAGGGATTTCCGCTTGCGTTTCA
GGTGTGGATATACGAGGTTGTCCCATCTCTCATCACTCCCGGTGTCAATCGTTTGAGCGAGACCGCCATTCCCCGGATAATTCGGTATTCGTGCAGTAGAGTCGTCGATA
CAAAAGATCTGGGGGAGGAGGTCCTTGGTTCAGCGGTGTTGGTCATATCTTATCCACTCGTGGAGACGGAGCTGGATAAGGACTACCAGAGGTGTCCATTGGACGAAAGA
GAGGTGGTTGATTTAACTGCGCCTGGGTGTTCCACCTCCGACAGTGATGATGGACACAATCCTTCCCCCATCACCGACAATCTTGGCGACGAAGACGATCTCCCACTCGA
CGATGCGCATTCGTTGGAAACGAATGTACAGGCAATGTCAGATGAGTCTTCGGATATGCCACGTACAGAGGCCGCATCTGAAGGTGGGCAACGGACACCGGTCGAAGTAC
TTCAACCAAGTACTTCTATTCGATCGAATGTGGGGCAAAGCACGCGGCAATCACCGCGAGCGACATCACGCGATGTTTTCCCTACACAACGGCACGACACCCGTCGATCG
AATGATAGATTCGGGGCTATGGAGAGAAGGCTGGATCTTTTAGCTTCAGACATGGCGGAGGTGAAGACGGATTTGGCGAAGGTCAAGTCCGACTTGAGTGAAATGAAACT
CATGCTTCAACGGTTGTGCCAGATCGATAGGCGAGAGGTGAATATTGGTGTCTCTCCGTCGGATACAGTCCACGTGTCACATCCATTGGTATCCAATGTTATCCCCGAGC
ATGATGGGGATGCTGATGACCACCAACCTGGAGGTTCCGATGCTGGAAAGAAGGACGATGTGGTTCCTGTAGAAGCGTCGTTGCATGAAAAGGCAACGGATGGAGTAGAG
ATGACCATACCCCCATCCGATCTTGGAGATGCAGAACTGACCAACCCCACGTGTATTGTCGATTCGGTGGAGTTGGATGTTGCAGTGGTGACACCCATTGTTTCGACAGA
GATGGTGGAACTCGAAATGACACCGCCAATAGTACAGGATCCACAAGCAGAGACGACGTCCGATCTAACCTTCGAGCCTCCTGCCTCAACCAACATTGATGGTCCGTGTG
GCATGATCCATGGGCCTCGTCAAGCCGAGCATATTGAGTTGGCCCTTACACCAGCGGATACGAGCCCCACTACTCAACCTATTCCCACACTTACACCAGCATATACGACT
CTCATCCCTCAACCTATTCCCACCCTTACACCAGCTGAAAACCCCACCACCCGTCATCCGAGTGATCCCGTGGGTTCTACTAACCTCGCGTTAGACAAAATTTCTGAACC
ATTGGCCATCGTGCACCAGCCAACTAAGGAGAAGAACCCCCCTCATGGCAAAAAAGCCACCACAATCCGATTTACGGCACCGCAAGAAGCCCCACTCTTTGTCAGCGGTT
CTGCTGTTAACGAACCCACTAAGCCGAAGAAAACTGAACAACAAACCGCTCCTAAGCAGTCGGCCTGGAAAATCGAGGTTTCGTATCCCGACGAAACAAGAAGGACCGAG
CGTAAGCGGACGGAAACGAAACCATTCAGTCCGAAGGACACGCGTTGGGAGATGATGCGTTGGGTACGGGACCCTGGGAATGACAAAACAACGCCGCCGTCTACAACTTG
GAATGTGCAGAGCGGATATTCCAGAAGATTCTTCATTAACATCCTCAATCCTAAGGAGAAGGTGGAAGACCCGGAAGCTGCTGTCATTCTATATTTCATTATGAGGAAGC
TTGGTAGTCGGCCGCACCTGTGCGTTCATAAGTTTTCTGTCCTAGACCCACTACAAATGCAAGTTCTTGCCGCTGCAGGTGGTCCCTATGCACGAATCAAGGGGAAGGTC
GTCCAGGACACGACCAATACTTGGGACGAGTATAAGGAGTGCATGGATGTCGTGCTGGGTCAGGTGGAAGATTTCATTCCATCCTGGGTGGACGTCGACGTAGTGTACAG
CCCGCTCTGTATCAAGGATCACTGGGTCCTGGTTGCGATAGATATGACCCAATCCGAGATTTTTGTATACGACTCATTGCCAGGCCACATTTCCACGTCGAAGTTGCTGA
CAGACATGCGGCCGTTGAGTCATACAATCCCATCGCTTTTGTACGCATGTGGGCTGATGGATACGGCCGATTGCAAGCTGAAGAGGACTCCGTGGCGTGTATACCGTCCT
ACGACCGACACGAGGCAGAAAGGTAGTATAGACTGTGGTATTTTTGCATGTAAATTTTTGGAATATCTTGTGTCGGGTAATAGTTTAGAAACTCTTGTTCAGGCTCAAGT
GTCGCACATTAGAAGGCAGTATGCGACACAACTTTGGCATAATGAACCTTACTTTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTCGTCACTGCCAGATTTGTTGTTGCAAGAAATGGATCGTCGAAAGTGTCGTCTGAAGCTTCGCAGGTGCCGTTGGAATGCTGGTTCAACGCCGGATCGTGGGT
GTGCTGCGGGAGCTCACTGAGATCTACTGTGGGTGTGCTATCGGAGTTTGTAGAGGATGTTGTCCAATGCCAAAGTAGGTTTTCAGAAGGTCGGGCGCCTGATCCGCTAC
CTTTGACGCCTCCAGAAATTGCTATTCGCACCACTGTCGGTCGATCACTGCCGGATCTGTCATGCCGGTGTGGTCGTTGGAGTTTGCAGTCCCTGCCGGATCTGCCACGC
CGGGGTGTTCGCCAGAGTCTGCCGCTACCTTCTGTGTGGGTGTCGAAGGGGTGCACAATGGACCATCAGTTGAGGATTAAGGACAATGACCGCTTTCCGGATCAAGCCAC
CAGCATGTCTCACTTGAGCAATGTCAACAGGCTTATCAAGGATAAACTCACAGCGGACCAACTTGATATGTTCCGTAGAAGAACAATATTTGGTCGATTTGTCGACTTGG
AGATGATGTTCTGCAGTGGTGTAGTTCATCACTTTCTGTCAAGGGAGGTTGCTAGGAGCAGTGACGACAGCATGAGTTTCTTAATTGGTGGCAACGTGTTGACATTCTCG
AAGGATCAATTCATGCTTATAACGGGATTGTGGCGGCTGCCCGGAAAGGTGGTCCAGAAAAAGATTGGAAAGAATAGGTTGCGGAGGAAGTACTTCAACGATGAAGCCTC
CATGCTGCTCGAAGAGTTTGTGGAGGTTTACAAACAGACTGATTTCGAGGACGACGAGGACGCCGTTAAAGTGACATTAATTTTGTACACGGAGCTTGTGATGATGGGAA
AGAGCAAGAGCAAGTCGAAGGTTGACATCGACTTGTACAACCAAGTCGATGACTTGGACTACTTCAACCATTTGGACTGGGGTTCTGATGTCTGGAGTAGGACAGTTAAC
GGTTTGAAGCGTGCGATGAATGGAAAAGTTGCGCTATACAAGAACAAAGTAAGAACGAACAAAAAGTATCTAGTAAAGTATAGCCTACAGGGATTTCCGCTTGCGTTTCA
GGTGTGGATATACGAGGTTGTCCCATCTCTCATCACTCCCGGTGTCAATCGTTTGAGCGAGACCGCCATTCCCCGGATAATTCGGTATTCGTGCAGTAGAGTCGTCGATA
CAAAAGATCTGGGGGAGGAGGTCCTTGGTTCAGCGGTGTTGGTCATATCTTATCCACTCGTGGAGACGGAGCTGGATAAGGACTACCAGAGGTGTCCATTGGACGAAAGA
GAGGTGGTTGATTTAACTGCGCCTGGGTGTTCCACCTCCGACAGTGATGATGGACACAATCCTTCCCCCATCACCGACAATCTTGGCGACGAAGACGATCTCCCACTCGA
CGATGCGCATTCGTTGGAAACGAATGTACAGGCAATGTCAGATGAGTCTTCGGATATGCCACGTACAGAGGCCGCATCTGAAGGTGGGCAACGGACACCGGTCGAAGTAC
TTCAACCAAGTACTTCTATTCGATCGAATGTGGGGCAAAGCACGCGGCAATCACCGCGAGCGACATCACGCGATGTTTTCCCTACACAACGGCACGACACCCGTCGATCG
AATGATAGATTCGGGGCTATGGAGAGAAGGCTGGATCTTTTAGCTTCAGACATGGCGGAGGTGAAGACGGATTTGGCGAAGGTCAAGTCCGACTTGAGTGAAATGAAACT
CATGCTTCAACGGTTGTGCCAGATCGATAGGCGAGAGGTGAATATTGGTGTCTCTCCGTCGGATACAGTCCACGTGTCACATCCATTGGTATCCAATGTTATCCCCGAGC
ATGATGGGGATGCTGATGACCACCAACCTGGAGGTTCCGATGCTGGAAAGAAGGACGATGTGGTTCCTGTAGAAGCGTCGTTGCATGAAAAGGCAACGGATGGAGTAGAG
ATGACCATACCCCCATCCGATCTTGGAGATGCAGAACTGACCAACCCCACGTGTATTGTCGATTCGGTGGAGTTGGATGTTGCAGTGGTGACACCCATTGTTTCGACAGA
GATGGTGGAACTCGAAATGACACCGCCAATAGTACAGGATCCACAAGCAGAGACGACGTCCGATCTAACCTTCGAGCCTCCTGCCTCAACCAACATTGATGGTCCGTGTG
GCATGATCCATGGGCCTCGTCAAGCCGAGCATATTGAGTTGGCCCTTACACCAGCGGATACGAGCCCCACTACTCAACCTATTCCCACACTTACACCAGCATATACGACT
CTCATCCCTCAACCTATTCCCACCCTTACACCAGCTGAAAACCCCACCACCCGTCATCCGAGTGATCCCGTGGGTTCTACTAACCTCGCGTTAGACAAAATTTCTGAACC
ATTGGCCATCGTGCACCAGCCAACTAAGGAGAAGAACCCCCCTCATGGCAAAAAAGCCACCACAATCCGATTTACGGCACCGCAAGAAGCCCCACTCTTTGTCAGCGGTT
CTGCTGTTAACGAACCCACTAAGCCGAAGAAAACTGAACAACAAACCGCTCCTAAGCAGTCGGCCTGGAAAATCGAGGTTTCGTATCCCGACGAAACAAGAAGGACCGAG
CGTAAGCGGACGGAAACGAAACCATTCAGTCCGAAGGACACGCGTTGGGAGATGATGCGTTGGGTACGGGACCCTGGGAATGACAAAACAACGCCGCCGTCTACAACTTG
GAATGTGCAGAGCGGATATTCCAGAAGATTCTTCATTAACATCCTCAATCCTAAGGAGAAGGTGGAAGACCCGGAAGCTGCTGTCATTCTATATTTCATTATGAGGAAGC
TTGGTAGTCGGCCGCACCTGTGCGTTCATAAGTTTTCTGTCCTAGACCCACTACAAATGCAAGTTCTTGCCGCTGCAGGTGGTCCCTATGCACGAATCAAGGGGAAGGTC
GTCCAGGACACGACCAATACTTGGGACGAGTATAAGGAGTGCATGGATGTCGTGCTGGGTCAGGTGGAAGATTTCATTCCATCCTGGGTGGACGTCGACGTAGTGTACAG
CCCGCTCTGTATCAAGGATCACTGGGTCCTGGTTGCGATAGATATGACCCAATCCGAGATTTTTGTATACGACTCATTGCCAGGCCACATTTCCACGTCGAAGTTGCTGA
CAGACATGCGGCCGTTGAGTCATACAATCCCATCGCTTTTGTACGCATGTGGGCTGATGGATACGGCCGATTGCAAGCTGAAGAGGACTCCGTGGCGTGTATACCGTCCT
ACGACCGACACGAGGCAGAAAGGTAGTATAGACTGTGGTATTTTTGCATGTAAATTTTTGGAATATCTTGTGTCGGGTAATAGTTTAGAAACTCTTGTTCAGGCTCAAGT
GTCGCACATTAGAAGGCAGTATGCGACACAACTTTGGCATAATGAACCTTACTTTGAATGA
Protein sequenceShow/hide protein sequence
MEVVTARFVVARNGSSKVSSEASQVPLECWFNAGSWVCCGSSLRSTVGVLSEFVEDVVQCQSRFSEGRAPDPLPLTPPEIAIRTTVGRSLPDLSCRCGRWSLQSLPDLPR
RGVRQSLPLPSVWVSKGCTMDHQLRIKDNDRFPDQATSMSHLSNVNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVARSSDDSMSFLIGGNVLTFS
KDQFMLITGLWRLPGKVVQKKIGKNRLRRKYFNDEASMLLEEFVEVYKQTDFEDDEDAVKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTVN
GLKRAMNGKVALYKNKVRTNKKYLVKYSLQGFPLAFQVWIYEVVPSLITPGVNRLSETAIPRIIRYSCSRVVDTKDLGEEVLGSAVLVISYPLVETELDKDYQRCPLDER
EVVDLTAPGCSTSDSDDGHNPSPITDNLGDEDDLPLDDAHSLETNVQAMSDESSDMPRTEAASEGGQRTPVEVLQPSTSIRSNVGQSTRQSPRATSRDVFPTQRHDTRRS
NDRFGAMERRLDLLASDMAEVKTDLAKVKSDLSEMKLMLQRLCQIDRREVNIGVSPSDTVHVSHPLVSNVIPEHDGDADDHQPGGSDAGKKDDVVPVEASLHEKATDGVE
MTIPPSDLGDAELTNPTCIVDSVELDVAVVTPIVSTEMVELEMTPPIVQDPQAETTSDLTFEPPASTNIDGPCGMIHGPRQAEHIELALTPADTSPTTQPIPTLTPAYTT
LIPQPIPTLTPAENPTTRHPSDPVGSTNLALDKISEPLAIVHQPTKEKNPPHGKKATTIRFTAPQEAPLFVSGSAVNEPTKPKKTEQQTAPKQSAWKIEVSYPDETRRTE
RKRTETKPFSPKDTRWEMMRWVRDPGNDKTTPPSTTWNVQSGYSRRFFINILNPKEKVEDPEAAVILYFIMRKLGSRPHLCVHKFSVLDPLQMQVLAAAGGPYARIKGKV
VQDTTNTWDEYKECMDVVLGQVEDFIPSWVDVDVVYSPLCIKDHWVLVAIDMTQSEIFVYDSLPGHISTSKLLTDMRPLSHTIPSLLYACGLMDTADCKLKRTPWRVYRP
TTDTRQKGSIDCGIFACKFLEYLVSGNSLETLVQAQVSHIRRQYATQLWHNEPYFE