; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g01130 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g01130
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr1:832924..844188
RNA-Seq ExpressionMoc01g01130
SyntenyMoc01g01130
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040934.1 uncharacterized protein E6C27_scaffold125G001250 [Cucumis melo var. makuwa]4.6e-4232.85Show/hide
Query:  GETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLK----------------TWN--------KLHRYIDAFMQR----QSE-------
        G  +K +  RGPTG   IT++S +GH+ V++YN  GQPIG +ATKLK                +WN        K++  I+  +++    Q+E       
Subjt:  GETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLK----------------TWN--------KLHRYIDAFMQR----QSE-------

Query:  --------------TNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILWKKSQTTQ-----KVLERIDNLMITQDLQEENVNKDEDLLSIAL
                        RE+RK + YNH M+RKGYANL EE+K   S+    DR ++WKK++TT+      +  + DNL++++       N   D+LS A+
Subjt:  --------------TNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILWKKSQTTQ-----KVLERIDNLMITQDLQEENVNKDEDLLSIAL

Query:  GSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKELQAKLKKHEKSSPKSKHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSK
        G  D PG +R VG+ +T ++YFH   +     VG+++   EE  RMAA + EL+ +L KH             K+ P+ + K + T  SK   + + +S 
Subjt:  GSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKELQAKLKKHEKSSPKSKHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSK

Query:  RTTPSKNDPKKSPQLKRTTPSKNASKKSSQPKRGTLSKHERDENILDKEETL-EFKEGTHCRLALGSIDNVVAAGTIFESGRNDGNVKVSIDVVVDYDSR
         T+   ND     + ++        KK    K G     E ++ +    ETL + K+GT CRLA+G+ DNVV AGTIF+   +  NVKVS+D+V D +  
Subjt:  RTTPSKNDPKKSPQLKRTTPSKNASKKSSQPKRGTLSKHERDENILDKEETL-EFKEGTHCRLALGSIDNVVAAGTIFESGRNDGNVKVSIDVVVDYDSR

Query:  LLIPTQGGNDILSQEIDDILSVITMAVRNIQKKPF---------------ALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILE
        + +PT+    +LSQE+   L      V  + KK F                    L  +    VECGYYVM+ MRDI+ + N TI+E
Subjt:  LLIPTQGGNDILSQEIDDILSVITMAVRNIQKKPF---------------ALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILE

KAA0041518.1 uncharacterized protein E6C27_scaffold6G001110 [Cucumis melo var. makuwa]3.4e-3727.95Show/hide
Query:  GETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTW------------------------NKLHRYID-------------------
        G  +K +  RGPT + EIT++S + H+ V++YN  GQPIG +ATKLK++                        +K++  I+                   
Subjt:  GETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTW------------------------NKLHRYID-------------------

Query:  -------------------------------AFMQRQ-----------------SETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILWK
                                       +F+ R+                 S   RE+RK + YNH M+RKGYANL EE+K  ++ E   +R ++WK
Subjt:  -------------------------------AFMQRQ-----------------SETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILWK

Query:  KSQT----------TQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKEL
        K++T          T++V  +IDNL++++ +     N   D+LS A+G  D P  +R VG+ +T ++YFH   +     VG+++  +EE  RM A + EL
Subjt:  KSQT----------TQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKEL

Query:  QAKLKKHEKSSPKSKHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSKRTTPSKNDPKKSPQLKRTTPSKNASKKS--SQPKRGTLSKHERDENILDKEET
        +A+L KH             K+ P+ + K + T  SK   + + +S  T+   ND     + ++          +   Q K G     E ++ +    ET
Subjt:  QAKLKKHEKSSPKSKHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSKRTTPSKNDPKKSPQLKRTTPSKNASKKS--SQPKRGTLSKHERDENILDKEET

Query:  L-EFKEGTHCRLALGSIDNVVAAGTIFESGRNDGNVKVSIDVVVDYDSRLLIPTQGGNDILSQEI-----------------------------------
        L + K+GT CRLA+G+ DNVV AGTIF+   +  NVKVS+D+V D +  + +PT+ G  +LSQE+                                   
Subjt:  L-EFKEGTHCRLALGSIDNVVAAGTIFESGRNDGNVKVSIDVVVDYDSRLLIPTQGGNDILSQEI-----------------------------------

Query:  --------------------DDILSVITMAVRNIQKKPFALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILE
                             D++ ++     + +KKP   + + CPKQ   VECGYYVMR MRDI+ + N TI+E
Subjt:  --------------------DDILSVITMAVRNIQKKPFALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILE

TYK24391.1 uncharacterized protein E5676_scaffold205G001770 [Cucumis melo var. makuwa]1.2e-3728.12Show/hide
Query:  GETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTW------------------------NKLHRYID-------------------
        G  +K +  RGPT + EIT++S + H+ V++YN  GQPIG +ATKLK++                        +K++  I+                   
Subjt:  GETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTW------------------------NKLHRYID-------------------

Query:  -------------------------------AFMQRQ-----------------SETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILWK
                                       +F+ R+                 S   RE+RK + YNH M+RKGYANL EE+K  ++ E   DR ++WK
Subjt:  -------------------------------AFMQRQ-----------------SETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILWK

Query:  KSQT----------TQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKEL
        K++T          T++V  +IDNL++++ +     N   D+LS A+G  D P  +R VG+ +T ++YFH   +     VG+++  +EE  RM A + EL
Subjt:  KSQT----------TQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKEL

Query:  QAKLKKHEKSSPKSKHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSKRTTPSKNDPKKSPQLKRTTPSKNASKKS--SQPKRGTLSKHERDENILDKEET
        +A+L KH             K+ P+ + K + T  SK   + + +S  T+   ND     + ++          +   Q K G     E ++ +    ET
Subjt:  QAKLKKHEKSSPKSKHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSKRTTPSKNDPKKSPQLKRTTPSKNASKKS--SQPKRGTLSKHERDENILDKEET

Query:  L-EFKEGTHCRLALGSIDNVVAAGTIFESGRNDGNVKVSIDVVVDYDSRLLIPTQGGNDILSQEI-----------------------------------
        L + K+GT CRLA+G+ DNVV AGTIF+   +  NVKVS+D+V D +  + +PT+ G  +LSQE+                                   
Subjt:  L-EFKEGTHCRLALGSIDNVVAAGTIFESGRNDGNVKVSIDVVVDYDSRLLIPTQGGNDILSQEI-----------------------------------

Query:  --------------------DDILSVITMAVRNIQKKPFALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILE
                             D++ ++     + +KKP   + + CPKQ   VECGYYVMR MRDI+ + N TI+E
Subjt:  --------------------DDILSVITMAVRNIQKKPFALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILE

XP_022156813.1 uncharacterized protein LOC111023653 [Momordica charantia]6.5e-4487.25Show/hide
Query:  EVIRCRCLKCGNRIYKDAATSRNQLYEHGIDQSYRVWFWHGEEHASRTYEDRLNGEFNKNHEDDNDLFDVIDMVQIVYDGISHVPKSFENMFDNAKKPLY
        ++IRC CLKCG RI KDAAT RN LYEHGIDQSYRVWFWHGEEHA RT EDRLN EFNKNHEDDNDLFDVIDMVQ VYD ISHVPKSFENMFDNAKKPLY
Subjt:  EVIRCRCLKCGNRIYKDAATSRNQLYEHGIDQSYRVWFWHGEEHASRTYEDRLNGEFNKNHEDDNDLFDVIDMVQIVYDGISHVPKSFENMFDNAKKPLY

Query:  SG
         G
Subjt:  SG

XP_022156873.1 uncharacterized protein LOC111023710 [Momordica charantia]2.4e-7556.77Show/hide
Query:  MTDVGETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTW-----------------------------------------------
        MTDVGETSK RKKRGPTGLH+IT+ISSEGHR VIKYNVKGQPIGHNATKLKT+                                               
Subjt:  MTDVGETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTW-----------------------------------------------

Query:  --------------NKL----------------------HRYIDAFM--------QRQSETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRC
                      NK                       H + + F+        +RQSETNREKRKKH+YNHCM RKGYANLAEELKIDASHERPSDRC
Subjt:  --------------NKL----------------------HRYIDAFM--------QRQSETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRC

Query:  ILWKKSQT----------TQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAAL
        ILWKK++T          TQKV+ERID+LMITQDLQEENV+K  DLLSIALGS DRPGLV+AVG+GITKTQYF+NPTK+S PLVGEKKVDI+EYDRMAAL
Subjt:  ILWKKSQT----------TQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAAL

Query:  VKE
        VK+
Subjt:  VKE

TrEMBL top hitse value%identityAlignment
A0A5A7TBV0 Uncharacterized protein2.2e-4232.85Show/hide
Query:  GETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLK----------------TWN--------KLHRYIDAFMQR----QSE-------
        G  +K +  RGPTG   IT++S +GH+ V++YN  GQPIG +ATKLK                +WN        K++  I+  +++    Q+E       
Subjt:  GETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLK----------------TWN--------KLHRYIDAFMQR----QSE-------

Query:  --------------TNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILWKKSQTTQ-----KVLERIDNLMITQDLQEENVNKDEDLLSIAL
                        RE+RK + YNH M+RKGYANL EE+K   S+    DR ++WKK++TT+      +  + DNL++++       N   D+LS A+
Subjt:  --------------TNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILWKKSQTTQ-----KVLERIDNLMITQDLQEENVNKDEDLLSIAL

Query:  GSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKELQAKLKKHEKSSPKSKHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSK
        G  D PG +R VG+ +T ++YFH   +     VG+++   EE  RMAA + EL+ +L KH             K+ P+ + K + T  SK   + + +S 
Subjt:  GSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKELQAKLKKHEKSSPKSKHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSK

Query:  RTTPSKNDPKKSPQLKRTTPSKNASKKSSQPKRGTLSKHERDENILDKEETL-EFKEGTHCRLALGSIDNVVAAGTIFESGRNDGNVKVSIDVVVDYDSR
         T+   ND     + ++        KK    K G     E ++ +    ETL + K+GT CRLA+G+ DNVV AGTIF+   +  NVKVS+D+V D +  
Subjt:  RTTPSKNDPKKSPQLKRTTPSKNASKKSSQPKRGTLSKHERDENILDKEETL-EFKEGTHCRLALGSIDNVVAAGTIFESGRNDGNVKVSIDVVVDYDSR

Query:  LLIPTQGGNDILSQEIDDILSVITMAVRNIQKKPF---------------ALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILE
        + +PT+    +LSQE+   L      V  + KK F                    L  +    VECGYYVM+ MRDI+ + N TI+E
Subjt:  LLIPTQGGNDILSQEIDDILSVITMAVRNIQKKPF---------------ALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILE

A0A5A7TF26 ULP_PROTEASE domain-containing protein1.7e-3727.95Show/hide
Query:  GETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTW------------------------NKLHRYID-------------------
        G  +K +  RGPT + EIT++S + H+ V++YN  GQPIG +ATKLK++                        +K++  I+                   
Subjt:  GETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTW------------------------NKLHRYID-------------------

Query:  -------------------------------AFMQRQ-----------------SETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILWK
                                       +F+ R+                 S   RE+RK + YNH M+RKGYANL EE+K  ++ E   +R ++WK
Subjt:  -------------------------------AFMQRQ-----------------SETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILWK

Query:  KSQT----------TQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKEL
        K++T          T++V  +IDNL++++ +     N   D+LS A+G  D P  +R VG+ +T ++YFH   +     VG+++  +EE  RM A + EL
Subjt:  KSQT----------TQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKEL

Query:  QAKLKKHEKSSPKSKHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSKRTTPSKNDPKKSPQLKRTTPSKNASKKS--SQPKRGTLSKHERDENILDKEET
        +A+L KH             K+ P+ + K + T  SK   + + +S  T+   ND     + ++          +   Q K G     E ++ +    ET
Subjt:  QAKLKKHEKSSPKSKHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSKRTTPSKNDPKKSPQLKRTTPSKNASKKS--SQPKRGTLSKHERDENILDKEET

Query:  L-EFKEGTHCRLALGSIDNVVAAGTIFESGRNDGNVKVSIDVVVDYDSRLLIPTQGGNDILSQEI-----------------------------------
        L + K+GT CRLA+G+ DNVV AGTIF+   +  NVKVS+D+V D +  + +PT+ G  +LSQE+                                   
Subjt:  L-EFKEGTHCRLALGSIDNVVAAGTIFESGRNDGNVKVSIDVVVDYDSRLLIPTQGGNDILSQEI-----------------------------------

Query:  --------------------DDILSVITMAVRNIQKKPFALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILE
                             D++ ++     + +KKP   + + CPKQ   VECGYYVMR MRDI+ + N TI+E
Subjt:  --------------------DDILSVITMAVRNIQKKPFALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILE

A0A5D3DL96 ULP_PROTEASE domain-containing protein5.7e-3828.12Show/hide
Query:  GETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTW------------------------NKLHRYID-------------------
        G  +K +  RGPT + EIT++S + H+ V++YN  GQPIG +ATKLK++                        +K++  I+                   
Subjt:  GETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTW------------------------NKLHRYID-------------------

Query:  -------------------------------AFMQRQ-----------------SETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILWK
                                       +F+ R+                 S   RE+RK + YNH M+RKGYANL EE+K  ++ E   DR ++WK
Subjt:  -------------------------------AFMQRQ-----------------SETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILWK

Query:  KSQT----------TQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKEL
        K++T          T++V  +IDNL++++ +     N   D+LS A+G  D P  +R VG+ +T ++YFH   +     VG+++  +EE  RM A + EL
Subjt:  KSQT----------TQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKEL

Query:  QAKLKKHEKSSPKSKHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSKRTTPSKNDPKKSPQLKRTTPSKNASKKS--SQPKRGTLSKHERDENILDKEET
        +A+L KH             K+ P+ + K + T  SK   + + +S  T+   ND     + ++          +   Q K G     E ++ +    ET
Subjt:  QAKLKKHEKSSPKSKHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSKRTTPSKNDPKKSPQLKRTTPSKNASKKS--SQPKRGTLSKHERDENILDKEET

Query:  L-EFKEGTHCRLALGSIDNVVAAGTIFESGRNDGNVKVSIDVVVDYDSRLLIPTQGGNDILSQEI-----------------------------------
        L + K+GT CRLA+G+ DNVV AGTIF+   +  NVKVS+D+V D +  + +PT+ G  +LSQE+                                   
Subjt:  L-EFKEGTHCRLALGSIDNVVAAGTIFESGRNDGNVKVSIDVVVDYDSRLLIPTQGGNDILSQEI-----------------------------------

Query:  --------------------DDILSVITMAVRNIQKKPFALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILE
                             D++ ++     + +KKP   + + CPKQ   VECGYYVMR MRDI+ + N TI+E
Subjt:  --------------------DDILSVITMAVRNIQKKPFALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILE

A0A6J1DRM7 uncharacterized protein LOC1110236533.1e-4487.25Show/hide
Query:  EVIRCRCLKCGNRIYKDAATSRNQLYEHGIDQSYRVWFWHGEEHASRTYEDRLNGEFNKNHEDDNDLFDVIDMVQIVYDGISHVPKSFENMFDNAKKPLY
        ++IRC CLKCG RI KDAAT RN LYEHGIDQSYRVWFWHGEEHA RT EDRLN EFNKNHEDDNDLFDVIDMVQ VYD ISHVPKSFENMFDNAKKPLY
Subjt:  EVIRCRCLKCGNRIYKDAATSRNQLYEHGIDQSYRVWFWHGEEHASRTYEDRLNGEFNKNHEDDNDLFDVIDMVQIVYDGISHVPKSFENMFDNAKKPLY

Query:  SG
         G
Subjt:  SG

A0A6J1DRS8 uncharacterized protein LOC1110237101.2e-7556.77Show/hide
Query:  MTDVGETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTW-----------------------------------------------
        MTDVGETSK RKKRGPTGLH+IT+ISSEGHR VIKYNVKGQPIGHNATKLKT+                                               
Subjt:  MTDVGETSKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTW-----------------------------------------------

Query:  --------------NKL----------------------HRYIDAFM--------QRQSETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRC
                      NK                       H + + F+        +RQSETNREKRKKH+YNHCM RKGYANLAEELKIDASHERPSDRC
Subjt:  --------------NKL----------------------HRYIDAFM--------QRQSETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRC

Query:  ILWKKSQT----------TQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAAL
        ILWKK++T          TQKV+ERID+LMITQDLQEENV+K  DLLSIALGS DRPGLV+AVG+GITKTQYF+NPTK+S PLVGEKKVDI+EYDRMAAL
Subjt:  ILWKKSQT----------TQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAAL

Query:  VKE
        VK+
Subjt:  VKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGTCACCGCCAAACGTCCTAAAACACTGCTGGAACGCACTAAACCTGCCGCTGGTTTTCGTAGCAGCAGGTCGTCGAGGTTCGTCGCGGATCTGATCTGT
TCGTGGTGCACCGCCGCGAGGAACGCTGCAGGAACGGTGCTGGCAGATCTGGGTCGTTCGCCGGAGAACACGTCGCCGGAGGAAATCGCGCCGCCGCTGCTGTGG
ATCGCCGGGACGTCGGGTCGAAGTTGCGCCGCCGCTGGATGGGTCGCCGTCCGTCGCTGCTGCTGTTACGGAGGTGGTCGCTGCCGCTTCCGTTGCCGCCGTGAG
GTTATACGTTGTCGCTGCTTGAAATGTGGAAACCGTATATATAAGGATGCTGCAACCAGTAGAAATCAGTTGTACGAACACGGTATTGATCAGAGCTATAGGGTA
TGGTTCTGGCATGGTGAAGAACATGCATCTAGAACATATGAGGATAGGTTGAATGGTGAGTTCAATAAGAATCACGAAGATGATAATGATTTGTTCGATGTGATT
GACATGGTTCAGATTGTTTATGATGGAATTTCACATGTACCGAAATCGTTTGAAAATATGTTTGATAATGCTAAGAAACCATTATACTCTGGATATGCAGTTGGA
CTTGGCTCGCAAAACAATGTAGAAGAGAGTCTCCGTGATAGGCCTTCATCAGCTGGATCATACGTAAATCCGAGTGTTGAACATCTAAAGCAATCTCATATTTAT
GTACTGGAGAAAATTGAAGAAGTGGAACCATATCAAAGACAACACATGAAACACTTGAAAGAGGAAAATCCAAATAGGTCAAATAATATGAAGTGGCTTCAGAAT
GAACACACCAGAAGCTTCGGCAACTGGATACGTGATAAGTGCAATTTCTACATAACTAAAAAGAATTTGTACCAGAACGACAAAATGACAGATGTAGGCGAAACC
AGCAAGATGAGGAAGAAGCGTGGACCTACGGGGTTGCACGAGATTACACAAATTAGCAGCGAAGGTCACCGGTGGGTTATTAAGTATAACGTGAAAGGGCAACCG
ATTGGACATAATGCTACGAAATTGAAGACTTGGAATAAGTTACACCGATACATTGACGCCTTTATGCAGAGACAAAGTGAGACGAACAGAGAAAAGCGTAAAAAG
CACATGTATAACCATTGCATGACGCGGAAGGGTTATGCAAACCTTGCTGAGGAATTGAAAATTGATGCATCACATGAGAGACCATCAGATCGTTGCATCTTATGG
AAGAAGTCTCAAACGACACAGAAAGTCCTAGAGCGCATAGATAATCTCATGATAACGCAAGACTTGCAAGAAGAGAATGTGAACAAAGATGAGGATTTGTTATCC
ATCGCACTGGGCAGTTGGGATCGGCCCGGACTTGTTAGAGCAGTAGGTCGAGGCATAACAAAAACCCAGTACTTCCATAATCCAACCAAATACTCAGCGCCCCTA
GTAGGAGAGAAAAAGGTGGACATCGAAGAGTATGATCGAATGGCAGCTCTTGTGAAAGAACTTCAAGCGAAACTGAAGAAGCATGAAAAAAGTTCTCCAAAATCA
AAACATGGCATGCCAGCTAAGAAGACTCCAAAAAAATCTCCTAAACTGAAGTGTACCACACCATCTAAACATGCTCCAAAGAAATCTCCTCGGTCGAAGCGTACC
ACACCATCAAAAAATGATCCAAAGAAATCTCCTCAGTTAAAGCGTACCACACCATCTAAAAACGCTTCAAAAAAATCTTCGCAACCAAAGCGTGGTACTTTATCG
AAGCATGAACGAGATGAAAATATATTGGACAAGGAAGAAACTTTGGAGTTCAAGGAGGGAACTCATTGTCGTCTGGCACTTGGGTCCATCGATAATGTTGTCGCT
GCGGGCACTATATTTGAATCTGGGAGGAATGATGGAAACGTGAAAGTGTCCATAGACGTGGTGGTTGATTACGACTCTCGACTTCTAATTCCGACACAAGGAGGA
AATGATATTCTCTCGCAAGAAATAGATGATATTCTTAGTGTCATCACCATGGCTGTAAGAAATATACAGAAAAAACCATTTGCTCTGAAGCGTGTACTGTGCCCA
AAACAACCGAATGCAGTAGAATGTGGATACTATGTCATGCGGTTAATGCGTGATATAGTCTTCGCTTGTAACACAACAATCCTAGAATGCGTGCGTATTGTGATT
GTTAGCTCTCTGGCACTACATAAAGGCTATGAAAGTGTGTTGATTCCAAAGCATGTGGGAGCGAAGGTTGAAGGAAGCAGGGTCGAGCGTTTCTGTAATATGTTA
GCTACATTTTGTAGTTTGACCTTTCACGTTTACACGGTCTTCCAAGAAGAGCCTGGAGAACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGGTCACCGCCAAACGTCCTAAAACACTGCTGGAACGCACTAAACCTGCCGCTGGTTTTCGTAGCAGCAGGTCGTCGAGGTTCGTCGCGGATCTGATCTGT
TCGTGGTGCACCGCCGCGAGGAACGCTGCAGGAACGGTGCTGGCAGATCTGGGTCGTTCGCCGGAGAACACGTCGCCGGAGGAAATCGCGCCGCCGCTGCTGTGG
ATCGCCGGGACGTCGGGTCGAAGTTGCGCCGCCGCTGGATGGGTCGCCGTCCGTCGCTGCTGCTGTTACGGAGGTGGTCGCTGCCGCTTCCGTTGCCGCCGTGAG
GTTATACGTTGTCGCTGCTTGAAATGTGGAAACCGTATATATAAGGATGCTGCAACCAGTAGAAATCAGTTGTACGAACACGGTATTGATCAGAGCTATAGGGTA
TGGTTCTGGCATGGTGAAGAACATGCATCTAGAACATATGAGGATAGGTTGAATGGTGAGTTCAATAAGAATCACGAAGATGATAATGATTTGTTCGATGTGATT
GACATGGTTCAGATTGTTTATGATGGAATTTCACATGTACCGAAATCGTTTGAAAATATGTTTGATAATGCTAAGAAACCATTATACTCTGGATATGCAGTTGGA
CTTGGCTCGCAAAACAATGTAGAAGAGAGTCTCCGTGATAGGCCTTCATCAGCTGGATCATACGTAAATCCGAGTGTTGAACATCTAAAGCAATCTCATATTTAT
GTACTGGAGAAAATTGAAGAAGTGGAACCATATCAAAGACAACACATGAAACACTTGAAAGAGGAAAATCCAAATAGGTCAAATAATATGAAGTGGCTTCAGAAT
GAACACACCAGAAGCTTCGGCAACTGGATACGTGATAAGTGCAATTTCTACATAACTAAAAAGAATTTGTACCAGAACGACAAAATGACAGATGTAGGCGAAACC
AGCAAGATGAGGAAGAAGCGTGGACCTACGGGGTTGCACGAGATTACACAAATTAGCAGCGAAGGTCACCGGTGGGTTATTAAGTATAACGTGAAAGGGCAACCG
ATTGGACATAATGCTACGAAATTGAAGACTTGGAATAAGTTACACCGATACATTGACGCCTTTATGCAGAGACAAAGTGAGACGAACAGAGAAAAGCGTAAAAAG
CACATGTATAACCATTGCATGACGCGGAAGGGTTATGCAAACCTTGCTGAGGAATTGAAAATTGATGCATCACATGAGAGACCATCAGATCGTTGCATCTTATGG
AAGAAGTCTCAAACGACACAGAAAGTCCTAGAGCGCATAGATAATCTCATGATAACGCAAGACTTGCAAGAAGAGAATGTGAACAAAGATGAGGATTTGTTATCC
ATCGCACTGGGCAGTTGGGATCGGCCCGGACTTGTTAGAGCAGTAGGTCGAGGCATAACAAAAACCCAGTACTTCCATAATCCAACCAAATACTCAGCGCCCCTA
GTAGGAGAGAAAAAGGTGGACATCGAAGAGTATGATCGAATGGCAGCTCTTGTGAAAGAACTTCAAGCGAAACTGAAGAAGCATGAAAAAAGTTCTCCAAAATCA
AAACATGGCATGCCAGCTAAGAAGACTCCAAAAAAATCTCCTAAACTGAAGTGTACCACACCATCTAAACATGCTCCAAAGAAATCTCCTCGGTCGAAGCGTACC
ACACCATCAAAAAATGATCCAAAGAAATCTCCTCAGTTAAAGCGTACCACACCATCTAAAAACGCTTCAAAAAAATCTTCGCAACCAAAGCGTGGTACTTTATCG
AAGCATGAACGAGATGAAAATATATTGGACAAGGAAGAAACTTTGGAGTTCAAGGAGGGAACTCATTGTCGTCTGGCACTTGGGTCCATCGATAATGTTGTCGCT
GCGGGCACTATATTTGAATCTGGGAGGAATGATGGAAACGTGAAAGTGTCCATAGACGTGGTGGTTGATTACGACTCTCGACTTCTAATTCCGACACAAGGAGGA
AATGATATTCTCTCGCAAGAAATAGATGATATTCTTAGTGTCATCACCATGGCTGTAAGAAATATACAGAAAAAACCATTTGCTCTGAAGCGTGTACTGTGCCCA
AAACAACCGAATGCAGTAGAATGTGGATACTATGTCATGCGGTTAATGCGTGATATAGTCTTCGCTTGTAACACAACAATCCTAGAATGCGTGCGTATTGTGATT
GTTAGCTCTCTGGCACTACATAAAGGCTATGAAAGTGTGTTGATTCCAAAGCATGTGGGAGCGAAGGTTGAAGGAAGCAGGGTCGAGCGTTTCTGTAATATGTTA
GCTACATTTTGTAGTTTGACCTTTCACGTTTACACGGTCTTCCAAGAAGAGCCTGGAGAACCATGA
Protein sequenceShow/hide protein sequence
MWVTAKRPKTLLERTKPAAGFRSSRSSRFVADLICSWCTAARNAAGTVLADLGRSPENTSPEEIAPPLLWIAGTSGRSCAAAGWVAVRRCCCYGGGRCRFRCRRE
VIRCRCLKCGNRIYKDAATSRNQLYEHGIDQSYRVWFWHGEEHASRTYEDRLNGEFNKNHEDDNDLFDVIDMVQIVYDGISHVPKSFENMFDNAKKPLYSGYAVG
LGSQNNVEESLRDRPSSAGSYVNPSVEHLKQSHIYVLEKIEEVEPYQRQHMKHLKEENPNRSNNMKWLQNEHTRSFGNWIRDKCNFYITKKNLYQNDKMTDVGET
SKMRKKRGPTGLHEITQISSEGHRWVIKYNVKGQPIGHNATKLKTWNKLHRYIDAFMQRQSETNREKRKKHMYNHCMTRKGYANLAEELKIDASHERPSDRCILW
KKSQTTQKVLERIDNLMITQDLQEENVNKDEDLLSIALGSWDRPGLVRAVGRGITKTQYFHNPTKYSAPLVGEKKVDIEEYDRMAALVKELQAKLKKHEKSSPKS
KHGMPAKKTPKKSPKLKCTTPSKHAPKKSPRSKRTTPSKNDPKKSPQLKRTTPSKNASKKSSQPKRGTLSKHERDENILDKEETLEFKEGTHCRLALGSIDNVVA
AGTIFESGRNDGNVKVSIDVVVDYDSRLLIPTQGGNDILSQEIDDILSVITMAVRNIQKKPFALKRVLCPKQPNAVECGYYVMRLMRDIVFACNTTILECVRIVI
VSSLALHKGYESVLIPKHVGAKVEGSRVERFCNMLATFCSLTFHVYTVFQEEPGEP