; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g32300 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g32300
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr8:23379980..23383490
RNA-Seq ExpressionMoc08g32300
SyntenyMoc08g32300
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]8.1e-11469.25Show/hide
Query:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI
        M+LRLI+DRNDWFPATLT+LAHVDKTTTRIKARLTPTQLDMFRQT FGPILD+ VVFNGPLIHHLLL EVEEPRQDVISFDLF KRVSFGKREFDLITG+
Subjt:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI

Query:  SHRMNRVDNHIPGRRFRARYFKDS-------------------------VGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL
        SH+MNRV+NHIPGRR RARYFKDS                         VGIVYFIELAMMGKERKQFIDT  +GV+DRWEAFCN DWSSMIF RTIWSL
Subjt:  SHRMNRVDNHIPGRRFRARYFKDS-------------------------VGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL

Query:  KNAL---------KARANPSHVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVEEQH
        KN L         KA A+P+HVETYSLYGFPY                           R+ +         RVLASEVFDNT SKVKEHLLATD EEQH
Subjt:  KNAL---------KARANPSHVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVEEQH

Query:  MVRVILPLEARVIPDLPVVPDRVVVPDRAVVPDLP
        MVRVILP E RVIPD P VPDR VVPDRAVVPD P
Subjt:  MVRVILPLEARVIPDLPVVPDRVVVPDRAVVPDLP

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]2.8e-21580.52Show/hide
Query:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI
        M+LRLIIDRNDWFPATLT+LAH+DKT+TRIKARLTPTQLDMFRQT FGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITG+
Subjt:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI

Query:  SHRMNRVDNHIPGRRFRARYFKD-------------------------SVGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL
        SHRMNRVDNHIPGRR RARYFKD                          V IVYFIELAMMGKERKQFIDTALLGV+DRWE FCNYDWSSMIF RTIWSL
Subjt:  SHRMNRVDNHIPGRRFRARYFKD-------------------------SVGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL

Query:  KNAL---------KARANPSHVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVEEQH
        KNAL         KA A+PSHVETYSLYGFPYAFQVWAYETIST        LSDD IPRLL+WSC YSCGFRVL SEVFDNTRSKVKEHLLATD +EQH
Subjt:  KNAL---------KARANPSHVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVEEQH

Query:  MVRVILPLEARVIPDLPVVPDRVVVPDRAVVPDLPISPDRATVPDPPADVEMGPLEDPVVEAHAVDEAGPSANDGEGLEKRWKKNKFKKRISRQLKRLDN
        MVRVILP E RVIPD P       VPDRAVVPD P SP+RA VPDPPADVEMGPLEDPVV+AHAVDEA PSANDGEGLEKR KKNKFKKRISR+LKRLDN
Subjt:  MVRVILPLEARVIPDLPVVPDRVVVPDRAVVPDLPISPDRATVPDPPADVEMGPLEDPVVEAHAVDEAGPSANDGEGLEKRWKKNKFKKRISRQLKRLDN

Query:  YVGAIEDILGDFGVALKGIQRYLKKLAKGKFLDPSKYFGGGGGPHDDGPSDGRPDESPKQDGDWKSMDEDQRPDEDQRTDEDLATEKEPKSGHGPNSI
         VGAIED LGDFGVALKGIQ YLKKLAKGKF D SKYFGGGGGP DDGPSD RPDESPK DG  KSMDEDQR DEDQRTDEDL TEKEP SGHG +++
Subjt:  YVGAIEDILGDFGVALKGIQRYLKKLAKGKFLDPSKYFGGGGGPHDDGPSDGRPDESPKQDGDWKSMDEDQRPDEDQRTDEDLATEKEPKSGHGPNSI

XP_022155158.1 uncharacterized protein LOC111022300 [Momordica charantia]4.5e-8073.53Show/hide
Query:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI
        M+LRLI++R+DWFP TLT+LAH DKTT+R+K RLTPTQ+DMFRQT FGPILD+DVVFNGPLIHHLLLREVEEPRQD+ISFDLFGKRVSFGKREFDLITG+
Subjt:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI

Query:  SHRMNRVDNHIPGRRFRARYFKDS-------------------------VGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL
        S+RM RVDN IPGRR RARYFKDS                         VGIVYF+ELAMMGKERKQFID  LLGV+DRWE FCN+DWSS+IF RT+WSL
Subjt:  SHRMNRVDNHIPGRRFRARYFKDS-------------------------VGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL

Query:  KNAL
        KNA+
Subjt:  KNAL

XP_022155476.1 uncharacterized protein LOC111022607 [Momordica charantia]1.2e-10173.06Show/hide
Query:  SGHGPNSIDEDPKRREDDPMITDEDDGMITDGDEDPNQDITIGRPPDGSEVDHADDHGPRVAVIQ-----------------------------------
        SGHGPNS+DEDPKRR++DPMI +EDDGMITDGDEDPNQDITIGR PDGSEVDH DDH P+VAVIQ                                   
Subjt:  SGHGPNSIDEDPKRREDDPMITDEDDGMITDGDEDPNQDITIGRPPDGSEVDHADDHGPRVAVIQ-----------------------------------

Query:  -VEPYLDQDETDLQHAPTGWRLRKRHYSWKLKGIYTPTGRRRITVDAYDPACPIPPQLD--------------------AGLQGKEWYRDLLDPTVQLKD
         VEPYLDQDETDLQHAPTG  LRK HYSWKLKGIYTPTGRRRITVDAYDPACPIPPQLD                    AGLQGKEWYRDLLDPTVQLKD
Subjt:  -VEPYLDQDETDLQHAPTGWRLRKRHYSWKLKGIYTPTGRRRITVDAYDPACPIPPQLD--------------------AGLQGKEWYRDLLDPTVQLKD

Query:  EVVDVLVLFTAKKLEKCLHLCRKKFAIGDVLFSTLLNRTDGPYAAMKPGVLSTRIEYPWSQENTIFRYVFG
        EVVD LVLFTAKKLEKC++LCRKKFAIGDVL STLLNRTDGPYAAMKPGVLSTRIEYP SQENTIFRYVFG
Subjt:  EVVDVLVLFTAKKLEKCLHLCRKKFAIGDVLFSTLLNRTDGPYAAMKPGVLSTRIEYPWSQENTIFRYVFG

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]3.5e-10163.97Show/hide
Query:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI
        MN+ L I+++DWFPA L++LAHV KT++R+KARLTP+QLDMF QT FGPIL ++VVFNGPL+HHLLLREVEEP+ D+ISF+LFG RVSFGKREFDLITG+
Subjt:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI

Query:  SHRMNRVDNHIPGRRFRARYFKD-------------------------SVGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL
         H MNRVD  +  RR R  YF+D                          + IVYFIELAMMGKERK  +DT+LLG++DRWE FCNYDWSSMIF RT+WSL
Subjt:  SHRMNRVDNHIPGRRFRARYFKD-------------------------SVGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL

Query:  KNALKARA---------NPSHVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVE
        KNALK +          + SHVETYSLY FPYAFQVWAYETISTLS +VA RL+DD IPRLL+WSCTYS  F VL  EVF+N +SKV   L ATDVE
Subjt:  KNALKARA---------NPSHVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVE

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156003.9e-11469.25Show/hide
Query:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI
        M+LRLI+DRNDWFPATLT+LAHVDKTTTRIKARLTPTQLDMFRQT FGPILD+ VVFNGPLIHHLLL EVEEPRQDVISFDLF KRVSFGKREFDLITG+
Subjt:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI

Query:  SHRMNRVDNHIPGRRFRARYFKDS-------------------------VGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL
        SH+MNRV+NHIPGRR RARYFKDS                         VGIVYFIELAMMGKERKQFIDT  +GV+DRWEAFCN DWSSMIF RTIWSL
Subjt:  SHRMNRVDNHIPGRRFRARYFKDS-------------------------VGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL

Query:  KNAL---------KARANPSHVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVEEQH
        KN L         KA A+P+HVETYSLYGFPY                           R+ +         RVLASEVFDNT SKVKEHLLATD EEQH
Subjt:  KNAL---------KARANPSHVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVEEQH

Query:  MVRVILPLEARVIPDLPVVPDRVVVPDRAVVPDLP
        MVRVILP E RVIPD P VPDR VVPDRAVVPD P
Subjt:  MVRVILPLEARVIPDLPVVPDRVVVPDRAVVPDLP

A0A6J1DJX9 uncharacterized protein LOC1110207571.4e-21580.52Show/hide
Query:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI
        M+LRLIIDRNDWFPATLT+LAH+DKT+TRIKARLTPTQLDMFRQT FGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITG+
Subjt:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI

Query:  SHRMNRVDNHIPGRRFRARYFKD-------------------------SVGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL
        SHRMNRVDNHIPGRR RARYFKD                          V IVYFIELAMMGKERKQFIDTALLGV+DRWE FCNYDWSSMIF RTIWSL
Subjt:  SHRMNRVDNHIPGRRFRARYFKD-------------------------SVGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL

Query:  KNAL---------KARANPSHVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVEEQH
        KNAL         KA A+PSHVETYSLYGFPYAFQVWAYETIST        LSDD IPRLL+WSC YSCGFRVL SEVFDNTRSKVKEHLLATD +EQH
Subjt:  KNAL---------KARANPSHVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVEEQH

Query:  MVRVILPLEARVIPDLPVVPDRVVVPDRAVVPDLPISPDRATVPDPPADVEMGPLEDPVVEAHAVDEAGPSANDGEGLEKRWKKNKFKKRISRQLKRLDN
        MVRVILP E RVIPD P       VPDRAVVPD P SP+RA VPDPPADVEMGPLEDPVV+AHAVDEA PSANDGEGLEKR KKNKFKKRISR+LKRLDN
Subjt:  MVRVILPLEARVIPDLPVVPDRVVVPDRAVVPDLPISPDRATVPDPPADVEMGPLEDPVVEAHAVDEAGPSANDGEGLEKRWKKNKFKKRISRQLKRLDN

Query:  YVGAIEDILGDFGVALKGIQRYLKKLAKGKFLDPSKYFGGGGGPHDDGPSDGRPDESPKQDGDWKSMDEDQRPDEDQRTDEDLATEKEPKSGHGPNSI
         VGAIED LGDFGVALKGIQ YLKKLAKGKF D SKYFGGGGGP DDGPSD RPDESPK DG  KSMDEDQR DEDQRTDEDL TEKEP SGHG +++
Subjt:  YVGAIEDILGDFGVALKGIQRYLKKLAKGKFLDPSKYFGGGGGPHDDGPSDGRPDESPKQDGDWKSMDEDQRPDEDQRTDEDLATEKEPKSGHGPNSI

A0A6J1DM82 uncharacterized protein LOC1110223002.2e-8073.53Show/hide
Query:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI
        M+LRLI++R+DWFP TLT+LAH DKTT+R+K RLTPTQ+DMFRQT FGPILD+DVVFNGPLIHHLLLREVEEPRQD+ISFDLFGKRVSFGKREFDLITG+
Subjt:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI

Query:  SHRMNRVDNHIPGRRFRARYFKDS-------------------------VGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL
        S+RM RVDN IPGRR RARYFKDS                         VGIVYF+ELAMMGKERKQFID  LLGV+DRWE FCN+DWSS+IF RT+WSL
Subjt:  SHRMNRVDNHIPGRRFRARYFKDS-------------------------VGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL

Query:  KNAL
        KNA+
Subjt:  KNAL

A0A6J1DRS0 uncharacterized protein LOC1110226075.9e-10273.06Show/hide
Query:  SGHGPNSIDEDPKRREDDPMITDEDDGMITDGDEDPNQDITIGRPPDGSEVDHADDHGPRVAVIQ-----------------------------------
        SGHGPNS+DEDPKRR++DPMI +EDDGMITDGDEDPNQDITIGR PDGSEVDH DDH P+VAVIQ                                   
Subjt:  SGHGPNSIDEDPKRREDDPMITDEDDGMITDGDEDPNQDITIGRPPDGSEVDHADDHGPRVAVIQ-----------------------------------

Query:  -VEPYLDQDETDLQHAPTGWRLRKRHYSWKLKGIYTPTGRRRITVDAYDPACPIPPQLD--------------------AGLQGKEWYRDLLDPTVQLKD
         VEPYLDQDETDLQHAPTG  LRK HYSWKLKGIYTPTGRRRITVDAYDPACPIPPQLD                    AGLQGKEWYRDLLDPTVQLKD
Subjt:  -VEPYLDQDETDLQHAPTGWRLRKRHYSWKLKGIYTPTGRRRITVDAYDPACPIPPQLD--------------------AGLQGKEWYRDLLDPTVQLKD

Query:  EVVDVLVLFTAKKLEKCLHLCRKKFAIGDVLFSTLLNRTDGPYAAMKPGVLSTRIEYPWSQENTIFRYVFG
        EVVD LVLFTAKKLEKC++LCRKKFAIGDVL STLLNRTDGPYAAMKPGVLSTRIEYP SQENTIFRYVFG
Subjt:  EVVDVLVLFTAKKLEKCLHLCRKKFAIGDVLFSTLLNRTDGPYAAMKPGVLSTRIEYPWSQENTIFRYVFG

A0A6J1DRZ7 uncharacterized protein LOC1110238471.7e-10163.97Show/hide
Query:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI
        MN+ L I+++DWFPA L++LAHV KT++R+KARLTP+QLDMF QT FGPIL ++VVFNGPL+HHLLLREVEEP+ D+ISF+LFG RVSFGKREFDLITG+
Subjt:  MNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQDVISFDLFGKRVSFGKREFDLITGI

Query:  SHRMNRVDNHIPGRRFRARYFKD-------------------------SVGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL
         H MNRVD  +  RR R  YF+D                          + IVYFIELAMMGKERK  +DT+LLG++DRWE FCNYDWSSMIF RT+WSL
Subjt:  SHRMNRVDNHIPGRRFRARYFKD-------------------------SVGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSL

Query:  KNALKARA---------NPSHVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVE
        KNALK +          + SHVETYSLY FPYAFQVWAYETISTLS +VA RL+DD IPRLL+WSCTYS  F VL  EVF+N +SKV   L ATDVE
Subjt:  KNALKARA---------NPSHVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G08430.1 Ulp1 protease family protein1.2e-0626.61Show/hide
Query:  IFRYVF-GGNHWVMLGIDLVEGDLTIWDSLQLATPLNSLEKELKPICTILLAVLHHGGIFAARPDLPVVLWRVRQVHTPQQSNATDCGIFCVRFFEYDVT
        ++ Y+F  GNHWV L IDL +  + ++DS+   T    +  +   + T++ A+L        R      L   R    P+  +A DC I+ +++ E    
Subjt:  IFRYVF-GGNHWVMLGIDLVEGDLTIWDSLQLATPLNSLEKELKPICTILLAVLHHGGIFAARPDLPVVLWRVRQVHTPQQSNATDCGIFCVRFFEYDVT

Query:  GSKLDTLTQDNIVFFRRQYAVQMW
        G   D L  +N+     + AV+M+
Subjt:  GSKLDTLTQDNIVFFRRQYAVQMW

AT5G45570.1 Ulp1 protease family protein5.4e-0726.61Show/hide
Query:  IFRYVF-GGNHWVMLGIDLVEGDLTIWDSLQLATPLNSLEKELKPICTILLAVLHHGGIFAARPDLPVVLWRVRQVHTPQQSNATDCGIFCVRFFEYDVT
        ++ Y+F  GNHWV L IDL    + ++DS+   T    +  +   + T++ A+L        R      L   R    P+  +  DC I+ +++ E    
Subjt:  IFRYVF-GGNHWVMLGIDLVEGDLTIWDSLQLATPLNSLEKELKPICTILLAVLHHGGIFAARPDLPVVLWRVRQVHTPQQSNATDCGIFCVRFFEYDVT

Query:  GSKLDTLTQDNIVFFRRQYAVQMW
        G   D L  +N+   R + AV+M+
Subjt:  GSKLDTLTQDNIVFFRRQYAVQMW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCCGATCTGGTTCGACCCGGAACAAGTCCGAGATGAGAGGCACATTTGAGTTGTCCCCATTCTGCCTTTCATCTCGGACTTGTCCCGGTCTGATTCGAATGAATTT
GAGACTCATCATAGATCGTAATGACTGGTTTCCGGCCACGTTGACGGACCTTGCCCATGTTGATAAAACCACTACTAGGATTAAGGCCAGGCTAACCCCAACCCAGTTAG
ACATGTTTAGGCAAACGAGTTTCGGTCCTATTTTGGACATTGACGTTGTTTTCAACGGTCCATTGATCCATCACCTGTTGTTGAGAGAGGTTGAAGAGCCTAGACAGGAC
GTCATTAGCTTTGACTTGTTTGGGAAGAGGGTGTCTTTTGGTAAGCGAGAGTTCGACCTAATCACCGGAATCAGTCATAGGATGAATAGGGTAGATAATCATATTCCTGG
ACGAAGATTTAGAGCACGTTACTTTAAAGACAGTGTTGGCATAGTTTACTTCATAGAACTTGCCATGATGGGGAAGGAGAGGAAGCAGTTCATAGATACGGCCCTGTTAG
GTGTTATGGATCGGTGGGAGGCGTTCTGCAACTATGACTGGAGTTCGATGATTTTTTATAGGACGATTTGGAGTCTCAAGAACGCCCTGAAGGCAAGAGCGAACCCTTCA
CACGTTGAGACTTATAGTTTGTACGGGTTTCCGTATGCATTTCAGGTATGGGCATATGAGACGATCTCGACGTTGAGCCTGCAAGTAGCAACGAGGTTGAGTGATGACAC
CATTCCTCGACTTCTCAAGTGGTCGTGCACTTATTCGTGCGGGTTTCGTGTGCTGGCGAGTGAGGTTTTCGATAACACCCGGTCCAAGGTTAAGGAACACTTGTTGGCGA
CGGATGTTGAAGAACAACACATGGTTCGTGTCATTCTTCCACTAGAAGCTCGTGTTATACCTGATCTGCCTGTTGTACCTGATCGGGTTGTTGTACCTGATCGAGCTGTT
GTACCTGATCTGCCTATTTCACCTGATCGGGCTACTGTACCTGATCCGCCCGCAGATGTGGAAATGGGTCCTCTAGAGGATCCGGTAGTAGAGGCACATGCAGTAGACGA
GGCTGGACCTAGTGCAAACGACGGTGAAGGGTTAGAGAAGAGGTGGAAGAAGAATAAATTCAAGAAGAGGATCAGCAGACAGTTGAAGAGACTGGATAACTATGTCGGTG
CTATCGAGGACATACTGGGTGACTTTGGAGTCGCCCTGAAAGGTATTCAGAGATACCTAAAGAAACTGGCGAAGGGTAAATTCCTTGATCCGAGCAAGTATTTTGGAGGT
GGGGGTGGGCCCCATGATGATGGTCCATCAGATGGAAGGCCTGATGAGTCCCCAAAGCAAGATGGAGATTGGAAGAGTATGGACGAGGACCAAAGGCCTGATGAGGACCA
GAGGACGGATGAAGACCTGGCGACTGAAAAGGAACCGAAGTCGGGACATGGTCCGAATAGTATCGACGAGGATCCGAAAAGAAGAGAAGATGATCCAATGATAACGGACG
AGGACGATGGTATGATAACGGATGGGGACGAGGATCCAAATCAGGACATTACGATCGGGAGACCGCCTGATGGCTCAGAAGTGGATCATGCAGATGACCATGGACCTCGG
GTGGCCGTAATTCAGGTTGAACCGTACCTTGACCAGGACGAAACTGACCTTCAGCATGCCCCAACTGGTTGGAGGCTACGCAAGCGCCATTATTCGTGGAAACTGAAGGG
TATATACACACCAACCGGCCGGCGTAGAATCACCGTGGACGCATATGACCCAGCATGTCCCATTCCTCCGCAACTGGACGCTGGCTTACAAGGGAAGGAATGGTATCGTG
ATCTACTAGACCCTACTGTCCAATTGAAGGACGAGGTAGTTGATGTTCTCGTCCTATTTACGGCCAAAAAGTTGGAGAAGTGTCTACATCTCTGTCGCAAAAAGTTTGCA
ATAGGCGACGTGCTATTTTCGACTCTGCTGAACCGAACAGACGGTCCGTATGCGGCCATGAAACCAGGGGTCCTGTCTACGAGAATCGAATACCCCTGGAGCCAAGAGAA
TACCATATTTCGATATGTCTTCGGTGGGAACCACTGGGTGATGCTCGGGATTGATCTTGTGGAAGGTGACTTAACCATATGGGATTCACTCCAATTGGCCACTCCACTAA
ATTCACTCGAGAAGGAGCTGAAGCCCATTTGTACGATCCTACTTGCGGTACTACATCATGGCGGGATATTTGCAGCACGACCGGACTTGCCAGTGGTGCTATGGAGGGTG
CGTCAGGTTCACACACCCCAACAAAGTAACGCCACAGATTGCGGGATTTTCTGTGTACGCTTCTTCGAGTACGATGTTACCGGGTCAAAGCTAGACACTTTGACCCAAGA
TAATATTGTATTTTTTAGGCGTCAGTACGCTGTACAGATGTGGGCGCGCCGTCCCATTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCCGATCTGGTTCGACCCGGAACAAGTCCGAGATGAGAGGCACATTTGAGTTGTCCCCATTCTGCCTTTCATCTCGGACTTGTCCCGGTCTGATTCGAATGAATTT
GAGACTCATCATAGATCGTAATGACTGGTTTCCGGCCACGTTGACGGACCTTGCCCATGTTGATAAAACCACTACTAGGATTAAGGCCAGGCTAACCCCAACCCAGTTAG
ACATGTTTAGGCAAACGAGTTTCGGTCCTATTTTGGACATTGACGTTGTTTTCAACGGTCCATTGATCCATCACCTGTTGTTGAGAGAGGTTGAAGAGCCTAGACAGGAC
GTCATTAGCTTTGACTTGTTTGGGAAGAGGGTGTCTTTTGGTAAGCGAGAGTTCGACCTAATCACCGGAATCAGTCATAGGATGAATAGGGTAGATAATCATATTCCTGG
ACGAAGATTTAGAGCACGTTACTTTAAAGACAGTGTTGGCATAGTTTACTTCATAGAACTTGCCATGATGGGGAAGGAGAGGAAGCAGTTCATAGATACGGCCCTGTTAG
GTGTTATGGATCGGTGGGAGGCGTTCTGCAACTATGACTGGAGTTCGATGATTTTTTATAGGACGATTTGGAGTCTCAAGAACGCCCTGAAGGCAAGAGCGAACCCTTCA
CACGTTGAGACTTATAGTTTGTACGGGTTTCCGTATGCATTTCAGGTATGGGCATATGAGACGATCTCGACGTTGAGCCTGCAAGTAGCAACGAGGTTGAGTGATGACAC
CATTCCTCGACTTCTCAAGTGGTCGTGCACTTATTCGTGCGGGTTTCGTGTGCTGGCGAGTGAGGTTTTCGATAACACCCGGTCCAAGGTTAAGGAACACTTGTTGGCGA
CGGATGTTGAAGAACAACACATGGTTCGTGTCATTCTTCCACTAGAAGCTCGTGTTATACCTGATCTGCCTGTTGTACCTGATCGGGTTGTTGTACCTGATCGAGCTGTT
GTACCTGATCTGCCTATTTCACCTGATCGGGCTACTGTACCTGATCCGCCCGCAGATGTGGAAATGGGTCCTCTAGAGGATCCGGTAGTAGAGGCACATGCAGTAGACGA
GGCTGGACCTAGTGCAAACGACGGTGAAGGGTTAGAGAAGAGGTGGAAGAAGAATAAATTCAAGAAGAGGATCAGCAGACAGTTGAAGAGACTGGATAACTATGTCGGTG
CTATCGAGGACATACTGGGTGACTTTGGAGTCGCCCTGAAAGGTATTCAGAGATACCTAAAGAAACTGGCGAAGGGTAAATTCCTTGATCCGAGCAAGTATTTTGGAGGT
GGGGGTGGGCCCCATGATGATGGTCCATCAGATGGAAGGCCTGATGAGTCCCCAAAGCAAGATGGAGATTGGAAGAGTATGGACGAGGACCAAAGGCCTGATGAGGACCA
GAGGACGGATGAAGACCTGGCGACTGAAAAGGAACCGAAGTCGGGACATGGTCCGAATAGTATCGACGAGGATCCGAAAAGAAGAGAAGATGATCCAATGATAACGGACG
AGGACGATGGTATGATAACGGATGGGGACGAGGATCCAAATCAGGACATTACGATCGGGAGACCGCCTGATGGCTCAGAAGTGGATCATGCAGATGACCATGGACCTCGG
GTGGCCGTAATTCAGGTTGAACCGTACCTTGACCAGGACGAAACTGACCTTCAGCATGCCCCAACTGGTTGGAGGCTACGCAAGCGCCATTATTCGTGGAAACTGAAGGG
TATATACACACCAACCGGCCGGCGTAGAATCACCGTGGACGCATATGACCCAGCATGTCCCATTCCTCCGCAACTGGACGCTGGCTTACAAGGGAAGGAATGGTATCGTG
ATCTACTAGACCCTACTGTCCAATTGAAGGACGAGGTAGTTGATGTTCTCGTCCTATTTACGGCCAAAAAGTTGGAGAAGTGTCTACATCTCTGTCGCAAAAAGTTTGCA
ATAGGCGACGTGCTATTTTCGACTCTGCTGAACCGAACAGACGGTCCGTATGCGGCCATGAAACCAGGGGTCCTGTCTACGAGAATCGAATACCCCTGGAGCCAAGAGAA
TACCATATTTCGATATGTCTTCGGTGGGAACCACTGGGTGATGCTCGGGATTGATCTTGTGGAAGGTGACTTAACCATATGGGATTCACTCCAATTGGCCACTCCACTAA
ATTCACTCGAGAAGGAGCTGAAGCCCATTTGTACGATCCTACTTGCGGTACTACATCATGGCGGGATATTTGCAGCACGACCGGACTTGCCAGTGGTGCTATGGAGGGTG
CGTCAGGTTCACACACCCCAACAAAGTAACGCCACAGATTGCGGGATTTTCTGTGTACGCTTCTTCGAGTACGATGTTACCGGGTCAAAGCTAGACACTTTGACCCAAGA
TAATATTGTATTTTTTAGGCGTCAGTACGCTGTACAGATGTGGGCGCGCCGTCCCATTTTTTGA
Protein sequenceShow/hide protein sequence
MSRSGSTRNKSEMRGTFELSPFCLSSRTCPGLIRMNLRLIIDRNDWFPATLTDLAHVDKTTTRIKARLTPTQLDMFRQTSFGPILDIDVVFNGPLIHHLLLREVEEPRQD
VISFDLFGKRVSFGKREFDLITGISHRMNRVDNHIPGRRFRARYFKDSVGIVYFIELAMMGKERKQFIDTALLGVMDRWEAFCNYDWSSMIFYRTIWSLKNALKARANPS
HVETYSLYGFPYAFQVWAYETISTLSLQVATRLSDDTIPRLLKWSCTYSCGFRVLASEVFDNTRSKVKEHLLATDVEEQHMVRVILPLEARVIPDLPVVPDRVVVPDRAV
VPDLPISPDRATVPDPPADVEMGPLEDPVVEAHAVDEAGPSANDGEGLEKRWKKNKFKKRISRQLKRLDNYVGAIEDILGDFGVALKGIQRYLKKLAKGKFLDPSKYFGG
GGGPHDDGPSDGRPDESPKQDGDWKSMDEDQRPDEDQRTDEDLATEKEPKSGHGPNSIDEDPKRREDDPMITDEDDGMITDGDEDPNQDITIGRPPDGSEVDHADDHGPR
VAVIQVEPYLDQDETDLQHAPTGWRLRKRHYSWKLKGIYTPTGRRRITVDAYDPACPIPPQLDAGLQGKEWYRDLLDPTVQLKDEVVDVLVLFTAKKLEKCLHLCRKKFA
IGDVLFSTLLNRTDGPYAAMKPGVLSTRIEYPWSQENTIFRYVFGGNHWVMLGIDLVEGDLTIWDSLQLATPLNSLEKELKPICTILLAVLHHGGIFAARPDLPVVLWRV
RQVHTPQQSNATDCGIFCVRFFEYDVTGSKLDTLTQDNIVFFRRQYAVQMWARRPIF