; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g27320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g27320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr9:20469160..20475557
RNA-Seq ExpressionMoc09g27320
SyntenyMoc09g27320
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]5.2e-4247.92Show/hide
Query:  MDIVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITRLSHRMIRVDNDIPGRRLRARYFKDSVRVKCRDLEKIFMEAVFEDDED------
        M +VFNGPLIHHLLL EVEEPRQD+ISFDLF KRVSFGKREFDLIT LSH+M RV+N IPGRRLRARYFKDSVRVKC +LEKIF+E +F DDED      
Subjt:  MDIVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITRLSHRMIRVDNDIPGRRLRARYFKDSVRVKCRDLEKIFMEAVFEDDED------

Query:  ----------RRTPSRINCRRTNRRQET--NPHTLRLIVST---GFRMGVRDDLDVESTRSHKAKRRRHSFDFLG------GCALILAGFVL---WGSKV
                  +     I+      R E   N     +I        +  ++D L     ++        ++   G        + +LA  V    W SKV
Subjt:  ----------RRTPSRINCRRTNRRQET--NPHTLRLIVST---GFRMGVRDDLDVESTRSHKAKRRRHSFDFLG------GCALILAGFVL---WGSKV

Query:  KEYLVATNADAQHMVRIMRPPKARAIPDLPDVPEPPAVPE
        KE+L+AT+A+ QHMVR++ PP+ R IPD P VP+   VP+
Subjt:  KEYLVATNADAQHMVRIMRPPKARAIPDLPDVPEPPAVPE

XP_022148137.1 uncharacterized protein LOC111016890 [Momordica charantia]2.1e-5181.67Show/hide
Query:  MKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPINVGGNYWVMLGIDLVEGDITVWDSLQTATPLDALEKDLKPMCTILPAVLHHGEIF
        MK  VLSTRINYPW EENTI+RYV+G QSDHNVPWSD D+VYTP+NVGGN+WVMLGIDLV+GDITVWDSLQTATPLD LEK+LKPMCTILP +LHHG IF
Subjt:  MKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPINVGGNYWVMLGIDLVEGDITVWDSLQTATPLDALEKDLKPMCTILPAVLHHGEIF

Query:  SVRSNLQLVSWRVRRVRIPQ
        SVR +L +V WRVRRVR+PQ
Subjt:  SVRSNLQLVSWRVRRVRIPQ

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]4.4e-6542.56Show/hide
Query:  MDIVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITRLSHRMIRVDNDIPGRRLRARYFKDSVRVKCRDLEKIFMEAVFEDDED---RRT
        +D+VFNGPLIHHLLLREVEEPRQD+ISFDLFGKRVSFGKREFDLIT LSHRM RVDN IPGRRLRARYFKD VRVKC +LEKIF+E VF DDED    R 
Subjt:  MDIVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITRLSHRMIRVDNDIPGRRLRARYFKDSVRVKCRDLEKIFMEAVFEDDED---RRT

Query:  PSRINCRRTNRRQETNPHTLRLIVST--------------------GFRMGVRDDLDVESTR-----SHKAKRRRHSFDF--------------------
           I      + ++    T  L V                        +  ++D L V   +     SH      + F +                    
Subjt:  PSRINCRRTNRRQETNPHTLRLIVST--------------------GFRMGVRDDLDVESTR-----SHKAKRRRHSFDF--------------------

Query:  -LGGCALILAGF-VLWG-------SKVKEYLVATNADAQHMVRIMRPPKARAIPDLPDVPEPPAVPELPADVERGTQKRRVKEKGKNIVEDPIEEAQALD
         L    +   GF VL         SKVKE+L+AT+A  QHMVR++ PP+ R IPD P VP+   VP+ PA  ER        +     +EDP+ +A A+D
Subjt:  -LGGCALILAGF-VLWG-------SKVKEYLVATNADAQHMVRIMRPPKARAIPDLPDVPEPPAVPELPADVERGTQKRRVKEKGKNIVEDPIEEAQALD

Query:  DAALDDPALYDVGPSENGGQAPQKRLKQNKFKNRISKRLKRLKDCVGAIEATLSGFGVALKVWSLLVHTFR------------------------ARPDE
        +A           PS N G+  +KRLK+NKFK RIS+RLKRL +CVGAIE  L  FGVALK   + +                             RPDE
Subjt:  DAALDDPALYDVGPSENGGQAPQKRLKQNKFKNRISKRLKRLKDCVGAIEATLSGFGVALKVWSLLVHTFR------------------------ARPDE

Query:  ASKLDEGSKSKDEDRKPDDAPKTNDDRTME
        + K D G KS DED++ D+  +T++D   E
Subjt:  ASKLDEGSKSKDEDRKPDDAPKTNDDRTME

XP_022155476.1 uncharacterized protein LOC111022607 [Momordica charantia]2.5e-7656.07Show/hide
Query:  GSKSKDEDRKPDDAPKTNDDRTMEH--GTITDGDN------ALDAYPD-----------RPVGLFQDVTVGRQEPNIASDTRPVSRRVRRPYKDWAPDAV
        G  S DED K  D    ND   ME   G ITDGD        +   PD             V + QD+TVGRQEP+   DT+P  RRVRRPYKDWAPDA+
Subjt:  GSKSKDEDRKPDDAPKTNDDRTMEH--GTITDGDN------ALDAYPD-----------RPVGLFQDVTVGRQEPNIASDTRPVSRRVRRPYKDWAPDAV

Query:  VKVEPYLDQDEYDLQQGPTGHGLRKQHYSWKLKNIHTPTSQRGITVDSYDP--DICFYSKRTFQCLSDLEALKKRIIEVVPFL--RNWTISSRDGWMTRR
        VKVEPYLDQDE DLQ  PTG GLRK HYSWKLK I+TPT +R ITVD+YDP   I       FQ   D   +  R       L  + W     D  +  +
Subjt:  VKVEPYLDQDEYDLQQGPTGHGLRKQHYSWKLKNIHTPTSQRGITVDSYDP--DICFYSKRTFQCLSDLEALKKRIIEVVPFL--RNWTISSRDGWMTRR

Query:  RMRYCGPLQLVLDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYTAMKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPIN
                  V+D LVLFTAKKLEKC++LCRKKFAIGDVLLSTLLNRTDGPY AMK GVLSTRI YP S+ENTIFRYV+G QSD NV W+D D+VYTPIN
Subjt:  RMRYCGPLQLVLDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYTAMKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPIN

Query:  VGGNY
        +GGN+
Subjt:  VGGNY

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]1.1e-5563.8Show/hide
Query:  LDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYTAMKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPINVGGNYWVMLGI
        +D L++ TA+K+EKC HL R +FAIGDVLLS LL RTDGPY AMK GVL ++  Y W +E TIFRYV G QSD++  WS+ D+VYT +N+GGN+WVM+GI
Subjt:  LDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYTAMKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPINVGGNYWVMLGI

Query:  DLVEGDITVWDSLQTATPLDALEKDLKPMCTILPAVLHHGEIFSVRSNLQLVSWRVRRVRIPQ
        DLVEGD+TVWDSLQ  TPL+ LEK LKPMCTI+PA+LH   I ++R NL +V WRVRR  +PQ
Subjt:  DLVEGDITVWDSLQTATPLDALEKDLKPMCTILPAVLHHGEIFSVRSNLQLVSWRVRRVRIPQ

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156002.5e-4247.92Show/hide
Query:  MDIVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITRLSHRMIRVDNDIPGRRLRARYFKDSVRVKCRDLEKIFMEAVFEDDED------
        M +VFNGPLIHHLLL EVEEPRQD+ISFDLF KRVSFGKREFDLIT LSH+M RV+N IPGRRLRARYFKDSVRVKC +LEKIF+E +F DDED      
Subjt:  MDIVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITRLSHRMIRVDNDIPGRRLRARYFKDSVRVKCRDLEKIFMEAVFEDDED------

Query:  ----------RRTPSRINCRRTNRRQET--NPHTLRLIVST---GFRMGVRDDLDVESTRSHKAKRRRHSFDFLG------GCALILAGFVL---WGSKV
                  +     I+      R E   N     +I        +  ++D L     ++        ++   G        + +LA  V    W SKV
Subjt:  ----------RRTPSRINCRRTNRRQET--NPHTLRLIVST---GFRMGVRDDLDVESTRSHKAKRRRHSFDFLG------GCALILAGFVL---WGSKV

Query:  KEYLVATNADAQHMVRIMRPPKARAIPDLPDVPEPPAVPE
        KE+L+AT+A+ QHMVR++ PP+ R IPD P VP+   VP+
Subjt:  KEYLVATNADAQHMVRIMRPPKARAIPDLPDVPEPPAVPE

A0A6J1D492 uncharacterized protein LOC1110168901.0e-5181.67Show/hide
Query:  MKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPINVGGNYWVMLGIDLVEGDITVWDSLQTATPLDALEKDLKPMCTILPAVLHHGEIF
        MK  VLSTRINYPW EENTI+RYV+G QSDHNVPWSD D+VYTP+NVGGN+WVMLGIDLV+GDITVWDSLQTATPLD LEK+LKPMCTILP +LHHG IF
Subjt:  MKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPINVGGNYWVMLGIDLVEGDITVWDSLQTATPLDALEKDLKPMCTILPAVLHHGEIF

Query:  SVRSNLQLVSWRVRRVRIPQ
        SVR +L +V WRVRRVR+PQ
Subjt:  SVRSNLQLVSWRVRRVRIPQ

A0A6J1DJX9 uncharacterized protein LOC1110207572.1e-6542.56Show/hide
Query:  MDIVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITRLSHRMIRVDNDIPGRRLRARYFKDSVRVKCRDLEKIFMEAVFEDDED---RRT
        +D+VFNGPLIHHLLLREVEEPRQD+ISFDLFGKRVSFGKREFDLIT LSHRM RVDN IPGRRLRARYFKD VRVKC +LEKIF+E VF DDED    R 
Subjt:  MDIVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITRLSHRMIRVDNDIPGRRLRARYFKDSVRVKCRDLEKIFMEAVFEDDED---RRT

Query:  PSRINCRRTNRRQETNPHTLRLIVST--------------------GFRMGVRDDLDVESTR-----SHKAKRRRHSFDF--------------------
           I      + ++    T  L V                        +  ++D L V   +     SH      + F +                    
Subjt:  PSRINCRRTNRRQETNPHTLRLIVST--------------------GFRMGVRDDLDVESTR-----SHKAKRRRHSFDF--------------------

Query:  -LGGCALILAGF-VLWG-------SKVKEYLVATNADAQHMVRIMRPPKARAIPDLPDVPEPPAVPELPADVERGTQKRRVKEKGKNIVEDPIEEAQALD
         L    +   GF VL         SKVKE+L+AT+A  QHMVR++ PP+ R IPD P VP+   VP+ PA  ER        +     +EDP+ +A A+D
Subjt:  -LGGCALILAGF-VLWG-------SKVKEYLVATNADAQHMVRIMRPPKARAIPDLPDVPEPPAVPELPADVERGTQKRRVKEKGKNIVEDPIEEAQALD

Query:  DAALDDPALYDVGPSENGGQAPQKRLKQNKFKNRISKRLKRLKDCVGAIEATLSGFGVALKVWSLLVHTFR------------------------ARPDE
        +A           PS N G+  +KRLK+NKFK RIS+RLKRL +CVGAIE  L  FGVALK   + +                             RPDE
Subjt:  DAALDDPALYDVGPSENGGQAPQKRLKQNKFKNRISKRLKRLKDCVGAIEATLSGFGVALKVWSLLVHTFR------------------------ARPDE

Query:  ASKLDEGSKSKDEDRKPDDAPKTNDDRTME
        + K D G KS DED++ D+  +T++D   E
Subjt:  ASKLDEGSKSKDEDRKPDDAPKTNDDRTME

A0A6J1DRS0 uncharacterized protein LOC1110226071.2e-7656.07Show/hide
Query:  GSKSKDEDRKPDDAPKTNDDRTMEH--GTITDGDN------ALDAYPD-----------RPVGLFQDVTVGRQEPNIASDTRPVSRRVRRPYKDWAPDAV
        G  S DED K  D    ND   ME   G ITDGD        +   PD             V + QD+TVGRQEP+   DT+P  RRVRRPYKDWAPDA+
Subjt:  GSKSKDEDRKPDDAPKTNDDRTMEH--GTITDGDN------ALDAYPD-----------RPVGLFQDVTVGRQEPNIASDTRPVSRRVRRPYKDWAPDAV

Query:  VKVEPYLDQDEYDLQQGPTGHGLRKQHYSWKLKNIHTPTSQRGITVDSYDP--DICFYSKRTFQCLSDLEALKKRIIEVVPFL--RNWTISSRDGWMTRR
        VKVEPYLDQDE DLQ  PTG GLRK HYSWKLK I+TPT +R ITVD+YDP   I       FQ   D   +  R       L  + W     D  +  +
Subjt:  VKVEPYLDQDEYDLQQGPTGHGLRKQHYSWKLKNIHTPTSQRGITVDSYDP--DICFYSKRTFQCLSDLEALKKRIIEVVPFL--RNWTISSRDGWMTRR

Query:  RMRYCGPLQLVLDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYTAMKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPIN
                  V+D LVLFTAKKLEKC++LCRKKFAIGDVLLSTLLNRTDGPY AMK GVLSTRI YP S+ENTIFRYV+G QSD NV W+D D+VYTPIN
Subjt:  RMRYCGPLQLVLDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYTAMKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPIN

Query:  VGGNY
        +GGN+
Subjt:  VGGNY

A0A6J1DY60 uncharacterized protein LOC1110252735.2e-5663.8Show/hide
Query:  LDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYTAMKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPINVGGNYWVMLGI
        +D L++ TA+K+EKC HL R +FAIGDVLLS LL RTDGPY AMK GVL ++  Y W +E TIFRYV G QSD++  WS+ D+VYT +N+GGN+WVM+GI
Subjt:  LDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRTDGPYTAMKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPINVGGNYWVMLGI

Query:  DLVEGDITVWDSLQTATPLDALEKDLKPMCTILPAVLHHGEIFSVRSNLQLVSWRVRRVRIPQ
        DLVEGD+TVWDSLQ  TPL+ LEK LKPMCTI+PA+LH   I ++R NL +V WRVRR  +PQ
Subjt:  DLVEGDITVWDSLQTATPLDALEKDLKPMCTILPAVLHHGEIFSVRSNLQLVSWRVRRVRIPQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACATAGTTTTTAACGGTCCATTAATACATCATCTGTTGTTGAGAGAAGTTGAAGAGCCTAGGCAGGACATCATTAGTTTCGACCTGTTTGGGAAAAGGGTGTCCTT
TGGTAAGCGGGAGTTTGACCTAATCACCAGACTCAGTCATAGGATGATTAGGGTAGATAACGATATTCCTGGCCGACGACTTAGAGCACGTTACTTTAAGGACAGTGTCA
GGGTTAAGTGTAGGGATTTAGAGAAGATTTTTATGGAGGCAGTTTTTGAAGACGACGAGGACCGGAGAACGCCCTCAAGGATAAACTGCCGGCGTACCAACAGAAGGCAA
GAGACGAATCCACACACGTTGAGACTTATAGTCTCTACGGGTTTCCGTATGGGTGTACGAGACGATCTTGACGTTGAGTCTACGCGTAGCCACAAGGCTAAGCGACGACG
CCATTCCTTTGACTTCTTAGGTGGTTGTGCACTTATTCTTGCGGGTTTCGTACTCTGGGGATCCAAGGTTAAGGAATACTTGGTTGCGACGAATGCTGATGCACAACACA
TGGTCCGTATCATGCGTCCTCCAAAAGCCCGCGCTATACCTGATCTGCCTGATGTACCTGAGCCGCCGGCTGTACCTGAGCTGCCTGCAGATGTGGAAAGGGGTACTCAG
AAAAGAAGAGTGAAGGAGAAAGGGAAGAATATCGTAGAGGATCCGATAGAAGAGGCCCAGGCATTGGACGATGCTGCATTAGATGATCCTGCATTATACGATGTTGGACC
CAGTGAAAATGGCGGTCAAGCACCACAGAAGAGGTTGAAACAGAATAAATTCAAGAATAGGATTAGTAAACGGTTGAAGAGGCTCAAAGACTGTGTTGGTGCTATCGAGG
CCACACTGAGTGGCTTTGGGGTCGCCCTGAAAGTTTGGTCCTTATTGGTTCATACTTTTCGTGCAAGGCCTGATGAGGCCTCGAAGCTAGATGAAGGTTCGAAGAGTAAG
GACGAGGACCGGAAGCCTGATGATGCCCCAAAGACTAACGATGATCGGACTATGGAGCATGGTACGATAACGGATGGGGACAATGCATTAGATGCTTACCCCGATCGTCC
TGTCGGTTTATTTCAAGATGTCACTGTTGGTAGGCAAGAGCCAAATATTGCATCAGATACGCGACCCGTCAGTCGACGCGTTAGGCGTCCCTATAAGGACTGGGCACCAG
ACGCAGTCGTTAAGGTTGAACCTTACCTTGACCAGGATGAATATGACCTTCAGCAGGGCCCAACTGGGCATGGGCTACGCAAGCAGCATTACTCGTGGAAGCTGAAGAAT
ATACACACACCAACCAGTCAGCGTGGGATCACCGTGGATAGCTACGACCCAGATATCTGCTTCTACTCTAAGAGAACCTTCCAGTGCTTAAGTGATTTAGAGGCCCTCAA
GAAAAGGATTATCGAAGTGGTCCCATTCCTCCGCAATTGGACGATAAGTTCCAGAGATGGATGGATGACCCGAAGACGGATGAGATATTGCGGTCCACTGCAACTGGTAC
TTGATGGTCTCGTCCTGTTTACGGCGAAAAAGTTGGAGAAGTGTCTCCATCTATGTCGTAAGAAGTTTGCGATAGGCGATGTACTTCTTTCGACTCTGCTGAATCGAACA
GATGGTCCATATACGGCCATGAAGTCGGGTGTCTTGTCCACTAGGATCAACTACCCTTGGAGCGAAGAGAATACCATCTTTCGATATGTCTACGGTTGGCAATCGGACCA
CAACGTGCCCTGGAGTGATGTAGACATGGTGTACACCCCCATCAATGTAGGCGGGAACTACTGGGTGATGCTCGGCATCGATCTTGTAGAAGGCGACATAACCGTATGGG
ATTCACTCCAAACGGCCACTCCACTGGATGCACTTGAGAAGGATCTGAAGCCCATGTGTACGATCCTACCCGCGGTGCTGCATCATGGCGAGATATTTTCAGTTCGATCC
AACTTGCAATTGGTGTCGTGGAGGGTGCGTCGGGTTCGCATACCACAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACATAGTTTTTAACGGTCCATTAATACATCATCTGTTGTTGAGAGAAGTTGAAGAGCCTAGGCAGGACATCATTAGTTTCGACCTGTTTGGGAAAAGGGTGTCCTT
TGGTAAGCGGGAGTTTGACCTAATCACCAGACTCAGTCATAGGATGATTAGGGTAGATAACGATATTCCTGGCCGACGACTTAGAGCACGTTACTTTAAGGACAGTGTCA
GGGTTAAGTGTAGGGATTTAGAGAAGATTTTTATGGAGGCAGTTTTTGAAGACGACGAGGACCGGAGAACGCCCTCAAGGATAAACTGCCGGCGTACCAACAGAAGGCAA
GAGACGAATCCACACACGTTGAGACTTATAGTCTCTACGGGTTTCCGTATGGGTGTACGAGACGATCTTGACGTTGAGTCTACGCGTAGCCACAAGGCTAAGCGACGACG
CCATTCCTTTGACTTCTTAGGTGGTTGTGCACTTATTCTTGCGGGTTTCGTACTCTGGGGATCCAAGGTTAAGGAATACTTGGTTGCGACGAATGCTGATGCACAACACA
TGGTCCGTATCATGCGTCCTCCAAAAGCCCGCGCTATACCTGATCTGCCTGATGTACCTGAGCCGCCGGCTGTACCTGAGCTGCCTGCAGATGTGGAAAGGGGTACTCAG
AAAAGAAGAGTGAAGGAGAAAGGGAAGAATATCGTAGAGGATCCGATAGAAGAGGCCCAGGCATTGGACGATGCTGCATTAGATGATCCTGCATTATACGATGTTGGACC
CAGTGAAAATGGCGGTCAAGCACCACAGAAGAGGTTGAAACAGAATAAATTCAAGAATAGGATTAGTAAACGGTTGAAGAGGCTCAAAGACTGTGTTGGTGCTATCGAGG
CCACACTGAGTGGCTTTGGGGTCGCCCTGAAAGTTTGGTCCTTATTGGTTCATACTTTTCGTGCAAGGCCTGATGAGGCCTCGAAGCTAGATGAAGGTTCGAAGAGTAAG
GACGAGGACCGGAAGCCTGATGATGCCCCAAAGACTAACGATGATCGGACTATGGAGCATGGTACGATAACGGATGGGGACAATGCATTAGATGCTTACCCCGATCGTCC
TGTCGGTTTATTTCAAGATGTCACTGTTGGTAGGCAAGAGCCAAATATTGCATCAGATACGCGACCCGTCAGTCGACGCGTTAGGCGTCCCTATAAGGACTGGGCACCAG
ACGCAGTCGTTAAGGTTGAACCTTACCTTGACCAGGATGAATATGACCTTCAGCAGGGCCCAACTGGGCATGGGCTACGCAAGCAGCATTACTCGTGGAAGCTGAAGAAT
ATACACACACCAACCAGTCAGCGTGGGATCACCGTGGATAGCTACGACCCAGATATCTGCTTCTACTCTAAGAGAACCTTCCAGTGCTTAAGTGATTTAGAGGCCCTCAA
GAAAAGGATTATCGAAGTGGTCCCATTCCTCCGCAATTGGACGATAAGTTCCAGAGATGGATGGATGACCCGAAGACGGATGAGATATTGCGGTCCACTGCAACTGGTAC
TTGATGGTCTCGTCCTGTTTACGGCGAAAAAGTTGGAGAAGTGTCTCCATCTATGTCGTAAGAAGTTTGCGATAGGCGATGTACTTCTTTCGACTCTGCTGAATCGAACA
GATGGTCCATATACGGCCATGAAGTCGGGTGTCTTGTCCACTAGGATCAACTACCCTTGGAGCGAAGAGAATACCATCTTTCGATATGTCTACGGTTGGCAATCGGACCA
CAACGTGCCCTGGAGTGATGTAGACATGGTGTACACCCCCATCAATGTAGGCGGGAACTACTGGGTGATGCTCGGCATCGATCTTGTAGAAGGCGACATAACCGTATGGG
ATTCACTCCAAACGGCCACTCCACTGGATGCACTTGAGAAGGATCTGAAGCCCATGTGTACGATCCTACCCGCGGTGCTGCATCATGGCGAGATATTTTCAGTTCGATCC
AACTTGCAATTGGTGTCGTGGAGGGTGCGTCGGGTTCGCATACCACAGTAG
Protein sequenceShow/hide protein sequence
MDIVFNGPLIHHLLLREVEEPRQDIISFDLFGKRVSFGKREFDLITRLSHRMIRVDNDIPGRRLRARYFKDSVRVKCRDLEKIFMEAVFEDDEDRRTPSRINCRRTNRRQ
ETNPHTLRLIVSTGFRMGVRDDLDVESTRSHKAKRRRHSFDFLGGCALILAGFVLWGSKVKEYLVATNADAQHMVRIMRPPKARAIPDLPDVPEPPAVPELPADVERGTQ
KRRVKEKGKNIVEDPIEEAQALDDAALDDPALYDVGPSENGGQAPQKRLKQNKFKNRISKRLKRLKDCVGAIEATLSGFGVALKVWSLLVHTFRARPDEASKLDEGSKSK
DEDRKPDDAPKTNDDRTMEHGTITDGDNALDAYPDRPVGLFQDVTVGRQEPNIASDTRPVSRRVRRPYKDWAPDAVVKVEPYLDQDEYDLQQGPTGHGLRKQHYSWKLKN
IHTPTSQRGITVDSYDPDICFYSKRTFQCLSDLEALKKRIIEVVPFLRNWTISSRDGWMTRRRMRYCGPLQLVLDGLVLFTAKKLEKCLHLCRKKFAIGDVLLSTLLNRT
DGPYTAMKSGVLSTRINYPWSEENTIFRYVYGWQSDHNVPWSDVDMVYTPINVGGNYWVMLGIDLVEGDITVWDSLQTATPLDALEKDLKPMCTILPAVLHHGEIFSVRS
NLQLVSWRVRRVRIPQ