; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg023994 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg023994
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1-like peptidase
Genome locationscaffold13:9077011..9081707
RNA-Seq ExpressionSpg023994
SyntenySpg023994
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148308.1 uncharacterized protein LOC111016993 [Momordica charantia]5.2e-2834.12Show/hide
Query:  RITAYVVREKTWFRALLTPSKWMTDEAIDSIFMFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLRREDGPYKELS
        R T Y ++ K WFR LL P  W T E +D +FM ++KKL++RP+LC +KFTT D+ +                             N+ RR+D  Y  + 
Subjt:  RITAYVVREKTWFRALLTPSKWMTDEAIDSIFMFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLRREDGPYKELS

Query:  SGIH-PRDLT--YEW-SKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIG
        S    P  +   Y+W  +  +++ Y  G   D+ + W  VDA+Y+ +NI G HWV+VCIDLE GE+VV DSL ++  D  +E  LKV+  ++  + ++  
Subjt:  SGIH-PRDLT--YEW-SKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIG

Query:  VMDSRKDLSVE
        VM  + +L ++
Subjt:  VMDSRKDLSVE

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]1.2e-2434.93Show/hide
Query:  MFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLRREDGPYKELSSG--IHPRDLT-YEWSKLA-NVLKYEMGELAD
        MF+  KL+ RP LCR+KFTT D+ ++                            NFLR  DG Y  + S   I  R  + Y+W   A ++L Y  G  +D
Subjt:  MFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLRREDGPYKELSSG--IHPRDLT-YEWSKLA-NVLKYEMGELAD

Query:  HNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIGVMDSRKDLSVERWPLRWELSRPQQKRSGECSM
        ++  W  VDAVY+ YNIGG+HW+++CID + GE++V DS + +     +EQELK +  ++  L   +GV   + ++ +  W +R   S PQQ   G+C +
Subjt:  HNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIGVMDSRKDLSVERWPLRWELSRPQQKRSGECSM

Query:  FVCKYFEYD
        F   +FEYD
Subjt:  FVCKYFEYD

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]1.9e-3031.98Show/hide
Query:  LDSENLEDNVRITAYVVREKTWFRALLTPSKWMTDEAIDSIFMFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLR
        +D  + +DN R T+  ++ K+WF  LL P   + DE IDS+ M   +K+++   L R +F   D+ ++                            N LR
Subjt:  LDSENLEDNVRITAYVVREKTWFRALLTPSKWMTDEAIDSIFMFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLR

Query:  REDGPYKELSSGIHPRDLTYEWSKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSA
        R DGPY  +  G+ P   TY+W +   + +Y +G  +D++  WS  D VY   NIGG HWV++ IDL  G++ V DSL A+   E +E+ LK +  ++ A
Subjt:  REDGPYKELSSGIHPRDLTYEWSKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSA

Query:  LRWEIGVMDSRKDLSVERWPLRWELSRPQQKRSGECSMFVCKYFEYD
        +    G++  R +L +  W +R   + PQQ    +CS+F  ++FEYD
Subjt:  LRWEIGVMDSRKDLSVERWPLRWELSRPQQKRSGECSMFVCKYFEYD

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]4.5e-2441.83Show/hide
Query:  DQNFLRREDGPYKELSSGIHPRDLTYEWSKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVL
        D+NFLRR+                T +WS    VLKY  G+  D+++PWS VDAVYM +N+ G+HWVLVC D +V E+++ DSL+AL+ +  +E E++++
Subjt:  DQNFLRREDGPYKELSSGIHPRDLTYEWSKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVL

Query:  SQVVSALRWEIGVMDSRKDLSVERWPLRWELSRPQQKRSGECSMFVCKYFEYD
         +    L     VM+S  +L ++RW LR +  R QQ  SG+C MF  K+FEYD
Subjt:  SQVVSALRWEIGVMDSRKDLSVERWPLRWELSRPQQKRSGECSMFVCKYFEYD

XP_038885861.1 sentrin-specific protease [Benincasa hispida]6.3e-2645.74Show/hide
Query:  TYEWSKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIGVMDSRKDLSVER
        T +WSK  NV+KY  G+  D+++PWS VDA+YM +N+  +HWVLVC+D +V E++V DSL+ L+ +  +E E++ L +    L     VM+S  +L ++R
Subjt:  TYEWSKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIGVMDSRKDLSVER

Query:  WPLRWELSRPQQKRSGECSMFVCKYFEYD
        W LR +   PQQ +SG+C MF CK+FEYD
Subjt:  WPLRWELSRPQQKRSGECSMFVCKYFEYD

TrEMBL top hitse value%identityAlignment
A0A1S4E5W8 uncharacterized protein LOC107992262 isoform X13.0e-2131.52Show/hide
Query:  KCKTLKYNPLPNI-PHDLDEPFKRWLDSENLEDNVRITAYVVREKTWFRALLTPSKWMTDEAIDSIFMFIQKKLQERPELCRKKFTTSDMCVTV--RTIG
        +C+ + Y+P+  I   DLD   + W+  E   D VR T +  + K +FR L    +W+ DE +D +F+FI+ K++       + FTT +    V  R I 
Subjt:  KCKTLKYNPLPNI-PHDLDEPFKRWLDSENLEDNVRITAYVVREKTWFRALLTPSKWMTDEAIDSIFMFIQKKLQERPELCRKKFTTSDMCVTV--RTIG

Query:  TTYSYVRFIVFDRVFIFVFFYD-QNFLRREDGPYKELSSGIHPRDLTYEWSKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEV
        T   Y+R I   R  I V  +  Q  L  +   YKE        D   E+     ++ Y  G   D   PW++VD VY  +N+   HWV++C+DL   +V
Subjt:  TTYSYVRFIVFDRVFIFVFFYD-QNFLRREDGPYKELSSGIHPRDLTYEWSKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEV

Query:  VVSDSLVALNKDEVVEQELKVLSQVVSALRWEIGVMDSRKDLSVERWPLRWEL--SRPQQKRSGECSMFVCKYFEY
         V DSL +L   E +   L ++ ++V  L   IG    R   S  + PL   +  S P Q+ + +C +F  KYFEY
Subjt:  VVSDSLVALNKDEVVEQELKVLSQVVSALRWEIGVMDSRKDLSVERWPLRWEL--SRPQQKRSGECSMFVCKYFEY

A0A5A7TVI1 Ulp1-like peptidase1.2e-1924.07Show/hide
Query:  ENLGTENTNEVV---EGVGMSVTNDRVCKRCKLLDGIENGIKELNGWMKGIEEDVKVIKSIEKYLKAIKKFMRRLSKYIDPDDGPNDGGARSESQSKGQD
        +N G++   EVV   +    S       K  K +  +++ +  + G +  I+ D+  +K +   +  I K +    K  + D   ++G      +SK  D
Subjt:  ENLGTENTNEVV---EGVGMSVTNDRVCKRCKLLDGIENGIKELNGWMKGIEEDVKVIKSIEKYLKAIKKFMRRLSKYIDPDDGPNDGGARSESQSKGQD

Query:  DDGGPASGSHAKAIDDTSMADHADPTGQPDAEQHGPVEEVDDPVEGVGKDEDTESGELVVGKDMHVAEGHNLLNKPIERRGTRKRKTAWKLRTP------
                 +AK  +D       +    P  ++H  V++  D      K +      + V  D  V E    L +    R  R+++ +  L TP      
Subjt:  DDGGPASGSHAKAIDDTSMADHADPTGQPDAEQHGPVEEVDDPVEGVGKDEDTESGELVVGKDMHVAEGHNLLNKPIERRGTRKRKTAWKLRTP------

Query:  WKDT-REDRKKCKTLKYNPLPNIPHDLDEPFKRWLDSENLEDNVRITAYVVREKTWFRALLTPSKWMTDEAIDSIFMFIQKKLQERPELCRKKFTTSDMC
        W  T      + + + Y+P+  I     +  + W+  +  +D +R T +  + K +FR L    +W+ DE +D++F+FI  K++       + FTT+D  
Subjt:  WKDT-REDRKKCKTLKYNPLPNIPHDLDEPFKRWLDSENLEDNVRITAYVVREKTWFRALLTPSKWMTDEAIDSIFMFIQKKLQERPELCRKKFTTSDMC

Query:  VTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLRREDGPYKELSSGIHPRDLTYEWSKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCID
                        +F R+ +  +             YKE      P    ++W +   ++ Y +G   D   PW++VD VY  +N+ G HWVL+C+D
Subjt:  VTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLRREDGPYKELSSGIHPRDLTYEWSKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCID

Query:  LEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIGVMDSRKDLSV--ERWPLRWELSRPQQKRSGECSMFVCKYFEY
        L   +V V DSL +L   E +   L  + Q+V  L +  G  D R   S   E WP+    S P Q+ + +C +F+ KYFEY
Subjt:  LEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIGVMDSRKDLSV--ERWPLRWELSRPQQKRSGECSMFVCKYFEY

A0A6J1D3R7 uncharacterized protein LOC1110169932.5e-2834.12Show/hide
Query:  RITAYVVREKTWFRALLTPSKWMTDEAIDSIFMFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLRREDGPYKELS
        R T Y ++ K WFR LL P  W T E +D +FM ++KKL++RP+LC +KFTT D+ +                             N+ RR+D  Y  + 
Subjt:  RITAYVVREKTWFRALLTPSKWMTDEAIDSIFMFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLRREDGPYKELS

Query:  SGIH-PRDLT--YEW-SKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIG
        S    P  +   Y+W  +  +++ Y  G   D+ + W  VDA+Y+ +NI G HWV+VCIDLE GE+VV DSL ++  D  +E  LKV+  ++  + ++  
Subjt:  SGIH-PRDLT--YEW-SKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIG

Query:  VMDSRKDLSVE
        VM  + +L ++
Subjt:  VMDSRKDLSVE

A0A6J1DLV0 uncharacterized protein LOC1110216465.8e-2534.93Show/hide
Query:  MFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLRREDGPYKELSSG--IHPRDLT-YEWSKLA-NVLKYEMGELAD
        MF+  KL+ RP LCR+KFTT D+ ++                            NFLR  DG Y  + S   I  R  + Y+W   A ++L Y  G  +D
Subjt:  MFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLRREDGPYKELSSG--IHPRDLT-YEWSKLA-NVLKYEMGELAD

Query:  HNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIGVMDSRKDLSVERWPLRWELSRPQQKRSGECSM
        ++  W  VDAVY+ YNIGG+HW+++CID + GE++V DS + +     +EQELK +  ++  L   +GV   + ++ +  W +R   S PQQ   G+C +
Subjt:  HNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIGVMDSRKDLSVERWPLRWELSRPQQKRSGECSM

Query:  FVCKYFEYD
        F   +FEYD
Subjt:  FVCKYFEYD

A0A6J1DY60 uncharacterized protein LOC1110252739.2e-3131.98Show/hide
Query:  LDSENLEDNVRITAYVVREKTWFRALLTPSKWMTDEAIDSIFMFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLR
        +D  + +DN R T+  ++ K+WF  LL P   + DE IDS+ M   +K+++   L R +F   D+ ++                            N LR
Subjt:  LDSENLEDNVRITAYVVREKTWFRALLTPSKWMTDEAIDSIFMFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLR

Query:  REDGPYKELSSGIHPRDLTYEWSKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSA
        R DGPY  +  G+ P   TY+W +   + +Y +G  +D++  WS  D VY   NIGG HWV++ IDL  G++ V DSL A+   E +E+ LK +  ++ A
Subjt:  REDGPYKELSSGIHPRDLTYEWSKLANVLKYEMGELADHNIPWSTVDAVYMSYNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSA

Query:  LRWEIGVMDSRKDLSVERWPLRWELSRPQQKRSGECSMFVCKYFEYD
        +    G++  R +L +  W +R   + PQQ    +CS+F  ++FEYD
Subjt:  LRWEIGVMDSRKDLSVERWPLRWELSRPQQKRSGECSMFVCKYFEYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCATACGCGTTTCAGGCTAAGAGTCACAACTGAACTTATGGCCTCAGAACAGGAGATCCAATTTATGGATCGTGTGATGCAGTCACCTCGTGCACCATCTCC
ACCTCCATCTCCACCTCCACTCTCACCTCCACCTCCACCCCCAACAGCTTTAGAAGATAATCCAATTGAAGATACTATGGTTGAGAATCTCGGGACTGAGAATACGAATG
AAGTGGTAGAGGGTGTTGGGATGTCTGTTACGAATGACAGAGTCTGCAAGAGGTGCAAACTCCTTGATGGTATCGAAAATGGAATCAAGGAGTTGAATGGGTGGATGAAA
GGGATTGAAGAGGATGTGAAGGTGATTAAGTCTATTGAAAAATACCTCAAAGCGATAAAGAAGTTCATGCGTCGATTGTCGAAGTATATAGATCCTGATGACGGTCCGAA
TGATGGTGGTGCTAGATCCGAATCACAATCGAAAGGTCAGGATGATGATGGTGGTCCTGCATCCGGGTCTCACGCAAAAGCAATTGACGACACCTCGATGGCTGACCATG
CGGATCCGACTGGTCAACCCGACGCAGAACAACATGGTCCAGTCGAGGAAGTAGATGACCCGGTAGAGGGTGTGGGGAAGGACGAAGACACTGAAAGTGGTGAACTTGTA
GTGGGAAAGGATATGCATGTTGCCGAAGGTCATAATTTGTTGAACAAGCCCATAGAAAGACGGGGGACTCGTAAGAGGAAGACTGCATGGAAGTTGAGAACTCCATGGAA
AGACACACGGGAAGACCGGAAGAAATGCAAGACCCTGAAGTATAATCCTCTTCCCAATATCCCCCATGATCTAGATGAGCCTTTCAAAAGATGGCTTGATAGTGAGAACC
TTGAAGACAATGTCCGGATAACTGCATATGTTGTTCGAGAAAAGACGTGGTTTCGTGCCCTTCTCACTCCATCGAAATGGATGACTGATGAAGCTATTGACTCGATCTTC
ATGTTCATCCAGAAGAAACTGCAAGAACGACCAGAGTTATGCCGCAAGAAGTTCACCACTTCAGACATGTGTGTAACGGTACGTACCATTGGAACAACATATTCATATGT
TCGTTTCATAGTGTTCGATAGGGTTTTTATTTTTGTATTTTTTTATGATCAAAATTTTTTAAGACGCGAAGATGGTCCGTACAAAGAATTGAGTAGCGGCATACACCCCC
GAGACTTGACGTACGAATGGAGCAAGCTAGCTAACGTCTTGAAGTACGAGATGGGCGAGCTTGCAGACCATAACATCCCATGGAGCACGGTTGATGCAGTGTACATGTCG
TACAACATCGGTGGTCTTCATTGGGTGTTGGTCTGCATTGACTTGGAGGTAGGGGAGGTGGTGGTATCAGATTCCCTGGTGGCGTTGAACAAGGACGAGGTGGTGGAGCA
GGAGTTAAAGGTCCTTAGCCAAGTCGTGTCGGCTCTACGTTGGGAGATAGGGGTCATGGATTCGAGGAAGGATCTCTCCGTTGAAAGATGGCCTCTGCGTTGGGAATTGT
CAAGGCCGCAGCAGAAACGTAGTGGCGAATGCAGCATGTTTGTCTGTAAATACTTTGAATATGATGACTTCCAAACGGTAAATGTTGGGTCTCAGGCTCATAGGACCAGT
GGTAATGTAGTTGCAGCTATGCATGGACATTATCCGCCGTTACTGAACCCTCTCTGTGCAGAGGCTCGTGCGGTTTTGGAAGGTATTAGGCTAGCCAATAGAATGAAGAT
ATCGAATGTAACAATTTTCTTTGATTCTCTTGGCCTTATTTCCATGATTAATGGTTCAATTGAGGATGAGGTTAAGGTTCATGCGTTATTATGGGATACCTGGTCAAACA
AGAGGACATTAAAGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCCATACGCGTTTCAGGCTAAGAGTCACAACTGAACTTATGGCCTCAGAACAGGAGATCCAATTTATGGATCGTGTGATGCAGTCACCTCGTGCACCATCTCC
ACCTCCATCTCCACCTCCACTCTCACCTCCACCTCCACCCCCAACAGCTTTAGAAGATAATCCAATTGAAGATACTATGGTTGAGAATCTCGGGACTGAGAATACGAATG
AAGTGGTAGAGGGTGTTGGGATGTCTGTTACGAATGACAGAGTCTGCAAGAGGTGCAAACTCCTTGATGGTATCGAAAATGGAATCAAGGAGTTGAATGGGTGGATGAAA
GGGATTGAAGAGGATGTGAAGGTGATTAAGTCTATTGAAAAATACCTCAAAGCGATAAAGAAGTTCATGCGTCGATTGTCGAAGTATATAGATCCTGATGACGGTCCGAA
TGATGGTGGTGCTAGATCCGAATCACAATCGAAAGGTCAGGATGATGATGGTGGTCCTGCATCCGGGTCTCACGCAAAAGCAATTGACGACACCTCGATGGCTGACCATG
CGGATCCGACTGGTCAACCCGACGCAGAACAACATGGTCCAGTCGAGGAAGTAGATGACCCGGTAGAGGGTGTGGGGAAGGACGAAGACACTGAAAGTGGTGAACTTGTA
GTGGGAAAGGATATGCATGTTGCCGAAGGTCATAATTTGTTGAACAAGCCCATAGAAAGACGGGGGACTCGTAAGAGGAAGACTGCATGGAAGTTGAGAACTCCATGGAA
AGACACACGGGAAGACCGGAAGAAATGCAAGACCCTGAAGTATAATCCTCTTCCCAATATCCCCCATGATCTAGATGAGCCTTTCAAAAGATGGCTTGATAGTGAGAACC
TTGAAGACAATGTCCGGATAACTGCATATGTTGTTCGAGAAAAGACGTGGTTTCGTGCCCTTCTCACTCCATCGAAATGGATGACTGATGAAGCTATTGACTCGATCTTC
ATGTTCATCCAGAAGAAACTGCAAGAACGACCAGAGTTATGCCGCAAGAAGTTCACCACTTCAGACATGTGTGTAACGGTACGTACCATTGGAACAACATATTCATATGT
TCGTTTCATAGTGTTCGATAGGGTTTTTATTTTTGTATTTTTTTATGATCAAAATTTTTTAAGACGCGAAGATGGTCCGTACAAAGAATTGAGTAGCGGCATACACCCCC
GAGACTTGACGTACGAATGGAGCAAGCTAGCTAACGTCTTGAAGTACGAGATGGGCGAGCTTGCAGACCATAACATCCCATGGAGCACGGTTGATGCAGTGTACATGTCG
TACAACATCGGTGGTCTTCATTGGGTGTTGGTCTGCATTGACTTGGAGGTAGGGGAGGTGGTGGTATCAGATTCCCTGGTGGCGTTGAACAAGGACGAGGTGGTGGAGCA
GGAGTTAAAGGTCCTTAGCCAAGTCGTGTCGGCTCTACGTTGGGAGATAGGGGTCATGGATTCGAGGAAGGATCTCTCCGTTGAAAGATGGCCTCTGCGTTGGGAATTGT
CAAGGCCGCAGCAGAAACGTAGTGGCGAATGCAGCATGTTTGTCTGTAAATACTTTGAATATGATGACTTCCAAACGGTAAATGTTGGGTCTCAGGCTCATAGGACCAGT
GGTAATGTAGTTGCAGCTATGCATGGACATTATCCGCCGTTACTGAACCCTCTCTGTGCAGAGGCTCGTGCGGTTTTGGAAGGTATTAGGCTAGCCAATAGAATGAAGAT
ATCGAATGTAACAATTTTCTTTGATTCTCTTGGCCTTATTTCCATGATTAATGGTTCAATTGAGGATGAGGTTAAGGTTCATGCGTTATTATGGGATACCTGGTCAAACA
AGAGGACATTAAAGAAGTAG
Protein sequenceShow/hide protein sequence
MASHTRFRLRVTTELMASEQEIQFMDRVMQSPRAPSPPPSPPPLSPPPPPPTALEDNPIEDTMVENLGTENTNEVVEGVGMSVTNDRVCKRCKLLDGIENGIKELNGWMK
GIEEDVKVIKSIEKYLKAIKKFMRRLSKYIDPDDGPNDGGARSESQSKGQDDDGGPASGSHAKAIDDTSMADHADPTGQPDAEQHGPVEEVDDPVEGVGKDEDTESGELV
VGKDMHVAEGHNLLNKPIERRGTRKRKTAWKLRTPWKDTREDRKKCKTLKYNPLPNIPHDLDEPFKRWLDSENLEDNVRITAYVVREKTWFRALLTPSKWMTDEAIDSIF
MFIQKKLQERPELCRKKFTTSDMCVTVRTIGTTYSYVRFIVFDRVFIFVFFYDQNFLRREDGPYKELSSGIHPRDLTYEWSKLANVLKYEMGELADHNIPWSTVDAVYMS
YNIGGLHWVLVCIDLEVGEVVVSDSLVALNKDEVVEQELKVLSQVVSALRWEIGVMDSRKDLSVERWPLRWELSRPQQKRSGECSMFVCKYFEYDDFQTVNVGSQAHRTS
GNVVAAMHGHYPPLLNPLCAEARAVLEGIRLANRMKISNVTIFFDSLGLISMINGSIEDEVKVHALLWDTWSNKRTLKK