; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001133 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001133
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr4:25032718..25034273
RNA-Seq ExpressionLag0001133
SyntenyLag0001133
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148137.1 uncharacterized protein LOC111016890 [Momordica charantia]1.1e-3147.37Show/hide
Query:  ITYDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVH
        I Y W   NT+  YV GR SDH+  WS  D +Y P+N+GGNHWVM+ +DL+ G +TV DS    T    L+KEL  + T+L  LL    +   +P LPV 
Subjt:  ITYDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVH

Query:  EWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFA
         W + R   VPQQ++  DCG+F V++FEYD TGS ++TL QD I + RRQ+A
Subjt:  EWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFA

XP_022153247.1 uncharacterized protein LOC111020782 [Momordica charantia]2.3e-3247.68Show/hide
Query:  YDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEW
        YDW    T+  YVLGR SD+DT WS  D +Y P+N+GGNHWVM+ +DL+ G LTV DS  A+T    L+K L  + T++ ++L    ++  +P L    W
Subjt:  YDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEW

Query:  EIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFAV
         + R  +VPQQ    DCG+F V+FFEYDVTGS+++TL Q  I+  RRQ+AV
Subjt:  EIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFAV

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]8.3e-4343.96Show/hide
Query:  MFVRKKLQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLIT------YDW-STANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMV
        MFV  KL+ R +LCR KF T D+++++FLR +D +      +Q P++I       YDW   A +++ Y+ G HSD+DT W  VDA+Y+P N+GG HW+++
Subjt:  MFVRKKLQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLIT------YDW-STANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMV

Query:  CVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINF
        C+D   G+L V DSF+ +T    L++EL  + T++  L+ +  V   KP++P+  W I R SS PQQ   GDCG+F + FFEYDVT    +TL Q R++F
Subjt:  CVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINF

Query:  CRRQFAV
         RRQFAV
Subjt:  CRRQFAV

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]5.9e-4143.41Show/hide
Query:  IDSLFMFVRKKLQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLITYDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCV
        IDSL M   +K+++   L R +F   D+++++ LRR+D     +K    PS  TYDW    T+  YVLGR SD+DT WS  D +Y  +N+GGNHWVM+ +
Subjt:  IDSLFMFVRKKLQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLITYDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCV

Query:  DLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCR
        DL+ G LTV DS  A+T    L+K L  + T++  +L    ++  +P+LP+  W + R  +VPQQ    DC +F V+FFEYDV GS+I+TL Q  I+  R
Subjt:  DLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCR

Query:  RQFAV
        RQ+AV
Subjt:  RQFAV

XP_038885861.1 sentrin-specific protease [Benincasa hispida]3.2e-3450.66Show/hide
Query:  TYDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHE
        T DWS    V+ YV G+H+D+D  WS VDAIYMP NL   HWV+VCVD  V +L V DS I L  +A L+ E+ +L      LL    VM++  +L +  
Subjt:  TYDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHE

Query:  WEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFAV
        W + RD+ VPQQ   GDCGMF  KFFEYDVTGS+++TL QDR+ + RRQ+A+
Subjt:  WEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFAV

TrEMBL top hitse value%identityAlignment
A0A6J1D492 uncharacterized protein LOC1110168905.4e-3247.37Show/hide
Query:  ITYDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVH
        I Y W   NT+  YV GR SDH+  WS  D +Y P+N+GGNHWVM+ +DL+ G +TV DS    T    L+KEL  + T+L  LL    +   +P LPV 
Subjt:  ITYDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVH

Query:  EWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFA
         W + R   VPQQ++  DCG+F V++FEYD TGS ++TL QD I + RRQ+A
Subjt:  EWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFA

A0A6J1DID7 uncharacterized protein LOC1110207821.1e-3247.68Show/hide
Query:  YDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEW
        YDW    T+  YVLGR SD+DT WS  D +Y P+N+GGNHWVM+ +DL+ G LTV DS  A+T    L+K L  + T++ ++L    ++  +P L    W
Subjt:  YDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEW

Query:  EIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFAV
         + R  +VPQQ    DCG+F V+FFEYDVTGS+++TL Q  I+  RRQ+AV
Subjt:  EIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFAV

A0A6J1DLV0 uncharacterized protein LOC1110216464.0e-4343.96Show/hide
Query:  MFVRKKLQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLIT------YDW-STANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMV
        MFV  KL+ R +LCR KF T D+++++FLR +D +      +Q P++I       YDW   A +++ Y+ G HSD+DT W  VDA+Y+P N+GG HW+++
Subjt:  MFVRKKLQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLIT------YDW-STANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMV

Query:  CVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINF
        C+D   G+L V DSF+ +T    L++EL  + T++  L+ +  V   KP++P+  W I R SS PQQ   GDCG+F + FFEYDVT    +TL Q R++F
Subjt:  CVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINF

Query:  CRRQFAV
         RRQFAV
Subjt:  CRRQFAV

A0A6J1DQZ3 uncharacterized protein LOC1110234421.6e-3146.04Show/hide
Query:  YVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQ
        Y+   HSD+   W  V+A+Y+P N+ GNHWVM+C+D + G++ V DS  A+TS A+L+++L  + TV+  LL K  V+  +P+LP+  W I R +S P+Q
Subjt:  YVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQ

Query:  TNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFA
         + GDCG+F VK+FEYDVT + + TL Q+ +++ RRQFA
Subjt:  TNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFA

A0A6J1DY60 uncharacterized protein LOC1110252732.9e-4143.41Show/hide
Query:  IDSLFMFVRKKLQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLITYDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCV
        IDSL M   +K+++   L R +F   D+++++ LRR+D     +K    PS  TYDW    T+  YVLGR SD+DT WS  D +Y  +N+GGNHWVM+ +
Subjt:  IDSLFMFVRKKLQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLITYDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCV

Query:  DLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCR
        DL+ G LTV DS  A+T    L+K L  + T++  +L    ++  +P+LP+  W + R  +VPQQ    DC +F V+FFEYDV GS+I+TL Q  I+  R
Subjt:  DLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCR

Query:  RQFAV
        RQ+AV
Subjt:  RQFAV

SwissProt top hitse value%identityAlignment
P59110 Sentrin-specific protease 11.1e-0528.23Show/hide
Query:  TVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEI--HRDSSVPQQTNGGDCGMFAVK
        +VD + +P++L G HW +  VD     +T  DS   + ++A           +L   L +  V K +     + W++   +   +PQQ NG DCGMFA K
Subjt:  TVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEI--HRDSSVPQQTNGGDCGMFAVK

Query:  FFEYDVTGSEINTLNQDRINFCRR
        + +       IN   Q    F +R
Subjt:  FFEYDVTGSEINTLNQDRINFCRR

Q5RBB1 Sentrin-specific protease 11.1e-0527.42Show/hide
Query:  TVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEI--HRDSSVPQQTNGGDCGMFAVK
        +VD + +P++L G HW +  VD     +T  DS   + ++A           +L   L +  + K +     + W++   +   +PQQ NG DCGMFA K
Subjt:  TVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEI--HRDSSVPQQTNGGDCGMFAVK

Query:  FFEYDVTGSEINTLNQDRINFCRR
        + +       IN   Q    F +R
Subjt:  FFEYDVTGSEINTLNQDRINFCRR

Q8GYL3 Ubiquitin-like-specific protease 1A2.1e-0427.54Show/hide
Query:  HWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVM-KAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFA
        H    D I++P+++   HW +  +++   K   LDSF          K L  LA       F  +V  K++  L V  W       +P Q NG DCGMF 
Subjt:  HWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVM-KAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFA

Query:  VKFFEYDVTGSEINTLNQDRINFCRRQFAVSNLGQQAD
        VK+ ++   G ++    Q+++ + R + A   L  +A+
Subjt:  VKFFEYDVTGSEINTLNQDRINFCRRQFAVSNLGQQAD

Q9P0U3 Sentrin-specific protease 11.1e-0527.42Show/hide
Query:  TVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEI--HRDSSVPQQTNGGDCGMFAVK
        +VD + +P++L G HW +  VD     +T  DS   + ++A           +L   L +  + K +     + W++   +   +PQQ NG DCGMFA K
Subjt:  TVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEI--HRDSSVPQQTNGGDCGMFAVK

Query:  FFEYDVTGSEINTLNQDRINFCRR
        + +       IN   Q    F +R
Subjt:  FFEYDVTGSEINTLNQDRINFCRR

Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases6.2e-1227.93Show/hide
Query:  RVIDSLFMFVRKKLQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLITYDWSTANTVIDYVLGR-HSDHDTHWSTVDAIYMPLNLGGNHWVM
        +V+D L  F R  L  R D    + +  D++ + F+ +   +  +  K   P     D+   + ++D ++G   S+    ++  D +YMP N    HWV 
Subjt:  RVIDSLFMFVRKKLQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLITYDWSTANTVIDYVLGR-HSDHDTHWSTVDAIYMPLNLGGNHWVM

Query:  VCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAV
        +CVDL   K+T+LDS I L  DA L  EL  LA +L  L  +     +   + +  + + R   +PQ ++  D G+ +V
Subjt:  VCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAV

AT3G06910.1 UB-like protease 1A1.5e-0527.54Show/hide
Query:  HWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVM-KAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFA
        H    D I++P+++   HW +  +++   K   LDSF          K L  LA       F  +V  K++  L V  W       +P Q NG DCGMF 
Subjt:  HWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVM-KAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFA

Query:  VKFFEYDVTGSEINTLNQDRINFCRRQFAVSNLGQQAD
        VK+ ++   G ++    Q+++ + R + A   L  +A+
Subjt:  VKFFEYDVTGSEINTLNQDRINFCRRQFAVSNLGQQAD

AT4G08430.1 Ulp1 protease family protein1.0e-0625.36Show/hide
Query:  VDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFE
        VD +Y  L + GNHWV + +DL   ++ V DS  +LT+D  +  +   + T++  +L      K +      + E  R + +P+  +  DC ++++K+ E
Subjt:  VDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFE

Query:  YDVTGSEINTLNQDRINFCRRQFAV---SNLGQQADIL
            G   + L  + +     + AV     LG+ A  L
Subjt:  YDVTGSEINTLNQDRINFCRRQFAV---SNLGQQADIL

AT4G15880.1 Cysteine proteinases superfamily protein2.1e-0424.24Show/hide
Query:  DAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEY
        D I++P++  G HW +  ++    KL  LDS   +            +   L+  +      K+   +  + W++     +PQQ NG DCGMF +K+ ++
Subjt:  DAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEY

Query:  DVTGSEINTLNQDRINFCRRQFAVSNLGQQAD
           G  +   +Q+ + + R + A   L  +AD
Subjt:  DVTGSEINTLNQDRINFCRRQFAVSNLGQQAD

AT5G45570.1 Ulp1 protease family protein2.4e-0826.09Show/hide
Query:  VDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFE
        VD +Y  L + GNHWV + +DL   ++ V DS  +LT+D  +  +   + T++  +L      K +      + E  R + +P+  + GDC ++++K+ E
Subjt:  VDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKKELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFE

Query:  YDVTGSEINTLNQDRINFCRRQFAV---SNLGQQADIL
            G   + L  + +   R + AV     +G+ A  L
Subjt:  YDVTGSEINTLNQDRINFCRRQFAV---SNLGQQADIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTGTGGGTTTGATGGAGTTGGGAGATGATTGGGTATTGTTGAGTTTGGAGATGATTGGGTTTTATGGAGTTTGTGGGTTGTATGGAATTTATGAAAGAGGGAG
AGGGTGTGGTTTGGAGTGGAAGGAACCGTCGGCACCGGAGAAGACGGAAATTCCCGGTCACTTCCCGGTCGGGATCGAAACCGGGAAGGAACCGGTTTTCCCGCAACTGT
CCGATTTTCCCGTAGGTCCCTACGGGAAAACCGGTTCTTCCTTTACGGGAAAACCGGCTTTCCGGGTCATTGACTCACTTTTTATGTTCGTCCGGAAGAAACTGCAACAG
CGGGCAGACTTATGTCGTTGGAAGTTTGTCACTGCAGATATTGTTGTTACCGATTTTCTGAGGCGTAGCGACGACATAGCTGAAGAGTTGAAGAAAGTGCAAGATCCTTC
GTTGATTACATACGACTGGAGTACGGCCAATACTGTGATAGACTACGTTTTGGGTCGACACTCGGACCACGATACACATTGGAGTACAGTTGATGCGATCTACATGCCAT
TGAACCTTGGGGGGAACCATTGGGTTATGGTATGTGTTGATCTCCTAGTGGGCAAGTTGACCGTCCTCGATTCATTCATAGCGTTGACATCGGATGCAACCTTGAAGAAA
GAGTTGAGCACTCTAGCCACAGTATTGTCATTGCTACTGTTCAAGTGCGATGTCATGAAAGCAAAGCCGCATCTCCCAGTTCACGAATGGGAAATACATAGAGATAGTTC
AGTGCCTCAACAAACGAACGGTGGGGATTGTGGTATGTTCGCGGTAAAGTTTTTTGAATATGATGTTACTGGAAGTGAAATAAACACTCTGAATCAAGATAGGATTAATT
TTTGTAGACGTCAATTTGCTGTTTCAAATTTGGGCCAACAGGCCGATATTTTAGTCTGGTTTGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTGTGGGTTTGATGGAGTTGGGAGATGATTGGGTATTGTTGAGTTTGGAGATGATTGGGTTTTATGGAGTTTGTGGGTTGTATGGAATTTATGAAAGAGGGAG
AGGGTGTGGTTTGGAGTGGAAGGAACCGTCGGCACCGGAGAAGACGGAAATTCCCGGTCACTTCCCGGTCGGGATCGAAACCGGGAAGGAACCGGTTTTCCCGCAACTGT
CCGATTTTCCCGTAGGTCCCTACGGGAAAACCGGTTCTTCCTTTACGGGAAAACCGGCTTTCCGGGTCATTGACTCACTTTTTATGTTCGTCCGGAAGAAACTGCAACAG
CGGGCAGACTTATGTCGTTGGAAGTTTGTCACTGCAGATATTGTTGTTACCGATTTTCTGAGGCGTAGCGACGACATAGCTGAAGAGTTGAAGAAAGTGCAAGATCCTTC
GTTGATTACATACGACTGGAGTACGGCCAATACTGTGATAGACTACGTTTTGGGTCGACACTCGGACCACGATACACATTGGAGTACAGTTGATGCGATCTACATGCCAT
TGAACCTTGGGGGGAACCATTGGGTTATGGTATGTGTTGATCTCCTAGTGGGCAAGTTGACCGTCCTCGATTCATTCATAGCGTTGACATCGGATGCAACCTTGAAGAAA
GAGTTGAGCACTCTAGCCACAGTATTGTCATTGCTACTGTTCAAGTGCGATGTCATGAAAGCAAAGCCGCATCTCCCAGTTCACGAATGGGAAATACATAGAGATAGTTC
AGTGCCTCAACAAACGAACGGTGGGGATTGTGGTATGTTCGCGGTAAAGTTTTTTGAATATGATGTTACTGGAAGTGAAATAAACACTCTGAATCAAGATAGGATTAATT
TTTGTAGACGTCAATTTGCTGTTTCAAATTTGGGCCAACAGGCCGATATTTTAGTCTGGTTTGTATAG
Protein sequenceShow/hide protein sequence
MESVGLMELGDDWVLLSLEMIGFYGVCGLYGIYERGRGCGLEWKEPSAPEKTEIPGHFPVGIETGKEPVFPQLSDFPVGPYGKTGSSFTGKPAFRVIDSLFMFVRKKLQQ
RADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLITYDWSTANTVIDYVLGRHSDHDTHWSTVDAIYMPLNLGGNHWVMVCVDLLVGKLTVLDSFIALTSDATLKK
ELSTLATVLSLLLFKCDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFAVSNLGQQADILVWFV