; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034564 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034564
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr3:8450595..8454131
RNA-Seq ExpressionLag0034564
SyntenyLag0034564
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR015410 - Domain of unknown function DUF1985
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAS91798.1 Ulp1-like peptidase [Cucumis melo]2.0e-4230.63Show/hide
Query:  SFGKRDFDLITGLRHSFRPMRRD--REGPPNRLLRLYFRENVGMKVEELDKSFPTLQFE-NDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVA
        +FG+R+F+++TGL   + P + D  + G  +RLL  +F++   + V +L+  F  L++E +D+D VK+A  YF E++L+G++R+ +VD   L + DDW +
Subjt:  SFGKRDFDLITGLRHSFRPMRRD--REGPPNRLLRLYFRENVGMKVEELDKSFPTLQFE-NDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVA

Query:  FCNEDWSTIVFDKTIKSLKKALKGKVESYKRKGSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIADEVFGSR
        F N DW  IVF +T+ +LK+AL  +    K+K +  KK   Y++ GFP A Q            R+           + +W                   
Subjt:  FCNEDWSTIVFDKTIKSLKKALKGKVESYKRKGSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIADEVFGSR

Query:  AVESCFFCRGESIHESSDHENDTFDWSRFKTVTNYVMGEHTDYSVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCT
                    +++    EN  FDW     + +YV+G   D+  PW+SVD VY PFN+  NHWVLLC D  + +  + DSL +L +  ++   +  +  
Subjt:  AVESCFFCRGESIHESSDHENDTFDWSRFKTVTNYVMGEHTDYSVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCT

Query:  IFPRLLLRCDVM--KDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANR
        + P+LL        + + S    PW       +P Q+++ DCGVF +K+ EY      L +L QE + + R+Q A Q+W N+
Subjt:  IFPRLLLRCDVM--KDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANR

KAA0038137.1 Ulp1-like peptidase [Cucumis melo var. makuwa]5.2e-5130.22Show/hide
Query:  KRDFDLITGLRHSFRPMRRDREGPPN-RLLRLYFRENVGMKVEELDKSFPTLQFE-NDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVAFCNE
        +R+F++ITGL   + P     +   N RLL  +F++   + V +L+  F  L++E +D+D VK+A  YF E++L+G++R+ +VD     + DDW  F N 
Subjt:  KRDFDLITGLRHSFRPMRRDREGPPN-RLLRLYFRENVGMKVEELDKSFPTLQFE-NDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVAFCNE

Query:  DWSTIVFDKTIKSLKKALKGKVESYKRKGSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIAD----------
        DW  IVF++T+ +LK+AL  +    K+K +  KK   Y++ GFP A Q+WAYE++ ++ G   ++V++ AIPR+ RW C  SP    I+           
Subjt:  DWSTIVFDKTIKSLKKALKGKVESYKRKGSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIAD----------

Query:  ----------------------------EVF---GSRAVESCF-----------------FCRGESI-----------HESSDHENDTFDWSRFKTVTNY
                                    E F    S+ +++ F                 F   ++I           ++    EN  FDW     + +Y
Subjt:  ----------------------------EVF---GSRAVESCF-----------------FCRGESI-----------HESSDHENDTFDWSRFKTVTNY

Query:  VMGEHTDYSVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCTIFPRLLLRCDVM--KDKPSLPTHPWRFRRKTQVPQ
        V+G   D+  PW+SVD VY PFN+  NHWVLLC D  + +  + DSL +L +  ++   +  +  + P+LL        + + S     W       +P 
Subjt:  VMGEHTDYSVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCTIFPRLLLRCDVM--KDKPSLPTHPWRFRRKTQVPQ

Query:  QQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF
        Q+++ DCGVFT+K+ EY      L +L QE + + R+Q A QLW N P +
Subjt:  QQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]1.3e-4651.01Show/hide
Query:  QKVSFGKRDFDLITGLRHSFRPMRRDREGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVA
        ++VSFGKR+FDLITGL H  R  R D   P  RL   YF++ V +K  EL+K F    F +DED VK+   YF ELA+MG+ERKQ +D + L ++D W  
Subjt:  QKVSFGKRDFDLITGLRHSFRPMRRDREGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVA

Query:  FCNEDWSTIVFDKTIKSLKKALKGKVESYKRKGSG-PKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIADEVF
        FCN DWS+++FD+TI SLK ALK K+  Y++K +  P   ETYSLYGFP+AFQ+WAYET+S+L        S+ AIPR+ RWSC +S  + ++  EVF
Subjt:  FCNEDWSTIVFDKTIKSLKKALKGKVESYKRKGSG-PKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIADEVF

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]1.2e-5252.31Show/hide
Query:  REIWQNSTLWLTENYRSQKVSFGKRDFDLITGLRHSFRPMRRDREGPPNRLLR-LYFRENVGMKVEELDKSFPTLQFENDEDAVKIAAFYFFELALMGRE
        RE+ +     ++ N    +VSFGKR+FDLITGLRH+   M R  E   NR LR LYF++   +K  EL+K F    FENDEDAVKIA  YF ELA+MG+E
Subjt:  REIWQNSTLWLTENYRSQKVSFGKRDFDLITGLRHSFRPMRRDREGPPNRLLR-LYFRENVGMKVEELDKSFPTLQFENDEDAVKIAAFYFFELALMGRE

Query:  RKQQVDASTLDLMDDWVAFCNEDWSTIVFDKTIKSLKKALKGKVESYKRK-GSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRW
        RK ++D S L ++D W  FCN DWS+++F++T+ SLK ALK KVE YK+K        ETYSLY FP+AFQ+WAYET+S+L+ RVA R+++ AIPR+ RW
Subjt:  RKQQVDASTLDLMDDWVAFCNEDWSTIVFDKTIKSLKKALKGKVESYKRK-GSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRW

Query:  SCAHSPSYTIIADEVF
        SC +S ++ ++  EVF
Subjt:  SCAHSPSYTIIADEVF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]6.4e-4151.83Show/hide
Query:  ENDTFDWSRFKTVTNYVMGEHTDYSVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCTIFPRLLLRCDVMKDKPSLP
        +  T DWS  K V  YV G+HTDY VPWS+VDAVYMPFNL   HWVL+CADF+  E ++ DSL AL+ +AD+  ++  VC  FP LL+   VM +  +L 
Subjt:  ENDTFDWSRFKTVTNYVMGEHTDYSVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCTIFPRLLLRCDVMKDKPSLP

Query:  THPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF
           W  RR     QQ +SGDCG+FT KF EYDVT S +G+L+Q++ ++ RRQ+A+Q+WANR  F
Subjt:  THPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF

TrEMBL top hitse value%identityAlignment
A0A5A7T951 Ulp1-like peptidase2.5e-5130.22Show/hide
Query:  KRDFDLITGLRHSFRPMRRDREGPPN-RLLRLYFRENVGMKVEELDKSFPTLQFE-NDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVAFCNE
        +R+F++ITGL   + P     +   N RLL  +F++   + V +L+  F  L++E +D+D VK+A  YF E++L+G++R+ +VD     + DDW  F N 
Subjt:  KRDFDLITGLRHSFRPMRRDREGPPN-RLLRLYFRENVGMKVEELDKSFPTLQFE-NDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVAFCNE

Query:  DWSTIVFDKTIKSLKKALKGKVESYKRKGSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIAD----------
        DW  IVF++T+ +LK+AL  +    K+K +  KK   Y++ GFP A Q+WAYE++ ++ G   ++V++ AIPR+ RW C  SP    I+           
Subjt:  DWSTIVFDKTIKSLKKALKGKVESYKRKGSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIAD----------

Query:  ----------------------------EVF---GSRAVESCF-----------------FCRGESI-----------HESSDHENDTFDWSRFKTVTNY
                                    E F    S+ +++ F                 F   ++I           ++    EN  FDW     + +Y
Subjt:  ----------------------------EVF---GSRAVESCF-----------------FCRGESI-----------HESSDHENDTFDWSRFKTVTNY

Query:  VMGEHTDYSVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCTIFPRLLLRCDVM--KDKPSLPTHPWRFRRKTQVPQ
        V+G   D+  PW+SVD VY PFN+  NHWVLLC D  + +  + DSL +L +  ++   +  +  + P+LL        + + S     W       +P 
Subjt:  VMGEHTDYSVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCTIFPRLLLRCDVM--KDKPSLPTHPWRFRRKTQVPQ

Query:  QQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF
        Q+++ DCGVFT+K+ EY      L +L QE + + R+Q A QLW N P +
Subjt:  QQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF

A0A6J1DJX9 uncharacterized protein LOC1110207576.4e-4751.01Show/hide
Query:  QKVSFGKRDFDLITGLRHSFRPMRRDREGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVA
        ++VSFGKR+FDLITGL H  R  R D   P  RL   YF++ V +K  EL+K F    F +DED VK+   YF ELA+MG+ERKQ +D + L ++D W  
Subjt:  QKVSFGKRDFDLITGLRHSFRPMRRDREGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVA

Query:  FCNEDWSTIVFDKTIKSLKKALKGKVESYKRKGSG-PKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIADEVF
        FCN DWS+++FD+TI SLK ALK K+  Y++K +  P   ETYSLYGFP+AFQ+WAYET+S+L        S+ AIPR+ RWSC +S  + ++  EVF
Subjt:  FCNEDWSTIVFDKTIKSLKKALKGKVESYKRKGSG-PKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIADEVF

A0A6J1DRZ7 uncharacterized protein LOC1110238476.0e-5352.31Show/hide
Query:  REIWQNSTLWLTENYRSQKVSFGKRDFDLITGLRHSFRPMRRDREGPPNRLLR-LYFRENVGMKVEELDKSFPTLQFENDEDAVKIAAFYFFELALMGRE
        RE+ +     ++ N    +VSFGKR+FDLITGLRH+   M R  E   NR LR LYF++   +K  EL+K F    FENDEDAVKIA  YF ELA+MG+E
Subjt:  REIWQNSTLWLTENYRSQKVSFGKRDFDLITGLRHSFRPMRRDREGPPNRLLR-LYFRENVGMKVEELDKSFPTLQFENDEDAVKIAAFYFFELALMGRE

Query:  RKQQVDASTLDLMDDWVAFCNEDWSTIVFDKTIKSLKKALKGKVESYKRK-GSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRW
        RK ++D S L ++D W  FCN DWS+++F++T+ SLK ALK KVE YK+K        ETYSLY FP+AFQ+WAYET+S+L+ RVA R+++ AIPR+ RW
Subjt:  RKQQVDASTLDLMDDWVAFCNEDWSTIVFDKTIKSLKKALKGKVESYKRK-GSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRW

Query:  SCAHSPSYTIIADEVF
        SC +S ++ ++  EVF
Subjt:  SCAHSPSYTIIADEVF

A0A6J1E0A9 uncharacterized protein LOC1110252092.0e-4045.77Show/hide
Query:  KVSFGKRDFDLITGLRHSFRPMRRDREGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVAF
        KVSFG+R+FD+I+GL++S  P+R+     P R   LYF  +  + + EL+K + +++FE+D DAVK+   YF EL L+GRER  + D   L ++DDW A 
Subjt:  KVSFGKRDFDLITGLRHSFRPMRRDREGPPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVAF

Query:  CNEDWSTIVFDKTIKSLKKALKGKVESYKRKGSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIADEVFGSRA
        CN DW+ + FDKTI SL++       S K K  G +K  +YSLYGFP+AFQ+WAYE +SSL+G +   VS+  +PRI +W   HS +Y ++A E+F S  
Subjt:  CNEDWSTIVFDKTIKSLKKALKGKVESYKRKGSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIADEVFGSRA

Query:  V
        V
Subjt:  V

Q5GIS9 Ulp1 peptidase-like9.6e-4330.63Show/hide
Query:  SFGKRDFDLITGLRHSFRPMRRD--REGPPNRLLRLYFRENVGMKVEELDKSFPTLQFE-NDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVA
        +FG+R+F+++TGL   + P + D  + G  +RLL  +F++   + V +L+  F  L++E +D+D VK+A  YF E++L+G++R+ +VD   L + DDW +
Subjt:  SFGKRDFDLITGLRHSFRPMRRD--REGPPNRLLRLYFRENVGMKVEELDKSFPTLQFE-NDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVA

Query:  FCNEDWSTIVFDKTIKSLKKALKGKVESYKRKGSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIADEVFGSR
        F N DW  IVF +T+ +LK+AL  +    K+K +  KK   Y++ GFP A Q            R+           + +W                   
Subjt:  FCNEDWSTIVFDKTIKSLKKALKGKVESYKRKGSGPKKQETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIADEVFGSR

Query:  AVESCFFCRGESIHESSDHENDTFDWSRFKTVTNYVMGEHTDYSVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCT
                    +++    EN  FDW     + +YV+G   D+  PW+SVD VY PFN+  NHWVLLC D  + +  + DSL +L +  ++   +  +  
Subjt:  AVESCFFCRGESIHESSDHENDTFDWSRFKTVTNYVMGEHTDYSVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCT

Query:  IFPRLLLRCDVM--KDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANR
        + P+LL        + + S    PW       +P Q+++ DCGVF +K+ EY      L +L QE + + R+Q A Q+W N+
Subjt:  IFPRLLLRCDVM--KDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases7.6e-0830.3Show/hide
Query:  WSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTV
        ++  D VYMPFN  + HWV LC D +  +  + DS   L  DA +  ++  +  + P L  +        SL   P+   R   +PQ     D GV +V
Subjt:  WSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTV

AT5G45570.1 Ulp1 protease family protein8.1e-1027.91Show/hide
Query:  VDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCTIFPRLLLRCDVMKD-KPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFL
        VD +Y    +  NHWV L  D       + DS+ +L +D ++A Q   V T+ P +L      K  + S     W  +R T++P+  D GDC ++++K++
Subjt:  VDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCTIFPRLLLRCDVMKD-KPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFL

Query:  EYDVTRSDLGSLSQEKIEFCRRQFAVQLW
        E          L  E ++  R + AV+++
Subjt:  EYDVTRSDLGSLSQEKIEFCRRQFAVQLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCGAGGCCGACCTCGTCCTCCTAGGGTCTTTAAGAATTCGGAGGTGTTTTGGGATGAACCAAGCGGAACCGGGGCGATTTGGGGCATCACGGGTCGAAAGGAGGT
GACCGAGCTCGGCCTAGACCATATGGGTCGGGCCAAGTCTTCCTATGAGTCGGACTTCTGGTCCTACCTTTGTCCGATTGTCCTTGTCAGCTCCTTATTTACTTTCCTTC
TCTCACGACTTCTCCATTTCTTCCTCACATTCGTCTGCTGTTTCCTTGGAAATATTTCGCGGAAATCAAGCTTACTTCTACACGAGTTAAGTTCGTCGGCGGTTTCTGAG
GAGGGTAGAGGGAAGAAGAACAGGATTTTGTGCTGTTTAAGACAACGTTTCTGGAATCGAGATGAATCCCGGCCGGGATTCATCTTCTGTAATTTCTCGGTCAACCGGGA
AATTTGGCAGAACTCGAAGCCTTTCTCGGTTGACCGAGTTACACAGACCAAATCTCGCTCTCGATTTCGACCGAGATTTAGCCGGGAAATTTGGCAGAACTCGACCCTTT
GGTTGACCGAGAATTATAGAAGTCAGAAGGTCTCCTTTGGTAAGAGAGATTTTGACCTCATAACCGGCCTTCGTCATTCATTTAGACCAATGAGGAGAGATAGAGAGGGC
CCTCCCAATAGACTCCTAAGATTATATTTTAGAGAGAATGTAGGCATGAAGGTGGAGGAGTTAGATAAGTCGTTTCCGACTCTTCAGTTTGAGAACGACGAAGATGCAGT
TAAGATCGCAGCGTTTTATTTTTTTGAGTTGGCTTTGATGGGGAGGGAACGCAAACAACAAGTAGATGCTAGCACCTTAGACTTAATGGATGATTGGGTGGCATTCTGCA
ACGAGGATTGGAGTACCATCGTTTTCGACAAGACGATCAAAAGTTTGAAGAAGGCACTAAAGGGGAAGGTTGAGTCGTACAAGCGTAAGGGAAGTGGTCCAAAGAAGCAG
GAGACATACAGTCTATATGGTTTCCCGTTTGCTTTTCAGATATGGGCATACGAGACTGTTTCATCTCTCACTGGACGTGTTGCCAATCGAGTAAGTGAGACAGCCATCCC
ACGCATCCGTCGATGGTCTTGCGCCCACTCCCCATCATACACGATCATTGCTGATGAAGTTTTTGGATCCCGGGCGGTCGAGTCTTGTTTCTTCTGCAGAGGAGAGTCAA
TACATGAATCGAGTGACCATGAGAACGACACGTTCGATTGGAGCAGATTCAAGACGGTCACTAACTACGTAATGGGAGAACACACAGATTACAGCGTTCCTTGGAGTTCC
GTTGATGCTGTCTACATGCCCTTCAACTTAGGTAGAAACCATTGGGTTCTACTGTGCGCTGACTTCGAAACAGGCGAATTTGTGTTGACAGACTCCCTAACGGCACTGAA
TTCAGATGCAGACATAGCCAAGCAGGTGAATACGGTATGCACCATTTTTCCTAGGCTGCTATTAAGGTGCGACGTTATGAAGGACAAGCCGTCTCTTCCAACACATCCAT
GGCGATTCAGAAGGAAGACCCAAGTGCCACAACAACAAGATAGTGGGGATTGTGGGGTCTTCACTGTAAAGTTTTTGGAATATGATGTAACTAGATCAGATTTAGGTAGT
CTTAGTCAGGAGAAAATTGAGTTTTGTAGGCGTCAATTTGCTGTACAACTTTGGGCCAATAGGCCGTTCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCCGAGGCCGACCTCGTCCTCCTAGGGTCTTTAAGAATTCGGAGGTGTTTTGGGATGAACCAAGCGGAACCGGGGCGATTTGGGGCATCACGGGTCGAAAGGAGGT
GACCGAGCTCGGCCTAGACCATATGGGTCGGGCCAAGTCTTCCTATGAGTCGGACTTCTGGTCCTACCTTTGTCCGATTGTCCTTGTCAGCTCCTTATTTACTTTCCTTC
TCTCACGACTTCTCCATTTCTTCCTCACATTCGTCTGCTGTTTCCTTGGAAATATTTCGCGGAAATCAAGCTTACTTCTACACGAGTTAAGTTCGTCGGCGGTTTCTGAG
GAGGGTAGAGGGAAGAAGAACAGGATTTTGTGCTGTTTAAGACAACGTTTCTGGAATCGAGATGAATCCCGGCCGGGATTCATCTTCTGTAATTTCTCGGTCAACCGGGA
AATTTGGCAGAACTCGAAGCCTTTCTCGGTTGACCGAGTTACACAGACCAAATCTCGCTCTCGATTTCGACCGAGATTTAGCCGGGAAATTTGGCAGAACTCGACCCTTT
GGTTGACCGAGAATTATAGAAGTCAGAAGGTCTCCTTTGGTAAGAGAGATTTTGACCTCATAACCGGCCTTCGTCATTCATTTAGACCAATGAGGAGAGATAGAGAGGGC
CCTCCCAATAGACTCCTAAGATTATATTTTAGAGAGAATGTAGGCATGAAGGTGGAGGAGTTAGATAAGTCGTTTCCGACTCTTCAGTTTGAGAACGACGAAGATGCAGT
TAAGATCGCAGCGTTTTATTTTTTTGAGTTGGCTTTGATGGGGAGGGAACGCAAACAACAAGTAGATGCTAGCACCTTAGACTTAATGGATGATTGGGTGGCATTCTGCA
ACGAGGATTGGAGTACCATCGTTTTCGACAAGACGATCAAAAGTTTGAAGAAGGCACTAAAGGGGAAGGTTGAGTCGTACAAGCGTAAGGGAAGTGGTCCAAAGAAGCAG
GAGACATACAGTCTATATGGTTTCCCGTTTGCTTTTCAGATATGGGCATACGAGACTGTTTCATCTCTCACTGGACGTGTTGCCAATCGAGTAAGTGAGACAGCCATCCC
ACGCATCCGTCGATGGTCTTGCGCCCACTCCCCATCATACACGATCATTGCTGATGAAGTTTTTGGATCCCGGGCGGTCGAGTCTTGTTTCTTCTGCAGAGGAGAGTCAA
TACATGAATCGAGTGACCATGAGAACGACACGTTCGATTGGAGCAGATTCAAGACGGTCACTAACTACGTAATGGGAGAACACACAGATTACAGCGTTCCTTGGAGTTCC
GTTGATGCTGTCTACATGCCCTTCAACTTAGGTAGAAACCATTGGGTTCTACTGTGCGCTGACTTCGAAACAGGCGAATTTGTGTTGACAGACTCCCTAACGGCACTGAA
TTCAGATGCAGACATAGCCAAGCAGGTGAATACGGTATGCACCATTTTTCCTAGGCTGCTATTAAGGTGCGACGTTATGAAGGACAAGCCGTCTCTTCCAACACATCCAT
GGCGATTCAGAAGGAAGACCCAAGTGCCACAACAACAAGATAGTGGGGATTGTGGGGTCTTCACTGTAAAGTTTTTGGAATATGATGTAACTAGATCAGATTTAGGTAGT
CTTAGTCAGGAGAAAATTGAGTTTTGTAGGCGTCAATTTGCTGTACAACTTTGGGCCAATAGGCCGTTCTTTTAG
Protein sequenceShow/hide protein sequence
MGRGRPRPPRVFKNSEVFWDEPSGTGAIWGITGRKEVTELGLDHMGRAKSSYESDFWSYLCPIVLVSSLFTFLLSRLLHFFLTFVCCFLGNISRKSSLLLHELSSSAVSE
EGRGKKNRILCCLRQRFWNRDESRPGFIFCNFSVNREIWQNSKPFSVDRVTQTKSRSRFRPRFSREIWQNSTLWLTENYRSQKVSFGKRDFDLITGLRHSFRPMRRDREG
PPNRLLRLYFRENVGMKVEELDKSFPTLQFENDEDAVKIAAFYFFELALMGRERKQQVDASTLDLMDDWVAFCNEDWSTIVFDKTIKSLKKALKGKVESYKRKGSGPKKQ
ETYSLYGFPFAFQIWAYETVSSLTGRVANRVSETAIPRIRRWSCAHSPSYTIIADEVFGSRAVESCFFCRGESIHESSDHENDTFDWSRFKTVTNYVMGEHTDYSVPWSS
VDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQVNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGS
LSQEKIEFCRRQFAVQLWANRPFF