; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036213 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036213
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr3:41678555..41681474
RNA-Seq ExpressionLag0036213
SyntenyLag0036213
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148308.1 uncharacterized protein LOC111016993 [Momordica charantia]2.9e-3841.18Show/hide
Query:  GSERKTVYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRSDHGNDT-----FDW-SRFKKVTTY
        G+ RKTVY+ ++K WF+ LL P +W + EV+D LF+ +RKK++  PDLC RKF T D+ +  + RR D     ++SD    +     +DW  R + +  Y
Subjt:  GSERKTVYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRSDHGNDT-----FDW-SRFKKVTTY

Query:  VMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHP
          G HTDY + W  VD +Y+PFN+   HWV++C D E GE V+ DSL  + +D  +   +  + TI P +L +CDVMK +P+LP  P
Subjt:  VMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHP

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]1.4e-4038.86Show/hide
Query:  LFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRS-----DHGNDTFDW-SRFKKVTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTD
        +F+  K+   P+LC RKF T DV ++ FLR  D    +++S           +DW  R   + +Y+ G H+D    W  VD VY+P+N+G  HW+++C D
Subjt:  LFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRS-----DHGNDTFDW-SRFKKVTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTD

Query:  FETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRR
        F+ GE ++ DS   +     + +++  + TI P L+ R  V   KP++P  PWR RR +  PQQ   GDC +F + F EYDVT     +L+Q ++ F RR
Subjt:  FETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRR

Query:  QFAVQLWANRS
        QFAVQLWAN+S
Subjt:  QFAVQLWANRS

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]9.3e-4537.85Show/hide
Query:  LEDPSSDGSERKTVYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRSD--HGNDTFDWSRFKKV
        ++DPS+D + R T    + K WF +LL P   + DE IDSL +   +K++    L   +F   DV ++  LRR D     ++        T+DW + + +
Subjt:  LEDPSSDGSERKTVYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRSD--HGNDTFDWSRFKKV

Query:  TTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHPWRFRRKTQVP
          YV+G  +DY   WS  D+VY   N+G NHWV++  D   G+  + DSL  +    D+ K +  +CTI P +L    ++  +P+LP  PWR RR T VP
Subjt:  TTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHPWRFRRKTQVP

Query:  QQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRSFF
        QQ    DC +F V+F EYDV  S + +L Q  I   RRQ+AVQ+WA R FF
Subjt:  QQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRSFF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]9.0e-4050.58Show/hide
Query:  QELLRSDHGNDTFDWSRFKKVTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDV
        +  LR D    T DWS  KKV  YV G+HTDY VPWS+VD VYMPFNL   HWVL+C DF+  E ++ DSL  L+ +AD+  +M  VC  FP LL+   V
Subjt:  QELLRSDHGNDTFDWSRFKKVTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDV

Query:  MKEKPSLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRSFF
        M E  +L    W  RR     QQ +SGDC +FT KF EYDVT S +G+L+Q++ ++ RRQ+A+Q+WANR+ F
Subjt:  MKEKPSLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRSFF

XP_038885861.1 sentrin-specific protease [Benincasa hispida]5.9e-3948.84Show/hide
Query:  QELLRSDHGNDTFDWSRFKKVTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDV
        +  LR D    T DWS+   V  YV G+HTDY VPWS+VD +YMPFNL R HWVL+C DF+  E ++ DSL VL+ +AD+  +M ++C  F  LL+   V
Subjt:  QELLRSDHGNDTFDWSRFKKVTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDV

Query:  MKEKPSLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRSFF
        M E  +L    W  RR   VPQQ  SGDC +FT KF EYDVT S + +L+Q+++++ RRQ+A+Q+ ANR+ F
Subjt:  MKEKPSLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRSFF

TrEMBL top hitse value%identityAlignment
A0A5A7SRX1 Ulp1-like peptidase2.0e-2928.63Show/hide
Query:  KEKEKEVEKRDESRKEKKKKKKTKQTCECSQWMESMDARMSDMETCLKSITKFLCRLSKGKFVDPEKYFG-PKDGPDDDGGQSKGPDDVGGPSKRPD---
        K   + V   D+ +K KK+K K K      + + ++  R++ +E  L SI   +  L KG      K+ G  + G + D   S+G  D    SK  D   
Subjt:  KEKEKEVEKRDESRKEKKKKKKTKQTCECSQWMESMDARMSDMETCLKSITKFLCRLSKGKFVDPEKYFG-PKDGPDDDGGQSKGPDDVGGPSKRPD---

Query:  ----DVGGPSKGPDDVDGPSKGPNDKSGPSKGPDDNEKDGKEKDVDEAYDIEHITEFESQPTTDLESHSITDVESQPTIDPVELIAPKAEEYEI---KFQ
            D GG    P+ +  P +  + +    KG +D  K   +  ++E    +  + + S P T L   S+T   S    +P+ +  P  +  ++   + +
Subjt:  ----DVGGPSKGPDDVDGPSKGPNDKSGPSKGPDDNEKDGKEKDVDEAYDIEHITEFESQPTTDLESHSITDVESQPTIDPVELIAPKAEEYEI---KFQ

Query:  KWLEDPSSDGSERKTVYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTAD-VFVTEFLRRGDVYQELLRSDHGNDTFDWSRFKK
         W+ D  +D   R+T +  +SK +F+ L     W+++E +D+LFLFIR K+        + F TAD +F+   + +  +Y+E ++ +H    FDW    +
Subjt:  KWLEDPSSDGSERKTVYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTAD-VFVTEFLRRGDVYQELLRSDHGNDTFDWSRFKK

Query:  VTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTH--PWRFRRKT
        +  YV+G   D+  PW+SVD VY PFN+  NHWVLLC D  + +  + DSL  L++  ++T  +  +  + P+LL        +    T+  PW      
Subjt:  VTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTH--PWRFRRKT

Query:  QVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRSFF
         +P Q+++ DC VFT+K+ EY      L +L QE + + R+Q A QLW N   +
Subjt:  QVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRSFF

A0A5A7TPK2 Ulp1-like peptidase1.5e-2728.45Show/hide
Query:  KEKEKEVEKRDESRKEKKKKKKTKQTCECSQWMESMDARMSDMETCLKSITKFLCRLSKGKFVDPEKYFG-PKDGPDDDGGQSKGPDDVGGPSKRPD---
        K   + V   D+ +K KK+K K K      + + ++  R++ +E  L SI   +  L KG      K+ G  + G + D   S+G  D    SK  D   
Subjt:  KEKEKEVEKRDESRKEKKKKKKTKQTCECSQWMESMDARMSDMETCLKSITKFLCRLSKGKFVDPEKYFG-PKDGPDDDGGQSKGPDDVGGPSKRPD---

Query:  ----DVGGPSKGPDDVDGPSKGPNDKSGPSKGPDDNEKD---GKEKDVDEAYDIEHITEFE---------------------SQPTTDLESHSITDVESQ
            D GG    P+ +  P +  + +       D  EK    G E+ +D   D E +TE E                     S P T L   S+T   S 
Subjt:  ----DVGGPSKGPDDVDGPSKGPNDKSGPSKGPDDNEKD---GKEKDVDEAYDIEHITEFE---------------------SQPTTDLESHSITDVESQ

Query:  PTIDPVELIAPKAEEYEI---KFQKWLEDPSSDGSERKTVYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTAD-VFVTEFLRR
           +P+ +  P  +  ++   + + W+ D  +D   R+T +  +SK +F+ L     W+SDE +D+LFLFIR K+        + F TAD +F+   + +
Subjt:  PTIDPVELIAPKAEEYEI---KFQKWLEDPSSDGSERKTVYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTAD-VFVTEFLRR

Query:  GDVYQELLRSDHGNDTFDWSRFKKVTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLL
          +Y+E ++    N  FDW    ++  YV+G   D+  PW+SVD VY PFN+  NHWVLLC D  + +  + DSL  L +  ++T  +  +  + P+LL 
Subjt:  GDVYQELLRSDHGNDTFDWSRFKKVTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLL

Query:  RCDVMKEKPSLPTH--PWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRSFF
               +    T+  PW       +P Q+++ DC VF +K+ EY      L +L QE + + R+Q A Q+W N   +
Subjt:  RCDVMKEKPSLPTH--PWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRSFF

A0A6J1D3R7 uncharacterized protein LOC1110169931.4e-3841.18Show/hide
Query:  GSERKTVYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRSDHGNDT-----FDW-SRFKKVTTY
        G+ RKTVY+ ++K WF+ LL P +W + EV+D LF+ +RKK++  PDLC RKF T D+ +  + RR D     ++SD    +     +DW  R + +  Y
Subjt:  GSERKTVYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRSDHGNDT-----FDW-SRFKKVTTY

Query:  VMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHP
          G HTDY + W  VD +Y+PFN+   HWV++C D E GE V+ DSL  + +D  +   +  + TI P +L +CDVMK +P+LP  P
Subjt:  VMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHP

A0A6J1DLV0 uncharacterized protein LOC1110216466.8e-4138.86Show/hide
Query:  LFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRS-----DHGNDTFDW-SRFKKVTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTD
        +F+  K+   P+LC RKF T DV ++ FLR  D    +++S           +DW  R   + +Y+ G H+D    W  VD VY+P+N+G  HW+++C D
Subjt:  LFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRS-----DHGNDTFDW-SRFKKVTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTD

Query:  FETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRR
        F+ GE ++ DS   +     + +++  + TI P L+ R  V   KP++P  PWR RR +  PQQ   GDC +F + F EYDVT     +L+Q ++ F RR
Subjt:  FETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRR

Query:  QFAVQLWANRS
        QFAVQLWAN+S
Subjt:  QFAVQLWANRS

A0A6J1DY60 uncharacterized protein LOC1110252734.5e-4537.85Show/hide
Query:  LEDPSSDGSERKTVYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRSD--HGNDTFDWSRFKKV
        ++DPS+D + R T    + K WF +LL P   + DE IDSL +   +K++    L   +F   DV ++  LRR D     ++        T+DW + + +
Subjt:  LEDPSSDGSERKTVYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRSD--HGNDTFDWSRFKKV

Query:  TTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHPWRFRRKTQVP
          YV+G  +DY   WS  D+VY   N+G NHWV++  D   G+  + DSL  +    D+ K +  +CTI P +L    ++  +P+LP  PWR RR T VP
Subjt:  TTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHPWRFRRKTQVP

Query:  QQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRSFF
        QQ    DC +F V+F EYDV  S + +L Q  I   RRQ+AVQ+WA R FF
Subjt:  QQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRSFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases8.2e-0728.46Show/hide
Query:  WSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHPWRFRRKTQVPQQQ---DSGDCEVF
        ++  D VYMPFN  + HWV LC D +  +  + DS   L  DA +  ++  +  + P L  +        SL   P+   R   +PQ     DSG   VF
Subjt:  WSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHPWRFRRKTQVPQQQ---DSGDCEVF

Query:  TV---------KFLEYDVTRSDL
         +         + +E+DV   D+
Subjt:  TV---------KFLEYDVTRSDL

AT3G06910.1 UB-like protease 1A3.5e-0521.27Show/hide
Query:  LTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRSDHGND---TFDWSRFKKVTTYVMGEHTDYGVPWSSVDVVYMPF
        L P  W++DEVI+   + ++++    P    +KF+    F T F      + +L+ S  G +      W+  K++           G      D +++P 
Subjt:  LTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRSDHGND---TFDWSRFKKVTTYVMGEHTDYGVPWSSVDVVYMPF

Query:  NLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKP--SLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRS
        ++   HW L   + +  +F   DS           K ++ +   F       D +++K    L    WR      +P Q++  DC +F VK++++     
Subjt:  NLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKP--SLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRS

Query:  DLGSLSQEKIEFCRRQFAVQL
        DL   +QE++ + R + A ++
Subjt:  DLGSLSQEKIEFCRRQFAVQL

AT5G45570.1 Ulp1 protease family protein2.3e-0921.33Show/hide
Query:  ITDVESQPTIDPVELIAPKAEEYEIKFQKWLE-DPSSDGSERKTVYAYRSKQWFQMLLTPSH--------WMSDEVIDSLFLFIRKKMDTHPDLCHRK--
        ITDV S  ++   E + P +++   + + WLE D   +G       A    +++  ++TP +        W+ D  + +     R++ +  P     +  
Subjt:  ITDVESQPTIDPVELIAPKAEEYEIKFQKWLE-DPSSDGSERKTVYAYRSKQWFQMLLTPSH--------WMSDEVIDSLFLFIRKKMDTHPDLCHRK--

Query:  -FVTADVFVTEFLRRGDVYQELLRSDHGNDTFDWSRFKKVTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADIT
         F+  D+ ++  L+    +Q   R     + ++     +V      E       +  VD +Y    +  NHWV L  D       + DS+  L +D ++ 
Subjt:  -FVTADVFVTEFLRRGDVYQELLRSDHGNDTFDWSRFKKVTTYVMGEHTDYGVPWSSVDVVYMPFNLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADIT

Query:  KQMNTVCTIFPRLLLRCDVMKE-KPSLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLW
         Q   V T+ P +L      K+ + S     W  +R T++P+  D GDC ++++K++E          L  E ++  R + AV+++
Subjt:  KQMNTVCTIFPRLLLRCDVMKE-KPSLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCCTTTTCCTATCGTTCAGCAGATTAAAAATAGGGCAACTGCACGCACAACTTCAGAACCACAGCTGCACTCTTCTCACTGCAGATATATTTCTGTGTCCACGGA
TATCGACCGTCAACAGCAAGTTAGCCGTTCACGTGTGTTCGTACCCCAGCTAGGTCAAATTACCGTTTTACCCCTGGGCTACCTCTTGGTCCTTAAGTACCAGTGCTCCT
CTAATGAACAACCTGTTTGTGGTCCAACCAGCAAACAGAAATCCCTCTCGTGCCATAAAGAGGACTACGTTCCCAGCTCCCCATTCGGTCTTGTCATTGAAGCTTCTCCA
GAAATTACAAATAAGAGAGGAAGGGAAAAAGATGACAAAGACAAATGGAAAGAGAAAGAGAAGGAAGTAGAAAAAAGGGATGAGAGCAGGAAGGAGAAGAAGAAGAAGAA
GAAGACAAAGCAGACCTGTGAGTGTAGCCAGTGGATGGAGAGTATGGATGCTCGCATGTCTGATATGGAGACATGCCTCAAGTCCATTACCAAGTTCTTATGTCGTCTCT
CTAAGGGAAAATTCGTGGACCCTGAGAAGTACTTTGGACCGAAAGATGGTCCGGATGATGACGGTGGTCAATCGAAAGGACCAGATGACGTGGGTGGTCCATCGAAAAGA
CCCGATGACGTGGGTGGTCCATCAAAAGGACCCGATGACGTGGATGGTCCATCGAAAGGACCCAATGACAAGAGTGGTCCATCGAAAGGACCCGATGACAATGAGAAGGA
CGGGAAGGAGAAGGACGTTGATGAGGCATACGACATAGAGCATATTACGGAGTTCGAGTCTCAACCAACCACTGACTTAGAGTCTCACTCAATTACTGACGTGGAGTCTC
AACCAACTATAGACCCAGTCGAACTAATTGCACCTAAGGCTGAGGAATACGAGATCAAGTTTCAGAAATGGTTGGAAGACCCATCGTCTGACGGATCGGAGCGTAAGACA
GTATATGCCTATAGAAGTAAACAGTGGTTTCAGATGTTACTCACACCATCTCATTGGATGAGTGATGAGGTGATTGACTCTCTCTTCCTCTTTATTCGGAAGAAGATGGA
TACCCATCCTGACTTATGTCATCGAAAGTTCGTCACAGCAGATGTATTTGTAACAGAATTTTTGAGGCGCGGGGATGTGTACCAAGAACTCCTTCGTAGTGACCATGGGA
ACGACACGTTCGATTGGAGCAGATTCAAGAAGGTCACTACCTACGTAATGGGAGAACACACAGATTACGGCGTTCCTTGGAGTTCTGTTGATGTTGTCTACATGCCCTTC
AACTTAGGTAGAAACCATTGGGTTCTACTGTGCACTGACTTTGAAACGGGCGAATTTGTGTTGACAGACTCCCTAACGGTACTGAATTCAGATGCAGACATAACCAAGCA
GATGAATACGGTATGCACCATTTTTCCTAGGCTGCTACTAAGGTGCGACGTTATGAAGGAGAAGCCGTCTCTTCCAACACATCCATGGCGATTCAGAAGGAAGACCCAAG
TGCCACAACAACAAGATAGTGGGGATTGTGAGGTTTTCACTGTAAAGTTTTTAGAATATGATGTAACTAGATCAGATTTAGGTAGTCTTAGTCAGGAGAAAATTGAGTTT
TGTAGGCGTCAATTTGCTGTACAACTTTGGGCCAATAGGTCGTTCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCCCTTTTCCTATCGTTCAGCAGATTAAAAATAGGGCAACTGCACGCACAACTTCAGAACCACAGCTGCACTCTTCTCACTGCAGATATATTTCTGTGTCCACGGA
TATCGACCGTCAACAGCAAGTTAGCCGTTCACGTGTGTTCGTACCCCAGCTAGGTCAAATTACCGTTTTACCCCTGGGCTACCTCTTGGTCCTTAAGTACCAGTGCTCCT
CTAATGAACAACCTGTTTGTGGTCCAACCAGCAAACAGAAATCCCTCTCGTGCCATAAAGAGGACTACGTTCCCAGCTCCCCATTCGGTCTTGTCATTGAAGCTTCTCCA
GAAATTACAAATAAGAGAGGAAGGGAAAAAGATGACAAAGACAAATGGAAAGAGAAAGAGAAGGAAGTAGAAAAAAGGGATGAGAGCAGGAAGGAGAAGAAGAAGAAGAA
GAAGACAAAGCAGACCTGTGAGTGTAGCCAGTGGATGGAGAGTATGGATGCTCGCATGTCTGATATGGAGACATGCCTCAAGTCCATTACCAAGTTCTTATGTCGTCTCT
CTAAGGGAAAATTCGTGGACCCTGAGAAGTACTTTGGACCGAAAGATGGTCCGGATGATGACGGTGGTCAATCGAAAGGACCAGATGACGTGGGTGGTCCATCGAAAAGA
CCCGATGACGTGGGTGGTCCATCAAAAGGACCCGATGACGTGGATGGTCCATCGAAAGGACCCAATGACAAGAGTGGTCCATCGAAAGGACCCGATGACAATGAGAAGGA
CGGGAAGGAGAAGGACGTTGATGAGGCATACGACATAGAGCATATTACGGAGTTCGAGTCTCAACCAACCACTGACTTAGAGTCTCACTCAATTACTGACGTGGAGTCTC
AACCAACTATAGACCCAGTCGAACTAATTGCACCTAAGGCTGAGGAATACGAGATCAAGTTTCAGAAATGGTTGGAAGACCCATCGTCTGACGGATCGGAGCGTAAGACA
GTATATGCCTATAGAAGTAAACAGTGGTTTCAGATGTTACTCACACCATCTCATTGGATGAGTGATGAGGTGATTGACTCTCTCTTCCTCTTTATTCGGAAGAAGATGGA
TACCCATCCTGACTTATGTCATCGAAAGTTCGTCACAGCAGATGTATTTGTAACAGAATTTTTGAGGCGCGGGGATGTGTACCAAGAACTCCTTCGTAGTGACCATGGGA
ACGACACGTTCGATTGGAGCAGATTCAAGAAGGTCACTACCTACGTAATGGGAGAACACACAGATTACGGCGTTCCTTGGAGTTCTGTTGATGTTGTCTACATGCCCTTC
AACTTAGGTAGAAACCATTGGGTTCTACTGTGCACTGACTTTGAAACGGGCGAATTTGTGTTGACAGACTCCCTAACGGTACTGAATTCAGATGCAGACATAACCAAGCA
GATGAATACGGTATGCACCATTTTTCCTAGGCTGCTACTAAGGTGCGACGTTATGAAGGAGAAGCCGTCTCTTCCAACACATCCATGGCGATTCAGAAGGAAGACCCAAG
TGCCACAACAACAAGATAGTGGGGATTGTGAGGTTTTCACTGTAAAGTTTTTAGAATATGATGTAACTAGATCAGATTTAGGTAGTCTTAGTCAGGAGAAAATTGAGTTT
TGTAGGCGTCAATTTGCTGTACAACTTTGGGCCAATAGGTCGTTCTTTTAG
Protein sequenceShow/hide protein sequence
MTPFPIVQQIKNRATARTTSEPQLHSSHCRYISVSTDIDRQQQVSRSRVFVPQLGQITVLPLGYLLVLKYQCSSNEQPVCGPTSKQKSLSCHKEDYVPSSPFGLVIEASP
EITNKRGREKDDKDKWKEKEKEVEKRDESRKEKKKKKKTKQTCECSQWMESMDARMSDMETCLKSITKFLCRLSKGKFVDPEKYFGPKDGPDDDGGQSKGPDDVGGPSKR
PDDVGGPSKGPDDVDGPSKGPNDKSGPSKGPDDNEKDGKEKDVDEAYDIEHITEFESQPTTDLESHSITDVESQPTIDPVELIAPKAEEYEIKFQKWLEDPSSDGSERKT
VYAYRSKQWFQMLLTPSHWMSDEVIDSLFLFIRKKMDTHPDLCHRKFVTADVFVTEFLRRGDVYQELLRSDHGNDTFDWSRFKKVTTYVMGEHTDYGVPWSSVDVVYMPF
NLGRNHWVLLCTDFETGEFVLTDSLTVLNSDADITKQMNTVCTIFPRLLLRCDVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCEVFTVKFLEYDVTRSDLGSLSQEKIEF
CRRQFAVQLWANRSFF