; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034939 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034939
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr3:12674504..12678067
RNA-Seq ExpressionLag0034939
SyntenyLag0034939
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153247.1 uncharacterized protein LOC111020782 [Momordica charantia]1.1e-3344.38Show/hide
Query:  FDWSRFKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPW
        +DW + +T+  YV+G  +DY+ PWS  D+VY P N+G NHWV++  D    +  + DS  A+    D+ K +  +CTI P +L    ++  +P L T PW
Subjt:  FDWSRFKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPW

Query:  RFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF
        R RR T VPQQ    DCG+F V+F EYDVT S + +L Q  +   RRQ+AVQ+WA RPFF
Subjt:  RFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]3.2e-3345.89Show/hide
Query:  YVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQ
        Y+   H+DY + W  ++ VY+PFN+  NHWV++C DF   E V+ DS  A+ S A + +Q+  + T+ P LL +  V+  +P+LP  PWR RR T  P+Q
Subjt:  YVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQ

Query:  QDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANR
          SGDCG+F VK+ EYDVT + L +L Q  M + RRQFA QLW+N+
Subjt:  QDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANR

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]9.9e-3535.86Show/hide
Query:  LDDPSSDGSERKTVYAYRSKQWFKTLLTSSHWMSDE----------------EFLRR-----GDV-YEELLHSDHG------------NDTFDWSRFKTV
        +DDPS+D + R T    + K WF  LL     + DE                + L R     GDV    LL    G              T+DW + +T+
Subjt:  LDDPSSDGSERKTVYAYRSKQWFKTLLTSSHWMSDE----------------EFLRR-----GDV-YEELLHSDHG------------NDTFDWSRFKTV

Query:  TNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVP
          YV+G  +DY+  WS  D+VY   N+G NHWV++  D    +  + DS  A+    D+ K +  +CTI P +L    ++  +P+LP  PWR RR T VP
Subjt:  TNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVP

Query:  QQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF
        QQ    DC +F V+F EYDV  S + +L Q  +   RRQ+AVQ+WA RPFF
Subjt:  QQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]2.4e-4148.63Show/hide
Query:  DEEFLRRGDVYEELLHSDHGNDTFDWSRFKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICT
        DE FLRR               T DWS  K V  YV G+HTDY+VPWS++D VYMPFNL   HWVL+CADF+  E ++ DS +AL+ +AD+  +M  +C 
Subjt:  DEEFLRRGDVYEELLHSDHGNDTFDWSRFKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICT

Query:  IFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF
         FP LL+   VM E  +L    W  RR     QQ +SGDCG+FT KF EYDVT S +G+L+Q++ ++ RRQ+A+Q+WANR  F
Subjt:  IFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF

XP_038885861.1 sentrin-specific protease [Benincasa hispida]2.7e-4047.54Show/hide
Query:  DEEFLRRGDVYEELLHSDHGNDTFDWSRFKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICT
        DE F   G   E  L  D    T DWS+   V  YV G+HTDY+VPWS++D +YMPFNL R HWVL+C DF+  E ++ DS + L+ +AD+  +M ++C 
Subjt:  DEEFLRRGDVYEELLHSDHGNDTFDWSRFKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICT

Query:  IFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF
         F  LL+   VM E  +L    W  RR   VPQQ  SGDCG+FT KF EYDVT S + +L+Q++M++ RRQ+A+Q+ ANR  F
Subjt:  IFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF

TrEMBL top hitse value%identityAlignment
A0A5A7STU0 Ulp1-like peptidase1.4e-3131.37Show/hide
Query:  YDPMRAIPEEYETKFQKWLDDPSSDGSERKTVYAYRSKQWFKTLLTSSHWMSDE----------EFLRRGDVYEELLHSD--------HGNDTFDWSRFK
        YDPM  I + +  + Q W+ D  +D   R+T +  +SK +F+ L     W++DE           F     ++  +L S           N  FDW    
Subjt:  YDPMRAIPEEYETKFQKWLDDPSSDGSERKTVYAYRSKQWFKTLLTSSHWMSDE----------EFLRRGDVYEELLHSD--------HGNDTFDWSRFK

Query:  TVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTH--PWRFRRK
         + +YV+G   D+  PW+S+D VY PFN+  NHWVLLC D  + +  + DS  +L +  ++   +  I  + P+LL        +    T+  PW     
Subjt:  TVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTH--PWRFRRK

Query:  TQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF
          +P Q+++ DCGVFT+K+ EY      L +L QE M + R+Q A QLW N P +
Subjt:  TQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF

A0A6J1DID7 uncharacterized protein LOC1110207825.3e-3444.38Show/hide
Query:  FDWSRFKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPW
        +DW + +T+  YV+G  +DY+ PWS  D+VY P N+G NHWV++  D    +  + DS  A+    D+ K +  +CTI P +L    ++  +P L T PW
Subjt:  FDWSRFKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPW

Query:  RFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF
        R RR T VPQQ    DCG+F V+F EYDVT S + +L Q  +   RRQ+AVQ+WA RPFF
Subjt:  RFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF

A0A6J1DLV0 uncharacterized protein LOC1110216462.6e-3341.77Show/hide
Query:  FDW-SRFKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHP
        +DW  R  ++ +Y+ G H+D +  W  +D VY+P+N+G  HW+++C DF+  E ++ DS M +     + +++  + TI P L+ R  V   KP++P  P
Subjt:  FDW-SRFKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHP

Query:  WRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANR
        WR RR +  PQQ   GDCG+F + F EYDVT     +L+Q +M F RRQFAVQLWAN+
Subjt:  WRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANR

A0A6J1DQZ3 uncharacterized protein LOC1110234421.5e-3345.89Show/hide
Query:  YVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQ
        Y+   H+DY + W  ++ VY+PFN+  NHWV++C DF   E V+ DS  A+ S A + +Q+  + T+ P LL +  V+  +P+LP  PWR RR T  P+Q
Subjt:  YVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQ

Query:  QDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANR
          SGDCG+F VK+ EYDVT + L +L Q  M + RRQFA QLW+N+
Subjt:  QDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANR

A0A6J1DY60 uncharacterized protein LOC1110252734.8e-3535.86Show/hide
Query:  LDDPSSDGSERKTVYAYRSKQWFKTLLTSSHWMSDE----------------EFLRR-----GDV-YEELLHSDHG------------NDTFDWSRFKTV
        +DDPS+D + R T    + K WF  LL     + DE                + L R     GDV    LL    G              T+DW + +T+
Subjt:  LDDPSSDGSERKTVYAYRSKQWFKTLLTSSHWMSDE----------------EFLRR-----GDV-YEELLHSDHG------------NDTFDWSRFKTV

Query:  TNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVP
          YV+G  +DY+  WS  D+VY   N+G NHWV++  D    +  + DS  A+    D+ K +  +CTI P +L    ++  +P+LP  PWR RR T VP
Subjt:  TNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVP

Query:  QQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF
        QQ    DC +F V+F EYDV  S + +L Q  +   RRQ+AVQ+WA RPFF
Subjt:  QQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases1.3e-0830.3Show/hide
Query:  WSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTV
        ++  D VYMPFN  + HWV LC D +  +  ++DS + L  DA +  ++  +  + P L  + +      SL   P+   R   +PQ     D GV +V
Subjt:  WSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTV

AT4G08430.1 Ulp1 protease family protein5.5e-0724.81Show/hide
Query:  LDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKE-KPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFL
        +D +Y    +  NHWV L  D       + DS  +L +D ++  Q   + T+ P +L      K+ + S     W  +R T++P+  D+ DC ++++K++
Subjt:  LDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKE-KPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFL

Query:  EYDVTRSDLGSLSQEKMEFCRRQFAVQLW
        E          L  E M+    + AV+++
Subjt:  EYDVTRSDLGSLSQEKMEFCRRQFAVQLW

AT4G15880.1 Cysteine proteinases superfamily protein4.3e-0424.04Show/hide
Query:  LTSSHWMSDEEFLRRGDVYEELLHSD-----------HGNDTFDWSR--------FKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFET
        LT S W++DE      +VY ELL              H  +TF + +        FK V  +       Y +     D++++P + G  HW L   +   
Subjt:  LTSSHWMSDEEFLRRGDVYEELLHSD-----------HGNDTFDWSR--------FKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFET

Query:  SEFVLIDS-----QMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFC
        S+ + +DS      M LN+   +AK M                 K    +  + W       +PQQ++  DCG+F +K++++  +R      SQE M + 
Subjt:  SEFVLIDS-----QMALNSDADIAKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFC

Query:  RRQFAVQL
        R + A ++
Subjt:  RRQFAVQL

AT5G45570.1 Ulp1 protease family protein3.4e-0927.13Show/hide
Query:  LDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKE-KPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFL
        +D +Y    +  NHWV L  D       + DS  +L +D ++A Q   + T+ P +L      K+ + S     W  +R T++P+  D GDC ++++K++
Subjt:  LDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADIAKQMNTICTIFPRLLLRCNVMKE-KPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFL

Query:  EYDVTRSDLGSLSQEKMEFCRRQFAVQLW
        E          L  E M+  R + AV+++
Subjt:  EYDVTRSDLGSLSQEKMEFCRRQFAVQLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAGGAGACGGCGATGGAAGGAGTCGACGTCGCCGTTCGTGCTTCGTGGTGGAAGAGAAGACGTTCATGCTTCATGGTGGAAGAGACGACGACGTGCGTGCTTCG
TGGTGGAAGAGACGACGACGACATTCGTGCTTTGTGGTGGAAGAGACGACGGCAGCGTTTGTGCTTCGTGGGAAAATATGGGCATACGAGACTGTTTCATCTCTTACTGG
ACGTGTTGCCAATCGCCAGAGTTAGGTTGAGTCTTGTTTCTTCTGTAGAGGAGAGTCAATTCATGAATCGAGTGATGAAGCCTCCACGTGCATCGGAGCCAGTGCTTGAG
CCAGTACTAGAGCCAGGGCAAGAACCATATCAAGAACCAGAGAGTCTACCTACTGCATTTGAGGTTATACTCCCTTTTGTAGAGGAGGCTACTGTAGTTGATACTGATAT
GTTGGATGTCATTGAAGCTTCTCCAGAAGTTTCAAATAAGAGAGGAAGGGAGAAAGAGGACAAAGACAAAGGGAAAGAGAAAGAGAAGGAAGTAGAAAAAGAGGATGAGA
GCACGAAGGAGAAGACAAAGAAGAAAAAGACGAAAAAGACGAAAACAAAGCAGACTTGTGAATGTAGCCAATGGCTACAACTTATGGATAGTCGGATGGAGAGAATGGAT
GCTTGCATGTCTGATATGGAGACATGCCTCAAGTCCATTACGAAGTTCTTGCGTCGTCTCTCTAAGGGTAAATTTGTGGACCATGAGAAGTACTTTGGACCGAAAGATGG
TCCGGATGATGAGGGCAGTCCATCGAAAGAACCAGCTGACATGGGTGGTCCATCGAAAGGACCCGATGACGTGGGTGGTCCATCGAAAGGACCCGATGGCAAGGGTGGTC
CATCAAAAGGACCGAATGACAAGGGTGGACCGGATGACAATAAGGAAGAAGGACATGAAGGAGGAGGAAGCGGCAGAGGAGAGGATGGGAAGGAGAATGACGTTGATGAG
GCGCACGACATAGAGCATATTACGGAGTTGGAGTCTCAACCAACCACTGACGTAGAGTCTCACTCCATTATTGATGTGGAGTCTCAACAAACCACTTACTTAGAGTCTCA
CTCCATTATTGACGTGAAGTCTCAACCAAACGGGGAAAGCGGACCCGACAAATTTCTTGGAAGCTTCGATCTCCATGGGCTAACACCAGGCCGAACGAAAAAAGGAGGAA
AAGTTAAGCAATATGATCCCATGCGTGCCATTCCTGAGGAGTACGAGACCAAGTTTCAAAAATGGTTGGATGACCCATCGTCTGATGGATCGGAACGCAAGACAGTGTAT
GCCTACAGAAGCAAACAATGGTTCAAGACGTTACTCACATCGTCTCATTGGATGAGTGATGAGGAATTTTTGAGGCGCGGGGATGTGTACGAAGAACTCCTTCATAGTGA
CCATGGGAACGACACGTTCGATTGGAGTAGATTCAAGACGGTCACTAACTACGTAATGGGAGAACACACAGATTACAACGTTCCTTGGAGTTCCCTTGATGTTGTCTACA
TGCCCTTCAACTTAGGTAGAAACCATTGGGTTCTACTGTGCGCTGACTTTGAAACGAGCGAATTTGTGTTGATAGACTCCCAAATGGCACTGAATTCAGATGCAGACATA
GCAAAGCAGATGAATACGATATGCACCATTTTTCCTAGGCTGCTACTGAGGTGCAACGTTATGAAGGAGAAGCCATCTCTTCCAACACATCCATGGCGATTCAGAAGGAA
GACCCAAGTGCCTCAACAACAAGATAGTGGAGATTGTGGGGTTTTCACTGTAAAGTTTTTGGAATATGATGTAACTAGATCAGATTTAGGTAGTCTTAGTCAAGAGAAAA
TGGAGTTTTGTAGGCGTCAATTTGCTGTACAACTTTGGGCCAATAGGCCGTTCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGAGGAGACGGCGATGGAAGGAGTCGACGTCGCCGTTCGTGCTTCGTGGTGGAAGAGAAGACGTTCATGCTTCATGGTGGAAGAGACGACGACGTGCGTGCTTCG
TGGTGGAAGAGACGACGACGACATTCGTGCTTTGTGGTGGAAGAGACGACGGCAGCGTTTGTGCTTCGTGGGAAAATATGGGCATACGAGACTGTTTCATCTCTTACTGG
ACGTGTTGCCAATCGCCAGAGTTAGGTTGAGTCTTGTTTCTTCTGTAGAGGAGAGTCAATTCATGAATCGAGTGATGAAGCCTCCACGTGCATCGGAGCCAGTGCTTGAG
CCAGTACTAGAGCCAGGGCAAGAACCATATCAAGAACCAGAGAGTCTACCTACTGCATTTGAGGTTATACTCCCTTTTGTAGAGGAGGCTACTGTAGTTGATACTGATAT
GTTGGATGTCATTGAAGCTTCTCCAGAAGTTTCAAATAAGAGAGGAAGGGAGAAAGAGGACAAAGACAAAGGGAAAGAGAAAGAGAAGGAAGTAGAAAAAGAGGATGAGA
GCACGAAGGAGAAGACAAAGAAGAAAAAGACGAAAAAGACGAAAACAAAGCAGACTTGTGAATGTAGCCAATGGCTACAACTTATGGATAGTCGGATGGAGAGAATGGAT
GCTTGCATGTCTGATATGGAGACATGCCTCAAGTCCATTACGAAGTTCTTGCGTCGTCTCTCTAAGGGTAAATTTGTGGACCATGAGAAGTACTTTGGACCGAAAGATGG
TCCGGATGATGAGGGCAGTCCATCGAAAGAACCAGCTGACATGGGTGGTCCATCGAAAGGACCCGATGACGTGGGTGGTCCATCGAAAGGACCCGATGGCAAGGGTGGTC
CATCAAAAGGACCGAATGACAAGGGTGGACCGGATGACAATAAGGAAGAAGGACATGAAGGAGGAGGAAGCGGCAGAGGAGAGGATGGGAAGGAGAATGACGTTGATGAG
GCGCACGACATAGAGCATATTACGGAGTTGGAGTCTCAACCAACCACTGACGTAGAGTCTCACTCCATTATTGATGTGGAGTCTCAACAAACCACTTACTTAGAGTCTCA
CTCCATTATTGACGTGAAGTCTCAACCAAACGGGGAAAGCGGACCCGACAAATTTCTTGGAAGCTTCGATCTCCATGGGCTAACACCAGGCCGAACGAAAAAAGGAGGAA
AAGTTAAGCAATATGATCCCATGCGTGCCATTCCTGAGGAGTACGAGACCAAGTTTCAAAAATGGTTGGATGACCCATCGTCTGATGGATCGGAACGCAAGACAGTGTAT
GCCTACAGAAGCAAACAATGGTTCAAGACGTTACTCACATCGTCTCATTGGATGAGTGATGAGGAATTTTTGAGGCGCGGGGATGTGTACGAAGAACTCCTTCATAGTGA
CCATGGGAACGACACGTTCGATTGGAGTAGATTCAAGACGGTCACTAACTACGTAATGGGAGAACACACAGATTACAACGTTCCTTGGAGTTCCCTTGATGTTGTCTACA
TGCCCTTCAACTTAGGTAGAAACCATTGGGTTCTACTGTGCGCTGACTTTGAAACGAGCGAATTTGTGTTGATAGACTCCCAAATGGCACTGAATTCAGATGCAGACATA
GCAAAGCAGATGAATACGATATGCACCATTTTTCCTAGGCTGCTACTGAGGTGCAACGTTATGAAGGAGAAGCCATCTCTTCCAACACATCCATGGCGATTCAGAAGGAA
GACCCAAGTGCCTCAACAACAAGATAGTGGAGATTGTGGGGTTTTCACTGTAAAGTTTTTGGAATATGATGTAACTAGATCAGATTTAGGTAGTCTTAGTCAAGAGAAAA
TGGAGTTTTGTAGGCGTCAATTTGCTGTACAACTTTGGGCCAATAGGCCGTTCTTTTAG
Protein sequenceShow/hide protein sequence
MVEETAMEGVDVAVRASWWKRRRSCFMVEETTTCVLRGGRDDDDIRALWWKRRRQRLCFVGKYGHTRLFHLLLDVLPIARVRLSLVSSVEESQFMNRVMKPPRASEPVLE
PVLEPGQEPYQEPESLPTAFEVILPFVEEATVVDTDMLDVIEASPEVSNKRGREKEDKDKGKEKEKEVEKEDESTKEKTKKKKTKKTKTKQTCECSQWLQLMDSRMERMD
ACMSDMETCLKSITKFLRRLSKGKFVDHEKYFGPKDGPDDEGSPSKEPADMGGPSKGPDDVGGPSKGPDGKGGPSKGPNDKGGPDDNKEEGHEGGGSGRGEDGKENDVDE
AHDIEHITELESQPTTDVESHSIIDVESQQTTYLESHSIIDVKSQPNGESGPDKFLGSFDLHGLTPGRTKKGGKVKQYDPMRAIPEEYETKFQKWLDDPSSDGSERKTVY
AYRSKQWFKTLLTSSHWMSDEEFLRRGDVYEELLHSDHGNDTFDWSRFKTVTNYVMGEHTDYNVPWSSLDVVYMPFNLGRNHWVLLCADFETSEFVLIDSQMALNSDADI
AKQMNTICTIFPRLLLRCNVMKEKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKMEFCRRQFAVQLWANRPFF