; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018472 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018472
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr5:27923192..27928623
RNA-Seq ExpressionLag0018472
SyntenyLag0018472
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]1.4e-2230.95Show/hide
Query:  LFVRKKMDTRPDLCHRKFVTTDVFVT--IMTFLGVPLMLST-----------------------------------------------LGRNHWVLLCAD
        +FV  K+  RP+LC RKF T DV ++  + +  GV +M+ +                                               +G  HW+++C D
Subjt:  LFVRKKMDTRPDLCHRKFVTTDVFVT--IMTFLGVPLMLST-----------------------------------------------LGRNHWVLLCAD

Query:  FETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRC
        F+ GE ++ DS   +     + +++  + T  P L+ R  V   KP++P   WR RR +  PQQ   GDCG+F + F EYDVT     +L+Q +M F R 
Subjt:  FETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRC

Query:  QFVVQLWANR
        QF VQLWAN+
Subjt:  QFVVQLWANR

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]8.0e-2345.38Show/hide
Query:  NHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLS
        NHWV++C DF  GE V+ DSL  + S   + +Q+  + T  P LL +  V+  +P+LP   WR RR T  P+Q +SGDCG+F VK+ EYDVT + L +L 
Subjt:  NHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLS

Query:  QEKMEFCRCQFVVQLWANR
        Q  M + R QF  QLW+N+
Subjt:  QEKMEFCRCQFVVQLWANR

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]1.2e-2128.69Show/hide
Query:  LDDPSSDGSEHKTVYAYRSKQCFQTLLTLSHWMSDEVIDSLFLFVRKKMDTRPDLCHRKFVTTDVFV---------------------------------
        +DDPS+D +   T    + K  F  LL     + DE IDSL +   +K++    L   +F   DV +                                 
Subjt:  LDDPSSDGSEHKTVYAYRSKQCFQTLLTLSHWMSDEVIDSLFLFVRKKMDTRPDLCHRKFVTTDVFV---------------------------------

Query:  ------------TIMTFLGVPLMLSTLGRNHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVP
                    T+ +   +      +G NHWV++  D   G+  + DSL  +    D+ K +  +CT  P +L    ++  +P+LP   WR RR T VP
Subjt:  ------------TIMTFLGVPLMLSTLGRNHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVP

Query:  QQQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRCQFVVQLWANRPFF
        QQ    DC +F V+F EYDV  S + +L Q  +   R Q+ VQ+WA RPFF
Subjt:  QQQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRCQFVVQLWANRPFF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]2.3e-2242Show/hide
Query:  HRKFVTTDVFVTIMTFLGVPLMLSTLGRNHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQ
        H K    DV  + +  + +P  LS +   HWVL+CADF+  E ++ DSL  L+ + D+  +M  VC NFP LL+   VM E  +L    W  RR     Q
Subjt:  HRKFVTTDVFVTIMTFLGVPLMLSTLGRNHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQ

Query:  QQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRCQFVVQLWANRPFF
        Q  SGDCG+F  KF EYDVT S +G+L+Q++ ++ R Q+ +Q+WANR  F
Subjt:  QQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRCQFVVQLWANRPFF

XP_038885861.1 sentrin-specific protease [Benincasa hispida]5.2e-2241.33Show/hide
Query:  HRKFVTTDVFVTIMTFLGVPLMLSTLGRNHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQ
        H K    DV  + +  + +P  LS   R HWVL+C DF+  E ++ DSL  L+ + D+  +M ++C NF  LL+   VM E  +L    W  RR   VPQ
Subjt:  HRKFVTTDVFVTIMTFLGVPLMLSTLGRNHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQ

Query:  QQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRCQFVVQLWANRPFF
        Q  SGDCG+F  KF EYDVT S + +L+Q++M++ R Q+ +Q+ ANR  F
Subjt:  QQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRCQFVVQLWANRPFF

TrEMBL top hitse value%identityAlignment
A0A438CGG4 Ubiquitin-like-specific protease ESD46.8e-1225.13Show/hide
Query:  RGEDGKEKDVDEAYD-----IKHITELESQPTTDLESHSITDMESQPTTDL-ESHSITDVKSQ-PRTDPVKFIAPKAEPVELGDEEVEKVIGPHKLVKRR
        R    KE D +  +D      + +T+ E +   D+      DME+     L E H   ++K      DPV  IA       +  E    VI PH L    
Subjt:  RGEDGKEKDVDEAYD-----IKHITELESQPTTDLESHSITDMESQPTTDL-ESHSITDVKSQ-PRTDPVKFIAPKAEPVELGDEEVEKVIGPHKLVKRR

Query:  GKRTRQI--SWKLRSPWVDTRPNGKRWK-----VKQYDLMRAIPEEYETKFQKWLDDPSSDGSEHKTVYAYRSKQCFQTLLTLSHWMSDEVIDSLFLFVR
         KR R+I  S  L+ P+ D     K  K     +  +D +  I EE    FQKW+ D    GS     Y +  K+ FQ+L     W++D  ID  F F R
Subjt:  GKRTRQI--SWKLRSPWVDTRPNGKRWK-----VKQYDLMRAIPEEYETKFQKWLDDPSSDGSEHKTVYAYRSKQCFQTLLTLSHWMSDEVIDSLFLFVR

Query:  KKMDTRPDLCHRKFVTTDVF-------------------------VTIMTFLGVPLMLS-------------TLGRNHWVLLCADFETGEFVLTDSLTTL
        K+    P L  +KF T D                           + I    G+  + S              +  +HWVL           + DSL  +
Subjt:  KKMDTRPDLCHRKFVTTDVF-------------------------VTIMTFLGVPLMLS-------------TLGRNHWVLLCADFETGEFVLTDSLTTL

Query:  NSDVDIAKQMNTVCTNFPRLL--LRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRCQFVVQLW
        N++  +   +  +    P +L  +          +    W   R   +PQQ+N GDCG+F++K+ EY +    L SL+  +M++ R +   +L+
Subjt:  NSDVDIAKQMNTVCTNFPRLL--LRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRCQFVVQLW

A0A5A7U549 Ulp1-like peptidase6.0e-1627.98Show/hide
Query:  YDLMRAIPEEYETKFQKWLDDPSSDGSEHKTVYAYRSKQCFQTLLTLSHWMSDEVIDSLFLFVRKKMDTRPDLCHRKFVTTDVFVTIMTFL---------
        YDLM  I +    + + W+ D  +D    +T +  +SK  F+ L     W++DE +D+LFLF+R K+        + F T D  + ++ ++         
Subjt:  YDLMRAIPEEYETKFQKWLDDPSSDGSEHKTVYAYRSKQCFQTLLTLSHWMSDEVIDSLFLFVRKKMDTRPDLCHRKFVTTDVFVTIMTFL---------

Query:  ---GVPLMLSTLG--RNHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHS--WRFRRKTQVPQQQNSGDCGVF
            V  + ST     NHWVLLC D  + +  + DSL +L +  ++   +  +    P LL        +    T+   W       +P Q+N+ DCGVF
Subjt:  ---GVPLMLSTLG--RNHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHS--WRFRRKTQVPQQQNSGDCGVF

Query:  IVKFLEYDVTRSDLGSLS
         +K+ EY     DL +LS
Subjt:  IVKFLEYDVTRSDLGSLS

A0A6J1CJT2 uncharacterized protein LOC1110120678.1e-2143.7Show/hide
Query:  NHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLS
        N+WV++C DF  GE V+ DSL  +  D  + +Q+  + T  P LL +  V+   P+LP   WR RR T  PQQ  S DC +F VK+ EYDVT + L +L 
Subjt:  NHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLS

Query:  QEKMEFCRCQFVVQLWANR
        Q  M + R QF  QLW+N+
Subjt:  QEKMEFCRCQFVVQLWANR

A0A6J1DLV0 uncharacterized protein LOC1110216466.6e-2330.95Show/hide
Query:  LFVRKKMDTRPDLCHRKFVTTDVFVT--IMTFLGVPLMLST-----------------------------------------------LGRNHWVLLCAD
        +FV  K+  RP+LC RKF T DV ++  + +  GV +M+ +                                               +G  HW+++C D
Subjt:  LFVRKKMDTRPDLCHRKFVTTDVFVT--IMTFLGVPLMLST-----------------------------------------------LGRNHWVLLCAD

Query:  FETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRC
        F+ GE ++ DS   +     + +++  + T  P L+ R  V   KP++P   WR RR +  PQQ   GDCG+F + F EYDVT     +L+Q +M F R 
Subjt:  FETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRC

Query:  QFVVQLWANR
        QF VQLWAN+
Subjt:  QFVVQLWANR

A0A6J1DQZ3 uncharacterized protein LOC1110234423.9e-2345.38Show/hide
Query:  NHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLS
        NHWV++C DF  GE V+ DSL  + S   + +Q+  + T  P LL +  V+  +P+LP   WR RR T  P+Q +SGDCG+F VK+ EYDVT + L +L 
Subjt:  NHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLS

Query:  QEKMEFCRCQFVVQLWANR
        Q  M + R QF  QLW+N+
Subjt:  QEKMEFCRCQFVVQLWANR

SwissProt top hitse value%identityAlignment
Q09353 Sentrin-specific protease7.1e-0623.33Show/hide
Query:  RGKRTRQISWKLRSPWVDTRPNGKRWKVKQYDLMRAIPEEYETKFQK-WLDDPSSDGSEHKTVYAYRSKQCFQTLLTLS--HWMSDEVIDSLFLFV--RK
        RG R   +  +L    +  RP  ++ KV   D   A+P+  +   ++ W    S      + V A+  + C + L TLS  HW++DE+I+     +  R 
Subjt:  RGKRTRQISWKLRSPWVDTRPNGKRWKVKQYDLMRAIPEEYETKFQK-WLDDPSSDGSEHKTVYAYRSKQCFQTLLTLS--HWMSDEVIDSLFLFV--RK

Query:  KMDTRPDLCHR-----------------KFVTTDVFVTIMTFLGVPLMLSTLGRNHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLL
          D++    +                  K  T  V +     + VP+ L      HW +   D    +    DSL             NT      R  L
Subjt:  KMDTRPDLCHR-----------------KFVTTDVFVTIMTFLGVPLMLSTLGRNHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLL

Query:  RCDVM-KEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRCQFVVQL
          + + K+K ++    W  ++ T +P+QQN  DCGVF  +F E+  +R      +Q+ M + R + V ++
Subjt:  RCDVM-KEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRCQFVVQL

Arabidopsis top hitse value%identityAlignment
AT5G45570.1 Ulp1 protease family protein1.7e-0727.35Show/hide
Query:  NHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKE-KPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSL
        NHWV L  D       + DS+ +L +D ++A Q   V T  P +L      K+ + S     W  +R T++P+  + GDC ++ +K++E          L
Subjt:  NHWVLLCADFETGEFVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKE-KPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSL

Query:  SQEKMEFCRCQFVVQLW
          E M+  R +  V+++
Subjt:  SQEKMEFCRCQFVVQLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTTCTGGGCCGATAGAAGAATCGCCGCAGACATCGCAGATAACTCGGCCATGGATTTCTCCGGTCCATGCTCTCGAACCCGAAACGAATATGGTGAGAATAATCA
AGCCCATCAACGTCTTCTTCCTCTGCTCTTCCATGGACGTGGTTTTGGGGCTCCAATGTCTTCTTCAGCTGTGGTTTCAGCTCTAAATTTTGATCTTCACTGCAGCGAAA
GGGAGAAATGTGTCGGTGTCGACAAAACTATTCTTGCCCTTAAAACTTCCTTTCCTCTCGACCGTTGGCTTCTTCTTCCTCACGCTCGACTATTGTTTCCTTGCAAATCT
TCTGCAGAAATCAAGCTTACAGCAACACAAGCTCTTCATCCTCTCGCTGTCGCCGTTGTTTTTCCTCCTCCCTCTGGCTCCGACACTAACGATGGAGTACACTACATCCT
CCTTAGGGAAGTAGAGGACAGTAGGGCAGATGTGATGAGTTTTAAATTATTGGATCAGAAGGTTTCCTTTGGTAAGAGAGAATTTGACCTCGTAACCAGCCTTCGTCATT
CGTTAAGACCAATGAGGAGAGATAGAGACGGTCCTCCCAATAGACTCCTAAGATTATATTTTAGGGAGAACGTAGGTATGAAGGTGGAGGAGTTAGATAAGTCGTTTCCG
AACCTTCAGTTTGAAAACGACGAAGATGCGGCACTAAAAGGGAAGCTGAGTCGTACAAAAGCAAGGGCGGTGGTCCAAAGAAGCGGAGACATACAGTTTGTACGGTTTCC
CGTTTGCTTTTCAGAGGAGACTACTGTAACCGATACTGAAACGTTGGATGCCATTGAAGCTTCTCCAGAAGTTTCAAATAAAAGAGGAAGGGAAAAAGAGGACAAAGACA
AAGGGAAAGAGAAAGAGAAGGAAGTAGAAAAAGAGGATGAGAGCATGAAGGAGAAGACAAAGAAGAAAAAGACGAAGACAAAGCAGACCTGTGAATGTAGCCATTGGATG
GAGAGTATGGATGCTCGCATGTCTGATATGGAGACATGTCTCAAGTCCATTACCAAGTTCTTACGTTGTCTCTCTAAGGGTAAATTCGTGGACCCTGAGAAGTACTTTGG
ACCGAAAGATGGTCCGGATGATGAAGGTGGTCCATCGAAAGGACTAAATGACGTGAGTGGTCCATCGAAAGGACCCGATGACAAGGGTGGTCCATCGAAAGGACCCGATG
ACAAGGGTGGACCAGATGACAATATGGAAGAAGGACGAGAAGGAGAAGGAAGCGGCAGAGGAGAGGACGGGAAGGAGAAGGACGTCGATGAGGCGTACGACATAAAACAT
ATTACAGAGTTGGAGTCTCAACCAACCACTGACTTAGAGTCTCACTCCATTACTGACATGGAGTCTCAACCAACCACTGACTTAGAGTCTCACTCAATTACTGACGTGAA
GTCTCAACCAAGAACAGACCCAGTCAAATTCATTGCACCTAAGGCTGAGCCTGTTGAGTTAGGTGATGAGGAGGTTGAAAAAGTAATCGGACCGCATAAATTGGTAAAAA
GACGAGGAAAGCGGACCCGACAAATTTCTTGGAAGCTTCGGTCTCCATGGGTTGACACCAGGCCAAATGGCAAAAGGTGGAAAGTTAAGCAATACGATCTCATGCGTGCC
ATTCCTGAGGAGTACGAGACCAAGTTTCAGAAATGGTTGGATGACCCATCGTCTGACGGATCAGAGCACAAGACAGTATATGCCTACAGAAGCAAACAATGCTTTCAGAC
ATTACTCACACTGTCTCATTGGATGAGTGATGAGGTGATTGACTCTCTGTTCCTCTTTGTTCGGAAGAAGATGGATACCCGACCTGACTTATGCCATCGAAAGTTTGTCA
CGACGGATGTATTTGTAACAATTATGACATTCCTTGGAGTTCCGTTGATGCTGTCTACATTAGGTAGAAACCATTGGGTTCTACTGTGCGCTGACTTTGAAACGGGCGAA
TTTGTGTTGACAGACTCCCTAACGACATTAAATTCAGATGTAGACATAGCGAAACAGATGAATACTGTATGCACCAATTTTCCTAGGCTGCTACTGAGGTGCGACGTTAT
GAAGGAGAAGCCGTCTCTTCCAACACATTCATGGCGATTCAGAAGGAAGACCCAAGTGCCACAACAACAAAATAGTGGGGATTGTGGGGTTTTCATTGTAAAGTTTTTGG
AATATGATGTAACTAGATCAGATTTAGGTAGTCTTAGTCAGGAGAAAATGGAGTTTTGTAGGTGTCAATTTGTTGTACAACTTTGGGCCAATAGGCCGTTTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCTTCTGGGCCGATAGAAGAATCGCCGCAGACATCGCAGATAACTCGGCCATGGATTTCTCCGGTCCATGCTCTCGAACCCGAAACGAATATGGTGAGAATAATCA
AGCCCATCAACGTCTTCTTCCTCTGCTCTTCCATGGACGTGGTTTTGGGGCTCCAATGTCTTCTTCAGCTGTGGTTTCAGCTCTAAATTTTGATCTTCACTGCAGCGAAA
GGGAGAAATGTGTCGGTGTCGACAAAACTATTCTTGCCCTTAAAACTTCCTTTCCTCTCGACCGTTGGCTTCTTCTTCCTCACGCTCGACTATTGTTTCCTTGCAAATCT
TCTGCAGAAATCAAGCTTACAGCAACACAAGCTCTTCATCCTCTCGCTGTCGCCGTTGTTTTTCCTCCTCCCTCTGGCTCCGACACTAACGATGGAGTACACTACATCCT
CCTTAGGGAAGTAGAGGACAGTAGGGCAGATGTGATGAGTTTTAAATTATTGGATCAGAAGGTTTCCTTTGGTAAGAGAGAATTTGACCTCGTAACCAGCCTTCGTCATT
CGTTAAGACCAATGAGGAGAGATAGAGACGGTCCTCCCAATAGACTCCTAAGATTATATTTTAGGGAGAACGTAGGTATGAAGGTGGAGGAGTTAGATAAGTCGTTTCCG
AACCTTCAGTTTGAAAACGACGAAGATGCGGCACTAAAAGGGAAGCTGAGTCGTACAAAAGCAAGGGCGGTGGTCCAAAGAAGCGGAGACATACAGTTTGTACGGTTTCC
CGTTTGCTTTTCAGAGGAGACTACTGTAACCGATACTGAAACGTTGGATGCCATTGAAGCTTCTCCAGAAGTTTCAAATAAAAGAGGAAGGGAAAAAGAGGACAAAGACA
AAGGGAAAGAGAAAGAGAAGGAAGTAGAAAAAGAGGATGAGAGCATGAAGGAGAAGACAAAGAAGAAAAAGACGAAGACAAAGCAGACCTGTGAATGTAGCCATTGGATG
GAGAGTATGGATGCTCGCATGTCTGATATGGAGACATGTCTCAAGTCCATTACCAAGTTCTTACGTTGTCTCTCTAAGGGTAAATTCGTGGACCCTGAGAAGTACTTTGG
ACCGAAAGATGGTCCGGATGATGAAGGTGGTCCATCGAAAGGACTAAATGACGTGAGTGGTCCATCGAAAGGACCCGATGACAAGGGTGGTCCATCGAAAGGACCCGATG
ACAAGGGTGGACCAGATGACAATATGGAAGAAGGACGAGAAGGAGAAGGAAGCGGCAGAGGAGAGGACGGGAAGGAGAAGGACGTCGATGAGGCGTACGACATAAAACAT
ATTACAGAGTTGGAGTCTCAACCAACCACTGACTTAGAGTCTCACTCCATTACTGACATGGAGTCTCAACCAACCACTGACTTAGAGTCTCACTCAATTACTGACGTGAA
GTCTCAACCAAGAACAGACCCAGTCAAATTCATTGCACCTAAGGCTGAGCCTGTTGAGTTAGGTGATGAGGAGGTTGAAAAAGTAATCGGACCGCATAAATTGGTAAAAA
GACGAGGAAAGCGGACCCGACAAATTTCTTGGAAGCTTCGGTCTCCATGGGTTGACACCAGGCCAAATGGCAAAAGGTGGAAAGTTAAGCAATACGATCTCATGCGTGCC
ATTCCTGAGGAGTACGAGACCAAGTTTCAGAAATGGTTGGATGACCCATCGTCTGACGGATCAGAGCACAAGACAGTATATGCCTACAGAAGCAAACAATGCTTTCAGAC
ATTACTCACACTGTCTCATTGGATGAGTGATGAGGTGATTGACTCTCTGTTCCTCTTTGTTCGGAAGAAGATGGATACCCGACCTGACTTATGCCATCGAAAGTTTGTCA
CGACGGATGTATTTGTAACAATTATGACATTCCTTGGAGTTCCGTTGATGCTGTCTACATTAGGTAGAAACCATTGGGTTCTACTGTGCGCTGACTTTGAAACGGGCGAA
TTTGTGTTGACAGACTCCCTAACGACATTAAATTCAGATGTAGACATAGCGAAACAGATGAATACTGTATGCACCAATTTTCCTAGGCTGCTACTGAGGTGCGACGTTAT
GAAGGAGAAGCCGTCTCTTCCAACACATTCATGGCGATTCAGAAGGAAGACCCAAGTGCCACAACAACAAAATAGTGGGGATTGTGGGGTTTTCATTGTAAAGTTTTTGG
AATATGATGTAACTAGATCAGATTTAGGTAGTCTTAGTCAGGAGAAAATGGAGTTTTGTAGGTGTCAATTTGTTGTACAACTTTGGGCCAATAGGCCGTTTTTTTAG
Protein sequenceShow/hide protein sequence
MVFWADRRIAADIADNSAMDFSGPCSRTRNEYGENNQAHQRLLPLLFHGRGFGAPMSSSAVVSALNFDLHCSEREKCVGVDKTILALKTSFPLDRWLLLPHARLLFPCKS
SAEIKLTATQALHPLAVAVVFPPPSGSDTNDGVHYILLREVEDSRADVMSFKLLDQKVSFGKREFDLVTSLRHSLRPMRRDRDGPPNRLLRLYFRENVGMKVEELDKSFP
NLQFENDEDAALKGKLSRTKARAVVQRSGDIQFVRFPVCFSEETTVTDTETLDAIEASPEVSNKRGREKEDKDKGKEKEKEVEKEDESMKEKTKKKKTKTKQTCECSHWM
ESMDARMSDMETCLKSITKFLRCLSKGKFVDPEKYFGPKDGPDDEGGPSKGLNDVSGPSKGPDDKGGPSKGPDDKGGPDDNMEEGREGEGSGRGEDGKEKDVDEAYDIKH
ITELESQPTTDLESHSITDMESQPTTDLESHSITDVKSQPRTDPVKFIAPKAEPVELGDEEVEKVIGPHKLVKRRGKRTRQISWKLRSPWVDTRPNGKRWKVKQYDLMRA
IPEEYETKFQKWLDDPSSDGSEHKTVYAYRSKQCFQTLLTLSHWMSDEVIDSLFLFVRKKMDTRPDLCHRKFVTTDVFVTIMTFLGVPLMLSTLGRNHWVLLCADFETGE
FVLTDSLTTLNSDVDIAKQMNTVCTNFPRLLLRCDVMKEKPSLPTHSWRFRRKTQVPQQQNSGDCGVFIVKFLEYDVTRSDLGSLSQEKMEFCRCQFVVQLWANRPFF