; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008398 (gene) of Snake gourd v1 genome

Gene IDTan0008398
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUlp1-like peptidase
Genome locationLG11:23127091..23129073
RNA-Seq ExpressionTan0008398
SyntenyTan0008398
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044249.1 Ulp1-like peptidase [Cucumis melo var. makuwa]1.9e-3131.03Show/hide
Query:  RKGGEEEEEKKKMQVLQVE----VDSDSVQEVVPLGEDLPR-RGKRKRRPTYKLRSPWTNMKPDGKKRSVK-----------VYDPLCPIPERMDTKFQK
        +K G+E+  +KK+ +   E    VD + V E+ P     P  R  R++R +  L +P+T +     KRS+            VYDP+  I +    + + 
Subjt:  RKGGEEEEEKKKMQVLQVE----VDSDSVQEVVPLGEDLPR-RGKRKRRPTYKLRSPWTNMKPDGKKRSVK-----------VYDPLCPIPERMDTKFQK

Query:  WMDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSDVYDHFNSSDPWKEAFNWNRFNSVR
        W+ D RTD+E R   +  ++K +F+ L    +W SDE +D LFLFIR K         Q F  AD +    L     +  +         F+W+    + 
Subjt:  WMDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSDVYDHFNSSDPWKEAFNWNRFNSVR

Query:  DYVSGSHSDHTIPWKT-------------HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVME--VRPTLPKHPWKIRRRTDV
        DYV GS  D   PW +             HW+L+C+DL   ++ ++DSL +L T  E+   L  I   +P LL      +   R +  K PW +     +
Subjt:  DYVSGSHSDHTIPWKT-------------HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVME--VRPTLPKHPWKIRRRTDV

Query:  PQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWAN
        P Q N+ DCG+F +K+FEY  T   LDTL Q  + + R+Q A QLW N
Subjt:  PQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWAN

KAA0052752.1 Ulp1-like peptidase [Cucumis melo var. makuwa]4.4e-3330.77Show/hide
Query:  RKGGEEEEEKKKMQVLQVE----VDSDSVQEVVPLGEDLPR-RGKRKRRPTYKLRSPWTNMKPDGKKRSVK-----------VYDPLCPIPERMDTKFQK
        +K G+E+  +KK+ +   E    VD + V E+ P     P  R  R++R +  L +P+T++     KRSV            VYDP+  I +    + + 
Subjt:  RKGGEEEEEKKKMQVLQVE----VDSDSVQEVVPLGEDLPR-RGKRKRRPTYKLRSPWTNMKPDGKKRSVK-----------VYDPLCPIPERMDTKFQK

Query:  WMDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSDVYDH---FNSSDPWKEAFNWNRFN
        W+ D RTD+E R   +  ++K +F+ L    +W +DE +D LFLFIR K         Q F  AD +   + RR  ++     +         F+W+   
Subjt:  WMDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSDVYDH---FNSSDPWKEAFNWNRFN

Query:  SVRDYVSGSHSDHTIPWKT-------------HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVME--VRPTLPKHPWKIRRR
         + DYV GS  D   PW +             HW+L+C+DL + ++ ++DSL +L T  E+   L  I   +P LL      +   R +  K PW +   
Subjt:  SVRDYVSGSHSDHTIPWKT-------------HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVME--VRPTLPKHPWKIRRR

Query:  TDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWAN
          +P Q N+ DCG+F +K+FEY      LDTL Q  + + R+Q A QLW N
Subjt:  TDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWAN

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]2.3e-3740.48Show/hide
Query:  LFIRTKASIHPNLCRQKFVIADIVITDFLRRSD-VYDHFNS----SDPWKEAFNW-NRFNSVRDYVSGSHSDHTIPWK-------------THWILICVD
        +F+  K  + PNLCR+KF   D++I++FLR +D VY    S    +      ++W  R  S+  Y+ G+HSD+   W               HWI+IC+D
Subjt:  LFIRTKASIHPNLCRQKFVIADIVITDFLRRSD-VYDHFNS----SDPWKEAFNW-NRFNSVRDYVSGSHSDHTIPWK-------------THWILICVD

Query:  LHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVMEVRPTLPKHPWKIRRRTDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRR
           GEL+++DS + +    +LE+EL+ + T +P L+    V   +P +P  PW+IRR +  PQQ   GDCG+FC+ FFEYDVT+   DTL+Q+ + F RR
Subjt:  LHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVMEVRPTLPKHPWKIRRRTDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRR

Query:  QFAVQLWANR
        QFAVQLWAN+
Subjt:  QFAVQLWANR

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]3.2e-3145.89Show/hide
Query:  YVSGSHSDHTIPWK-------------THWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVMEVRPTLPKHPWKIRRRTDVPQQ
        Y+  SHSD+ + W+              HW++IC+D   GE+V++DSL  + +   LEE+L+++ T +P LL    V+ VRP LP  PW+IRR T  P+Q
Subjt:  YVSGSHSDHTIPWK-------------THWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVMEVRPTLPKHPWKIRRRTDVPQQ

Query:  VNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWANR
         +SGDCG+FCVK+FEYDVT + L+TL Q  + + RRQFA QLW+N+
Subjt:  VNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWANR

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]1.7e-3737.45Show/hide
Query:  MDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSD-VYDHFNSSD-PWKEAFNWNRFNSV
        MDDP TD+  R      + K WF  LL       DE ID L +    K     +L R +F I D+++++ LRR+D  Y        P K  ++W +  ++
Subjt:  MDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSD-VYDHFNSSD-PWKEAFNWNRFNSV

Query:  RDYVSGSHSDHTIPWK-------------THWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVMEVRPTLPKHPWKIRRRTDVP
          YV G  SD+   W               HW++I +DL  G+L ++DSL  +    +LE+ L+ + T +P +L    ++ +RP LP  PW++RR T VP
Subjt:  RDYVSGSHSDHTIPWK-------------THWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVMEVRPTLPKHPWKIRRRTDVP

Query:  QQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWANRVFF
        QQ    DC +FCV+FFEYDV  S +DTL Q+ I   RRQ+AVQ+WA R FF
Subjt:  QQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWANRVFF

TrEMBL top hitse value%identityAlignment
A0A5A7TSP7 Ulp1-like peptidase9.0e-3231.03Show/hide
Query:  RKGGEEEEEKKKMQVLQVE----VDSDSVQEVVPLGEDLPR-RGKRKRRPTYKLRSPWTNMKPDGKKRSVK-----------VYDPLCPIPERMDTKFQK
        +K G+E+  +KK+ +   E    VD + V E+ P     P  R  R++R +  L +P+T +     KRS+            VYDP+  I +    + + 
Subjt:  RKGGEEEEEKKKMQVLQVE----VDSDSVQEVVPLGEDLPR-RGKRKRRPTYKLRSPWTNMKPDGKKRSVK-----------VYDPLCPIPERMDTKFQK

Query:  WMDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSDVYDHFNSSDPWKEAFNWNRFNSVR
        W+ D RTD+E R   +  ++K +F+ L    +W SDE +D LFLFIR K         Q F  AD +    L     +  +         F+W+    + 
Subjt:  WMDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSDVYDHFNSSDPWKEAFNWNRFNSVR

Query:  DYVSGSHSDHTIPWKT-------------HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVME--VRPTLPKHPWKIRRRTDV
        DYV GS  D   PW +             HW+L+C+DL   ++ ++DSL +L T  E+   L  I   +P LL      +   R +  K PW +     +
Subjt:  DYVSGSHSDHTIPWKT-------------HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVME--VRPTLPKHPWKIRRRTDV

Query:  PQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWAN
        P Q N+ DCG+F +K+FEY  T   LDTL Q  + + R+Q A QLW N
Subjt:  PQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWAN

A0A5A7TVI1 Ulp1-like peptidase1.5e-3130.23Show/hide
Query:  RKGGEEEEEKKKMQVLQVE----VDSDSVQEVVPLGEDLPR-RGKRKRRPTYKLRSPWTNM-------KPDGKKRSVKVYDPLCPIPERMDTKFQKWMDD
        +K G+E   +KK+ +   E    VD + V E+ P     P  R  R++R +  L +P+T +            +    VYDP+  I +    + + W+ D
Subjt:  RKGGEEEEEKKKMQVLQVE----VDSDSVQEVVPLGEDLPR-RGKRKRRPTYKLRSPWTNM-------KPDGKKRSVKVYDPLCPIPERMDTKFQKWMDD

Query:  PRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSDVYDHFNSSDPWKEAFNWNRFNSVRDYVS
         RTD+E R   +  ++K +F+ L    +W +DE +D LFLFI  K         Q F  AD +    L     +  +         F+W+    + DYV 
Subjt:  PRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSDVYDHFNSSDPWKEAFNWNRFNSVRDYVS

Query:  GSHSDHTIPWKT-------------HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVME--VRPTLPKHPWKIRRRTDVPQQV
        GS  D   PW +             HW+L+C+DL   ++ ++DSL +L T  E+   L  I   +P LL      +   R +  K PW +     +P Q 
Subjt:  GSHSDHTIPWKT-------------HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVME--VRPTLPKHPWKIRRRTDVPQQV

Query:  NSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWAN
        N+ DCG+F +K+FEY V    LDTL Q  + + R+QFA QLW N
Subjt:  NSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWAN

A0A5A7UAG5 Ulp1-like peptidase2.1e-3330.77Show/hide
Query:  RKGGEEEEEKKKMQVLQVE----VDSDSVQEVVPLGEDLPR-RGKRKRRPTYKLRSPWTNMKPDGKKRSVK-----------VYDPLCPIPERMDTKFQK
        +K G+E+  +KK+ +   E    VD + V E+ P     P  R  R++R +  L +P+T++     KRSV            VYDP+  I +    + + 
Subjt:  RKGGEEEEEKKKMQVLQVE----VDSDSVQEVVPLGEDLPR-RGKRKRRPTYKLRSPWTNMKPDGKKRSVK-----------VYDPLCPIPERMDTKFQK

Query:  WMDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSDVYDH---FNSSDPWKEAFNWNRFN
        W+ D RTD+E R   +  ++K +F+ L    +W +DE +D LFLFIR K         Q F  AD +   + RR  ++     +         F+W+   
Subjt:  WMDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSDVYDH---FNSSDPWKEAFNWNRFN

Query:  SVRDYVSGSHSDHTIPWKT-------------HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVME--VRPTLPKHPWKIRRR
         + DYV GS  D   PW +             HW+L+C+DL + ++ ++DSL +L T  E+   L  I   +P LL      +   R +  K PW +   
Subjt:  SVRDYVSGSHSDHTIPWKT-------------HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVME--VRPTLPKHPWKIRRR

Query:  TDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWAN
          +P Q N+ DCG+F +K+FEY      LDTL Q  + + R+Q A QLW N
Subjt:  TDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWAN

A0A6J1DLV0 uncharacterized protein LOC1110216461.1e-3740.48Show/hide
Query:  LFIRTKASIHPNLCRQKFVIADIVITDFLRRSD-VYDHFNS----SDPWKEAFNW-NRFNSVRDYVSGSHSDHTIPWK-------------THWILICVD
        +F+  K  + PNLCR+KF   D++I++FLR +D VY    S    +      ++W  R  S+  Y+ G+HSD+   W               HWI+IC+D
Subjt:  LFIRTKASIHPNLCRQKFVIADIVITDFLRRSD-VYDHFNS----SDPWKEAFNW-NRFNSVRDYVSGSHSDHTIPWK-------------THWILICVD

Query:  LHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVMEVRPTLPKHPWKIRRRTDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRR
           GEL+++DS + +    +LE+EL+ + T +P L+    V   +P +P  PW+IRR +  PQQ   GDCG+FC+ FFEYDVT+   DTL+Q+ + F RR
Subjt:  LHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVMEVRPTLPKHPWKIRRRTDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRR

Query:  QFAVQLWANR
        QFAVQLWAN+
Subjt:  QFAVQLWANR

A0A6J1DY60 uncharacterized protein LOC1110252738.4e-3837.45Show/hide
Query:  MDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSD-VYDHFNSSD-PWKEAFNWNRFNSV
        MDDP TD+  R      + K WF  LL       DE ID L +    K     +L R +F I D+++++ LRR+D  Y        P K  ++W +  ++
Subjt:  MDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQKFVIADIVITDFLRRSD-VYDHFNSSD-PWKEAFNWNRFNSV

Query:  RDYVSGSHSDHTIPWK-------------THWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVMEVRPTLPKHPWKIRRRTDVP
          YV G  SD+   W               HW++I +DL  G+L ++DSL  +    +LE+ L+ + T +P +L    ++ +RP LP  PW++RR T VP
Subjt:  RDYVSGSHSDHTIPWK-------------THWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVMEVRPTLPKHPWKIRRRTDVP

Query:  QQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWANRVFF
        QQ    DC +FCV+FFEYDV  S +DTL Q+ I   RRQ+AVQ+WA R FF
Subjt:  QQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWANRVFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G08430.1 Ulp1 protease family protein3.8e-0625Show/hide
Query:  HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLL-VMGDVMEVRPTLPKHPWKIRRRTDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLS
        HW+ + +DL    + ++DS+ +L TDTE+  +   + T +P +L       + R +  K  WK  R T +P+ +++ DC ++ +K+ E        D L 
Subjt:  HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLL-VMGDVMEVRPTLPKHPWKIRRRTDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLS

Query:  QACIDFCRRQFAVQLW
           +     + AV+++
Subjt:  QACIDFCRRQFAVQLW

AT5G45570.1 Ulp1 protease family protein9.0e-0826.72Show/hide
Query:  HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLL-VMGDVMEVRPTLPKHPWKIRRRTDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLS
        HW+ + +DL    + ++DS+ +L TDTE+  +   + T +P +L       + R +  K  WK  R T +P+ ++ GDC ++ +K+ E        D L 
Subjt:  HWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLL-VMGDVMEVRPTLPKHPWKIRRRTDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLS

Query:  QACIDFCRRQFAVQLW
           +   R + AV+++
Subjt:  QACIDFCRRQFAVQLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCGCATAGTTGAATGCCCACGGGTACCACTACTTGCCCCTAGTGCTGAGGGCACTGAAATGCAAGAGGACACTCAATTGGATGAGGTCCCTGAGACGGCGGGTCG
TGAAGCAGATCAGGAACCATGCCTTGTAGCAACCTCACAGGATACGCCTAATATGTGTAAAAGTAAAGAGGTACTTCACGAACAGGGTCAAAATGGATGTAAGAGAAGAA
GGGGAAGAAGAAAAGGAGGAGAAGAGGAAGAAGAAAAGAAGAAGATGCAAGTGCTCCAAGTGGAGGTTGATTCGGACTCAGTTCAGGAGGTCGTACCACTTGGTGAAGAT
TTACCTCGCCGAGGTAAGCGTAAGCGACGTCCAACATATAAGCTACGTTCACCATGGACAAACATGAAGCCAGATGGGAAGAAGAGAAGTGTTAAAGTGTACGACCCACT
ATGCCCTATACCTGAACGTATGGATACTAAATTTCAAAAGTGGATGGACGATCCAAGAACAGACAATGAAGAACGCATCAACGTGTACGCTTCTCGCAACAAAATGTGGT
TCCAAACCCTTTTGACGTCATCCCAGTGGACGAGCGACGAGGTTATTGACGTTTTGTTCCTCTTCATCCGAACGAAAGCGAGCATTCATCCAAACCTATGTCGTCAGAAG
TTCGTCATTGCAGATATAGTCATTACGGACTTTTTGAGACGATCTGATGTCTACGACCACTTTAATAGTAGTGATCCATGGAAGGAAGCGTTCAACTGGAATCGATTCAA
CAGCGTTAGAGACTACGTTAGTGGCAGTCATTCAGACCATACGATTCCATGGAAAACGCATTGGATACTAATATGTGTCGATCTCCATGTCGGTGAGCTCGTCATCTTCG
ATTCTCTTGTCACACTATGTACGGACACAGAGTTAGAGGAGGAGTTAAGGATGATCAGCACCACACTACCACATCTACTAGTTATGGGTGACGTCATGGAAGTACGACCG
ACTCTCCCCAAACACCCTTGGAAGATTCGTAGACGGACTGATGTCCCGCAACAGGTTAATAGTGGAGATTGTGGGATGTTTTGTGTTAAATTTTTTGAATATGATGTAAC
GGCCTCGCCTTTAGATACACTTAGTCAGGCTTGCATAGACTTTTGTAGACGTCAATTCGCTGTACAACTATGGGCTAACCGCGTTTTTTTTAGCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCGCATAGTTGAATGCCCACGGGTACCACTACTTGCCCCTAGTGCTGAGGGCACTGAAATGCAAGAGGACACTCAATTGGATGAGGTCCCTGAGACGGCGGGTCG
TGAAGCAGATCAGGAACCATGCCTTGTAGCAACCTCACAGGATACGCCTAATATGTGTAAAAGTAAAGAGGTACTTCACGAACAGGGTCAAAATGGATGTAAGAGAAGAA
GGGGAAGAAGAAAAGGAGGAGAAGAGGAAGAAGAAAAGAAGAAGATGCAAGTGCTCCAAGTGGAGGTTGATTCGGACTCAGTTCAGGAGGTCGTACCACTTGGTGAAGAT
TTACCTCGCCGAGGTAAGCGTAAGCGACGTCCAACATATAAGCTACGTTCACCATGGACAAACATGAAGCCAGATGGGAAGAAGAGAAGTGTTAAAGTGTACGACCCACT
ATGCCCTATACCTGAACGTATGGATACTAAATTTCAAAAGTGGATGGACGATCCAAGAACAGACAATGAAGAACGCATCAACGTGTACGCTTCTCGCAACAAAATGTGGT
TCCAAACCCTTTTGACGTCATCCCAGTGGACGAGCGACGAGGTTATTGACGTTTTGTTCCTCTTCATCCGAACGAAAGCGAGCATTCATCCAAACCTATGTCGTCAGAAG
TTCGTCATTGCAGATATAGTCATTACGGACTTTTTGAGACGATCTGATGTCTACGACCACTTTAATAGTAGTGATCCATGGAAGGAAGCGTTCAACTGGAATCGATTCAA
CAGCGTTAGAGACTACGTTAGTGGCAGTCATTCAGACCATACGATTCCATGGAAAACGCATTGGATACTAATATGTGTCGATCTCCATGTCGGTGAGCTCGTCATCTTCG
ATTCTCTTGTCACACTATGTACGGACACAGAGTTAGAGGAGGAGTTAAGGATGATCAGCACCACACTACCACATCTACTAGTTATGGGTGACGTCATGGAAGTACGACCG
ACTCTCCCCAAACACCCTTGGAAGATTCGTAGACGGACTGATGTCCCGCAACAGGTTAATAGTGGAGATTGTGGGATGTTTTGTGTTAAATTTTTTGAATATGATGTAAC
GGCCTCGCCTTTAGATACACTTAGTCAGGCTTGCATAGACTTTTGTAGACGTCAATTCGCTGTACAACTATGGGCTAACCGCGTTTTTTTTAGCAATTAA
Protein sequenceShow/hide protein sequence
MDRIVECPRVPLLAPSAEGTEMQEDTQLDEVPETAGREADQEPCLVATSQDTPNMCKSKEVLHEQGQNGCKRRRGRRKGGEEEEEKKKMQVLQVEVDSDSVQEVVPLGED
LPRRGKRKRRPTYKLRSPWTNMKPDGKKRSVKVYDPLCPIPERMDTKFQKWMDDPRTDNEERINVYASRNKMWFQTLLTSSQWTSDEVIDVLFLFIRTKASIHPNLCRQK
FVIADIVITDFLRRSDVYDHFNSSDPWKEAFNWNRFNSVRDYVSGSHSDHTIPWKTHWILICVDLHVGELVIFDSLVTLCTDTELEEELRMISTTLPHLLVMGDVMEVRP
TLPKHPWKIRRRTDVPQQVNSGDCGMFCVKFFEYDVTASPLDTLSQACIDFCRRQFAVQLWANRVFFSN