; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004936 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004936
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUlp1-like peptidase
Genome locationchr6:8723711..8725861
RNA-Seq ExpressionLag0004936
SyntenyLag0004936
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]2.1e-4039.52Show/hide
Query:  LFVRKKMDTRPDLCHRKFVTADVFVTEFSRRGDVYEELLRS-----DHENDTFDW-SRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCAD
        +FV  K+  RP+LC RKF T DV ++ F R  D    +++S           +DW  R  ++ +YI G H+D D  W  VDAVY+P+N+G  HW+++C D
Subjt:  LFVRKKMDTRPDLCHRKFVTADVFVTEFSRRGDVYEELLRS-----DHENDTFDW-SRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCAD

Query:  FETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRR
        F+  E ++ DS   +     + +++  + TI P L+ R  V   KP++P   WR RR +  PQQ   GDCG+F + F EYDV      +L+Q +M F RR
Subjt:  FETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRR

Query:  QFVVQLWANR
        QF VQLWAN+
Subjt:  QFVVQLWANR

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]1.2e-3245.89Show/hide
Query:  YIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQ
        YI   H+DY + W  V+AVY+PFN+  NHWV++C DF   E V+ DSL  + S A++ +Q+  + T+ P LL +  V+  +P+LP   WR RR T  P+Q
Subjt:  YIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQ

Query:  QDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANR
          SGDCG+F VK+ EYDV  + L +L Q+ M + RRQF  QLW+N+
Subjt:  QDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANR

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]3.0e-3433.47Show/hide
Query:  LDDPLSDGSERKT-----------------------VIDSLFLFVRKKMDTRPDLCHRKFVTADVFVTEFSRRGDVYEELLRSD--HENDTFDWSRFKTV
        +DDP +D + R T                        IDSL +   +K++    L   +F   DV ++   RR D     ++        T+DW + +T+
Subjt:  LDDPLSDGSERKT-----------------------VIDSLFLFVRKKMDTRPDLCHRKFVTADVFVTEFSRRGDVYEELLRSD--HENDTFDWSRFKTV

Query:  TNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVP
          Y++G  +DYD  WS  D VY   N+G NHWV++  D    +  + DSL  +    ++ K +  +CTI P +L    ++  +P+LP   WR RR T VP
Subjt:  TNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVP

Query:  QQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANRPFF
        QQ    DC +F V+F EYDVI S + +L Q  +   RRQ+ VQ+WA RPFF
Subjt:  QQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANRPFF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]3.6e-4050.58Show/hide
Query:  EELLRSDHENDTFDWSRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDV
        E  LR D    T DWS  K V  Y+ G+HTDYDV WS+VDAVYMPFNL   HWVL+CADF+  E ++ DSL  L+ +A++  +M  VC  FP LL+   V
Subjt:  EELLRSDHENDTFDWSRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDV

Query:  MKEKPSLPTHLWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANRPFF
        M E  +L    W  RR     QQ +SGDCG+FT KF EYDV  S +G+L+QD+ ++ RRQ+ +Q+WANR  F
Subjt:  MKEKPSLPTHLWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANRPFF

XP_038885861.1 sentrin-specific protease [Benincasa hispida]6.2e-4047.31Show/hide
Query:  DVFVTEFSRRGDVYEELLRSDHENDTFDWSRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNT
        DV    F   G   E  LR D    T DWS+   V  Y+ G+HTDYDV WS+VDA+YMPFNL R HWVL+C DF+  E ++ DSL  L+ +A++  +M +
Subjt:  DVFVTEFSRRGDVYEELLRSDHENDTFDWSRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNT

Query:  VCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANRPFF
        +C  F  LL+   VM E  +L    W  RR   VPQQ  SGDCG+FT KF EYDV  S + +L+QD+M++ RRQ+ +Q+ ANR  F
Subjt:  VCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANRPFF

TrEMBL top hitse value%identityAlignment
A0A5A7TPR0 Ulp1-like peptidase5.3e-2127.02Show/hide
Query:  PTVEEATIADIEMLDVIEASPEVSNKRGREK------EDEDKGKEKEKEVEKEDESRKEKTKRKRR----RSRLVTVASGYNLWIAGWREW--MLAYPEK
        P  E+  IA  E+ +   +S  + +K G  K       DED  K+ +K+        K KTK K+     + R+  V    N   +   E   M++   K
Subjt:  PTVEEATIADIEMLDVIEASPEVSNKRGREK------EDEDKGKEKEKEVEKEDESRKEKTKRKRR----RSRLVTVASGYNLWIAGWREW--MLAYPEK

Query:  YFG-PKDGPDDEGGPSKGPDDVGGPSKGPDDKGGPSKGPDDKGRPD---DNKEEGHEGGGSDRGEDGKEKDVDEAHDIEHITPVELSDEEVKVIE-----
        + G  + G + +   S+G  D    SK  D+    ++  D    P+     KE+ H        ED  EK V    DI    P+++ D+EV++ +     
Subjt:  YFG-PKDGPDDEGGPSKGPDDVGGPSKGPDDKGGPSKGPDDKGRPD---DNKEEGHEGGGSDRGEDGKEKDVDEAHDIEHITPVELSDEEVKVIE-----

Query:  ------------------PHELVKRRGKRTRQISWKL-----RSPWADTRPDGKRRKVKQYDPMRAIPEEYETKFQKWLDDPLSDGSERKTVIDSLFLFV
                          PH    RR + +  +S        RS  + T        V  YDPM  I +    + + W+ D  +D       +D+LFLF+
Subjt:  ------------------PHELVKRRGKRTRQISWKL-----RSPWADTRPDGKRRKVKQYDPMRAIPEEYETKFQKWLDDPLSDGSERKTVIDSLFLFV

Query:  RKKMDTRPDLCHRKFVTAD-VFVTEFSRRGDVYEELLRSDHENDTFDWSRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVL
        R K+        + F TAD +F+     +  +Y+E ++   EN  FDW     + +Y++G   D+   W+SVD VY PFN+  NHWVLLC D  + +  +
Subjt:  RKKMDTRPDLCHRKFVTAD-VFVTEFSRRGDVYEELLRSDHENDTFDWSRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVL

Query:  TDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHL--WRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQL
         DSL +L +   +   +  +  + P+LL        +    T+   W       +P Q+++ DCGVFT+K+ EY V+   L +L Q+ M + R+Q   QL
Subjt:  TDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHL--WRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQL

Query:  WANRPFF
        W N P +
Subjt:  WANRPFF

A0A5A7V1U0 Ulp1-like peptidase4.8e-2227.82Show/hide
Query:  DDEGGPSKGPDDVGGPSKGPDDKGGPSKGPDDKGRPD---DNKEEGHEGGGSDRGEDGKEKDVDEAHDIEHITPVE-LSDEEVKVIEPHELVKR---RGK
        DDEG        V    +  +     ++  D  G P+     KE+ H        ED  +K V    DI    P++ + D++V  IEP  L +R   R  
Subjt:  DDEGGPSKGPDDVGGPSKGPDDKGGPSKGPDDKGRPD---DNKEEGHEGGGSDRGEDGKEKDVDEAHDIEHITPVE-LSDEEVKVIEPHELVKR---RGK

Query:  RTRQISWKLRSPW-------ADTRPDGKRRKVKQYDPMRAIPEEYETKFQKWLDDPLSDGSERKTVIDSLFLFVRKKMDTRPDLCHRKFVTAD-VFVTEF
        R ++ S  L +P+         +     + +   YDP+  I + +  K + W+ D  +D       +D+LFLF+R K+        + F TAD +F+   
Subjt:  RTRQISWKLRSPW-------ADTRPDGKRRKVKQYDPMRAIPEEYETKFQKWLDDPLSDGSERKTVIDSLFLFVRKKMDTRPDLCHRKFVTAD-VFVTEF

Query:  SRRGDVYEELLRSDHENDTFDWSRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPR
          +  +Y+E ++ +H    FDW     + +Y++    D+   W+SVD VY PFN+  NHWVLLC D  + +  + DSL +L +   +   +  +  + P+
Subjt:  SRRGDVYEELLRSDHENDTFDWSRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPR

Query:  LLLRCDVMKEKPSLPTHL--WRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANRPFF
        LL        +    T+   W       +P Q+++ DCGVFT+K+ EY      L +L Q+ M + R+Q   QLW N P +
Subjt:  LLLRCDVMKEKPSLPTHL--WRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANRPFF

A0A6J1DLV0 uncharacterized protein LOC1110216461.0e-4039.52Show/hide
Query:  LFVRKKMDTRPDLCHRKFVTADVFVTEFSRRGDVYEELLRS-----DHENDTFDW-SRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCAD
        +FV  K+  RP+LC RKF T DV ++ F R  D    +++S           +DW  R  ++ +YI G H+D D  W  VDAVY+P+N+G  HW+++C D
Subjt:  LFVRKKMDTRPDLCHRKFVTADVFVTEFSRRGDVYEELLRS-----DHENDTFDW-SRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCAD

Query:  FETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRR
        F+  E ++ DS   +     + +++  + TI P L+ R  V   KP++P   WR RR +  PQQ   GDCG+F + F EYDV      +L+Q +M F RR
Subjt:  FETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRR

Query:  QFVVQLWANR
        QF VQLWAN+
Subjt:  QFVVQLWANR

A0A6J1DQZ3 uncharacterized protein LOC1110234426.0e-3345.89Show/hide
Query:  YIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQ
        YI   H+DY + W  V+AVY+PFN+  NHWV++C DF   E V+ DSL  + S A++ +Q+  + T+ P LL +  V+  +P+LP   WR RR T  P+Q
Subjt:  YIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQ

Query:  QDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANR
          SGDCG+F VK+ EYDV  + L +L Q+ M + RRQF  QLW+N+
Subjt:  QDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANR

A0A6J1DY60 uncharacterized protein LOC1110252731.4e-3433.47Show/hide
Query:  LDDPLSDGSERKT-----------------------VIDSLFLFVRKKMDTRPDLCHRKFVTADVFVTEFSRRGDVYEELLRSD--HENDTFDWSRFKTV
        +DDP +D + R T                        IDSL +   +K++    L   +F   DV ++   RR D     ++        T+DW + +T+
Subjt:  LDDPLSDGSERKT-----------------------VIDSLFLFVRKKMDTRPDLCHRKFVTADVFVTEFSRRGDVYEELLRSD--HENDTFDWSRFKTV

Query:  TNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVP
          Y++G  +DYD  WS  D VY   N+G NHWV++  D    +  + DSL  +    ++ K +  +CTI P +L    ++  +P+LP   WR RR T VP
Subjt:  TNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVP

Query:  QQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANRPFF
        QQ    DC +F V+F EYDVI S + +L Q  +   RRQ+ VQ+WA RPFF
Subjt:  QQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANRPFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases8.7e-0829.75Show/hide
Query:  WSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSL-PTHLWRFRRKTQVPQQQDSGDCGVFTV
        ++  D VYMPFN  + HWV LC D +  +  + DS   L  DA +  ++  +  + P L  +        SL P  L R     QV    DSG   VF +
Subjt:  WSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSL-PTHLWRFRRKTQVPQQQDSGDCGVFTV

Query:  ---------KFLEYDVIRSDL
                 + +E+DV+  D+
Subjt:  ---------KFLEYDVIRSDL

AT4G00690.1 UB-like protease 1B4.5e-0422.35Show/hide
Query:  IDSLFLFVRKKMDTRPDLCHRKFVTADVFVTEFSRRGDVYEELLRSDHENDTFDWSRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADF
        + +L+L + K+  TR     +K+     F T F      Y +L+     N       +K V+ +       YD+     D +++P ++   HW L   + 
Subjt:  IDSLFLFVRKKMDTRPDLCHRKFVTADVFVTEFSRRGDVYEELLRSDHENDTFDWSRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADF

Query:  ETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQQDSGDCGVFTVKFLEY
           +FV  DSL T           +T+     + L+     K + ++    W      + PQQQ+  DCG+F +K++++
Subjt:  ETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQQDSGDCGVFTVKFLEY

AT5G45570.1 Ulp1 protease family protein3.2e-1026.56Show/hide
Query:  VDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQQDSGDCGVFTVKFLE
        VD +Y    +  NHWV L  D       + DS+ +L +D  +A Q   V T+ P +L      K++    + L  ++R T++P+  D GDC ++++K++E
Subjt:  VDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLPTHLWRFRRKTQVPQQQDSGDCGVFTVKFLE

Query:  YDVIRSDLGSLSQDKMKFCRRQFVVQLW
           +      L  + M+  R +  V+++
Subjt:  YDVIRSDLGSLSQDKMKFCRRQFVVQLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGAGTGATGCATCCTCCACGTGCACCAGAGCCAGTGCCTGAGCCAGTACTAGAGCCAGTACTAGAGCCAGGGCAAGAACCAGATGAAGAACGAGAGAGTCTACC
TATTGTATCCGAGGCTAGACTCCCTACTGTAGAGGAGGCTACTATAGCCGATATTGAGATGTTGGATGTCATTGAAGCTTCTCCAGAAGTTTCAAATAAGAGAGGAAGGG
AAAAAGAGGACGAAGACAAAGGAAAAGAGAAAGAGAAGGAAGTAGAAAAAGAGGATGAGAGTAGGAAGGAGAAAACAAAAAGAAAAAGAAGAAGAAGCAGACTTGTGACT
GTAGCTAGTGGCTACAACTTATGGATAGCCGGATGGAGAGAATGGATGCTCGCATACCCTGAGAAGTACTTTGGACCGAAAGATGGTCCGGATGATGAGGGTGGTCCATC
AAAAGGACCAGATGACGTGGGTGGTCCATCGAAAGGACCCGATGACAAGGGTGGTCCATCGAAAGGACCGGATGACAAGGGTAGACCGGATGACAATAAGGAAGAAGGAC
ATGAAGGAGGAGGAAGCGACAGAGGAGAGGACGGGAAGGAGAAAGACGTTGATGAGGCGCACGACATAGAGCATATTACGCCTGTTGAGTTAAGTGATGAGGAGGTTAAA
GTAATCGAGCCGCATGAATTGGTGAAAAGACGGGGAAAGCGGACCCGACAAATTTCTTGGAAGCTTCGGTCTCCATGGGCTGACACTAGGCCGGACGGCAAAAGGAGGAA
AGTTAAGCAATACGATCCCATGCGTGCCATTCCTGAGGAGTACGAGACCAAGTTTCAGAAATGGTTGGATGACCCATTGTCTGATGGATCGGAGCGCAAGACAGTGATTG
ACTCTCTCTTCCTCTTTGTTCGAAAGAAGATGGATACCCGTCCTGACTTATGCCATAGAAAGTTCGTCACGGCGGATGTATTTGTAACAGAATTTTCGAGGCGCGGGGAT
GTGTACGAAGAACTCCTTCGTAGTGACCATGAGAACGACACGTTCGATTGGAGTAGATTCAAGACGGTCACTAACTACATAATGGGAGAACACACAGATTACGATGTTCA
TTGGAGTTCCGTTGATGCTGTCTACATGCCCTTCAACTTAGGTAGAAACCATTGGGTTCTACTGTGCGCTGACTTTGAAACGAGCGAATTTGTGTTGACAGACTCCCTAA
CGACATTGAATTCAGATGCAAATATTGCGAAGCAGATGAATACGGTATGCACCATTTTCCCTAGGCTGCTACTTAGGTGTGACGTTATGAAGGAGAAACCGTCTCTTCCA
ACACATCTATGGCGATTCAGAAGGAAGACCCAAGTGCCACAACAACAAGATAGTGGGGATTGTGGGGTTTTCACTGTAAAGTTTTTAGAATATGATGTAATTAGATCAGA
TTTAGGTAGTCTTAGTCAGGATAAAATGAAGTTTTGTAGGCGTCAATTTGTTGTACAACTTTGGGCCAATAGGCCGTTCTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGAGTGATGCATCCTCCACGTGCACCAGAGCCAGTGCCTGAGCCAGTACTAGAGCCAGTACTAGAGCCAGGGCAAGAACCAGATGAAGAACGAGAGAGTCTACC
TATTGTATCCGAGGCTAGACTCCCTACTGTAGAGGAGGCTACTATAGCCGATATTGAGATGTTGGATGTCATTGAAGCTTCTCCAGAAGTTTCAAATAAGAGAGGAAGGG
AAAAAGAGGACGAAGACAAAGGAAAAGAGAAAGAGAAGGAAGTAGAAAAAGAGGATGAGAGTAGGAAGGAGAAAACAAAAAGAAAAAGAAGAAGAAGCAGACTTGTGACT
GTAGCTAGTGGCTACAACTTATGGATAGCCGGATGGAGAGAATGGATGCTCGCATACCCTGAGAAGTACTTTGGACCGAAAGATGGTCCGGATGATGAGGGTGGTCCATC
AAAAGGACCAGATGACGTGGGTGGTCCATCGAAAGGACCCGATGACAAGGGTGGTCCATCGAAAGGACCGGATGACAAGGGTAGACCGGATGACAATAAGGAAGAAGGAC
ATGAAGGAGGAGGAAGCGACAGAGGAGAGGACGGGAAGGAGAAAGACGTTGATGAGGCGCACGACATAGAGCATATTACGCCTGTTGAGTTAAGTGATGAGGAGGTTAAA
GTAATCGAGCCGCATGAATTGGTGAAAAGACGGGGAAAGCGGACCCGACAAATTTCTTGGAAGCTTCGGTCTCCATGGGCTGACACTAGGCCGGACGGCAAAAGGAGGAA
AGTTAAGCAATACGATCCCATGCGTGCCATTCCTGAGGAGTACGAGACCAAGTTTCAGAAATGGTTGGATGACCCATTGTCTGATGGATCGGAGCGCAAGACAGTGATTG
ACTCTCTCTTCCTCTTTGTTCGAAAGAAGATGGATACCCGTCCTGACTTATGCCATAGAAAGTTCGTCACGGCGGATGTATTTGTAACAGAATTTTCGAGGCGCGGGGAT
GTGTACGAAGAACTCCTTCGTAGTGACCATGAGAACGACACGTTCGATTGGAGTAGATTCAAGACGGTCACTAACTACATAATGGGAGAACACACAGATTACGATGTTCA
TTGGAGTTCCGTTGATGCTGTCTACATGCCCTTCAACTTAGGTAGAAACCATTGGGTTCTACTGTGCGCTGACTTTGAAACGAGCGAATTTGTGTTGACAGACTCCCTAA
CGACATTGAATTCAGATGCAAATATTGCGAAGCAGATGAATACGGTATGCACCATTTTCCCTAGGCTGCTACTTAGGTGTGACGTTATGAAGGAGAAACCGTCTCTTCCA
ACACATCTATGGCGATTCAGAAGGAAGACCCAAGTGCCACAACAACAAGATAGTGGGGATTGTGGGGTTTTCACTGTAAAGTTTTTAGAATATGATGTAATTAGATCAGA
TTTAGGTAGTCTTAGTCAGGATAAAATGAAGTTTTGTAGGCGTCAATTTGTTGTACAACTTTGGGCCAATAGGCCGTTCTTTTAA
Protein sequenceShow/hide protein sequence
MNRVMHPPRAPEPVPEPVLEPVLEPGQEPDEERESLPIVSEARLPTVEEATIADIEMLDVIEASPEVSNKRGREKEDEDKGKEKEKEVEKEDESRKEKTKRKRRRSRLVT
VASGYNLWIAGWREWMLAYPEKYFGPKDGPDDEGGPSKGPDDVGGPSKGPDDKGGPSKGPDDKGRPDDNKEEGHEGGGSDRGEDGKEKDVDEAHDIEHITPVELSDEEVK
VIEPHELVKRRGKRTRQISWKLRSPWADTRPDGKRRKVKQYDPMRAIPEEYETKFQKWLDDPLSDGSERKTVIDSLFLFVRKKMDTRPDLCHRKFVTADVFVTEFSRRGD
VYEELLRSDHENDTFDWSRFKTVTNYIMGEHTDYDVHWSSVDAVYMPFNLGRNHWVLLCADFETSEFVLTDSLTTLNSDANIAKQMNTVCTIFPRLLLRCDVMKEKPSLP
THLWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVIRSDLGSLSQDKMKFCRRQFVVQLWANRPFF