; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005175 (gene) of Snake gourd v1 genome

Gene IDTan0005175
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationLG08:34292763..34297001
RNA-Seq ExpressionTan0005175
SyntenyTan0005175
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.8e-3834.85Show/hide
Query:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYAE-KHGKDDPVV
        GV LG DNV+V +D++  E   + IP  +   +ET   Q    FVAWPR LVI+S+     S +   T+     HT V   IK+L RY       +D V 
Subjt:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYAE-KHGKDDPVV

Query:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF
        + +S  IFGK + +YLT  DIM++C M EI  +CIL YIA             F +VD   I+P   + E R R + +    +   Q++LI Y  G    
Subjt:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF

Query:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP
                       H +L ++++  N  Y+ DSL   + +D Q ++NT+++I  A+ +I+ + + NT+ W  +KCP Q GSVECGYY+  +IREIV N 
Subjt:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP

Query:  DTPLKTL
         T +  +
Subjt:  DTPLKTL

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]1.8e-3834.85Show/hide
Query:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYAE-KHGKDDPVV
        GV LG DNV+V +D++  E   + IP  +   +ET   Q    FVAWPR LVI+S+     S +   T+     HT V   IK+L RY       +D V 
Subjt:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYAE-KHGKDDPVV

Query:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF
        + +S  IFGK + +YLT  DIM++C M EI  +CIL YIA             F +VD   I+P   + E R R + +    +   Q++LI Y  G    
Subjt:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF

Query:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP
                       H +L ++++  N  Y+ DSL   + +D Q ++NT+++I  A+ +I+ + + NT+ W  +KCP Q GSVECGYY+  +IREIV N 
Subjt:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP

Query:  DTPLKTL
         T +  +
Subjt:  DTPLKTL

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]3.1e-3835.43Show/hide
Query:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYAE-KHGKDDPVV
        GV LG DNV+V +D++  E   + IP  +   +ET   Q    FVAWPR LVI+S+     S +   T+     HT V   IK+L RY       +D V 
Subjt:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYAE-KHGKDDPVV

Query:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF
        + +S  IFGK + +YLT  DIM++C M EI  +CIL YIA             F +VD   I+P   + E R R + +    +   Q++LI Y  G    
Subjt:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF

Query:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP
                       H +L ++++  N  Y+ DSL   + +D Q ++NT+++I  A+ +I+ + + NT+ W  +KCP Q GSVECGYY+  +IREIV N 
Subjt:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP

Query:  DT
         T
Subjt:  DT

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]8.5e-3632.24Show/hide
Query:  LGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYA-EKHGKDDPVVVPI
        LG DNV+  +D++  E   L IP    ++   +  QA   FVAWPR LVI +K  K  S     +      +T V   IK+L RYA      DD + + +
Subjt:  LGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYA-EKHGKDDPVVVPI

Query:  SNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYFDNN
        S +I GK +T+YL  +DI+++C M EI  +CIL YIA             F +VD   I+      E R++ + +    +   Q++LI YN G       
Subjt:  SNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYFDNN

Query:  MITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNPDTP
                    H +L ++++  N  Y+ DSL   + ++ Q ++NT+++   A+ ++   R    + W P+KCPRQ G++ECGYY+  +IREIV N +T 
Subjt:  MITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNPDTP

Query:  LKTL
        +  L
Subjt:  LKTL

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]1.1e-3532.24Show/hide
Query:  LGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYA-EKHGKDDPVVVPI
        LG DNV+  +D++  E   L IP    ++   +  QA   FVAWPR LVI +K  K  S     +      +T V   IK+L RYA      DD + + +
Subjt:  LGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYA-EKHGKDDPVVVPI

Query:  SNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYFDNN
        S +I GK +T+YL  +DI+++C M EI  +CIL YIA             F +VD   I+      E R++ + +    +   Q++LI YN G       
Subjt:  SNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYFDNN

Query:  MITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNPDTP
                    H +L ++++  N  Y+ DSL   + ++ Q ++NT+++   A+ ++   R    + W P+KCPRQ G++ECGYY+  +IREIV N +T 
Subjt:  MITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNPDTP

Query:  LKTL
        +  L
Subjt:  LKTL

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X17.7e-3531.27Show/hide
Query:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYA-EKHGKDDPVV
        G+ LG +N++V +D+  VE  ++ +P  +  ++ET   QA   FVAWPR LVI++K  K  S     +      +T V   IK+L RYA +    +D + 
Subjt:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYA-EKHGKDDPVV

Query:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF
        + +S  IFGK +T+YL  +DI+++C M EI  +CIL YIA             F +VD   I+    + E+R+R + +        Q++LI YN G    
Subjt:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF

Query:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP
                       H +L ++D+  N  Y+ D L   +  + Q ++N +++    + + +  R  + + W P+KCPR  GS+ECGYY+  ++RE+V N 
Subjt:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP

Query:  DTPLKTL
        +T +  L
Subjt:  DTPLKTL

A0A5D3CYL9 ULP_PROTEASE domain-containing protein7.7e-3531.27Show/hide
Query:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYA-EKHGKDDPVV
        G+ LG +N++V +D+  VE  ++ +P  +  ++ET   QA   FVAWPR LVI++K  K  S     +      +T V   IK+L RYA +    +D + 
Subjt:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYA-EKHGKDDPVV

Query:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF
        + +S  IFGK +T+YL  +DI+++C M EI  +CIL YIA             F +VD   I+    + E+R+R + +        Q++LI YN G    
Subjt:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF

Query:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP
                       H +L ++D+  N  Y+ D L   +  + Q ++N +++    + + +  R  + + W P+KCPR  GS+ECGYY+  ++RE+V N 
Subjt:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP

Query:  DTPLKTL
        +T +  L
Subjt:  DTPLKTL

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X18.8e-3934.85Show/hide
Query:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYAE-KHGKDDPVV
        GV LG DNV+V +D++  E   + IP  +   +ET   Q    FVAWPR LVI+S+     S +   T+     HT V   IK+L RY       +D V 
Subjt:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYAE-KHGKDDPVV

Query:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF
        + +S  IFGK + +YLT  DIM++C M EI  +CIL YIA             F +VD   I+P   + E R R + +    +   Q++LI Y  G    
Subjt:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF

Query:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP
                       H +L ++++  N  Y+ DSL   + +D Q ++NT+++I  A+ +I+ + + NT+ W  +KCP Q GSVECGYY+  +IREIV N 
Subjt:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP

Query:  DTPLKTL
         T +  +
Subjt:  DTPLKTL

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X41.5e-3835.43Show/hide
Query:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYAE-KHGKDDPVV
        GV LG DNV+V +D++  E   + IP  +   +ET   Q    FVAWPR LVI+S+     S +   T+     HT V   IK+L RY       +D V 
Subjt:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYAE-KHGKDDPVV

Query:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF
        + +S  IFGK + +YLT  DIM++C M EI  +CIL YIA             F +VD   I+P   + E R R + +    +   Q++LI Y  G    
Subjt:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF

Query:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP
                       H +L ++++  N  Y+ DSL   + +D Q ++NT+++I  A+ +I+ + + NT+ W  +KCP Q GSVECGYY+  +IREIV N 
Subjt:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP

Query:  DT
         T
Subjt:  DT

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X28.8e-3934.85Show/hide
Query:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYAE-KHGKDDPVV
        GV LG DNV+V +D++  E   + IP  +   +ET   Q    FVAWPR LVI+S+     S +   T+     HT V   IK+L RY       +D V 
Subjt:  GVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDK-DSKQNKPTKHSFPSHTTVPSCIKILYRYAE-KHGKDDPVV

Query:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF
        + +S  IFGK + +YLT  DIM++C M EI  +CIL YIA             F +VD   I+P   + E R R + +    +   Q++LI Y  G    
Subjt:  VPISNRIFGKGETLYLTPEDIMEFCAMEEISNTCILVYIA----------INMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYF

Query:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP
                       H +L ++++  N  Y+ DSL   + +D Q ++NT+++I  A+ +I+ + + NT+ W  +KCP Q GSVECGYY+  +IREIV N 
Subjt:  DNNMITYILVDFDKRHCLLCVVDVFANNAYLFDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNP

Query:  DTPLKTL
         T +  +
Subjt:  DTPLKTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGTGCTTTTAGGGAAAGACAATGTGAAGGTGTTTATTGATGTCATACAAGTTGAAAAAGGTAATCTTCATATTCCTTTTTCGATGATGGAAAATATGGAGACAAG
TTTTTGTCAAGCTCAGAGTTGTTTCGTTGCTTGGCCTCGCAGTCTTGTTATTATTTCTAAAAATGACAAGGATTCTAAACAAAACAAACCTACTAAACATAGTTTTCCTA
GTCATACGACTGTCCCTTCATGCATCAAGATTCTTTATCGATATGCTGAAAAACATGGTAAAGATGATCCAGTGGTTGTGCCCATAAGTAATAGAATATTTGGAAAAGGA
GAAACCCTTTATCTTACGCCAGAAGATATCATGGAATTTTGTGCAATGGAAGAAATATCAAATACCTGCATATTAGTCTACATCGCAATAAACATGTTTAGAGTAGTAGA
TTCGAATGACATTGCACCATGCTTTGGAACTCTGGAAGATCGAGCAAGAAAAGTGGGACATGTTTTTTCCCAAATAAAACCACGACAGATTCTACTAATTTCATACAATC
CTGGATTACTTTATTTTGATAATAATATGATTACATATATATTGGTTGATTTTGATAAACGTCATTGCCTATTGTGTGTGGTGGATGTATTTGCGAATAATGCTTATCTT
TTTGATTCCCTACATCCTAACATGACAGATGACCTCCAAAATATTGTGAATACGGCAATGAGGATTTGTTTGGCCCAAAGTACTATTCGAGTACAAAGAAAAATCAATAC
TGTTAATTGGATACCGGTAAAGTGTCCACGCCAACACGGTAGCGTTGAATGTGGATATTATATTTTGAATTTCATTCGAGAGATTGTTCATAATCCAGATACACCTCTTA
AAACTCTCGTATATGAAGATACAATGTCGGTTTCAGACCGTCGTCGATACCCCTATAGATGTCAATTCGTAGTTTTAAACCATAAGCTGTGCAAGCCAGCTCTCTCACTT
ATGAGCTTGCTTACTGGATCAAGTTGGCAACATACATCGATTCGAGGATCCGCGAAACAACCACTTCCTCGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGTGCTTTTAGGGAAAGACAATGTGAAGGTGTTTATTGATGTCATACAAGTTGAAAAAGGTAATCTTCATATTCCTTTTTCGATGATGGAAAATATGGAGACAAG
TTTTTGTCAAGCTCAGAGTTGTTTCGTTGCTTGGCCTCGCAGTCTTGTTATTATTTCTAAAAATGACAAGGATTCTAAACAAAACAAACCTACTAAACATAGTTTTCCTA
GTCATACGACTGTCCCTTCATGCATCAAGATTCTTTATCGATATGCTGAAAAACATGGTAAAGATGATCCAGTGGTTGTGCCCATAAGTAATAGAATATTTGGAAAAGGA
GAAACCCTTTATCTTACGCCAGAAGATATCATGGAATTTTGTGCAATGGAAGAAATATCAAATACCTGCATATTAGTCTACATCGCAATAAACATGTTTAGAGTAGTAGA
TTCGAATGACATTGCACCATGCTTTGGAACTCTGGAAGATCGAGCAAGAAAAGTGGGACATGTTTTTTCCCAAATAAAACCACGACAGATTCTACTAATTTCATACAATC
CTGGATTACTTTATTTTGATAATAATATGATTACATATATATTGGTTGATTTTGATAAACGTCATTGCCTATTGTGTGTGGTGGATGTATTTGCGAATAATGCTTATCTT
TTTGATTCCCTACATCCTAACATGACAGATGACCTCCAAAATATTGTGAATACGGCAATGAGGATTTGTTTGGCCCAAAGTACTATTCGAGTACAAAGAAAAATCAATAC
TGTTAATTGGATACCGGTAAAGTGTCCACGCCAACACGGTAGCGTTGAATGTGGATATTATATTTTGAATTTCATTCGAGAGATTGTTCATAATCCAGATACACCTCTTA
AAACTCTCGTATATGAAGATACAATGTCGGTTTCAGACCGTCGTCGATACCCCTATAGATGTCAATTCGTAGTTTTAAACCATAAGCTGTGCAAGCCAGCTCTCTCACTT
ATGAGCTTGCTTACTGGATCAAGTTGGCAACATACATCGATTCGAGGATCCGCGAAACAACCACTTCCTCGATAA
Protein sequenceShow/hide protein sequence
MGVLLGKDNVKVFIDVIQVEKGNLHIPFSMMENMETSFCQAQSCFVAWPRSLVIISKNDKDSKQNKPTKHSFPSHTTVPSCIKILYRYAEKHGKDDPVVVPISNRIFGKG
ETLYLTPEDIMEFCAMEEISNTCILVYIAINMFRVVDSNDIAPCFGTLEDRARKVGHVFSQIKPRQILLISYNPGLLYFDNNMITYILVDFDKRHCLLCVVDVFANNAYL
FDSLHPNMTDDLQNIVNTAMRICLAQSTIRVQRKINTVNWIPVKCPRQHGSVECGYYILNFIREIVHNPDTPLKTLVYEDTMSVSDRRRYPYRCQFVVLNHKLCKPALSL
MSLLTGSSWQHTSIRGSAKQPLPR