; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g11030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g11030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr7:8471780..8486046
RNA-Seq ExpressionMoc07g11030
SyntenyMoc07g11030
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031708.1 hypothetical protein E6C27_scaffold139G004940 [Cucumis melo var. makuwa]1.5e-3936Show/hide
Query:  DKSWMKKNRLSEDYELGVECGTLSKHERDENVLKKENIDEI-LDKE--------------ETL-EFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKV
        D+S +K    S+  +   +       E +  VL+   I+++ ++K+              ETL + K+GT CRLA+G+ DNVV AGTIF+   +  NVKV
Subjt:  DKSWMKKNRLSEDYELGVECGTLSKHERDENVLKKENIDEI-LDKE--------------ETL-EFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKV

Query:  CIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFI
         +D++   +  +P+PT  G  +LSQE+GS +LWP+ LVI  +EK         RL S    T+ APV+L  L  E++YIG  IQ+ +   VFG+E K  I
Subjt:  CIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFI

Query:  MVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGS--LDIDSDVSCKEYDIFL
         +E +Q+   M+P +T CIDA+   L++ +        YKF DAG  S+  +SKE+R Q+L  RL   +  Q+LM PY+ G+    I  D S +    ++
Subjt:  MVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGS--LDIDSDVSCKEYDIFL

Query:  DSMRNRLTDDILSVITMAVRNIQKK
        D +RNR+ +D   V+ MA     KK
Subjt:  DSMRNRLTDDILSVITMAVRNIQKK

KAA0035941.1 uncharacterized protein E6C27_scaffold56G001300 [Cucumis melo var. makuwa]2.7e-3834.88Show/hide
Query:  DKSWMKKNRLSEDYELGVECGTLSKHERDENVLKKENIDEI-LDKE--------------ETL-EFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKV
        D+S +K    S+  +   +       E +  VL+   I+++ ++K+              ETL + K+GT CRLA+G+ DNVV AGTI +   +  NVKV
Subjt:  DKSWMKKNRLSEDYELGVECGTLSKHERDENVLKKENIDEI-LDKE--------------ETL-EFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKV

Query:  CIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFI
         +D++   +  +PIPT  G  +LSQE+GS +LWP+ LVI  +EK         RL S    T+ APV+L  L  E++YIG  IQ+ +   VFG+E K  I
Subjt:  CIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFI

Query:  MVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGSLDIDSDVSCKEYDIF-LD
         +E +Q+   M+P +T CIDA+   L++ +        YKF DAG  S+  +SKE+R Q+L  RL   +  Q+L+ PY+ G+      ++      + +D
Subjt:  MVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGSLDIDSDVSCKEYDIF-LD

Query:  SMRNRLTDDILSVITMAVRNIQKK
         +RNR+ +D   V+ MA     KK
Subjt:  SMRNRLTDDILSVITMAVRNIQKK

KAA0046954.1 uncharacterized protein E6C27_scaffold230G001320 [Cucumis melo var. makuwa]1.9e-3938.68Show/hide
Query:  RDENVLKKENIDE----ILDKEETL-EFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKVCIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLV
        +D  + K++ + E    +    ETL + K+GT CRLA+G+ DNVV A TIF+   N  NVKV +D++   +  +P+PT  G  +LSQE+GS +LWP+ LV
Subjt:  RDENVLKKENIDE----ILDKEETL-EFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKVCIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLV

Query:  ISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSP
        I  +EK         RL S    T+ APV+L  L  E++YIG  IQ+ +   VFG+E K  I +E +Q+   M+P +T CIDA+   L++ +        
Subjt:  ISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSP

Query:  YKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGS--LDIDSDVSCKEYDIFLDSMRNRLTDDILSVITMAVRNIQKK
        YKF DAG  S+  +SKE+R Q+L  RL   +  Q+LM PY+ G+    I  D S +    ++D +RNR+ +D   V+ MA     KK
Subjt:  YKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGS--LDIDSDVSCKEYDIFLDSMRNRLTDDILSVITMAVRNIQKK

XP_022156814.1 uncharacterized protein LOC111023655 [Momordica charantia]7.2e-6388.89Show/hide
Query:  HGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTT
        HGGNDILSQEIGSHIL PQSLVI DNEK+IKHS ELF LASGKS TQPAPVS T LTREINYIGRAIQMIISKDVFGHE+KLFIMVEDVQKLFHMEPTTT
Subjt:  HGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTT

Query:  LCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQV
         CIDAY TFLHRSLGNENE SPYKFLD G  SITNLSKENR+QV
Subjt:  LCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQV

XP_022156878.1 uncharacterized protein LOC111023711 [Momordica charantia]1.5e-11377.7Show/hide
Query:  EILDKEETLEFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKVCIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRL
        +ILDKEETLEFKEGTHCRLALGSIDNVV A TIFESGR DGNVKV IDV+V DDSRLPIPTHGGN+ILSQEIGSHILWPQ+LVISDNEK+IKHS ELF  
Subjt:  EILDKEETLEFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKVCIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRL

Query:  ASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKE
        ASGKS TQPAPVSLT LTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTT CIDAYTTFLHRSLGNE ESSPYKFLDAG TSITNLSKE
Subjt:  ASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKE

Query:  NRMQVLTKRLSELELNQLLMSPYHFGSLDIDSDVSCKEYDIFLDSMRNRLTDDILSVITMAVRNIQKKPFTLKRVPVRIVIVSSLAL
        NR+QVLTKRLSELELNQLLM PYH G                                  AVRNIQKKPF LKRVPV+ +  S + L
Subjt:  NRMQVLTKRLSELELNQLLMSPYHFGSLDIDSDVSCKEYDIFLDSMRNRLTDDILSVITMAVRNIQKKPFTLKRVPVRIVIVSSLAL

TrEMBL top hitse value%identityAlignment
A0A5A7SM56 ULP_PROTEASE domain-containing protein7.0e-4036Show/hide
Query:  DKSWMKKNRLSEDYELGVECGTLSKHERDENVLKKENIDEI-LDKE--------------ETL-EFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKV
        D+S +K    S+  +   +       E +  VL+   I+++ ++K+              ETL + K+GT CRLA+G+ DNVV AGTIF+   +  NVKV
Subjt:  DKSWMKKNRLSEDYELGVECGTLSKHERDENVLKKENIDEI-LDKE--------------ETL-EFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKV

Query:  CIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFI
         +D++   +  +P+PT  G  +LSQE+GS +LWP+ LVI  +EK         RL S    T+ APV+L  L  E++YIG  IQ+ +   VFG+E K  I
Subjt:  CIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFI

Query:  MVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGS--LDIDSDVSCKEYDIFL
         +E +Q+   M+P +T CIDA+   L++ +        YKF DAG  S+  +SKE+R Q+L  RL   +  Q+LM PY+ G+    I  D S +    ++
Subjt:  MVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGS--LDIDSDVSCKEYDIFL

Query:  DSMRNRLTDDILSVITMAVRNIQKK
        D +RNR+ +D   V+ MA     KK
Subjt:  DSMRNRLTDDILSVITMAVRNIQKK

A0A5A7T2U8 ULP_PROTEASE domain-containing protein1.3e-3834.88Show/hide
Query:  DKSWMKKNRLSEDYELGVECGTLSKHERDENVLKKENIDEI-LDKE--------------ETL-EFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKV
        D+S +K    S+  +   +       E +  VL+   I+++ ++K+              ETL + K+GT CRLA+G+ DNVV AGTI +   +  NVKV
Subjt:  DKSWMKKNRLSEDYELGVECGTLSKHERDENVLKKENIDEI-LDKE--------------ETL-EFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKV

Query:  CIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFI
         +D++   +  +PIPT  G  +LSQE+GS +LWP+ LVI  +EK         RL S    T+ APV+L  L  E++YIG  IQ+ +   VFG+E K  I
Subjt:  CIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFI

Query:  MVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGSLDIDSDVSCKEYDIF-LD
         +E +Q+   M+P +T CIDA+   L++ +        YKF DAG  S+  +SKE+R Q+L  RL   +  Q+L+ PY+ G+      ++      + +D
Subjt:  MVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGSLDIDSDVSCKEYDIF-LD

Query:  SMRNRLTDDILSVITMAVRNIQKK
         +RNR+ +D   V+ MA     KK
Subjt:  SMRNRLTDDILSVITMAVRNIQKK

A0A5A7TVG6 ULP_PROTEASE domain-containing protein9.2e-4038.68Show/hide
Query:  RDENVLKKENIDE----ILDKEETL-EFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKVCIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLV
        +D  + K++ + E    +    ETL + K+GT CRLA+G+ DNVV A TIF+   N  NVKV +D++   +  +P+PT  G  +LSQE+GS +LWP+ LV
Subjt:  RDENVLKKENIDE----ILDKEETL-EFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKVCIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLV

Query:  ISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSP
        I  +EK         RL S    T+ APV+L  L  E++YIG  IQ+ +   VFG+E K  I +E +Q+   M+P +T CIDA+   L++ +        
Subjt:  ISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSP

Query:  YKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGS--LDIDSDVSCKEYDIFLDSMRNRLTDDILSVITMAVRNIQKK
        YKF DAG  S+  +SKE+R Q+L  RL   +  Q+LM PY+ G+    I  D S +    ++D +RNR+ +D   V+ MA     KK
Subjt:  YKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGS--LDIDSDVSCKEYDIFLDSMRNRLTDDILSVITMAVRNIQKK

A0A6J1DRE3 uncharacterized protein LOC1110236553.5e-6388.89Show/hide
Query:  HGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTT
        HGGNDILSQEIGSHIL PQSLVI DNEK+IKHS ELF LASGKS TQPAPVS T LTREINYIGRAIQMIISKDVFGHE+KLFIMVEDVQKLFHMEPTTT
Subjt:  HGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTT

Query:  LCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQV
         CIDAY TFLHRSLGNENE SPYKFLD G  SITNLSKENR+QV
Subjt:  LCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQV

A0A6J1DRT3 uncharacterized protein LOC1110237117.4e-11477.7Show/hide
Query:  EILDKEETLEFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKVCIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRL
        +ILDKEETLEFKEGTHCRLALGSIDNVV A TIFESGR DGNVKV IDV+V DDSRLPIPTHGGN+ILSQEIGSHILWPQ+LVISDNEK+IKHS ELF  
Subjt:  EILDKEETLEFKEGTHCRLALGSIDNVVVAGTIFESGRNDGNVKVCIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRL

Query:  ASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKE
        ASGKS TQPAPVSLT LTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTT CIDAYTTFLHRSLGNE ESSPYKFLDAG TSITNLSKE
Subjt:  ASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKE

Query:  NRMQVLTKRLSELELNQLLMSPYHFGSLDIDSDVSCKEYDIFLDSMRNRLTDDILSVITMAVRNIQKKPFTLKRVPVRIVIVSSLAL
        NR+QVLTKRLSELELNQLLM PYH G                                  AVRNIQKKPF LKRVPV+ +  S + L
Subjt:  NRMQVLTKRLSELELNQLLMSPYHFGSLDIDSDVSCKEYDIFLDSMRNRLTDDILSVITMAVRNIQKKPFTLKRVPVRIVIVSSLAL

SwissProt top hitse value%identityAlignment
Q9LNG5 Serine/threonine-protein phosphatase 7 long form homolog4.7e-0951.61Show/hide
Query:  LIGGYLFADKSNTFMHLMFLPLLSDVETVGQYSWDIACLAWWYRELCRASRSDALKIVGPLV
        L+ G+L+ DKS   + L FLPLL D + V + SW  A LA  YRELCRAS+     I GPLV
Subjt:  LIGGYLFADKSNTFMHLMFLPLLSDVETVGQYSWDIACLAWWYRELCRASRSDALKIVGPLV

Arabidopsis top hitse value%identityAlignment
AT1G48120.1 hydrolases;protein serine/threonine phosphatases3.4e-1051.61Show/hide
Query:  LIGGYLFADKSNTFMHLMFLPLLSDVETVGQYSWDIACLAWWYRELCRASRSDALKIVGPLV
        L+ G+L+ DKS   + L FLPLL D + V + SW  A LA  YRELCRAS+     I GPLV
Subjt:  LIGGYLFADKSNTFMHLMFLPLLSDVETVGQYSWDIACLAWWYRELCRASRSDALKIVGPLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGCCCAAAGACCTCGCACATTCACAATGAAATGGATTGGCGGGAGGCGAGAACAAAGCTTCATTCTGGACTTTGCAAGTGTCAAGGACACAATAATCGCACTTGT
CCACCTCGTAATTCAAGCGTCGCTTATTCTAGACACATGGATGGATAAGCTTATAGGAGGGTATCTGTTTGCTGATAAGTCTAACACATTTATGCATCTCATGTTTCTTC
CTCTCCTAAGCGATGTTGAGACAGTTGGACAGTATTCATGGGATATTGCATGTCTTGCATGGTGGTATCGAGAATTGTGTCGAGCTAGTCGATCTGATGCTTTAAAAATA
GTAGGACCACTTGTACCTTTTGCAAGACTCGAGCAAGTTTACAGTCAAAGATGCGAAGGAAGAATAGAAGAAACAACCAGTGGAGTTGACTGCGAGGACAGCAGCAAGAA
TTGGACAATGGACAAATCATGGATGAAAAAAAATAGATTGTCTGAGGATTATGAGTTAGGAGTCGAGTGTGGTACTTTATCGAAGCATGAGCGAGATGAAAACGTTTTAA
AAAAGGAGAATATAGATGAGATATTGGACAAGGAAGAAACTTTGGAGTTCAAGGAGGGAACTCATTGTCGTCTGGCACTTGGGTCCATCGATAATGTTGTCGTTGCAGGC
ACTATATTTGAATCTGGGAGGAATGATGGAAACGTGAAAGTGTGCATAGACGTGATGGTTCATGACGACTCTCGACTTCCAATTCCGACGCATGGAGGAAATGATATTCT
CTCGCAAGAAATAGGTTCACATATATTATGGCCACAAAGTCTAGTCATATCTGATAATGAAAAGAAAATAAAACATTCGATGGAGCTGTTCAGATTGGCAAGTGGAAAAT
CTCAAACACAACCTGCGCCCGTTAGTCTAACATTTTTAACTCGTGAGATAAACTACATTGGAAGGGCAATTCAAATGATTATATCAAAGGATGTGTTCGGTCATGAACAT
AAGTTGTTTATCATGGTGGAAGATGTACAGAAGTTGTTTCATATGGAACCGACAACTACTCTGTGCATTGATGCCTACACGACGTTCTTACATAGATCGTTGGGCAATGA
AAATGAATCAAGCCCGTACAAGTTTCTAGATGCTGGGGTCACTTCCATAACTAATCTGTCTAAAGAAAACCGCATGCAAGTATTGACTAAAAGACTCTCAGAATTGGAGT
TGAACCAACTGCTGATGTCTCCATATCATTTCGGATCATTGGACATTGATAGTGATGTCTCCTGCAAAGAATATGACATTTTTCTTGACTCGATGAGAAATCGCCTGACA
GATGATATTCTTAGTGTCATCACCATGGCTGTAAGAAATATACAGAAAAAACCATTTACTCTAAAGCGTGTACCGGTGCGTATTGTGATTGTTAGCTCTCTGGCACTGCA
TAAAGGCTATGAAAGTGTGTTGATTCCGAAGCATGTGGGAGCAAGGGTTGAAGGAAGCAGGGCCGAGCGTTTCTGTAATATGTTAGCTACATTTTGTAGTTTGACCTTTC
ACGTTTACACGGTCTTCCAAGAAGAGTCTGGAGAACCACGACGGGTAGCTACTTGTAATATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGCCCAAAGACCTCGCACATTCACAATGAAATGGATTGGCGGGAGGCGAGAACAAAGCTTCATTCTGGACTTTGCAAGTGTCAAGGACACAATAATCGCACTTGT
CCACCTCGTAATTCAAGCGTCGCTTATTCTAGACACATGGATGGATAAGCTTATAGGAGGGTATCTGTTTGCTGATAAGTCTAACACATTTATGCATCTCATGTTTCTTC
CTCTCCTAAGCGATGTTGAGACAGTTGGACAGTATTCATGGGATATTGCATGTCTTGCATGGTGGTATCGAGAATTGTGTCGAGCTAGTCGATCTGATGCTTTAAAAATA
GTAGGACCACTTGTACCTTTTGCAAGACTCGAGCAAGTTTACAGTCAAAGATGCGAAGGAAGAATAGAAGAAACAACCAGTGGAGTTGACTGCGAGGACAGCAGCAAGAA
TTGGACAATGGACAAATCATGGATGAAAAAAAATAGATTGTCTGAGGATTATGAGTTAGGAGTCGAGTGTGGTACTTTATCGAAGCATGAGCGAGATGAAAACGTTTTAA
AAAAGGAGAATATAGATGAGATATTGGACAAGGAAGAAACTTTGGAGTTCAAGGAGGGAACTCATTGTCGTCTGGCACTTGGGTCCATCGATAATGTTGTCGTTGCAGGC
ACTATATTTGAATCTGGGAGGAATGATGGAAACGTGAAAGTGTGCATAGACGTGATGGTTCATGACGACTCTCGACTTCCAATTCCGACGCATGGAGGAAATGATATTCT
CTCGCAAGAAATAGGTTCACATATATTATGGCCACAAAGTCTAGTCATATCTGATAATGAAAAGAAAATAAAACATTCGATGGAGCTGTTCAGATTGGCAAGTGGAAAAT
CTCAAACACAACCTGCGCCCGTTAGTCTAACATTTTTAACTCGTGAGATAAACTACATTGGAAGGGCAATTCAAATGATTATATCAAAGGATGTGTTCGGTCATGAACAT
AAGTTGTTTATCATGGTGGAAGATGTACAGAAGTTGTTTCATATGGAACCGACAACTACTCTGTGCATTGATGCCTACACGACGTTCTTACATAGATCGTTGGGCAATGA
AAATGAATCAAGCCCGTACAAGTTTCTAGATGCTGGGGTCACTTCCATAACTAATCTGTCTAAAGAAAACCGCATGCAAGTATTGACTAAAAGACTCTCAGAATTGGAGT
TGAACCAACTGCTGATGTCTCCATATCATTTCGGATCATTGGACATTGATAGTGATGTCTCCTGCAAAGAATATGACATTTTTCTTGACTCGATGAGAAATCGCCTGACA
GATGATATTCTTAGTGTCATCACCATGGCTGTAAGAAATATACAGAAAAAACCATTTACTCTAAAGCGTGTACCGGTGCGTATTGTGATTGTTAGCTCTCTGGCACTGCA
TAAAGGCTATGAAAGTGTGTTGATTCCGAAGCATGTGGGAGCAAGGGTTGAAGGAAGCAGGGCCGAGCGTTTCTGTAATATGTTAGCTACATTTTGTAGTTTGACCTTTC
ACGTTTACACGGTCTTCCAAGAAGAGTCTGGAGAACCACGACGGGTAGCTACTTGTAATATCTAA
Protein sequenceShow/hide protein sequence
MVAQRPRTFTMKWIGGRREQSFILDFASVKDTIIALVHLVIQASLILDTWMDKLIGGYLFADKSNTFMHLMFLPLLSDVETVGQYSWDIACLAWWYRELCRASRSDALKI
VGPLVPFARLEQVYSQRCEGRIEETTSGVDCEDSSKNWTMDKSWMKKNRLSEDYELGVECGTLSKHERDENVLKKENIDEILDKEETLEFKEGTHCRLALGSIDNVVVAG
TIFESGRNDGNVKVCIDVMVHDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKKIKHSMELFRLASGKSQTQPAPVSLTFLTREINYIGRAIQMIISKDVFGHEH
KLFIMVEDVQKLFHMEPTTTLCIDAYTTFLHRSLGNENESSPYKFLDAGVTSITNLSKENRMQVLTKRLSELELNQLLMSPYHFGSLDIDSDVSCKEYDIFLDSMRNRLT
DDILSVITMAVRNIQKKPFTLKRVPVRIVIVSSLALHKGYESVLIPKHVGARVEGSRAERFCNMLATFCSLTFHVYTVFQEESGEPRRVATCNI