; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0010548 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0010548
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptionproline-rich protein 4-like
Genome locationchr01:15331527..15333667
RNA-Seq ExpressionIVF0010548
SyntenyIVF0010548
Gene Ontology termsNA
InterPro domainsIPR011049 - Serralysin-like metalloprotease, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049728.1 proline-rich protein 4-like [Cucumis melo var. makuwa]1.56e-10698.75Show/hide
Query:  FSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCV
         +GLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCV
Subjt:  FSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCV

Query:  SAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVYEKPPPVEE
        SAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVYEKPPPVEE
Subjt:  SAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVYEKPPPVEE

KAE8645665.1 hypothetical protein Csa_020188 [Cucumis sativus]4.05e-13393.33Show/hide
Query:  MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA
        MHNPP FLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGT ERKGVAKLDEEGNFKVLLPTEVLNKD NL+GKCFA
Subjt:  MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA

Query:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPP--PVLEKPPP
        QLHSASSTPCPSHDGSEMASMI IKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHY+HHPPLPPF++PVFPPHPP+YTHPLFPPKV VPP  PV EKPPP
Subjt:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPP--PVLEKPPP

Query:  VYEKPPPVEE
        V EKPPPVEE
Subjt:  VYEKPPPVEE

XP_008447985.1 PREDICTED: proline-rich protein 4-like [Cucumis melo]1.66e-147100Show/hide
Query:  MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA
        MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA
Subjt:  MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA

Query:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVY
        QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVY
Subjt:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVY

Query:  EKPPPVEE
        EKPPPVEE
Subjt:  EKPPPVEE

XP_031745143.1 proline-rich protein 4 [Cucumis sativus]5.11e-13693.33Show/hide
Query:  MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA
        MHNPP FLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGT ERKGVAKLDEEGNFKVLLPTEVLNKD NL+GKCFA
Subjt:  MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA

Query:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPP--PVLEKPPP
        QLHSASSTPCPSHDGSEMASMI IKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHY+HHPPLPPF++PVFPPHPP+YTHPLFPPKV VPP  PV EKPPP
Subjt:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPP--PVLEKPPP

Query:  VYEKPPPVEE
        V EKPPPVEE
Subjt:  VYEKPPPVEE

XP_038888565.1 proline-rich protein 4-like [Benincasa hispida]3.19e-11479.04Show/hide
Query:  MHNPPCF--LIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKC
        MHNP CF  L FFLL AS FCHG+DLTTVEVVGVGECADC+KNNIKTNHAFSGLSVSIDCKQKDG IERKGVAKLDEEG FKVLLPTEVL KD  L+GKC
Subjt:  MHNPPCF--LIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKC

Query:  FAQLHSASSTPCPSHDGSEMA-SMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHP---------LFPPKV--
        FAQLHSASSTPCPSHDG EMA SMIV KSK +GKQTFGLP+G+KFKSETCVSAFFWHHYYHHPPLPP +LPVFPPHPPLY+HP         LFPPKV  
Subjt:  FAQLHSASSTPCPSHDGSEMA-SMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHP---------LFPPKV--

Query:  -------SVPPPVLEKPPPVYEKPPPVEE
                +PPPV +KPPP YEKPPPV E
Subjt:  -------SVPPPVLEKPPPVYEKPPPVEE

TrEMBL top hitse value%identityAlignment
A0A0A0K2H7 Uncharacterized protein3.4e-10893.33Show/hide
Query:  MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA
        MHNPP FLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGT ERKGVAKLDEEGNFKVLLPTEVLNKD NL+GKCFA
Subjt:  MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA

Query:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPP--PVLEKPPP
        QLHSASSTPCPSHDGSEMASMI IKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHY+HHPPLPPF++PVFPPHPP+YTHPLFPPKV VPP  PV EKPPP
Subjt:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPP--PVLEKPPP

Query:  VYEKPPPVEE
        V EKPPPVEE
Subjt:  VYEKPPPVEE

A0A1S3BIP0 proline-rich protein 4-like5.2e-117100Show/hide
Query:  MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA
        MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA
Subjt:  MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA

Query:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVY
        QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVY
Subjt:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVY

Query:  EKPPPVEE
        EKPPPVEE
Subjt:  EKPPPVEE

A0A5A7U3B0 Proline-rich protein 4-like6.8e-8599.37Show/hide
Query:  SGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVS
        +GLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVS
Subjt:  SGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVS

Query:  AFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVYEKPPPVEE
        AFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVYEKPPPVEE
Subjt:  AFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVYEKPPPVEE

A0A6J1C205 proline-rich protein 4-like1.2e-7071.36Show/hide
Query:  MHNPPC-FLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCF
        M N  C FL    LFA+TFCHGSDLTTVEVVG GECADC+KNNIKT+HAFSGL VSIDCKQKDG  +RKGVA+LDEEG FKVLLPTEVL   E  + KCF
Subjt:  MHNPPC-FLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCF

Query:  AQLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHP-----LFPPKVSVPPPVLE
        AQLHSASS PCPSH G E +SMIV KSKD+GKQTFGLP G+KFKS TC SAFF   Y+HHPPLPP     FPPHPPL+THP      FPPK + PPP  E
Subjt:  AQLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHP-----LFPPKVSVPPPVLE

Query:  K--PPPVYEKPPP
        +  PPPV EKPPP
Subjt:  K--PPPVYEKPPP

A0A6J1JIY3 proline-rich protein 4-like isoform X36.2e-7071.09Show/hide
Query:  FLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHSAS
        F  FF LFA TFCHGSDLTTVEVVGVGECADC+KNNIKT HAF+GL VSIDCK KDG+ ERKG A+L+EEG FKVLLPTE L KD  L+GKCFA LHSAS
Subjt:  FLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHSAS

Query:  STPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHP--------LFPPKVSVPPPVLEKPPP
        STPC S DG E +SMIV+KS   GK TFGLP  +KF+S TCVSAFFWH  YHHPPLPP    VFPPHPPL+ HP         FPP     PPV EKPPP
Subjt:  STPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHP--------LFPPKVSVPPPVLEKPPP

Query:  VYEKPPPVEEN
        VYE PPPV EN
Subjt:  VYEKPPPVEEN

SwissProt top hitse value%identityAlignment
Q9SKP9 Proline-rich protein 27.6e-2538.43Show/hide
Query:  CFLIFFLLFA---STFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKD--GTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA
        C L  F L +   S  C    +  VEV+G  E      + IK  +AFSGL V+I+CK  D  G    +G  ++DE G F + +P +++  D  L+  C+A
Subjt:  CFLIFFLLFA---STFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKD--GTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA

Query:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWH--HYYHHPP--LPPFTLP-VFPPHPPLYTHPLFPPKVSVPP-----
         L SA   PCP+HDG E AS IV  SK       GL   +KF  E C+S FFWH   +   PP  LPP T P +  P PP+Y  P+  PK   PP     
Subjt:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWH--HYYHHPP--LPPFTLP-VFPPHPPLYTHPLFPPKVSVPP-----

Query:  PVLEKPPPVYEKPPPV
        P+ + P P+Y+ P P+
Subjt:  PVLEKPPPVYEKPPPV

Q9T0I5 Proline-rich protein 42.5e-2841.52Show/hide
Query:  PCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHS
        PC L+   +  S     S    VEVVG  E      + IKT HAFSGL V+IDCK   G    KG   +D++G F + +P ++++ +  L+ +C+AQLHS
Subjt:  PCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHS

Query:  ASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFW-------HHYYHHP-PL-PPFTLPVF--PPHPPLYTHPLF----------P
        A+ TPCP+HDG E ++ IV  SK   K   GL   +KF  E CVS FFW          + HP PL PP  LP F   P PP Y+ P+           P
Subjt:  ASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFW-------HHYYHHP-PL-PPFTLPVF--PPHPPLYTHPLF----------P

Query:  PKVSVPPPV-LEKPPPVYEKPPPV
        PK  +PPPV +  PPP  E PPPV
Subjt:  PKVSVPPPV-LEKPPPVYEKPPPV

Arabidopsis top hitse value%identityAlignment
AT2G21140.1 proline-rich protein 25.4e-2638.43Show/hide
Query:  CFLIFFLLFA---STFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKD--GTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA
        C L  F L +   S  C    +  VEV+G  E      + IK  +AFSGL V+I+CK  D  G    +G  ++DE G F + +P +++  D  L+  C+A
Subjt:  CFLIFFLLFA---STFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKD--GTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFA

Query:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWH--HYYHHPP--LPPFTLP-VFPPHPPLYTHPLFPPKVSVPP-----
         L SA   PCP+HDG E AS IV  SK       GL   +KF  E C+S FFWH   +   PP  LPP T P +  P PP+Y  P+  PK   PP     
Subjt:  QLHSASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWH--HYYHHPP--LPPFTLP-VFPPHPPLYTHPLFPPKVSVPP-----

Query:  PVLEKPPPVYEKPPPV
        P+ + P P+Y+ P P+
Subjt:  PVLEKPPPVYEKPPPV

AT4G38770.1 proline-rich protein 41.8e-2941.52Show/hide
Query:  PCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHS
        PC L+   +  S     S    VEVVG  E      + IKT HAFSGL V+IDCK   G    KG   +D++G F + +P ++++ +  L+ +C+AQLHS
Subjt:  PCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHS

Query:  ASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFW-------HHYYHHP-PL-PPFTLPVF--PPHPPLYTHPLF----------P
        A+ TPCP+HDG E ++ IV  SK   K   GL   +KF  E CVS FFW          + HP PL PP  LP F   P PP Y+ P+           P
Subjt:  ASSTPCPSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFW-------HHYYHHP-PL-PPFTLPVF--PPHPPLYTHPLF----------P

Query:  PKVSVPPPV-LEKPPPVYEKPPPV
        PK  +PPPV +  PPP  E PPPV
Subjt:  PKVSVPPPV-LEKPPPVYEKPPPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAATCCTCCCTGTTTCTTGATCTTCTTCTTGCTTTTTGCTTCAACTTTCTGTCATGGCAGTGACTTGACGACAGTTGAAGTTGTTGGAGTTGGAGAATGTGCAGA
TTGCTACAAGAATAACATTAAAACCAACCATGCCTTCTCAGGGCTTAGTGTTAGCATTGACTGCAAACAAAAAGATGGAACAATAGAAAGAAAAGGGGTTGCAAAGCTAG
ATGAAGAAGGAAACTTCAAAGTCTTACTTCCAACTGAGGTTTTGAACAAAGATGAAAATTTGGAAGGAAAGTGCTTTGCACAACTTCACAGTGCTTCTTCTACTCCTTGT
CCTTCTCACGATGGCTCAGAAATGGCTTCAATGATTGTAATCAAATCCAAAGATAAAGGAAAACAAACATTTGGTTTGCCTAATGGAATCAAATTCAAAAGTGAAACTTG
TGTTTCTGCCTTCTTTTGGCATCATTATTATCATCACCCTCCTTTGCCTCCTTTCACTTTACCTGTTTTCCCTCCTCATCCTCCTTTGTATACCCATCCCTTGTTCCCTC
CTAAAGTTTCGGTACCTCCCCCCGTCCTCGAAAAGCCTCCTCCGGTCTACGAAAAGCCTCCACCGGTTGAAGAGAATTTACAAGAAAAAACCACCGAAGCCACCGGTTTA
CAAGCCGAAGCCACCGGTCTACAAGCCAAAGCCACCGGTTTACAAGCCGAAGCCACCGGTCTACGAGCCAAAGCCACCGGTTTACAAGCCGAAGCCACCGGTTTACAAGC
CGAAGCCACCGGTGTACGAGCCAAAGCCACCGGTTTACGAGCCGAAGCCACCGGTTTACAAGCCAAAACCACCTGTTTACAAGCCTCCAACACCACAAAAGCCAGAGTCT
CCTCCATATTATAA
mRNA sequenceShow/hide mRNA sequence
CATTCCCTTCTTCATTATATAATTTCTTCATGCATAATCCTCCCTGTTTCTTGATCTTCTTCTTGCTTTTTGCTTCAACTTTCTGTCATGGCAGTGACTTGACGACAGTT
GAAGTTGTTGGAGTTGGAGAATGTGCAGATTGCTACAAGAATAACATTAAAACCAACCATGCCTTCTCAGGGCTTAGTGTTAGCATTGACTGCAAACAAAAAGATGGAAC
AATAGAAAGAAAAGGGGTTGCAAAGCTAGATGAAGAAGGAAACTTCAAAGTCTTACTTCCAACTGAGGTTTTGAACAAAGATGAAAATTTGGAAGGAAAGTGCTTTGCAC
AACTTCACAGTGCTTCTTCTACTCCTTGTCCTTCTCACGATGGCTCAGAAATGGCTTCAATGATTGTAATCAAATCCAAAGATAAAGGAAAACAAACATTTGGTTTGCCT
AATGGAATCAAATTCAAAAGTGAAACTTGTGTTTCTGCCTTCTTTTGGCATCATTATTATCATCACCCTCCTTTGCCTCCTTTCACTTTACCTGTTTTCCCTCCTCATCC
TCCTTTGTATACCCATCCCTTGTTCCCTCCTAAAGTTTCGGTACCTCCCCCCGTCCTCGAAAAGCCTCCTCCGGTCTACGAAAAGCCTCCACCGGTTGAAGAGAATTTAC
AAGAAAAAACCACCGAAGCCACCGGTTTACAAGCCGAAGCCACCGGTCTACAAGCCAAAGCCACCGGTTTACAAGCCGAAGCCACCGGTCTACGAGCCAAAGCCACCGGT
TTACAAGCCGAAGCCACCGGTTTACAAGCCGAAGCCACCGGTGTACGAGCCAAAGCCACCGGTTTACGAGCCGAAGCCACCGGTTTACAAGCCAAAACCACCTGTTTACA
AGCCTCCAACACCACAAAAGCCAGAGTCTCCTCCATATTATAAGCATCCATGGTATAAGATTCTCCCTCCAATTTCAAAGCTTCCACCATGTCCACCAGTTCCAAAGGTT
GTTGTCCCTCCAAAGTACTATTCTCACCCAAAGTTCGGCAAGAAATTTCCTCCCCAGTCTCCTTCTGTTCCACACCTTAATTAGAAGAAAACTAAATACATTTAGTTTAT
ATGAAGATGATTGAAGATATGTTATCTATGTTATATATATATATATATATATATTGAATTCTTTGTCATCTGTAAGTATAGGGGAATTTGTTTGATGTTATATTATATTT
GTAGATTAGCTTCAAACTTTGTTGTTGTTGTTGTAATAAAGACTTCAAGTGGAGATCAGAAGAAGATCAAAGAAATTCAGAGGGAGAGATCAGAAATGGTATATTGTGTA
TGTATTGAGTGTAATAATAATTTGTTTTGTATTGTAAAAGTTACAAAAGTGAGATTGAATTTGAAGGGTGTGGATTATGTTTGAGTTTAA
Protein sequenceShow/hide protein sequence
MHNPPCFLIFFLLFASTFCHGSDLTTVEVVGVGECADCYKNNIKTNHAFSGLSVSIDCKQKDGTIERKGVAKLDEEGNFKVLLPTEVLNKDENLEGKCFAQLHSASSTPC
PSHDGSEMASMIVIKSKDKGKQTFGLPNGIKFKSETCVSAFFWHHYYHHPPLPPFTLPVFPPHPPLYTHPLFPPKVSVPPPVLEKPPPVYEKPPPVEENLQEKTTEATGL
QAEATGLQAKATGLQAEATGLRAKATGLQAEATGLQAEATGVRAKATGLRAEATGLQAKTTCLQASNTTKARVSSIL