; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0010598 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0010598
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionKynurenine formamidase
Genome locationchr04:29912759..29914119
RNA-Seq ExpressionPay0010598
SyntenyPay0010598
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136559.1 uncharacterized protein LOC101219545 [Cucumis sativus]7.7e-13893.75Show/hide
Query:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDR
        MNSLFLHSSLVPS+TC PRSFPL+  RPI ICA RGSHHNTRIFAP ISFPALHISNS A SSTPSNEGVVSVVNFEDLVEKDFS LDSDDFSS+EEH +
Subjt:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDR

Query:  KIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSE
        KIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSE
Subjt:  KIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSE

Query:  RCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS
        RCVAGARLVISHP+GRKALEQE+QQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKD S
Subjt:  RCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS

XP_008443029.1 PREDICTED: uncharacterized protein LOC103486747 [Cucumis melo]5.7e-14998.9Show/hide
Query:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDR
        MNSLFLHSSLVPSLTCNPRSFPLKRLRPIH+CAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFS LDSDDFSSVEEHDR
Subjt:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDR

Query:  KIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSE
        KIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSE
Subjt:  KIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSE

Query:  RCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS
        RCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS
Subjt:  RCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS

XP_022151424.1 uncharacterized protein LOC111019361 [Momordica charantia]1.1e-11277.66Show/hide
Query:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSIS-FPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHD
        MNSL LHSSL PSLT  P    LK  RP+ IC   GSH + RIF+ S+S +P+LH+S     +STPSNEGVVSV+NFEDLVEKDFS LDSDDFSS EE+D
Subjt:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSIS-FPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHD

Query:  RKIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALS
        +KIRRIISAGE+ ESSQVMVSI SEGFVDQLF  AP RSLLV+HDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIFG LS
Subjt:  RKIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALS

Query:  ERCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS
        +RCV GARLVISHP+G+ ALE+E+QQFPDVVVS LPD+MTLQK AADHS DLTEF+D++GFYLA+LKF+KDRS
Subjt:  ERCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS

XP_022971263.1 uncharacterized protein LOC111470036 [Cucurbita maxima]4.7e-11177.45Show/hide
Query:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIF-----APSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSV
        MNSLFLHSSL P L C P   PLK  RPIH+ A  G   +TRIF     +P    P+LHIS S   SSTPSNEGVVSV+NFEDLVEKDFS LDSDDF S+
Subjt:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIF-----APSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSV

Query:  EEHDRKIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIF
        EEHD KIRRIISAGEI ESSQVMV+ISSEGFVD+L+  AP RSLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIF
Subjt:  EEHDRKIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIF

Query:  GALSERCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKD
        GAL++RC+ G RLVISHP GRKAL+QE+QQF DVVVSDLPDR TLQK AADHSF LTEF+DE GFYLA+LKFNKD
Subjt:  GALSERCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKD

XP_038905691.1 uncharacterized protein LOC120091662 [Benincasa hispida]2.1e-11981.16Show/hide
Query:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSI----SFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVE
        MNSLFLHSSL PS TC P+   LK  R IHIC  RGSHHNTRIFAP +     +PA+HI+ S    STP +EGVVSV+NFEDLVEKDFS LDSD+FSS E
Subjt:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSI----SFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVE

Query:  EHDRKIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFG
        EHD+KIRRIISAGEI ESSQVMVSISSEGFVDQLF LAP RSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIF 
Subjt:  EHDRKIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFG

Query:  ALSERCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS
        ALS+RCVAGARLVISHP+GRK LEQE+QQFPDVVVSDLP+RM L+ AAADHSF+LTEFIDE+ FYLA+LKF+KDRS
Subjt:  ALSERCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS

TrEMBL top hitse value%identityAlignment
A0A0A0LF01 Uncharacterized protein8.6e-12794.38Show/hide
Query:  KRLRPIHICAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDRKIRRIISAGEIVESSQVMVSISS
        K  RPI ICA RGSHHNTRIFAP ISFPALHISNS A SSTPSNEGVVSVVNFEDLVEKDFS LDSDDFSS+EEH +KIRRIISAGEIVESSQVMVSISS
Subjt:  KRLRPIHICAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDRKIRRIISAGEIVESSQVMVSISS

Query:  EGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSERCVAGARLVISHPDGRKALEQER
        EGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSERCVAGARLVISHP+GRKALEQE+
Subjt:  EGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSERCVAGARLVISHPDGRKALEQER

Query:  QQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS
        QQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKD S
Subjt:  QQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS

A0A1S3B7V6 uncharacterized protein LOC1034867472.8e-14998.9Show/hide
Query:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDR
        MNSLFLHSSLVPSLTCNPRSFPLKRLRPIH+CAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFS LDSDDFSSVEEHDR
Subjt:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDR

Query:  KIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSE
        KIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSE
Subjt:  KIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSE

Query:  RCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS
        RCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS
Subjt:  RCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS

A0A5D3DP12 Uncharacterized protein2.8e-14998.9Show/hide
Query:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDR
        MNSLFLHSSLVPSLTCNPRSFPLKRLRPIH+CAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFS LDSDDFSSVEEHDR
Subjt:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDR

Query:  KIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSE
        KIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSE
Subjt:  KIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSE

Query:  RCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS
        RCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS
Subjt:  RCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS

A0A6J1DD17 uncharacterized protein LOC1110193615.4e-11377.66Show/hide
Query:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSIS-FPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHD
        MNSL LHSSL PSLT  P    LK  RP+ IC   GSH + RIF+ S+S +P+LH+S     +STPSNEGVVSV+NFEDLVEKDFS LDSDDFSS EE+D
Subjt:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSIS-FPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHD

Query:  RKIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALS
        +KIRRIISAGE+ ESSQVMVSI SEGFVDQLF  AP RSLLV+HDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIFG LS
Subjt:  RKIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALS

Query:  ERCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS
        +RCV GARLVISHP+G+ ALE+E+QQFPDVVVS LPD+MTLQK AADHS DLTEF+D++GFYLA+LKF+KDRS
Subjt:  ERCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS

A0A6J1I6B9 uncharacterized protein LOC1114700362.3e-11177.45Show/hide
Query:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIF-----APSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSV
        MNSLFLHSSL P L C P   PLK  RPIH+ A  G   +TRIF     +P    P+LHIS S   SSTPSNEGVVSV+NFEDLVEKDFS LDSDDF S+
Subjt:  MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIF-----APSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSV

Query:  EEHDRKIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIF
        EEHD KIRRIISAGEI ESSQVMV+ISSEGFVD+L+  AP RSLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIF
Subjt:  EEHDRKIRRIISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIF

Query:  GALSERCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKD
        GAL++RC+ G RLVISHP GRKAL+QE+QQF DVVVSDLPDR TLQK AADHSF LTEF+DE GFYLA+LKFNKD
Subjt:  GALSERCVAGARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41950.1 unknown protein1.2e-7255.47Show/hide
Query:  SSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPS-ISFPALHISNSAA--RSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDRKIRR
        S+L+PSL     S      R + +   + +   + +F+P  +SF   ++S   A   SS+   EG VSVV+F    EKD+S L+S +  S  EH +KI R
Subjt:  SSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPS-ISFPALHISNSAA--RSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDRKIRR

Query:  IISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSERCVA
        II AGE+ ESS+V+VSISSE FVD+L + +PS+ LL+VHDS+ TLACIKEKYDKVKCWQGE+IYVPEKW P DAVFLY+LPA+PF+LD +F  LS+RC +
Subjt:  IISAGEIVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSERCVA

Query:  GARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNK
        GAR+VISHP GR  LEQ+R++F DVVVSDLPD  TL   A  HSF+LT+F+DE G YLA+LK +K
Subjt:  GARLVISHPDGRKALEQERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCTCTGTTCTTACATTCTTCTCTTGTTCCATCACTTACTTGCAATCCTCGGTCGTTTCCTCTCAAGAGGCTGAGGCCTATACATATATGTGCCCCACGTGGTTC
TCATCATAATACTCGGATATTTGCTCCTTCCATATCGTTTCCAGCCTTACATATTAGTAACTCAGCTGCCCGTTCTTCAACTCCTTCTAATGAAGGTGTAGTATCTGTAG
TCAATTTTGAAGATTTAGTTGAAAAGGACTTTTCGGTTCTCGATTCAGATGATTTTAGTTCCGTAGAAGAGCATGATCGAAAAATTAGACGAATTATTTCTGCTGGAGAG
ATTGTAGAAAGTTCTCAGGTTATGGTTTCCATTTCTTCAGAAGGATTTGTTGATCAGTTGTTTCAATTAGCTCCCTCCCGAAGTTTGCTTGTTGTTCATGACTCTATTCT
AACGTTAGCTTGTATTAAAGAAAAATATGACAAAGTTAAGTGTTGGCAAGGAGAAGTTATATATGTACCAGAAAAATGGGGACCTTTCGATGCTGTATTTCTCTATTATC
TTCCGGCCATGCCATTCGAACTCGACGCAATTTTTGGAGCACTTTCAGAACGTTGCGTGGCAGGTGCAAGACTAGTTATTAGCCATCCTGACGGAAGGAAAGCATTAGAG
CAAGAACGACAACAGTTCCCAGATGTCGTAGTTTCGGATTTACCTGATAGGATGACTTTGCAAAAGGCTGCTGCAGATCACTCGTTTGACTTGACTGAATTTATAGATGA
GCATGGCTTTTATCTAGCAATTTTAAAGTTCAATAAGGACAGAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTCTCTGTTCTTACATTCTTCTCTTGTTCCATCACTTACTTGCAATCCTCGGTCGTTTCCTCTCAAGAGGCTGAGGCCTATACATATATGTGCCCCACGTGGTTC
TCATCATAATACTCGGATATTTGCTCCTTCCATATCGTTTCCAGCCTTACATATTAGTAACTCAGCTGCCCGTTCTTCAACTCCTTCTAATGAAGGTGTAGTATCTGTAG
TCAATTTTGAAGATTTAGTTGAAAAGGACTTTTCGGTTCTCGATTCAGATGATTTTAGTTCCGTAGAAGAGCATGATCGAAAAATTAGACGAATTATTTCTGCTGGAGAG
ATTGTAGAAAGTTCTCAGGTTATGGTTTCCATTTCTTCAGAAGGATTTGTTGATCAGTTGTTTCAATTAGCTCCCTCCCGAAGTTTGCTTGTTGTTCATGACTCTATTCT
AACGTTAGCTTGTATTAAAGAAAAATATGACAAAGTTAAGTGTTGGCAAGGAGAAGTTATATATGTACCAGAAAAATGGGGACCTTTCGATGCTGTATTTCTCTATTATC
TTCCGGCCATGCCATTCGAACTCGACGCAATTTTTGGAGCACTTTCAGAACGTTGCGTGGCAGGTGCAAGACTAGTTATTAGCCATCCTGACGGAAGGAAAGCATTAGAG
CAAGAACGACAACAGTTCCCAGATGTCGTAGTTTCGGATTTACCTGATAGGATGACTTTGCAAAAGGCTGCTGCAGATCACTCGTTTGACTTGACTGAATTTATAGATGA
GCATGGCTTTTATCTAGCAATTTTAAAGTTCAATAAGGACAGAAGTTAA
Protein sequenceShow/hide protein sequence
MNSLFLHSSLVPSLTCNPRSFPLKRLRPIHICAPRGSHHNTRIFAPSISFPALHISNSAARSSTPSNEGVVSVVNFEDLVEKDFSVLDSDDFSSVEEHDRKIRRIISAGE
IVESSQVMVSISSEGFVDQLFQLAPSRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSERCVAGARLVISHPDGRKALE
QERQQFPDVVVSDLPDRMTLQKAAADHSFDLTEFIDEHGFYLAILKFNKDRS