; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019983 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019983
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionKynurenine formamidase
Genome locationChr04:27578676..27580039
RNA-Seq ExpressionHG10019983
SyntenyHG10019983
Gene Ontology termsNA
InterPro domainsIPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596247.1 hypothetical protein SDJN03_09427, partial [Cucurbita argyrosperma subsp. sororia]2.6e-12582.05Show/hide
Query:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTEE
        MNSLFLHSSLGP L CKP PCPLK+PI + AL GF  +TRI + S LFP KL  +LHIS+ ITC STPSNEGVVS+INFEDLVEKDFSFLDSDD  S EE
Subjt:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGA
        HDQKIRRIISAGEIAESSQVMV+ISSEGFVD+L+D APCRSLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFD+VFLYYLPAMPFELDTIFG 
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKD
        L++RC+PG RLVISHP GRKAL+QEQQQF DVVVSDLPDRT LQK AA+HSF LTEF+DE GFYLA+LKF+KD
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKD

XP_004136559.1 uncharacterized protein LOC101219545 [Cucumis sativus]7.8e-13086.91Show/hide
Query:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTEE
        MNSLFLHSSL PS+TCKPR  PL+RPIRICALRG HHNTRI AP I FP     ALHIS  I C STPSNEGVVS++NFEDLVEKDFSFLDSDD SS EE
Subjt:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGA
        H QKIRRIISAGEI ESSQVMVSISSEGFVDQLF  AP RSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELD IFGA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS
        LS+RCV GARLVISHPNGRKALEQEQQQFPDVVVSDLPDR  LQKAAA+HSFDLTEFIDEHGFYLAILKF+KD S
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS

XP_008443029.1 PREDICTED: uncharacterized protein LOC103486747 [Cucumis melo]1.2e-12584.48Show/hide
Query:  MNSLFLHSSLGPSLTCKPRPCPLK--RPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSST
        MNSLFLHSSL PSLTC PR  PLK  RPI +CA RG HHNTRI APSI FP     ALHIS      STPSNEGVVS++NFEDLVEKDFSFLDSDD SS 
Subjt:  MNSLFLHSSLGPSLTCKPRPCPLK--RPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSST

Query:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIF
        EEHD+KIRRIISAGEI ESSQVMVSISSEGFVDQLF  AP RSLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFD VFLYYLPAMPFELD IF
Subjt:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIF

Query:  GALSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS
        GALS+RCV GARLVISHP+GRKALEQE+QQFPDVVVSDLPDR  LQKAAA+HSFDLTEFIDEHGFYLAILKF+KDRS
Subjt:  GALSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS

XP_022151424.1 uncharacterized protein LOC111019361 [Momordica charantia]1.7e-12481.16Show/hide
Query:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSI-LFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTE
        MNSL LHSSLGPSLT KP PC LKRP+ IC L G H + RI + S+ L+P     +LH+SKP+TC STPSNEGVVS+INFEDLVEKDFSFLDSDD SS+E
Subjt:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSI-LFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTE

Query:  EHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFG
        E+DQKIRRIISAGE+AESSQVMVSI SEGFVDQLFD APCRSLLV+HDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFD+VFLYYLPAMPFELD IFG
Subjt:  EHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFG

Query:  ALSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS
         LSKRCVPGARLVISHPNG+ ALE+EQQQFPDVVVS LPD+  LQK AA+HS DLTEF+D++GFYLA+LKFHKDRS
Subjt:  ALSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS

XP_038905691.1 uncharacterized protein LOC120091662 [Benincasa hispida]2.6e-13386.55Show/hide
Query:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTEE
        MNSLFLHSSLGPS TCKP+PC LKR I IC LRG HHNTRI AP +LFPFKL  A+HI+K ITCFSTP +EGVVS+INFEDLVEKDFSFLDSD+ SSTEE
Subjt:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGA
        HDQKIRRIISAGEI ESSQVMVSISSEGFVDQLFD APCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFD+VFLYYLPAMPFELD IF A
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS
        LSKRCV GARLVISHPNGRK LEQEQQQFPDVVVSDLP+R  L+ AAA+HSF+LTEFIDE+ FYLA+LKFHKDRS
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS

TrEMBL top hitse value%identityAlignment
A0A1S3B7V6 uncharacterized protein LOC1034867475.7e-12684.48Show/hide
Query:  MNSLFLHSSLGPSLTCKPRPCPLK--RPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSST
        MNSLFLHSSL PSLTC PR  PLK  RPI +CA RG HHNTRI APSI FP     ALHIS      STPSNEGVVS++NFEDLVEKDFSFLDSDD SS 
Subjt:  MNSLFLHSSLGPSLTCKPRPCPLK--RPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSST

Query:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIF
        EEHD+KIRRIISAGEI ESSQVMVSISSEGFVDQLF  AP RSLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFD VFLYYLPAMPFELD IF
Subjt:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIF

Query:  GALSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS
        GALS+RCV GARLVISHP+GRKALEQE+QQFPDVVVSDLPDR  LQKAAA+HSFDLTEFIDEHGFYLAILKF+KDRS
Subjt:  GALSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS

A0A5D3DP12 Uncharacterized protein5.7e-12684.48Show/hide
Query:  MNSLFLHSSLGPSLTCKPRPCPLK--RPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSST
        MNSLFLHSSL PSLTC PR  PLK  RPI +CA RG HHNTRI APSI FP     ALHIS      STPSNEGVVS++NFEDLVEKDFSFLDSDD SS 
Subjt:  MNSLFLHSSLGPSLTCKPRPCPLK--RPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSST

Query:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIF
        EEHD+KIRRIISAGEI ESSQVMVSISSEGFVDQLF  AP RSLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFD VFLYYLPAMPFELD IF
Subjt:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIF

Query:  GALSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS
        GALS+RCV GARLVISHP+GRKALEQE+QQFPDVVVSDLPDR  LQKAAA+HSFDLTEFIDEHGFYLAILKF+KDRS
Subjt:  GALSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS

A0A6J1DD17 uncharacterized protein LOC1110193618.2e-12581.16Show/hide
Query:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSI-LFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTE
        MNSL LHSSLGPSLT KP PC LKRP+ IC L G H + RI + S+ L+P     +LH+SKP+TC STPSNEGVVS+INFEDLVEKDFSFLDSDD SS+E
Subjt:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSI-LFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTE

Query:  EHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFG
        E+DQKIRRIISAGE+AESSQVMVSI SEGFVDQLFD APCRSLLV+HDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFD+VFLYYLPAMPFELD IFG
Subjt:  EHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFG

Query:  ALSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS
         LSKRCVPGARLVISHPNG+ ALE+EQQQFPDVVVS LPD+  LQK AA+HS DLTEF+D++GFYLA+LKFHKDRS
Subjt:  ALSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS

A0A6J1FPV5 uncharacterized protein LOC1114458891.8e-12481.32Show/hide
Query:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTEE
        MNSLFLHSSLGP L CKP PCPLK PI + AL GF  +TRI + S LFP KL  +LHIS+ ITC STPSNEGVVS+INFEDLVEKDFSFLDSDD  S EE
Subjt:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD+L+D APCRSLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFD+VFLYYLPAMPFELD IFGA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKD
        L++RC+PG RLVISHP GRK L+QEQQQF DVVVSDLPDRT LQK AA+HSF LTEF+DE GFYLA+LKF+KD
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKD

A0A6J1I6B9 uncharacterized protein LOC1114700362.4e-12481.68Show/hide
Query:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTEE
        MNSLFLHSSLGP L CKP PCPLKRPI + AL GF  +TRI + S L P KL  +LHIS+ ITC STPSNEGVVS+INFEDLVEKDFSFLDSDD  S EE
Subjt:  MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD+L+D APCRSLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFD+VFLYYLPAMPFELD IFGA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKD
        L++RC+PG RLVISHP GRKAL+QEQQQF DVVVSDLPDRT LQK AA+HSF LTEF+DE GFYLA+LKF+KD
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41950.1 unknown protein1.0e-7162.86Show/hide
Query:  CFSTPS-NEGVVSIINFEDLVEKDFSFLDSDDLSSTEEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKV
        C S+ S  EG VS+++F    EKD+SFL+S ++ ST EH QKI RII AGE++ESS+V+VSISSE FVD+L + +P + LL+VHDS+ TLACIKEKYDKV
Subjt:  CFSTPS-NEGVVSIINFEDLVEKDFSFLDSDDLSSTEEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKV

Query:  KCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGALSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHG
        KCWQGE+IYVPEKW P D VFLY+LPA+PF+LD +F  LS+RC  GAR+VISHP GR  LEQ++++F DVVVSDLPD + L   A +HSF+LT+F+DE G
Subjt:  KCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGALSKRCVPGARLVISHPNGRKALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHG

Query:  FYLAILKFHK
         YLA+LK  K
Subjt:  FYLAILKFHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCTCTGTTTTTACATTCTTCTCTTGGTCCATCACTTACTTGCAAACCACGGCCATGTCCTCTTAAGAGGCCCATACGTATATGTGCTCTACGTGGCTTTCATCA
TAATACTCGGATATCTGCTCCTTCGATATTATTTCCTTTCAAATTGGGTTCAGCCTTGCATATTAGCAAACCAATCACCTGTTTTTCAACTCCTTCAAATGAAGGTGTAG
TATCTATAATCAATTTTGAAGATTTAGTTGAGAAGGACTTTTCGTTTCTCGATTCAGACGATTTAAGTTCCACAGAAGAGCATGATCAAAAGATTAGGCGGATTATTTCT
GCTGGAGAGATTGCAGAAAGTTCTCAGGTTATGGTTTCCATTTCTTCAGAAGGATTTGTTGATCAGTTGTTTGACTTCGCTCCTTGCCGAAGTTTGCTTGTTGTCCATGA
TTCTATTCTAACATTAGCTTGTATTAAAGAAAAGTATGACAAAGTTAAGTGTTGGCAAGGAGAAGTTATATATGTACCAGAAAAATGGGGACCTTTCGACATTGTATTTC
TCTATTATCTTCCAGCTATGCCTTTCGAACTTGACACAATTTTTGGAGCACTCTCAAAACGTTGTGTACCAGGTGCAAGACTAGTTATTAGCCATCCGAACGGAAGGAAA
GCATTAGAGCAAGAACAACAACAGTTCCCAGATGTTGTAGTTTCGGATTTACCTGATAGGACGATTTTGCAGAAAGCTGCTGCAGAGCACTCTTTTGACTTGACTGAATT
TATAGATGAGCATGGCTTTTATCTTGCAATTTTGAAGTTCCACAAGGATAGAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTCTCTGTTTTTACATTCTTCTCTTGGTCCATCACTTACTTGCAAACCACGGCCATGTCCTCTTAAGAGGCCCATACGTATATGTGCTCTACGTGGCTTTCATCA
TAATACTCGGATATCTGCTCCTTCGATATTATTTCCTTTCAAATTGGGTTCAGCCTTGCATATTAGCAAACCAATCACCTGTTTTTCAACTCCTTCAAATGAAGGTGTAG
TATCTATAATCAATTTTGAAGATTTAGTTGAGAAGGACTTTTCGTTTCTCGATTCAGACGATTTAAGTTCCACAGAAGAGCATGATCAAAAGATTAGGCGGATTATTTCT
GCTGGAGAGATTGCAGAAAGTTCTCAGGTTATGGTTTCCATTTCTTCAGAAGGATTTGTTGATCAGTTGTTTGACTTCGCTCCTTGCCGAAGTTTGCTTGTTGTCCATGA
TTCTATTCTAACATTAGCTTGTATTAAAGAAAAGTATGACAAAGTTAAGTGTTGGCAAGGAGAAGTTATATATGTACCAGAAAAATGGGGACCTTTCGACATTGTATTTC
TCTATTATCTTCCAGCTATGCCTTTCGAACTTGACACAATTTTTGGAGCACTCTCAAAACGTTGTGTACCAGGTGCAAGACTAGTTATTAGCCATCCGAACGGAAGGAAA
GCATTAGAGCAAGAACAACAACAGTTCCCAGATGTTGTAGTTTCGGATTTACCTGATAGGACGATTTTGCAGAAAGCTGCTGCAGAGCACTCTTTTGACTTGACTGAATT
TATAGATGAGCATGGCTTTTATCTTGCAATTTTGAAGTTCCACAAGGATAGAAGTTAA
Protein sequenceShow/hide protein sequence
MNSLFLHSSLGPSLTCKPRPCPLKRPIRICALRGFHHNTRISAPSILFPFKLGSALHISKPITCFSTPSNEGVVSIINFEDLVEKDFSFLDSDDLSSTEEHDQKIRRIIS
AGEIAESSQVMVSISSEGFVDQLFDFAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDIVFLYYLPAMPFELDTIFGALSKRCVPGARLVISHPNGRK
ALEQEQQQFPDVVVSDLPDRTILQKAAAEHSFDLTEFIDEHGFYLAILKFHKDRS