; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G195280 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G195280
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionKynurenine formamidase
Genome locationCiama_Chr10:30564270..30565678
RNA-Seq ExpressionCaUC10G195280
SyntenyCaUC10G195280
Gene Ontology termsNA
InterPro domainsIPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596247.1 hypothetical protein SDJN03_09427, partial [Cucurbita argyrosperma subsp. sororia]1.5e-12080.59Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSLFLHSSL P L CKP P PLK+PIH+ AL G   +TRIF+ S LFP KLCP+ H SR I C STPSNEGVVSVINFEDLVEKDFSFLDS D  S EE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        HDQKIRRIISAGEIAESSQVMV+ISSEGFVD+L+D APC SLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELD I G 
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKD
        L++RC+PGGRLVISHP GRKAL+QEQQQF DVVVSDLPDR TLQK AADHSF L EF+D   FYLA+LKF+KD
Subjt:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKD

XP_004136559.1 uncharacterized protein LOC101219545 [Cucumis sativus]3.6e-12786.18Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSLFLHSSL PS+TCKPR  PL+RPI ICALRGSHHNTRIFAP I F     PA H S  IAC STPSNEGVVSV+NFEDLVEKDFSFLDS D  S EE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        H QKIRRIISAGEI ESSQVMVSISSEGFVDQLF LAP  SLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAI GA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS
        LS+RCV G RLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDL EFID H FYLAILKF+KD S
Subjt:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS

XP_008443029.1 PREDICTED: uncharacterized protein LOC103486747 [Cucumis melo]1.7e-12484.48Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLK--RPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLIST
        MNSLFLHSSL PSLTC PR  PLK  RPIH+CA RGSHHNTRIFAPSI F     PA H S   A  STPSNEGVVSV+NFEDLVEKDFSFLDS D  S 
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLK--RPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLIST

Query:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIV
        EEHD+KIRRIISAGEI ESSQVMVSISSEGFVDQLF LAP  SLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFD VFLYYLPAMPFELDAI 
Subjt:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIV

Query:  GALSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS
        GALS+RCV G RLVISHP+GRKALEQE+QQFPDVVVSDLPDRMTLQKAAADHSFDL EFID H FYLAILKF+KDRS
Subjt:  GALSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS

XP_022940190.1 uncharacterized protein LOC111445889 [Cucurbita moschata]1.9e-12080.59Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSLFLHSSL P L CKP P PLK PIH+ AL G   +TRIF+ S LFP KLCP+ H SR I C STPSNEGVVSVINFEDLVEKDFSFLDS D  S EE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD+L+D APC SLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAI GA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKD
        L++RC+PGGRLVISHP GRK L+QEQQQF DVVVSDLPDR TLQK AADHSF L EF+D   FYLA+LKF+KD
Subjt:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKD

XP_038905691.1 uncharacterized protein LOC120091662 [Benincasa hispida]2.1e-13085.82Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSLFLHSSL PS TCKP+P  LKR IHIC LRGSHHNTRIFAP +LFPFKL PA H ++ I CFSTP +EGVVSVINFEDLVEKDFSFLDS +  STEE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        HDQKIRRIISAGEI ESSQVMVSISSEGFVDQLFDLAPC SLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAI  A
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS
        LSKRCV G RLVISHPNGRK LEQEQQQFPDVVVSDLP+RM L+ AAADHSF+L EFID ++FYLA+LKFHKDRS
Subjt:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS

TrEMBL top hitse value%identityAlignment
A0A1S3B7V6 uncharacterized protein LOC1034867478.2e-12584.48Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLK--RPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLIST
        MNSLFLHSSL PSLTC PR  PLK  RPIH+CA RGSHHNTRIFAPSI F     PA H S   A  STPSNEGVVSV+NFEDLVEKDFSFLDS D  S 
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLK--RPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLIST

Query:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIV
        EEHD+KIRRIISAGEI ESSQVMVSISSEGFVDQLF LAP  SLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFD VFLYYLPAMPFELDAI 
Subjt:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIV

Query:  GALSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS
        GALS+RCV G RLVISHP+GRKALEQE+QQFPDVVVSDLPDRMTLQKAAADHSFDL EFID H FYLAILKF+KDRS
Subjt:  GALSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS

A0A5D3DP12 Uncharacterized protein8.2e-12584.48Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLK--RPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLIST
        MNSLFLHSSL PSLTC PR  PLK  RPIH+CA RGSHHNTRIFAPSI F     PA H S   A  STPSNEGVVSV+NFEDLVEKDFSFLDS D  S 
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLK--RPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLIST

Query:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIV
        EEHD+KIRRIISAGEI ESSQVMVSISSEGFVDQLF LAP  SLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFD VFLYYLPAMPFELDAI 
Subjt:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIV

Query:  GALSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS
        GALS+RCV G RLVISHP+GRKALEQE+QQFPDVVVSDLPDRMTLQKAAADHSFDL EFID H FYLAILKF+KDRS
Subjt:  GALSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS

A0A6J1DD17 uncharacterized protein LOC1110193612.7e-12080Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSL LHSSL PSLT KP P  LKRP+ IC L GSH + RIF+ S+     L P+ H S+P+ C STPSNEGVVSVINFEDLVEKDFSFLDS D  S+EE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        +DQKIRRIISAGE+AESSQVMVSI SEGFVDQLFD APC SLLV+HDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAI G 
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS
        LSKRCVPG RLVISHPNG+ ALE+EQQQFPDVVVS LPD+MTLQK AADHS DL EF+D + FYLA+LKFHKDRS
Subjt:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS

A0A6J1FPV5 uncharacterized protein LOC1114458899.3e-12180.59Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSLFLHSSL P L CKP P PLK PIH+ AL G   +TRIF+ S LFP KLCP+ H SR I C STPSNEGVVSVINFEDLVEKDFSFLDS D  S EE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD+L+D APC SLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAI GA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKD
        L++RC+PGGRLVISHP GRK L+QEQQQF DVVVSDLPDR TLQK AADHSF L EF+D   FYLA+LKF+KD
Subjt:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKD

A0A6J1I6B9 uncharacterized protein LOC1114700361.2e-12080.95Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSLFLHSSL P L CKP P PLKRPIH+ AL G   +TRIF+ S L P KLCP+ H SR I C STPSNEGVVSVINFEDLVEKDFSFLDS D  S EE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD+L+D APC SLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAI GA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKD
        L++RC+PGGRLVISHP GRKAL+QEQQQF DVVVSDLPDR TLQK AADHSF L EF+D   FYLA+LKF+KD
Subjt:  LSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41950.1 unknown protein6.9e-6860.09Show/hide
Query:  RPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKY
        R     S+   EG VSV++F    EKD+SFL+S ++ ST EH QKI RII AGE++ESS+V+VSISSE FVD+L + +P   LL+VHDS+ TLACIKEKY
Subjt:  RPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKY

Query:  DKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGALSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFID
        DKVKCWQGE+IYVPEKW P D VFLY+LPA+PF+LD +   LS+RC  G R+VISHP GR  LEQ++++F DVVVSDLPD  TL   A  HSF+L +F+D
Subjt:  DKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGALSKRCVPGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFID

Query:  VHSFYLAILKFHK
            YLA+LK  K
Subjt:  VHSFYLAILKFHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCTCTGTTTTTACATTCTTCTCTTAGTCCATCTCTTACTTGCAAACCTCGGCCGCGTCCTCTTAAGAGGCCTATACATATATGTGCACTACGTGGCTCTCATCA
TAATACTCGGATATTTGCTCCTTCGATATTATTTCCTTTCAAATTGTGTCCAGCCTTTCATACTAGTAGACCAATCGCCTGTTTTTCAACTCCTTCAAATGAAGGTGTAG
TATCTGTAATCAATTTTGAAGATTTAGTTGAGAAGGACTTCTCTTTTCTCGATTCACACGACCTTATTTCCACAGAAGAGCATGATCAAAAGATTAGGCGGATTATTTCT
GCTGGAGAGATTGCAGAAAGTTCTCAGGTTATGGTTTCCATTTCTTCAGAAGGATTTGTTGATCAGTTGTTTGACTTAGCTCCTTGCACAAGCTTGCTTGTTGTCCATGA
TTCTATTTTAACATTAGCTTGTATTAAAGAAAAATATGACAAAGTTAAGTGTTGGCAAGGAGAAGTTATATATGTACCAGAAAAATGGGGACCTTTCGATGTTGTATTTC
TCTATTATCTTCCAGCCATGCCTTTCGAACTTGACGCAATTGTTGGAGCACTTTCAAAACGTTGTGTACCAGGTGGAAGACTAGTTATTAGCCATCCCAACGGAAGGAAA
GCATTAGAGCAAGAACAACAACAGTTCCCAGATGTCGTAGTTTCGGATTTACCTGATAGGATGACTTTGCAGAAAGCTGCTGCAGATCACTCTTTTGACTTGATTGAATT
TATAGATGTGCATAGCTTTTATCTTGCAATTTTGAAGTTCCACAAGGATAGAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTCTCTGTTTTTACATTCTTCTCTTAGTCCATCTCTTACTTGCAAACCTCGGCCGCGTCCTCTTAAGAGGCCTATACATATATGTGCACTACGTGGCTCTCATCA
TAATACTCGGATATTTGCTCCTTCGATATTATTTCCTTTCAAATTGTGTCCAGCCTTTCATACTAGTAGACCAATCGCCTGTTTTTCAACTCCTTCAAATGAAGGTGTAG
TATCTGTAATCAATTTTGAAGATTTAGTTGAGAAGGACTTCTCTTTTCTCGATTCACACGACCTTATTTCCACAGAAGAGCATGATCAAAAGATTAGGCGGATTATTTCT
GCTGGAGAGATTGCAGAAAGTTCTCAGGTTATGGTTTCCATTTCTTCAGAAGGATTTGTTGATCAGTTGTTTGACTTAGCTCCTTGCACAAGCTTGCTTGTTGTCCATGA
TTCTATTTTAACATTAGCTTGTATTAAAGAAAAATATGACAAAGTTAAGTGTTGGCAAGGAGAAGTTATATATGTACCAGAAAAATGGGGACCTTTCGATGTTGTATTTC
TCTATTATCTTCCAGCCATGCCTTTCGAACTTGACGCAATTGTTGGAGCACTTTCAAAACGTTGTGTACCAGGTGGAAGACTAGTTATTAGCCATCCCAACGGAAGGAAA
GCATTAGAGCAAGAACAACAACAGTTCCCAGATGTCGTAGTTTCGGATTTACCTGATAGGATGACTTTGCAGAAAGCTGCTGCAGATCACTCTTTTGACTTGATTGAATT
TATAGATGTGCATAGCTTTTATCTTGCAATTTTGAAGTTCCACAAGGATAGAAGTTAAGCCTAAATATTTCATCAAGGGCTAGGAGTTCCATGTCTAAATCCTGCTGCAA
AACTTTTGTGATTTGGATAAACAAAATTTTGCTGTTTTATCATTTATCAATGGTACAGTACATACTATTTTATG
Protein sequenceShow/hide protein sequence
MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAFHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEEHDQKIRRIIS
AGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGALSKRCVPGGRLVISHPNGRK
ALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDVHSFYLAILKFHKDRS