; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G17080 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G17080
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionKynurenine formamidase
Genome locationClcChr10:30968431..30969690
RNA-Seq ExpressionClc10G17080
SyntenyClc10G17080
Gene Ontology termsNA
InterPro domainsIPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596247.1 hypothetical protein SDJN03_09427, partial [Cucurbita argyrosperma subsp. sororia]3.3e-12080.59Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSLFLHSSL P L CKP P PLK+PIH+ AL G   +TRIF+ S LFP KLCP++H SR I C STPSNEGVVSVINFEDLVEKDFSFLDS D  S EE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        HDQKIRRIISAGEIAESSQVMV+ISSEGFVD+L+D APC SLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELD I G 
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKD
        L++RC+ GGRLVISHP GRKAL+QEQQQF DVVVSDLPDR TLQK AADHSF L EF+DE  FYLA+LKF+KD
Subjt:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKD

XP_004136559.1 uncharacterized protein LOC101219545 [Cucumis sativus]1.6e-12786.18Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSLFLHSSL PS+TCKPR  PL+RPI ICALRGSHHNTRIFAP I F     PA+H S  IAC STPSNEGVVSV+NFEDLVEKDFSFLDS D  S EE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        H QKIRRIISAGEI ESSQVMVSISSEGFVDQLF LAP  SLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAI GA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS
        LS+RCV+G RLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDL EFIDE+ FYLAILKF+KD S
Subjt:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS

XP_008443029.1 PREDICTED: uncharacterized protein LOC103486747 [Cucumis melo]7.6e-12584.48Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLK--RPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLIST
        MNSLFLHSSL PSLTC PR  PLK  RPIH+CA RGSHHNTRIFAPSI F     PA+H S   A  STPSNEGVVSV+NFEDLVEKDFSFLDS D  S 
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLK--RPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLIST

Query:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIV
        EEHD+KIRRIISAGEI ESSQVMVSISSEGFVDQLF LAP  SLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFD VFLYYLPAMPFELDAI 
Subjt:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIV

Query:  GALSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS
        GALS+RCV+G RLVISHP+GRKALEQE+QQFPDVVVSDLPDRMTLQKAAADHSFDL EFIDE+ FYLAILKF+KDRS
Subjt:  GALSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS

XP_022940190.1 uncharacterized protein LOC111445889 [Cucurbita moschata]4.3e-12080.59Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSLFLHSSL P L CKP P PLK PIH+ AL G   +TRIF+ S LFP KLCP++H SR I C STPSNEGVVSVINFEDLVEKDFSFLDS D  S EE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD+L+D APC SLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAI GA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKD
        L++RC+ GGRLVISHP GRK L+QEQQQF DVVVSDLPDR TLQK AADHSF L EF+DE  FYLA+LKF+KD
Subjt:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKD

XP_038905691.1 uncharacterized protein LOC120091662 [Benincasa hispida]3.7e-13286.55Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSLFLHSSL PS TCKP+P  LKR IHIC LRGSHHNTRIFAP +LFPFKL PA+H ++ I CFSTP +EGVVSVINFEDLVEKDFSFLDS +  STEE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        HDQKIRRIISAGEI ESSQVMVSISSEGFVDQLFDLAPC SLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAI  A
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS
        LSKRCV+G RLVISHPNGRK LEQEQQQFPDVVVSDLP+RM L+ AAADHSF+L EFIDEY+FYLA+LKFHKDRS
Subjt:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS

TrEMBL top hitse value%identityAlignment
A0A1S3B7V6 uncharacterized protein LOC1034867473.7e-12584.48Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLK--RPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLIST
        MNSLFLHSSL PSLTC PR  PLK  RPIH+CA RGSHHNTRIFAPSI F     PA+H S   A  STPSNEGVVSV+NFEDLVEKDFSFLDS D  S 
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLK--RPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLIST

Query:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIV
        EEHD+KIRRIISAGEI ESSQVMVSISSEGFVDQLF LAP  SLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFD VFLYYLPAMPFELDAI 
Subjt:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIV

Query:  GALSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS
        GALS+RCV+G RLVISHP+GRKALEQE+QQFPDVVVSDLPDRMTLQKAAADHSFDL EFIDE+ FYLAILKF+KDRS
Subjt:  GALSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS

A0A5D3DP12 Uncharacterized protein3.7e-12584.48Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLK--RPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLIST
        MNSLFLHSSL PSLTC PR  PLK  RPIH+CA RGSHHNTRIFAPSI F     PA+H S   A  STPSNEGVVSV+NFEDLVEKDFSFLDS D  S 
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLK--RPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLIST

Query:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIV
        EEHD+KIRRIISAGEI ESSQVMVSISSEGFVDQLF LAP  SLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFD VFLYYLPAMPFELDAI 
Subjt:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIV

Query:  GALSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS
        GALS+RCV+G RLVISHP+GRKALEQE+QQFPDVVVSDLPDRMTLQKAAADHSFDL EFIDE+ FYLAILKF+KDRS
Subjt:  GALSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS

A0A6J1DD17 uncharacterized protein LOC1110193611.8e-11979.64Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSL LHSSL PSLT KP P  LKRP+ IC L GSH + RIF+ S+     L P++H S+P+ C STPSNEGVVSVINFEDLVEKDFSFLDS D  S+EE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        +DQKIRRIISAGE+AESSQVMVSI SEGFVDQLFD APC SLLV+HDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAI G 
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS
        LSKRCV G RLVISHPNG+ ALE+EQQQFPDVVVS LPD+MTLQK AADHS DL EF+D+  FYLA+LKFHKDRS
Subjt:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS

A0A6J1FPV5 uncharacterized protein LOC1114458892.1e-12080.59Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSLFLHSSL P L CKP P PLK PIH+ AL G   +TRIF+ S LFP KLCP++H SR I C STPSNEGVVSVINFEDLVEKDFSFLDS D  S EE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD+L+D APC SLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAI GA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKD
        L++RC+ GGRLVISHP GRK L+QEQQQF DVVVSDLPDR TLQK AADHSF L EF+DE  FYLA+LKF+KD
Subjt:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKD

A0A6J1I6B9 uncharacterized protein LOC1114700362.7e-12080.95Show/hide
Query:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE
        MNSLFLHSSL P L CKP P PLKRPIH+ AL G   +TRIF+ S L P KLCP++H SR I C STPSNEGVVSVINFEDLVEKDFSFLDS D  S EE
Subjt:  MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD+L+D APC SLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAI GA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGA

Query:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKD
        L++RC+ GGRLVISHP GRKAL+QEQQQF DVVVSDLPDR TLQK AADHSF L EF+DE  FYLA+LKF+KD
Subjt:  LSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41950.1 unknown protein4.8e-6961.03Show/hide
Query:  RPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKY
        R     S+   EG VSV++F    EKD+SFL+S ++ ST EH QKI RII AGE++ESS+V+VSISSE FVD+L + +P   LL+VHDS+ TLACIKEKY
Subjt:  RPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKY

Query:  DKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGALSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFID
        DKVKCWQGE+IYVPEKW P D VFLY+LPA+PF+LD +   LS+RC SG R+VISHP GR  LEQ++++F DVVVSDLPD  TL   A  HSF+L +F+D
Subjt:  DKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGALSKRCVSGGRLVISHPNGRKALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFID

Query:  EYSFYLAILKFHK
        E   YLA+LK  K
Subjt:  EYSFYLAILKFHK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCTCTGTTTTTACACTCTTCTCTTAGTCCATCACTTACTTGCAAACCTCGGCCGCGTCCTCTTAAGAGGCCTATACATATATGTGCACTACGTGGCTCTCATCA
TAATACTCGGATATTTGCTCCTTCGATATTATTTCCTTTCAAATTGTGTCCAGCCGTTCATACTAGTAGACCAATCGCCTGTTTTTCAACTCCTTCAAATGAAGGTGTAG
TATCTGTAATCAATTTTGAAGATTTAGTTGAGAAGGACTTCTCTTTTCTCGATTCACACGACCTTATTTCCACAGAAGAGCATGATCAAAAGATTAGGCGGATTATTTCT
GCTGGAGAGATTGCAGAAAGTTCTCAGGTTATGGTTTCCATTTCTTCAGAAGGATTTGTTGATCAGTTGTTTGACTTAGCTCCTTGCACAAGTTTGCTTGTTGTCCATGA
TTCTATTTTAACATTAGCTTGTATTAAAGAAAAATATGACAAAGTTAAGTGTTGGCAAGGAGAAGTTATATATGTACCAGAAAAATGGGGACCTTTCGATGTTGTATTTC
TCTATTATCTTCCAGCCATGCCTTTCGAACTTGACGCAATTGTTGGAGCACTTTCAAAACGTTGTGTATCAGGTGGAAGACTAGTTATTAGCCATCCCAACGGAAGGAAA
GCATTAGAGCAAGAACAACAACAGTTCCCAGATGTCGTAGTTTCGGATTTACCTGATAGGATGACTTTGCAGAAAGCTGCTGCAGATCACTCTTTTGACTTGATTGAATT
TATAGATGAGTATAGCTTTTATCTTGCAATTTTGAAGTTCCACAAGGATAGAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTCTCTGTTTTTACACTCTTCTCTTAGTCCATCACTTACTTGCAAACCTCGGCCGCGTCCTCTTAAGAGGCCTATACATATATGTGCACTACGTGGCTCTCATCA
TAATACTCGGATATTTGCTCCTTCGATATTATTTCCTTTCAAATTGTGTCCAGCCGTTCATACTAGTAGACCAATCGCCTGTTTTTCAACTCCTTCAAATGAAGGTGTAG
TATCTGTAATCAATTTTGAAGATTTAGTTGAGAAGGACTTCTCTTTTCTCGATTCACACGACCTTATTTCCACAGAAGAGCATGATCAAAAGATTAGGCGGATTATTTCT
GCTGGAGAGATTGCAGAAAGTTCTCAGGTTATGGTTTCCATTTCTTCAGAAGGATTTGTTGATCAGTTGTTTGACTTAGCTCCTTGCACAAGTTTGCTTGTTGTCCATGA
TTCTATTTTAACATTAGCTTGTATTAAAGAAAAATATGACAAAGTTAAGTGTTGGCAAGGAGAAGTTATATATGTACCAGAAAAATGGGGACCTTTCGATGTTGTATTTC
TCTATTATCTTCCAGCCATGCCTTTCGAACTTGACGCAATTGTTGGAGCACTTTCAAAACGTTGTGTATCAGGTGGAAGACTAGTTATTAGCCATCCCAACGGAAGGAAA
GCATTAGAGCAAGAACAACAACAGTTCCCAGATGTCGTAGTTTCGGATTTACCTGATAGGATGACTTTGCAGAAAGCTGCTGCAGATCACTCTTTTGACTTGATTGAATT
TATAGATGAGTATAGCTTTTATCTTGCAATTTTGAAGTTCCACAAGGATAGAAGTTAA
Protein sequenceShow/hide protein sequence
MNSLFLHSSLSPSLTCKPRPRPLKRPIHICALRGSHHNTRIFAPSILFPFKLCPAVHTSRPIACFSTPSNEGVVSVINFEDLVEKDFSFLDSHDLISTEEHDQKIRRIIS
AGEIAESSQVMVSISSEGFVDQLFDLAPCTSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIVGALSKRCVSGGRLVISHPNGRK
ALEQEQQQFPDVVVSDLPDRMTLQKAAADHSFDLIEFIDEYSFYLAILKFHKDRS