; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033044 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033044
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionKynurenine formamidase
Genome locationscaffold5:6563518..6564929
RNA-Seq ExpressionSpg033044
SyntenySpg033044
Gene Ontology termsNA
InterPro domainsIPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596247.1 hypothetical protein SDJN03_09427, partial [Cucurbita argyrosperma subsp. sororia]2.6e-13387.18Show/hide
Query:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE
        MNSLFLHSSLGP L CKPLPCPLK+PIH+ AL G   +TRI SRS LFPVKLCPSLHISRS+TCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDF SIEE
Subjt:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA
        HDQKIRRIISAGEIAESSQVMV+ISSEGFVD++YDSAPCRSLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELD IFG 
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD
        L++RC+PG RLVISHP GRKAL+QEQQ+F DVV+SDLPDRTTLQK AADHS  LTEFVDED FYLAVLKFNKD
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD

XP_022940190.1 uncharacterized protein LOC111445889 [Cucurbita moschata]3.4e-13387.18Show/hide
Query:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE
        MNSLFLHSSLGP L CKPLPCPLK PIH+ AL G   +TRI SRS LFPVKLCPSLHISRS+TCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDF SIEE
Subjt:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD++YDSAPCRSLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIFGA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD
        L++RC+PG RLVISHP GRK L+QEQQ+F DVV+SDLPDRTTLQK AADHS  LTEFVDED FYLAVLKFNKD
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD

XP_022971263.1 uncharacterized protein LOC111470036 [Cucurbita maxima]4.4e-13387.55Show/hide
Query:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE
        MNSLFLHSSLGP L CKPLPCPLKRPIH+ AL G   +TRI SRS L PVKLCPSLHISRS+TCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDF SIEE
Subjt:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD++YDSAPCRSLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIFGA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD
        L++RC+PG RLVISHP GRKAL+QEQQ+F DVV+SDLPDRTTLQK AADHS  LTEFVDED FYLAVLKFNKD
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD

XP_023540915.1 uncharacterized protein LOC111801154 [Cucurbita pepo subsp. pepo]2.2e-13286.45Show/hide
Query:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE
        MNSLFLHSSLGP L CKPLPCPLKRPIH+ AL G   +TR+ SRS LFPVKLCPSLHISRS+TCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDF  IEE
Subjt:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD++Y+SAPCRSLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIFGA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD
        L++RC+PG RLVISHP GRK L+QEQQ+F DVV+SDLPDRTTLQK AADHS  LTEFVDED FYLAVLKFNKD
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD

XP_038905691.1 uncharacterized protein LOC120091662 [Benincasa hispida]6.2e-12784.25Show/hide
Query:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE
        MNSLFLHSSLGPS TCKP PC LKR IHIC LRGSH NTRI +  VLFP KL P++HI++S+TC STP +EGVVSVINFEDLVEKDFSFLDSD+FSS EE
Subjt:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA
        HDQKIRRIISAGEI ESSQVMVSISSEGFVDQ++D APCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIF A
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD
        LSKRCV GARLVISHPNGRK LEQEQQ+FPDVV+SDLP+R  L+ AAADHS +LTEF+DE  FYLAVLKF+KD
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD

TrEMBL top hitse value%identityAlignment
A0A1S3B7V6 uncharacterized protein LOC1034867473.4e-12382.91Show/hide
Query:  MNSLFLHSSLGPSLTCKPLPCPLK--RPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSI
        MNSLFLHSSL PSLTC P   PLK  RPIH+CA RGSH NTRI + S+ F     P+LHIS S   SSTPSNEGVVSV+NFEDLVEKDFSFLDSDDFSS+
Subjt:  MNSLFLHSSLGPSLTCKPLPCPLK--RPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSI

Query:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIF
        EEHD+KIRRIISAGEI ESSQVMVSISSEGFVDQ++  AP RSLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFDAVFLYYLPAMPFELDAIF
Subjt:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIF

Query:  GALSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD
        GALS+RCV GARLVISHP+GRKALEQE+Q+FPDVV+SDLPDR TLQKAAADHS DLTEF+DE  FYLA+LKFNKD
Subjt:  GALSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD

A0A5D3DP12 Uncharacterized protein3.4e-12382.91Show/hide
Query:  MNSLFLHSSLGPSLTCKPLPCPLK--RPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSI
        MNSLFLHSSL PSLTC P   PLK  RPIH+CA RGSH NTRI + S+ F     P+LHIS S   SSTPSNEGVVSV+NFEDLVEKDFSFLDSDDFSS+
Subjt:  MNSLFLHSSLGPSLTCKPLPCPLK--RPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSI

Query:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIF
        EEHD+KIRRIISAGEI ESSQVMVSISSEGFVDQ++  AP RSLLVVHDSILTLACIKEKYDKVKCWQGE+IYVPEKWGPFDAVFLYYLPAMPFELDAIF
Subjt:  EEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIF

Query:  GALSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD
        GALS+RCV GARLVISHP+GRKALEQE+Q+FPDVV+SDLPDR TLQKAAADHS DLTEF+DE  FYLA+LKFNKD
Subjt:  GALSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD

A0A6J1DD17 uncharacterized protein LOC1110193616.7e-12784.62Show/hide
Query:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE
        MNSL LHSSLGPSLT KPLPC LKRP+ IC L GSHR+ RI SRS    V L PSLH+S+ VTC+STPSNEGVVSVINFEDLVEKDFSFLDSDDFSS EE
Subjt:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA
        +DQKIRRIISAGE+AESSQVMVSI SEGFVDQ++DSAPCRSLLV+HDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIFG 
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD
        LSKRCVPGARLVISHPNG+ ALE+EQQ+FPDVV+S LPD+ TLQK AADHSLDLTEFVD++ FYLAVLKF+KD
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD

A0A6J1FPV5 uncharacterized protein LOC1114458891.6e-13387.18Show/hide
Query:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE
        MNSLFLHSSLGP L CKPLPCPLK PIH+ AL G   +TRI SRS LFPVKLCPSLHISRS+TCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDF SIEE
Subjt:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD++YDSAPCRSLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIFGA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD
        L++RC+PG RLVISHP GRK L+QEQQ+F DVV+SDLPDRTTLQK AADHS  LTEFVDED FYLAVLKFNKD
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD

A0A6J1I6B9 uncharacterized protein LOC1114700362.1e-13387.55Show/hide
Query:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE
        MNSLFLHSSLGP L CKPLPCPLKRPIH+ AL G   +TRI SRS L PVKLCPSLHISRS+TCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDF SIEE
Subjt:  MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEE

Query:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA
        HD+KIRRIISAGEIAESSQVMV+ISSEGFVD++YDSAPCRSLLVVHD+IL LACIKEKYDKVKCWQGEVIYVPEKWGPFD VFLYYLPAMPFELDAIFGA
Subjt:  HDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGA

Query:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD
        L++RC+PG RLVISHP GRKAL+QEQQ+F DVV+SDLPDRTTLQK AADHS  LTEFVDED FYLAVLKFNKD
Subjt:  LSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41950.1 unknown protein8.8e-7162.44Show/hide
Query:  RSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKY
        R+   SS+   EG VSV++F    EKD+SFL+S +  S  EH QKI RII AGE++ESS+V+VSISSE FVD++ +S+P + LL+VHDS+ TLACIKEKY
Subjt:  RSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEEHDQKIRRIISAGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKY

Query:  DKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVD
        DKVKCWQGE+IYVPEKW P DAVFLY+LPA+PF+LD +F  LS+RC  GAR+VISHP GR  LEQ+++ F DVV+SDLPD +TL   A  HS +LT+FVD
Subjt:  DKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSKRCVPGARLVISHPNGRKALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVD

Query:  EDAFYLAVLKFNK
        E   YLAVLK +K
Subjt:  EDAFYLAVLKFNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCTCTTTTCTTACATTCATCTCTTGGTCCATCACTTACTTGCAAACCTCTGCCATGTCCTCTTAAGAGGCCTATACATATATGTGCACTACGTGGCTCTCATCG
CAATACTCGGATATTGTCTCGTTCAGTATTATTTCCTGTCAAATTGTGTCCATCCTTACACATTAGCAGATCAGTCACCTGTTCTTCAACTCCTTCAAATGAGGGTGTAG
TGTCAGTGATCAATTTTGAAGATTTAGTTGAGAAGGATTTTTCATTTCTCGATTCAGATGATTTTAGTTCTATAGAAGAGCATGATCAAAAGATTAGGCGCATCATTTCT
GCTGGAGAGATTGCAGAAAGTTCTCAGGTTATGGTTTCCATTTCTTCAGAAGGATTTGTTGATCAGGTGTATGACTCAGCTCCTTGTCGAAGTTTGCTTGTTGTCCATGA
TTCTATTCTAACATTAGCTTGTATTAAAGAAAAATATGACAAAGTTAAGTGCTGGCAAGGAGAAGTTATATATGTACCAGAAAAATGGGGACCCTTCGACGCTGTATTTC
TCTATTATCTGCCTGCAATGCCTTTCGAACTTGATGCAATTTTTGGAGCACTCTCAAAACGTTGTGTACCAGGTGCAAGACTAGTTATTAGCCATCCCAACGGAAGGAAA
GCATTAGAACAAGAACAGCAACGGTTCCCCGATGTCGTACTTTCAGATTTACCTGATAGGACGACTTTGCAGAAAGCTGCTGCAGATCACTCTCTTGACCTGACTGAATT
TGTAGATGAGGATGCCTTTTATCTTGCAGTTTTGAAGTTCAACAAGGATAGTACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATTCTCTTTTCTTACATTCATCTCTTGGTCCATCACTTACTTGCAAACCTCTGCCATGTCCTCTTAAGAGGCCTATACATATATGTGCACTACGTGGCTCTCATCG
CAATACTCGGATATTGTCTCGTTCAGTATTATTTCCTGTCAAATTGTGTCCATCCTTACACATTAGCAGATCAGTCACCTGTTCTTCAACTCCTTCAAATGAGGGTGTAG
TGTCAGTGATCAATTTTGAAGATTTAGTTGAGAAGGATTTTTCATTTCTCGATTCAGATGATTTTAGTTCTATAGAAGAGCATGATCAAAAGATTAGGCGCATCATTTCT
GCTGGAGAGATTGCAGAAAGTTCTCAGGTTATGGTTTCCATTTCTTCAGAAGGATTTGTTGATCAGGTGTATGACTCAGCTCCTTGTCGAAGTTTGCTTGTTGTCCATGA
TTCTATTCTAACATTAGCTTGTATTAAAGAAAAATATGACAAAGTTAAGTGCTGGCAAGGAGAAGTTATATATGTACCAGAAAAATGGGGACCCTTCGACGCTGTATTTC
TCTATTATCTGCCTGCAATGCCTTTCGAACTTGATGCAATTTTTGGAGCACTCTCAAAACGTTGTGTACCAGGTGCAAGACTAGTTATTAGCCATCCCAACGGAAGGAAA
GCATTAGAACAAGAACAGCAACGGTTCCCCGATGTCGTACTTTCAGATTTACCTGATAGGACGACTTTGCAGAAAGCTGCTGCAGATCACTCTCTTGACCTGACTGAATT
TGTAGATGAGGATGCCTTTTATCTTGCAGTTTTGAAGTTCAACAAGGATAGTACCTAA
Protein sequenceShow/hide protein sequence
MNSLFLHSSLGPSLTCKPLPCPLKRPIHICALRGSHRNTRILSRSVLFPVKLCPSLHISRSVTCSSTPSNEGVVSVINFEDLVEKDFSFLDSDDFSSIEEHDQKIRRIIS
AGEIAESSQVMVSISSEGFVDQVYDSAPCRSLLVVHDSILTLACIKEKYDKVKCWQGEVIYVPEKWGPFDAVFLYYLPAMPFELDAIFGALSKRCVPGARLVISHPNGRK
ALEQEQQRFPDVVLSDLPDRTTLQKAAADHSLDLTEFVDEDAFYLAVLKFNKDST