; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007988 (gene) of Snake gourd v1 genome

Gene IDTan0007988
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionKynurenine formamidase
Genome locationLG08:63507816..63509200
RNA-Seq ExpressionTan0007988
SyntenyTan0007988
Gene Ontology termsNA
InterPro domainsIPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596247.1 hypothetical protein SDJN03_09427, partial [Cucurbita argyrosperma subsp. sororia]5.9e-13085.35Show/hide
Query:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE
        M+SLFLHSSLGP L CKPLPCPLK+ IH+ AL     +TRIFSRS  FPVKLCPSL+ SRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDS+DF SIEE
Subjt:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE

Query:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
        HDQKI+RIISAGEIAESSQVMV+ISSEGFV++LYDSAPCRSLLVVHD+IL LACIKEKYDKV+CWQGEVIYVPEKWGPFDVVFLYYLPAMPFELD IFG 
Subjt:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA

Query:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE
        L++RC+PG RLVISHP GRKAL+QEQQQF DVVVSDLPD+T+LQKVAADHSFGLTEFVDED FYLA+LKFNK+
Subjt:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE

XP_022940190.1 uncharacterized protein LOC111445889 [Cucurbita moschata]4.5e-13085.35Show/hide
Query:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE
        M+SLFLHSSLGP L CKPLPCPLK  IH+ AL     +TRIFSRS  FPVKLCPSL+ SRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDS+DF SIEE
Subjt:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE

Query:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
        HD+KI+RIISAGEIAESSQVMV+ISSEGFV++LYDSAPCRSLLVVHD+IL LACIKEKYDKV+CWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
Subjt:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA

Query:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE
        L++RC+PG RLVISHP GRK L+QEQQQF DVVVSDLPD+T+LQKVAADHSFGLTEFVDED FYLA+LKFNK+
Subjt:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE

XP_022971263.1 uncharacterized protein LOC111470036 [Cucurbita maxima]5.9e-13085.71Show/hide
Query:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE
        M+SLFLHSSLGP L CKPLPCPLKR IH+ AL     +TRIFSRS   PVKLCPSL+ SRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDS+DF SIEE
Subjt:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE

Query:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
        HD+KI+RIISAGEIAESSQVMV+ISSEGFV++LYDSAPCRSLLVVHD+IL LACIKEKYDKV+CWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
Subjt:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA

Query:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE
        L++RC+PG RLVISHP GRKAL+QEQQQF DVVVSDLPD+T+LQKVAADHSFGLTEFVDED FYLA+LKFNK+
Subjt:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE

XP_023540915.1 uncharacterized protein LOC111801154 [Cucurbita pepo subsp. pepo]2.9e-12984.62Show/hide
Query:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE
        M+SLFLHSSLGP L CKPLPCPLKR IH+ AL     +TR+FSRS  FPVKLCPSL+ SRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDS+DF  IEE
Subjt:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE

Query:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
        HD+KI+RIISAGEIAESSQVMV+ISSEGFV++LY+SAPCRSLLVVHD+IL LACIKEKYDKV+CWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
Subjt:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA

Query:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE
        L++RC+PG RLVISHP GRK L+QEQQQF DVVVSDLPD+T+LQKVAADHSFGLTEFVDED FYLA+LKFNK+
Subjt:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE

XP_038905691.1 uncharacterized protein LOC120091662 [Benincasa hispida]6.6e-12180.59Show/hide
Query:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE
        M+SLFLHSSLGPS TCKP PC LKR+IHI  L  SH NTRIF+  V FP KL P+++ ++SITC STP +EGVVSVINFEDLVEKDFSFLDS++FSS EE
Subjt:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE

Query:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
        HDQKI+RIISAGEI ESSQVMVSISSEGFV+QL+D APCRSLLVVHDSILTLACIKEKYDKV+CWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIF A
Subjt:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA

Query:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE
        LS+RCV GARLVISHP+GRK LEQEQQQFPDVVVSDLP++ +L+  AADHSF LTEF+DE  FYLA+LKF+K+
Subjt:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE

TrEMBL top hitse value%identityAlignment
A0A1S3B7V6 uncharacterized protein LOC1034867472.0e-11579.27Show/hide
Query:  MSSLFLHSSLGPSLTCKPLPCPLK--RHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSI
        M+SLFLHSSL PSLTC P   PLK  R IH+ A   SH NTRIF+ S+ F     P+L+ S S   SSTPSNEGVVSV+NFEDLVEKDFSFLDS+DFSS+
Subjt:  MSSLFLHSSLGPSLTCKPLPCPLK--RHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSI

Query:  EEHDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIF
        EEHD+KI+RIISAGEI ESSQVMVSISSEGFV+QL+  AP RSLLVVHDSILTLACIKEKYDKV+CWQGE+IYVPEKWGPFD VFLYYLPAMPFELDAIF
Subjt:  EEHDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIF

Query:  GALSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE
        GALSERCV GARLVISHP GRKALEQE+QQFPDVVVSDLPD+ +LQK AADHSF LTEF+DE  FYLAILKFNK+
Subjt:  GALSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE

A0A5D3DP12 Uncharacterized protein2.0e-11579.27Show/hide
Query:  MSSLFLHSSLGPSLTCKPLPCPLK--RHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSI
        M+SLFLHSSL PSLTC P   PLK  R IH+ A   SH NTRIF+ S+ F     P+L+ S S   SSTPSNEGVVSV+NFEDLVEKDFSFLDS+DFSS+
Subjt:  MSSLFLHSSLGPSLTCKPLPCPLK--RHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSI

Query:  EEHDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIF
        EEHD+KI+RIISAGEI ESSQVMVSISSEGFV+QL+  AP RSLLVVHDSILTLACIKEKYDKV+CWQGE+IYVPEKWGPFD VFLYYLPAMPFELDAIF
Subjt:  EEHDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIF

Query:  GALSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE
        GALSERCV GARLVISHP GRKALEQE+QQFPDVVVSDLPD+ +LQK AADHSF LTEF+DE  FYLAILKFNK+
Subjt:  GALSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE

A0A6J1DD17 uncharacterized protein LOC1110193617.8e-12080.95Show/hide
Query:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE
        M+SL LHSSLGPSLT KPLPC LKR + I  L  SHR+ RIFSRS    V L PSL+ S+ +TC+STPSNEGVVSVINFEDLVEKDFSFLDS+DFSS EE
Subjt:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE

Query:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
        +DQKI+RIISAGE+AESSQVMVSI SEGFV+QL+DSAPCRSLLV+HDSILTLACIKEKYDKV+CWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFG 
Subjt:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA

Query:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE
        LS+RCVPGARLVISHP+G+ ALE+EQQQFPDVVVS LPDK +LQKVAADHS  LTEFVD++ FYLA+LKF+K+
Subjt:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE

A0A6J1FPV5 uncharacterized protein LOC1114458892.2e-13085.35Show/hide
Query:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE
        M+SLFLHSSLGP L CKPLPCPLK  IH+ AL     +TRIFSRS  FPVKLCPSL+ SRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDS+DF SIEE
Subjt:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE

Query:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
        HD+KI+RIISAGEIAESSQVMV+ISSEGFV++LYDSAPCRSLLVVHD+IL LACIKEKYDKV+CWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
Subjt:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA

Query:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE
        L++RC+PG RLVISHP GRK L+QEQQQF DVVVSDLPD+T+LQKVAADHSFGLTEFVDED FYLA+LKFNK+
Subjt:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE

A0A6J1I6B9 uncharacterized protein LOC1114700362.9e-13085.71Show/hide
Query:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE
        M+SLFLHSSLGP L CKPLPCPLKR IH+ AL     +TRIFSRS   PVKLCPSL+ SRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDS+DF SIEE
Subjt:  MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEE

Query:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
        HD+KI+RIISAGEIAESSQVMV+ISSEGFV++LYDSAPCRSLLVVHD+IL LACIKEKYDKV+CWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA
Subjt:  HDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGA

Query:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE
        L++RC+PG RLVISHP GRKAL+QEQQQF DVVVSDLPD+T+LQKVAADHSFGLTEFVDED FYLA+LKFNK+
Subjt:  LSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41950.1 unknown protein3.0e-7161.68Show/hide
Query:  RSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEEHDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKY
        R+   SS+   EG VSV++F    EKD+SFL+S +  S  EH QKI+RII AGE++ESS+V+VSISSE FV++L +S+P + LL+VHDS+ TLACIKEKY
Subjt:  RSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEEHDQKIKRIISAGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKY

Query:  DKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGALSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVD
        DKV+CWQGE+IYVPEKW P D VFLY+LPA+PF+LD +F  LS+RC  GAR+VISHP GR  LEQ++++F DVVVSDLPD+++L  VA  HSF LT+FVD
Subjt:  DKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGALSERCVPGARLVISHPSGRKALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVD

Query:  EDAFYLAILKFNKE
        E   YLA+LK +K+
Subjt:  EDAFYLAILKFNKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTCTTTTCTTACATTCTTCTCTTGGTCCATCACTTACTTGCAAGCCTCTGCCATGTCCTCTTAAGAGGCATATACATATACTTGCACTACATAGTTCTCATCG
CAATACTCGGATATTTTCTCGTTCGGTATTTTTTCCTGTCAAATTGTGTCCATCCTTAAATACTAGCAGATCAATCACCTGTTCTTCAACTCCTTCAAATGAGGGTGTAG
TATCAGTGATCAATTTTGAAGATTTAGTTGAGAAGGATTTTTCGTTTCTCGATTCAGAGGATTTCAGTTCCATAGAAGAGCATGATCAGAAGATTAAGCGCATCATTTCT
GCTGGGGAGATTGCAGAAAGTTCTCAGGTTATGGTTTCCATTTCTTCAGAAGGATTTGTTAATCAGTTGTATGACTCAGCTCCTTGCCGAAGTTTACTTGTTGTCCATGA
TTCTATTCTAACACTAGCTTGTATTAAAGAAAAATATGACAAAGTTAGGTGCTGGCAAGGAGAAGTTATATATGTACCAGAAAAATGGGGACCTTTCGACGTTGTATTTC
TCTATTATCTGCCAGCAATGCCTTTTGAACTTGACGCAATCTTTGGAGCACTCTCTGAACGTTGTGTGCCAGGTGCAAGACTAGTTATTAGCCATCCCAGCGGAAGGAAA
GCGTTAGAGCAAGAACAGCAACAGTTCCCAGATGTCGTAGTTTCGGATTTACCTGATAAGACTTCTTTGCAGAAAGTTGCTGCAGATCACTCTTTTGGTCTGACTGAATT
TGTAGACGAGGATGCCTTTTACCTTGCAATTTTGAAGTTCAACAAGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTCTTTTCTTACATTCTTCTCTTGGTCCATCACTTACTTGCAAGCCTCTGCCATGTCCTCTTAAGAGGCATATACATATACTTGCACTACATAGTTCTCATCG
CAATACTCGGATATTTTCTCGTTCGGTATTTTTTCCTGTCAAATTGTGTCCATCCTTAAATACTAGCAGATCAATCACCTGTTCTTCAACTCCTTCAAATGAGGGTGTAG
TATCAGTGATCAATTTTGAAGATTTAGTTGAGAAGGATTTTTCGTTTCTCGATTCAGAGGATTTCAGTTCCATAGAAGAGCATGATCAGAAGATTAAGCGCATCATTTCT
GCTGGGGAGATTGCAGAAAGTTCTCAGGTTATGGTTTCCATTTCTTCAGAAGGATTTGTTAATCAGTTGTATGACTCAGCTCCTTGCCGAAGTTTACTTGTTGTCCATGA
TTCTATTCTAACACTAGCTTGTATTAAAGAAAAATATGACAAAGTTAGGTGCTGGCAAGGAGAAGTTATATATGTACCAGAAAAATGGGGACCTTTCGACGTTGTATTTC
TCTATTATCTGCCAGCAATGCCTTTTGAACTTGACGCAATCTTTGGAGCACTCTCTGAACGTTGTGTGCCAGGTGCAAGACTAGTTATTAGCCATCCCAGCGGAAGGAAA
GCGTTAGAGCAAGAACAGCAACAGTTCCCAGATGTCGTAGTTTCGGATTTACCTGATAAGACTTCTTTGCAGAAAGTTGCTGCAGATCACTCTTTTGGTCTGACTGAATT
TGTAGACGAGGATGCCTTTTACCTTGCAATTTTGAAGTTCAACAAGGAATAG
Protein sequenceShow/hide protein sequence
MSSLFLHSSLGPSLTCKPLPCPLKRHIHILALHSSHRNTRIFSRSVFFPVKLCPSLNTSRSITCSSTPSNEGVVSVINFEDLVEKDFSFLDSEDFSSIEEHDQKIKRIIS
AGEIAESSQVMVSISSEGFVNQLYDSAPCRSLLVVHDSILTLACIKEKYDKVRCWQGEVIYVPEKWGPFDVVFLYYLPAMPFELDAIFGALSERCVPGARLVISHPSGRK
ALEQEQQQFPDVVVSDLPDKTSLQKVAADHSFGLTEFVDEDAFYLAILKFNKE