; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004601 (gene) of Snake gourd v1 genome

Gene IDTan0004601
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPeroxyureidoacrylate/ureidoacrylate amidohydrolase RutB
Genome locationLG06:2223235..2225873
RNA-Seq ExpressionTan0004601
SyntenyTan0004601
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR000868 - Isochorismatase-like
IPR036380 - Isochorismatase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593471.1 putative inactive nicotinamidase, partial [Cucurbita argyrosperma subsp. sororia]4.0e-9285.13Show/hide
Query:  LEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMK
        +EMAN+W+RTALL+ID QRDFFDERSV  VPGAYAILPSVYDA+E ARKRGMF+VWVVREHDPEGRDVE FRRH YGSGKQNPV+KGS+GAEL+EG E+K
Subjt:  LEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMK

Query:  EGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGES
        EGEYKLVKTRFSAFFNTNLHSLLQ  GITDLVI GVQTPNCIRQTVFDA+SLDYHSIT+LYDAT AA+ +IHHDNITDMENVGVVVVRVD+W  S
Subjt:  EGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGES

XP_022154038.1 probable inactive nicotinamidase At3g16190 [Momordica charantia]3.4e-9186.6Show/hide
Query:  MANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMKEG
        MANQWN TALLVIDMQRDFFDERSV+ +PGAYAI+PSV DAVEIARKR M IVWVVREHDPEGRDVE FRRH YGSGKQNPVAKGS GAELV+GLE+KEG
Subjt:  MANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMKEG

Query:  EYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGESD
        EYKLVKTRFSAFFNTNLHSLLQ  GI +LV+AGVQTPNCIRQTVFDAV+LDYHSITLL+DAT AATPK HHDNI DMENVGVV  RVD+WGESD
Subjt:  EYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGESD

XP_022964605.1 probable inactive nicotinamidase At3g16190 [Cucurbita moschata]4.0e-9285.13Show/hide
Query:  LEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMK
        +EMAN+W+RTALL+ID QRDFFDERSV  VPGAYAILPSVYDA+E ARKRGMF+VWVVREHDPEGRDVE FRRH YGSGKQNPV+KGS+GAEL+EG E+K
Subjt:  LEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMK

Query:  EGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGES
        EGEYKLVKTRFSAFFNTNLHSLLQ  GITDLVI GVQTPNCIRQTVFDA+SLDYHSIT+LYDAT AA+ +IHHDNITDMENVGVVVVRVD+W  S
Subjt:  EGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGES

XP_023000390.1 probable inactive nicotinamidase At3g16190 [Cucurbita maxima]1.2e-9184.62Show/hide
Query:  LEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMK
        +EMAN+W+ TALL+ID QRDFFDERSV  VPGAYAILPSVYDA+E ARKRGMF+VWVVREHDPEGRDVE FRRH YGSGKQNPV+KGS+GAEL+EG E+K
Subjt:  LEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMK

Query:  EGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGES
        EGEYKLVKTRFSAFFNTNLHSLLQ +GITDLVI GVQTPNCIRQTVFDA+SLDYHSIT+LYDAT AA+ +IHHDNITDMENVGVVVVRVD+W  S
Subjt:  EGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGES

XP_023515292.1 probable inactive nicotinamidase At3g16190 [Cucurbita pepo subsp. pepo]1.5e-9184.62Show/hide
Query:  LEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMK
        +EMAN+W+RTALL+ID QRDFFDERSV  VPGAYAILPSVYDA+E ARKRGMF+VWVVREHDPEGRDVE FRRH YGSGKQNPV+KGS+GAEL+EG E+K
Subjt:  LEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMK

Query:  EGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGES
        EGEYKLVKTRFSAFF+TNLHSLLQ  GITDLVI GVQTPNCIRQTVFDA+SLDYHSIT+LYDAT AA+ +IHHDNITDMENVGVVVVRVD+W  S
Subjt:  EGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGES

TrEMBL top hitse value%identityAlignment
A0A0A0K7E5 Isochorismatase domain-containing protein5.4e-8782.81Show/hide
Query:  MANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMKEG
        M +QW RTALLVIDMQ DFFDE S   VPGA  I+PSV +A+EIAR RG+FI+WVVREHD EGRDVE FRRH+YG+GK NP  KGS GAELVEGLE+KEG
Subjt:  MANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMKEG

Query:  EYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGE
        EYKLVKTRFSAFFNTNLHSLLQGAGITDLV+ GVQTPNCIRQTVFDAV+LDYHSITLLYDAT AATPKIHHDN TDMENVGVVV RVD+WGE
Subjt:  EYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGE

A0A1S3CP26 probable inactive nicotinamidase At3g16190 isoform X13.2e-8782.05Show/hide
Query:  MANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMKEG
        MA+QW RTALLVIDMQRDF DE SV  VPGA  I+PSV  AVEIAR RG+FI+WVVREHD EGRDVE FRRH+YG+GK NP+ KGS GAELV+GLE+KEG
Subjt:  MANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMKEG

Query:  EYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGESDA
        EYKLVKTRFSAFFNTNL SLLQGAGITDLV+ GVQTPNCIRQTVFDAV+LDYHSITLLYDAT AATPK+HHDNITDM NVGV V RVD+WGESD+
Subjt:  EYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGESDA

A0A6J1DJ62 probable inactive nicotinamidase At3g161901.6e-9186.6Show/hide
Query:  MANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMKEG
        MANQWN TALLVIDMQRDFFDERSV+ +PGAYAI+PSV DAVEIARKR M IVWVVREHDPEGRDVE FRRH YGSGKQNPVAKGS GAELV+GLE+KEG
Subjt:  MANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMKEG

Query:  EYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGESD
        EYKLVKTRFSAFFNTNLHSLLQ  GI +LV+AGVQTPNCIRQTVFDAV+LDYHSITLL+DAT AATPK HHDNI DMENVGVV  RVD+WGESD
Subjt:  EYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGESD

A0A6J1HLA4 probable inactive nicotinamidase At3g161901.9e-9285.13Show/hide
Query:  LEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMK
        +EMAN+W+RTALL+ID QRDFFDERSV  VPGAYAILPSVYDA+E ARKRGMF+VWVVREHDPEGRDVE FRRH YGSGKQNPV+KGS+GAEL+EG E+K
Subjt:  LEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMK

Query:  EGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGES
        EGEYKLVKTRFSAFFNTNLHSLLQ  GITDLVI GVQTPNCIRQTVFDA+SLDYHSIT+LYDAT AA+ +IHHDNITDMENVGVVVVRVD+W  S
Subjt:  EGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGES

A0A6J1KI74 probable inactive nicotinamidase At3g161905.6e-9284.62Show/hide
Query:  LEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMK
        +EMAN+W+ TALL+ID QRDFFDERSV  VPGAYAILPSVYDA+E ARKRGMF+VWVVREHDPEGRDVE FRRH YGSGKQNPV+KGS+GAEL+EG E+K
Subjt:  LEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEMK

Query:  EGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGES
        EGEYKLVKTRFSAFFNTNLHSLLQ +GITDLVI GVQTPNCIRQTVFDA+SLDYHSIT+LYDAT AA+ +IHHDNITDMENVGVVVVRVD+W  S
Subjt:  EGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGES

SwissProt top hitse value%identityAlignment
B7MTF4 Ureidoacrylate amidohydrolase RutB5.9e-1431.49Show/hide
Query:  RTALLVIDMQRDFFDERSVLVVPG-----AYAILPSVYDAVEIARKRGMFIVWVVREHDPE--------------GRDVELFRRHHYGSGKQNPVAKGSL
        +TAL+V+DMQ  +      L + G        ++ ++  AV  AR  GM I+W     D +                 ++  R+     GK   +AKGS 
Subjt:  RTALLVIDMQRDFFDERSVLVVPG-----AYAILPSVYDAVEIARKRGMFIVWVVREHDPE--------------GRDVELFRRHHYGSGKQNPVAKGSL

Query:  GAELVEGLEMKEGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPK
          +LV+ L  + G+  L K R+S FFNT L S+L+  GI  LV  G+ T  C+  T+ D   L+Y  + +L DAT  A P+
Subjt:  GAELVEGLEMKEGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPK

B7NLB5 Ureidoacrylate amidohydrolase RutB4.5e-1431.49Show/hide
Query:  RTALLVIDMQRDFFDERSVLVVPG-----AYAILPSVYDAVEIARKRGMFIVWVVREHDPE--------------GRDVELFRRHHYGSGKQNPVAKGSL
        +TAL+V+DMQ  +      L + G        ++ ++  AV  AR  GM I+W     D +                 ++  R+     GK   +AKGS 
Subjt:  RTALLVIDMQRDFFDERSVLVVPG-----AYAILPSVYDAVEIARKRGMFIVWVVREHDPE--------------GRDVELFRRHHYGSGKQNPVAKGSL

Query:  GAELVEGLEMKEGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPK
          +LV+ L  + G+  L K R+S+FFNT L S+L+  GI  LV  G+ T  C+  T+ D   L+Y  + +L DAT  A P+
Subjt:  GAELVEGLEMKEGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPK

C4ZQD9 Ureidoacrylate amidohydrolase RutB5.9e-1431.49Show/hide
Query:  RTALLVIDMQRDFFDERSVLVVPG-----AYAILPSVYDAVEIARKRGMFIVWVVREHDPE--------------GRDVELFRRHHYGSGKQNPVAKGSL
        ++AL+V+DMQ  +      L + G        ++ ++  AV  AR  GM I+W     D +                 ++  R+     GK   +AKGS 
Subjt:  RTALLVIDMQRDFFDERSVLVVPG-----AYAILPSVYDAVEIARKRGMFIVWVVREHDPE--------------GRDVELFRRHHYGSGKQNPVAKGSL

Query:  GAELVEGLEMKEGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPK
          +LV+ L  + G+  L K R+S FFNT L S+L+  GI  LV  G+ T  C+  T+ D   L+Y  + +L DAT  A PK
Subjt:  GAELVEGLEMKEGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPK

C9QZ65 Ureidoacrylate amidohydrolase RutB5.9e-1431.49Show/hide
Query:  RTALLVIDMQRDFFDERSVLVVPG-----AYAILPSVYDAVEIARKRGMFIVWVVREHDPE--------------GRDVELFRRHHYGSGKQNPVAKGSL
        ++AL+V+DMQ  +      L + G        ++ ++  AV  AR  GM I+W     D +                 ++  R+     GK   +AKGS 
Subjt:  RTALLVIDMQRDFFDERSVLVVPG-----AYAILPSVYDAVEIARKRGMFIVWVVREHDPE--------------GRDVELFRRHHYGSGKQNPVAKGSL

Query:  GAELVEGLEMKEGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPK
          +LV+ L  + G+  L K R+S FFNT L S+L+  GI  LV  G+ T  C+  T+ D   L+Y  + +L DAT  A PK
Subjt:  GAELVEGLEMKEGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPK

Q93Z51 Probable inactive nicotinamidase At3g161901.9e-6562.69Show/hide
Query:  MANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEM-KE
        MA +W  TALLVIDMQ DF +E +V  V G  +I+P+V   VE+AR+RG+ ++WVVREHD +GRDVELFRRH+Y S K  PV KG++GAELV+GL + +E
Subjt:  MANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEM-KE

Query:  GEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGE
         +YK+VKTRFSAFF+TNLHS LQ +G+T LVIAGVQTPNCIRQTVFDAV+LDY ++T++ DAT AATP+IH  NI DM+N+GV    + +W E
Subjt:  GEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGE

Arabidopsis top hitse value%identityAlignment
AT3G16190.1 Isochorismatase family protein1.4e-6662.69Show/hide
Query:  MANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEM-KE
        MA +W  TALLVIDMQ DF +E +V  V G  +I+P+V   VE+AR+RG+ ++WVVREHD +GRDVELFRRH+Y S K  PV KG++GAELV+GL + +E
Subjt:  MANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAKGSLGAELVEGLEM-KE

Query:  GEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGE
         +YK+VKTRFSAFF+TNLHS LQ +G+T LVIAGVQTPNCIRQTVFDAV+LDY ++T++ DAT AATP+IH  NI DM+N+GV    + +W E
Subjt:  GEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTTATTCCCAAGCCATCCTTCTATCCTTCTTGTTCATCTTTCAATCTTTCTTGTAATTTGTGCTTTTTGGAAATGGCGAATCAGTGGAATCGCACTGCTCTTCT
TGTGATCGACATGCAGAGAGATTTCTTTGATGAACGATCTGTCTTAGTTGTACCAGGAGCGTACGCCATACTTCCAAGCGTATATGATGCCGTCGAAATTGCAAGGAAGC
GGGGCATGTTCATCGTTTGGGTTGTCCGAGAGCATGATCCTGAAGGACGAGATGTTGAACTCTTTCGACGCCATCACTATGGCAGTGGGAAGCAGAATCCAGTTGCGAAG
GGAAGCTTAGGGGCGGAGTTAGTAGAAGGCCTTGAAATGAAAGAAGGAGAATACAAGTTGGTGAAAACCAGGTTCAGTGCGTTCTTTAATACAAACTTGCATTCCCTTCT
GCAAGGGGCGGGCATTACTGATTTGGTCATTGCTGGTGTCCAGACGCCAAACTGTATCAGGCAGACTGTTTTTGATGCTGTATCTTTGGATTACCATTCTATCACTCTTC
TTTATGATGCAACAGTAGCTGCTACACCCAAAATTCATCATGATAATATCACTGATATGGAGAATGTGGGAGTTGTGGTCGTAAGAGTTGATAAATGGGGCGAGTCTGAT
GCTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAGACGGCACTGATCATCAGTCCACGAAACTGTTGTATTTATCATCTTTCTTTATGGAATTTATTCCCAAGCCATCCTTCTATCCTTCTTGTTCATCTTTCAATCTT
TCTTGTAATTTGTGCTTTTTGGAAATGGCGAATCAGTGGAATCGCACTGCTCTTCTTGTGATCGACATGCAGAGAGATTTCTTTGATGAACGATCTGTCTTAGTTGTACC
AGGAGCGTACGCCATACTTCCAAGCGTATATGATGCCGTCGAAATTGCAAGGAAGCGGGGCATGTTCATCGTTTGGGTTGTCCGAGAGCATGATCCTGAAGGACGAGATG
TTGAACTCTTTCGACGCCATCACTATGGCAGTGGGAAGCAGAATCCAGTTGCGAAGGGAAGCTTAGGGGCGGAGTTAGTAGAAGGCCTTGAAATGAAAGAAGGAGAATAC
AAGTTGGTGAAAACCAGGTTCAGTGCGTTCTTTAATACAAACTTGCATTCCCTTCTGCAAGGGGCGGGCATTACTGATTTGGTCATTGCTGGTGTCCAGACGCCAAACTG
TATCAGGCAGACTGTTTTTGATGCTGTATCTTTGGATTACCATTCTATCACTCTTCTTTATGATGCAACAGTAGCTGCTACACCCAAAATTCATCATGATAATATCACTG
ATATGGAGAATGTGGGAGTTGTGGTCGTAAGAGTTGATAAATGGGGCGAGTCTGATGCTTGATCCCCATCTGCCATTGGAAATGAAGTTACAATTCCTACATGATATCAG
GGTTGCTGGCAATTGATCTACCGGAATAAATCACGGTTCGATCTCAACCCGCTCACGATTAGATGCATCTCGTTTTCGCATATTAGTATGTAAAGAGTCAAAGACGTTCA
TATTGAAATAATCAAAACGCCTCATGGTTCCTCGTTCTCTATGAAGAACTTTGATTCTCTTCCTCTGTATGATAAAAGAAAACAACCAGCTTTAGTTTTGCAGGGAATAG
CTGGATATAAAGGCGTGACATTCTTTTTCCCTGTGGAGAAAGTGTGGCCATGGTATTTGAATAAGTGAAGTGTGTTTTTCATATTTAAGTAATGTAATCCACTACAATCA
AGATCAAAGGATTGAAATTATCAGTATTGATGACTTGATTAGTCAAGTTGAGTTGATTTAAGGTTTTCATTAACCTTTCAACTCTTTCTATTCTCATTCACCCCTTTCTT
GAAAATAAACTTTTTGTTTCTTCTATGAGATCAACAGCATTTGGAGTGGGGCTCGAACCTATGGTTTTGCATTTTATGACTTGTTTGGG
Protein sequenceShow/hide protein sequence
MEFIPKPSFYPSCSSFNLSCNLCFLEMANQWNRTALLVIDMQRDFFDERSVLVVPGAYAILPSVYDAVEIARKRGMFIVWVVREHDPEGRDVELFRRHHYGSGKQNPVAK
GSLGAELVEGLEMKEGEYKLVKTRFSAFFNTNLHSLLQGAGITDLVIAGVQTPNCIRQTVFDAVSLDYHSITLLYDATVAATPKIHHDNITDMENVGVVVVRVDKWGESD
A