; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000596 (gene) of Snake gourd v1 genome

Gene IDTan0000596
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEncodes a protein whose expression is responsive to nematode infection, putative
Genome locationLG04:24908196..24912324
RNA-Seq ExpressionTan0000596
SyntenyTan0000596
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008454771.1 PREDICTED: uncharacterized protein At1g66480 isoform X1 [Cucumis melo]2.0e-8583.72Show/hide
Query:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES
        MGN FGVKKTVKVMK+SGET+KLN PVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLV+LPK+  +QAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKDIIKPRE-KRRVS
        LMLARRSASDLTIMKPKSVL EE GGE   +G+ ATR+KVRLP+AEVE+LLKE KDEAEAAE+IMGLY  KTRE V E+D KE+ KKDIIKPRE KRRVS
Subjt:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKDIIKPRE-KRRVS

Query:  FMTTMEARSQIAVTS
        FMTTMEA +QIAV S
Subjt:  FMTTMEARSQIAVTS

XP_008454773.1 PREDICTED: uncharacterized protein At1g66480 isoform X2 [Cucumis melo]8.3e-8784.11Show/hide
Query:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES
        MGN FGVKKTVKVMK+SGET+KLN PVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLV+LPK+  +QAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKDIIKPREKRRVSF
        LMLARRSASDLTIMKPKSVL EE GGE   +G+ ATR+KVRLP+AEVE+LLKE KDEAEAAE+IMGLY  KTRE V E+D KE+ KKDIIKPREKRRVSF
Subjt:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKDIIKPREKRRVSF

Query:  MTTMEARSQIAVTS
        MTTMEA +QIAV S
Subjt:  MTTMEARSQIAVTS

XP_022941735.1 uncharacterized protein At1g66480-like [Cucurbita moschata]4.7e-8281.57Show/hide
Query:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES
        MGNAFG KKTVKVMKV+GETMKLN PVQ GDVVKDYPGFVLL+SEAVKHYGVRAKPLEPHQ LS KRLYFLV+LPK+ NQ  PRR+RSAI+MSAKDRLES
Subjt:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIM---GLYMDKTREIVSESDRKEEIKKDIIKPREKRR
        LMLARRSASDLTIMKPKSVLAEED GEGS +GA  TRLKVRLPRAEVEKLLKESKD+ EAAEKI+   GLYMDKT +  S+S  K+E K+D IKPREKRR
Subjt:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIM---GLYMDKTREIVSESDRKEEIKKDIIKPREKRR

Query:  VSFMTTMEARSQIAVTS
        VSFMTTMEA ++IAV S
Subjt:  VSFMTTMEARSQIAVTS

XP_022942789.1 uncharacterized protein At1g66480 isoform X2 [Cucurbita moschata]2.8e-8281.86Show/hide
Query:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES
        MGNAFG+KKTVKVM VSG+T+KL PPVQ  DVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLVELPK+  +QAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKD-IIKPREKRRVS
        LML+RRSASDLTIMKPKSVLAEE GGE  R G+EATR+KVRLP+AEVE++LKESKDEAEAAE+IMGLYM K RE V +++RK E +KD IIKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKD-IIKPREKRRVS

Query:  FMTTMEARSQIAVTS
        FMTT+EA  QIAV +
Subjt:  FMTTMEARSQIAVTS

XP_038891934.1 uncharacterized protein At1g66480 [Benincasa hispida]5.4e-8685.39Show/hide
Query:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES
        MGN FGVKKTVKVM +SGETMKLN PVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLVELPKVS +QAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEDGGE----GSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKDIIKPREKR
        LMLARRSASDLTIMKPKSVLAEE GGE    GS +GA ATR+KVRLP+AEVE+LLKE KDEAEAAE+IMGLY  KTRE V E+D K+E KKDIIKPREKR
Subjt:  LMLARRSASDLTIMKPKSVLAEEDGGE----GSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKDIIKPREKR

Query:  RVSFMTTMEAR-SQIAVTS
        RVSFMTTMEAR +QIAV S
Subjt:  RVSFMTTMEAR-SQIAVTS

TrEMBL top hitse value%identityAlignment
A0A1S3BZC7 uncharacterized protein At1g66480 isoform X19.9e-8683.72Show/hide
Query:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES
        MGN FGVKKTVKVMK+SGET+KLN PVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLV+LPK+  +QAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKDIIKPRE-KRRVS
        LMLARRSASDLTIMKPKSVL EE GGE   +G+ ATR+KVRLP+AEVE+LLKE KDEAEAAE+IMGLY  KTRE V E+D KE+ KKDIIKPRE KRRVS
Subjt:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKDIIKPRE-KRRVS

Query:  FMTTMEARSQIAVTS
        FMTTMEA +QIAV S
Subjt:  FMTTMEARSQIAVTS

A0A1S3C0K8 uncharacterized protein At1g66480 isoform X24.0e-8784.11Show/hide
Query:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES
        MGN FGVKKTVKVMK+SGET+KLN PVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLV+LPK+  +QAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKDIIKPREKRRVSF
        LMLARRSASDLTIMKPKSVL EE GGE   +G+ ATR+KVRLP+AEVE+LLKE KDEAEAAE+IMGLY  KTRE V E+D KE+ KKDIIKPREKRRVSF
Subjt:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKDIIKPREKRRVSF

Query:  MTTMEARSQIAVTS
        MTTMEA +QIAV S
Subjt:  MTTMEARSQIAVTS

A0A6J1FSC9 uncharacterized protein At1g66480 isoform X21.3e-8281.86Show/hide
Query:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES
        MGNAFG+KKTVKVM VSG+T+KL PPVQ  DVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLVELPK+  +QAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKD-IIKPREKRRVS
        LML+RRSASDLTIMKPKSVLAEE GGE  R G+EATR+KVRLP+AEVE++LKESKDEAEAAE+IMGLYM K RE V +++RK E +KD IIKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKD-IIKPREKRRVS

Query:  FMTTMEARSQIAVTS
        FMTT+EA  QIAV +
Subjt:  FMTTMEARSQIAVTS

A0A6J1FUK8 uncharacterized protein At1g66480-like2.3e-8281.57Show/hide
Query:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES
        MGNAFG KKTVKVMKV+GETMKLN PVQ GDVVKDYPGFVLL+SEAVKHYGVRAKPLEPHQ LS KRLYFLV+LPK+ NQ  PRR+RSAI+MSAKDRLES
Subjt:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIM---GLYMDKTREIVSESDRKEEIKKDIIKPREKRR
        LMLARRSASDLTIMKPKSVLAEED GEGS +GA  TRLKVRLPRAEVEKLLKESKD+ EAAEKI+   GLYMDKT +  S+S  K+E K+D IKPREKRR
Subjt:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIM---GLYMDKTREIVSESDRKEEIKKDIIKPREKRR

Query:  VSFMTTMEARSQIAVTS
        VSFMTTMEA ++IAV S
Subjt:  VSFMTTMEARSQIAVTS

A0A6J1FVP5 uncharacterized protein At1g66480 isoform X15.6e-8180.73Show/hide
Query:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES
        MGNAFG+KKTVKVM VSG+T+KL PPVQ  DVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLVELPK+  +QAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKD-IIKPRE---KR
        LML+RRSASDLTIMKPKSVLAEE GGE  R G+EATR+KVRLP+AEVE++LKESKDEAEAAE+IMGLYM K RE V +++RK E +KD IIKPRE   KR
Subjt:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKD-IIKPRE---KR

Query:  RVSFMTTMEARSQIAVTS
        RVSFMTT+EA  QIAV +
Subjt:  RVSFMTTMEARSQIAVTS

SwissProt top hitse value%identityAlignment
Q6NLC8 Uncharacterized protein At1g664802.8e-3745.83Show/hide
Query:  MGNAFGVK-KTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKV---------SNQQAPRRVRSAI
        MGN+  VK K  KVMK+ GET ++  PV   +V  DYPG+VLL+S+AVKH+GVR+KPLEP+Q L  K+ YFLVELPK+          N+   RRV S I
Subjt:  MGNAFGVK-KTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKV---------SNQQAPRRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSVLAEEDGGE--GSRAGAEATRLKVRLPRAEVEKLLKESKDEAEA-AEKIMGLYMDKTREIVSES---DRKEE
        ++ AK+RL+ LML+RR+ SD+TI          DGG+  G   G   T +++RLPR+++ KL++E+ ++A A AEKI+G+YM+++ E+       D + E
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSVLAEEDGGE--GSRAGAEATRLKVRLPRAEVEKLLKESKDEAEA-AEKIMGLYMDKTREIVSES---DRKEE

Query:  IKKDIIKPREKRRVSF
        +    IK REK +VSF
Subjt:  IKKDIIKPREKRRVSF

Arabidopsis top hitse value%identityAlignment
AT1G66480.1 plastid movement impaired 22.0e-3845.83Show/hide
Query:  MGNAFGVK-KTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKV---------SNQQAPRRVRSAI
        MGN+  VK K  KVMK+ GET ++  PV   +V  DYPG+VLL+S+AVKH+GVR+KPLEP+Q L  K+ YFLVELPK+          N+   RRV S I
Subjt:  MGNAFGVK-KTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKV---------SNQQAPRRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSVLAEEDGGE--GSRAGAEATRLKVRLPRAEVEKLLKESKDEAEA-AEKIMGLYMDKTREIVSES---DRKEE
        ++ AK+RL+ LML+RR+ SD+TI          DGG+  G   G   T +++RLPR+++ KL++E+ ++A A AEKI+G+YM+++ E+       D + E
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSVLAEEDGGE--GSRAGAEATRLKVRLPRAEVEKLLKESKDEAEA-AEKIMGLYMDKTREIVSES---DRKEE

Query:  IKKDIIKPREKRRVSF
        +    IK REK +VSF
Subjt:  IKKDIIKPREKRRVSF

AT1G71015.1 unknown protein1.0e-4252.69Show/hide
Query:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES
        MGN+ G KKT  +M ++GE+ KL  PV+ G VVKD+PG VLLESEAVK  G+RAKPLEPHQ L +KR+YF+VELP+   ++ PRRVRS I MSAK+RLE+
Subjt:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGL
        L L+RRS+SDL++MK K+ + +E+         E + +K++LP+ ++EKL KES+  ++ + KI  L
Subjt:  LMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGL

AT2G01340.1 Encodes a protein whose expression is responsive to nematode infection.4.2e-4450.88Show/hide
Query:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES
        MGN+ G KKT KVMK+ GET KL  PV   +V+KD+PG VLL+SE+VKHYG RAKPLE  Q+L  KRLYF+VE  K   +  PRRVRS I++SAK+RLES
Subjt:  MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKS--VLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTR-EIVSESDRKEEIKKDI--------
        LMLARRS+SDL+I+KP       EE+G           R+KVR+P+AE+EKL+KE   EAEA +KI  L+M K R E   ++ R++E             
Subjt:  LMLARRSASDLTIMKPKS--VLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTR-EIVSESDRKEEIKKDI--------

Query:  ---IKPREKRRVSFMTTMEARSQIAV
           +K R K RVSFM      S+I V
Subjt:  ---IKPREKRRVSFMTTMEARSQIAV

AT5G37840.1 BEST Arabidopsis thaliana protein match is: plastid movement impaired 2 (TAIR:AT1G66480.1)6.2e-3246.2Show/hide
Query:  MGNAFGVKKT-VKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVS--NQQAPRRVRSA-INMSAKD
        MGN   V++  VKVMK+ G+  +L  PV   D  K+YPGFVLL+SE VK  GVRAKPLEP+Q L     YFLV+LP V   N+   RRV S  I++ AK+
Subjt:  MGNAFGVKKT-VKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVS--NQQAPRRVRSA-INMSAKD

Query:  RLESLMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESD
        RLE LML+RR+ SD+   +   V      G+G   G   TR+++RLPR+++ KL++ES D +E A KI+  YM+ +  I    D
Subjt:  RLESLMLARRSASDLTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAACGCCTTTGGAGTTAAGAAGACCGTGAAGGTGATGAAGGTCTCCGGCGAGACGATGAAGCTAAACCCCCCGGTTCAAGTCGGGGACGTCGTCAAGGATTATCC
CGGCTTCGTTCTGCTCGAATCCGAGGCCGTAAAGCATTATGGAGTCCGAGCAAAGCCATTAGAGCCCCACCAGAAGCTCAGTACGAAGAGACTCTATTTCCTCGTCGAGC
TGCCTAAGGTTTCAAATCAACAGGCTCCACGACGGGTACGATCTGCGATCAACATGAGTGCCAAGGACAGGCTAGAGAGCTTGATGTTGGCGAGAAGGTCGGCGTCGGAC
CTAACTATCATGAAACCGAAGAGCGTGTTGGCGGAGGAGGACGGTGGAGAGGGATCGAGAGCGGGAGCGGAAGCGACACGGTTGAAGGTACGGCTGCCGAGGGCGGAGGT
GGAGAAGCTGTTGAAGGAGAGCAAAGATGAGGCAGAGGCAGCGGAGAAGATCATGGGGCTGTATATGGATAAAACCAGAGAGATTGTTTCTGAAAGTGATCGGAAGGAAG
AGATCAAAAAGGATATCATCAAGCCACGTGAGAAACGACGCGTAAGTTTCATGACGACAATGGAAGCAAGGTCTCAAATTGCAGTGACATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATTCTCTCTTATACGCCCACCCGCCACCTATCTTCTTCTCTCTCCAATCCTCAAAACTCTTTCCCTTTTGGCCGCCATGGGAAACGCCTTTGGAGTTAAGAAGACCGTGA
AGGTGATGAAGGTCTCCGGCGAGACGATGAAGCTAAACCCCCCGGTTCAAGTCGGGGACGTCGTCAAGGATTATCCCGGCTTCGTTCTGCTCGAATCCGAGGCCGTAAAG
CATTATGGAGTCCGAGCAAAGCCATTAGAGCCCCACCAGAAGCTCAGTACGAAGAGACTCTATTTCCTCGTCGAGCTGCCTAAGGTTTCAAATCAACAGGCTCCACGACG
GGTACGATCTGCGATCAACATGAGTGCCAAGGACAGGCTAGAGAGCTTGATGTTGGCGAGAAGGTCGGCGTCGGACCTAACTATCATGAAACCGAAGAGCGTGTTGGCGG
AGGAGGACGGTGGAGAGGGATCGAGAGCGGGAGCGGAAGCGACACGGTTGAAGGTACGGCTGCCGAGGGCGGAGGTGGAGAAGCTGTTGAAGGAGAGCAAAGATGAGGCA
GAGGCAGCGGAGAAGATCATGGGGCTGTATATGGATAAAACCAGAGAGATTGTTTCTGAAAGTGATCGGAAGGAAGAGATCAAAAAGGATATCATCAAGCCACGTGAGAA
ACGACGCGTAAGTTTCATGACGACAATGGAAGCAAGGTCTCAAATTGCAGTGACATCTTAATTAACAAGAACAACAAGATTCATGAGCGAACTAATTGAAGTAATATAGT
GACGATCTATCGAGAGTATTACGTCGTTACGTCCTTTGCTTCAACTTCATTTGAATCTTGAATGAGCTTTTGTAGAATATAGATAGATTTAGCGCTCCTTTATACGCCCA
TCATGTACTTACTAATTAACTTCATCATACTTATATAGCTAGTTCCCTTATGTTATGGACGATAATTTGCTCAATTTATGGTTTAATGTTTAGTGTTTGCAACATTT
Protein sequenceShow/hide protein sequence
MGNAFGVKKTVKVMKVSGETMKLNPPVQVGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVSNQQAPRRVRSAINMSAKDRLESLMLARRSASD
LTIMKPKSVLAEEDGGEGSRAGAEATRLKVRLPRAEVEKLLKESKDEAEAAEKIMGLYMDKTREIVSESDRKEEIKKDIIKPREKRRVSFMTTMEARSQIAVTS