; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017191 (gene) of Snake gourd v1 genome

Gene IDTan0017191
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSWIM-type domain-containing protein
Genome locationLG11:8457568..8459925
RNA-Seq ExpressionTan0017191
SyntenyTan0017191
Gene Ontology termsNA
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7138490.1 hypothetical protein RHSIM_Rhsim07G0255600 [Rhododendron simsii]4.5e-4534.84Show/hide
Query:  NLVERMGYAANYFKLWGRDPTMDV-GSYKTLESNEDVETLTSLLSDRMPFEIFVEHDVDCGLSDDSSFNLDDFCGFESEDMDD-----YEEEECRPLTFK
        ++VER+GY   Y  LW R P + +    + +E+++D   + +++      EI+VEH +D     D++ N+  F      ++ +      +EE        
Subjt:  NLVERMGYAANYFKLWGRDPTMDV-GSYKTLESNEDVETLTSLLSDRMPFEIFVEHDVDCGLSDDSSFNLDDFCGFESEDMDD-----YEEEECRPLTFK

Query:  CSNQGDKGKQVIEVDIESDIDID--------------------SDELHSLD----------DSSDSDQGCKSKMKFPSFKQNMEENNMTNFEIGMLFNSL
        C++  D+    ++  +  D D D                    ++EL   D          D + +D   +  ++F +FK      N+  F  GMLF+SL
Subjt:  CSNQGDKGKQVIEVDIESDIDID--------------------SDELHSLD----------DSSDSDQGCKSKMKFPSFKQNMEENNMTNFEIGMLFNSL

Query:  SEFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQ
         +FK  ++ YAV GG+GI+F KND+ RVRA C + C +    SK+  +ETFQLKT   EH C+R ++N RL+S +L+ +L  +VKD P ++L+ IQ+KV 
Subjt:  SEFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQ

Query:  REYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILK
         +++ QISR KAYRAKR ALD V GS+ EQY  LW+YC EL+ SNPGST +++
Subjt:  REYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILK

XP_022153708.1 uncharacterized protein LOC111021158 [Momordica charantia]7.6e-8551.86Show/hide
Query:  WSMSKRINLVERMGYAANYFKLWGRDPTMDVGSYKTLESNEDVETLTSLLSDRMPFEIFVEHDV--DCGLSDDSSFNLDDFCGFESEDMDDYEEEECRPL
        WS  + I +VE++GY  +Y KLWGRDPT +VG Y+ +ES++DV  L SLLSD M FEI+VEH+   +  L D+ S  L    G ESE  DD +      +
Subjt:  WSMSKRINLVERMGYAANYFKLWGRDPTMDVGSYKTLESNEDVETLTSLLSDRMPFEIFVEHDV--DCGLSDDSSFNLDDFCGFESEDMDDYEEEECRPL

Query:  TFKCSNQGDKGKQVIEVD--IESD---IDIDSDELHSLDDSSDSDQGCKSKMKFPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTK
                 K  ++I +D  +E++    DIDS EL SL DSSDS+     ++K+P +++   +     FE+GM FNSL EFKNVV+ YAVKGGW IRF K
Subjt:  TFKCSNQGDKGKQVIEVD--IESD---IDIDSDELHSLDDSSDSDQGCKSKMKFPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTK

Query:  NDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKRLALDK
        NDK RVRAKCV+GCKWLAY +K+QGE T+QLKT+V EH+CSR F NP LTS WL +++ N+VK+ P+++L +IQ+KVQR+YISQI++ KA+RAK+LALD 
Subjt:  NDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKRLALDK

Query:  VRGSYAEQYIQLWEYCEELRISNPGSTTILKLKISQEEEENNREPIERI
        V GS+ EQ   LWEYC E+  SNPGS+ +L L + QE+EE + +PI RI
Subjt:  VRGSYAEQYIQLWEYCEELRISNPGSTTILKLKISQEEEENNREPIERI

XP_023896782.1 uncharacterized protein LOC112008678 [Quercus suber]6.5e-4445.76Show/hide
Query:  DYEEEECRPLTFKCSNQGDKGKQVIEVDIESDIDIDSDELHSLDDSSDSDQGCKSKMKFPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWG
        DY  EE   LT + S+ GD+G     VD +SD + D D  H   D+        ++ KFP FKQ     +M  FE  MLF S  +FK+ +  YAV GGWG
Subjt:  DYEEEECRPLTFKCSNQGDKGKQVIEVDIESDIDIDSDELHSLDDSSDSDQGCKSKMKFPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWG

Query:  IRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKR
        ++F KNDK+RVRAKC   CK+ AY +K+  E ++QLKT   EHTC+R ++NPR T+ +L+++L  +V+  P +KL  IQ+ V  +Y+  I+ GKA RA+ 
Subjt:  IRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKR

Query:  LALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILK
         A + V GSY EQY QLW+YC ELR S+PGST ++K
Subjt:  LALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILK

XP_023914573.1 uncharacterized protein LOC112026126 [Quercus suber]3.6e-4241.26Show/hide
Query:  GFESEDMDDYEEEECRPL------TFKCSNQ--------GDKGKQVIEVDIESD----IDIDSDELHSLDDSSDSD------QGCKSKMKFPSFKQNMEE
        G E E  D  EE E +P+      T    NQ        G  G  V+  D ES+    +D  S      DDSSD D           K K+P F+   + 
Subjt:  GFESEDMDDYEEEECRPL------TFKCSNQ--------GDKGKQVIEVDIESD----IDIDSDELHSLDDSSDSD------QGCKSKMKFPSFKQNMEE

Query:  NNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVK
         ++  FE  MLF S  +FK+ +  YAV GGWGI+F KND +RVRA+C  GC ++AY +K+  E++F+LKT   EHTCSR ++NPR T+S++ K+L   V+
Subjt:  NNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVK

Query:  DHPEMKLTSIQKKVQREYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILKL
          P++KL  IQ  V  +Y+  IS GKA RA+  A D V G++  Q+ QLWEYC+ELR  +PGST ++K+
Subjt:  DHPEMKLTSIQKKVQREYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILKL

XP_030924747.1 uncharacterized protein LOC115951735 [Quercus lobata]5.2e-4137.83Show/hide
Query:  ESEDMDDYEEEECRPLTFKCSNQGDKGKQVIEVDIES--------DIDIDSDELHSLDDSSDSDQGCKS----------------KMKFPSFKQNMEENN
        E  D+++  E E +P+     +      Q    ++E+        + D +S++L SLD+SS S +G                     K+P F+   +  N
Subjt:  ESEDMDDYEEEECRPLTFKCSNQGDKGKQVIEVDIES--------DIDIDSDELHSLDDSSDSDQGCKS----------------KMKFPSFKQNMEENN

Query:  MTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDH
        +  FE  MLF    +FK+ +  YAV GGWGI+F KND +RVRA+C  GC ++AY +K+  E++F+LKT   EHTC+R ++NPR T+S++ K+L   V+  
Subjt:  MTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDH

Query:  PEMKLTSIQKKVQREYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILKL
        P +KL  IQ+ V ++Y+  IS GKA RA+  A D V G++  Q+ QLWEYC+ELR  +PGST ++K+
Subjt:  PEMKLTSIQKKVQREYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILKL

TrEMBL top hitse value%identityAlignment
A0A2N9ES33 Uncharacterized protein4.4e-5436.09Show/hide
Query:  LVERMGYAANYFKLWGRDPTMDV--GSYKTLESNEDVETLTSLLSDRMPFEIFVEHDVD--------CGLSDDSSFNLDDFCGFESEDMDDYEEEECRPL
        +V  +GYA    +LW R P + +  G  + + S+ D   +T L+      E+FVEH V+          + DD + N+DD    E +++ D +E++   L
Subjt:  LVERMGYAANYFKLWGRDPTMDV--GSYKTLESNEDVETLTSLLSDRMPFEIFVEHDVD--------CGLSDDSSFNLDDFCGFESEDMDDYEEEECRPL

Query:  ---------------------------------TFKCSNQGDKGKQVIEVDIESDI--------DIDSDELHS----LDDSSDSDQGCKSKMK-------
                                              N  D     +E D + ++        D +S++L+S    L +S     G   +++       
Subjt:  ---------------------------------TFKCSNQGDKGKQVIEVDIESDI--------DIDSDELHS----LDDSSDSDQGCKSKMK-------

Query:  -------------FPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTC
                     FP F+      ++  FE+GMLF S  +FK  +  YAV+GGWGIRF KNDK+RVRA C EGCK++AY +K+  E TFQLKT   EH+C
Subjt:  -------------FPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTC

Query:  SRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILKLKISQEEE
        SRCF+NPR+T+ +L+K+L   VKD P++KL SIQKKV R+Y++ IS+ KAYRAK  A+D + GS+ EQY  LW+YCEELR SNPGST ++K++   E E
Subjt:  SRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILKLKISQEEE

A0A2N9G7H5 Uncharacterized protein2.4e-5236.93Show/hide
Query:  LVERMGYAANYFKLWGRDPTMDV--GSYKTLESNEDVETLTSLLSDRMPFEIFVEHDVD--------CGLSDDSSFNLDDFCGFESEDMDDYEEEECRPL
        +V  +GYA    +LW + P + +  G    + S+ D   +T L+      ++FVEH V+          + DD + ++DD    + +++ D +E+    L
Subjt:  LVERMGYAANYFKLWGRDPTMDV--GSYKTLESNEDVETLTSLLSDRMPFEIFVEHDVD--------CGLSDDSSFNLDDFCGFESEDMDDYEEEECRPL

Query:  TFKCSNQGDKGKQVIEVDIESDIDIDSDELHSLDDSSDSDQGCKSKMK--------------------------FPSFKQNMEENNMTNFEIGMLFNSLS
                D+   +++ ++      +SD  +S DDS DS +  +   K                          FP F+      ++  FE GMLF S  
Subjt:  TFKCSNQGDKGKQVIEVDIESDIDIDSDELHSLDDSSDSDQGCKSKMK--------------------------FPSFKQNMEENNMTNFEIGMLFNSLS

Query:  EFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQR
        ++K  +  YAV+GGWGI+F KNDK+RVRA C EGCK++AY +K+  E TFQLKT   +H+C+RC++NPR+T+ +L+K+L   VKD P++KL SIQKKV +
Subjt:  EFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQR

Query:  EYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILKLKISQEEEENNREPIERIR
        +Y++ IS+ KAYRAK  A+D + GS+ EQY  LW+YCEELR SNPGST ++K++ S  E E   E + +IR
Subjt:  EYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILKLKISQEEEENNREPIERIR

A0A2N9H0G1 Uncharacterized protein3.4e-5435.94Show/hide
Query:  LVERMGYAANYFKLWGRDPTMDV--GSYKTLESNEDVETLTSLLSDRMPFEIFVEHDVD--------CGLSDDSSFNLDDFCGFESEDMDDYEEEECRPL
        +V  +GYA    +LW R P + +  G  + + S+ D   +T L+      E+FVEH V+          + DD + N+DD    E +++ D +E++   L
Subjt:  LVERMGYAANYFKLWGRDPTMDV--GSYKTLESNEDVETLTSLLSDRMPFEIFVEHDVD--------CGLSDDSSFNLDDFCGFESEDMDDYEEEECRPL

Query:  ----------------------------TFKCSNQGDKGKQVIEVDIESDI--------DIDSDELHS----LDDSSDSDQGCKSKMK------------
                                         N  D     +E D + ++        D +S++L+S    L +S     G   +++            
Subjt:  ----------------------------TFKCSNQGDKGKQVIEVDIESDI--------DIDSDELHS----LDDSSDSDQGCKSKMK------------

Query:  ------------FPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCS
                    FP F+      ++  FE+GMLF S  +FK  +  YAV+GGWGIRF KNDK+RVRA C EGCK++AY +K+  E TFQLKT   EH+CS
Subjt:  ------------FPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCS

Query:  RCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILKLKISQEEEEN
        RCF+NPR+T+ +L+K+L   VKD P++KL SIQKKV+++Y++ IS+ KAYRAK  A+D + GS+ EQY  LW+YCEELR SNPGST ++K++ S  E E 
Subjt:  RCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILKLKISQEEEEN

Query:  NREPIERIR
          E + +IR
Subjt:  NREPIERIR

A0A2N9HKG1 SWIM-type domain-containing protein1.6e-5639.24Show/hide
Query:  LVERMGYAANYFKLWGRDPTMDV--GSYKTLESNEDVETLTSLLSDRMPFEIFVEHDVD--------CGLSDDSSFNLDDFCGFESEDMDDYEEEE----
        +V  +GYA    +LW R P + +  G  + + S+ D   +T L+      E+FVEH V+          + DD + N+DD    E +++ D +E++    
Subjt:  LVERMGYAANYFKLWGRDPTMDV--GSYKTLESNEDVETLTSLLSDRMPFEIFVEHDVD--------CGLSDDSSFNLDDFCGFESEDMDDYEEEE----

Query:  ------CRPLTFKCSNQGDKGKQVIEVDIESDIDIDSDELHSLDDSSDSDQGCKSKMKFPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWG
                 +  +  +Q          D ++ ++ ++D   S        +    +  FP F+      ++  FE+GMLF S  +FK  +  YAV+GGWG
Subjt:  ------CRPLTFKCSNQGDKGKQVIEVDIESDIDIDSDELHSLDDSSDSDQGCKSKMKFPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWG

Query:  IRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKR
        IRF KNDK+RVRA C EGCK++AY +K+  E TFQLKT   EH+CSRCF+NPR+T+ +L+K+L   VKD P++KL SIQKKV ++Y++ IS+ KAYRAK 
Subjt:  IRFTKNDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKR

Query:  LALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILKLKISQEEE
         A+D + GS+ EQY  LW+YCEELR SNPGST ++K++   E E
Subjt:  LALDKVRGSYAEQYIQLWEYCEELRISNPGSTTILKLKISQEEE

A0A6J1DI69 uncharacterized protein LOC1110211583.7e-8551.86Show/hide
Query:  WSMSKRINLVERMGYAANYFKLWGRDPTMDVGSYKTLESNEDVETLTSLLSDRMPFEIFVEHDV--DCGLSDDSSFNLDDFCGFESEDMDDYEEEECRPL
        WS  + I +VE++GY  +Y KLWGRDPT +VG Y+ +ES++DV  L SLLSD M FEI+VEH+   +  L D+ S  L    G ESE  DD +      +
Subjt:  WSMSKRINLVERMGYAANYFKLWGRDPTMDVGSYKTLESNEDVETLTSLLSDRMPFEIFVEHDV--DCGLSDDSSFNLDDFCGFESEDMDDYEEEECRPL

Query:  TFKCSNQGDKGKQVIEVD--IESD---IDIDSDELHSLDDSSDSDQGCKSKMKFPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTK
                 K  ++I +D  +E++    DIDS EL SL DSSDS+     ++K+P +++   +     FE+GM FNSL EFKNVV+ YAVKGGW IRF K
Subjt:  TFKCSNQGDKGKQVIEVD--IESD---IDIDSDELHSLDDSSDSDQGCKSKMKFPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTK

Query:  NDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKRLALDK
        NDK RVRAKCV+GCKWLAY +K+QGE T+QLKT+V EH+CSR F NP LTS WL +++ N+VK+ P+++L +IQ+KVQR+YISQI++ KA+RAK+LALD 
Subjt:  NDKMRVRAKCVEGCKWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKRLALDK

Query:  VRGSYAEQYIQLWEYCEELRISNPGSTTILKLKISQEEEENNREPIERI
        V GS+ EQ   LWEYC E+  SNPGS+ +L L + QE+EE + +PI RI
Subjt:  VRGSYAEQYIQLWEYCEELRISNPGSTTILKLKISQEEEENNREPIERI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCCTCGCTCGCCTTCTCTCCGCAATGAAGAATCGTGGTCAATGTCTAAACGTATTAATTTGGTTGAACGCATGGGTTATGCTGCCAATTATTTTAAATTGTGGGG
CAGAGATCCTACAATGGATGTTGGTAGTTATAAAACTTTAGAGAGTAATGAAGATGTAGAGACATTAACTAGTCTTTTAAGTGATAGAATGCCTTTTGAAATATTTGTGG
AGCATGATGTAGATTGTGGTTTGTCTGATGATAGTAGTTTTAATTTAGATGATTTTTGTGGTTTTGAGAGTGAAGATATGGATGATTATGAAGAGGAAGAATGCAGACCC
TTAACTTTCAAATGTAGCAATCAAGGAGATAAAGGAAAACAAGTTATTGAGGTAGATATTGAAAGTGATATAGACATAGACTCTGACGAACTACACTCATTAGATGACTC
GTCTGATTCAGACCAAGGATGCAAGAGTAAGATGAAGTTCCCATCCTTCAAGCAAAACATGGAGGAAAATAACATGACCAACTTTGAAATTGGTATGTTATTTAATTCTT
TGAGTGAGTTTAAGAATGTTGTGAACACTTATGCAGTCAAAGGAGGATGGGGGATTCGTTTTACAAAAAATGATAAGATGAGAGTTCGAGCAAAATGTGTAGAAGGTTGT
AAATGGTTAGCCTATGCATCCAAGATTCAAGGGGAAGAAACCTTTCAATTGAAAACCTATGTGAATGAACATACATGTAGTAGATGCTTTAGAAATCCTCGATTGACATC
TTCTTGGTTGAGCAAACGACTAGAAAATGAAGTTAAGGACCATCCAGAAATGAAGCTAACTTCCATTCAGAAAAAGGTACAACGTGAGTACATTTCACAAATTTCTAGGG
GCAAGGCTTATAGGGCCAAACGTCTAGCATTAGATAAGGTTCGTGGTTCATACGCTGAACAATACATTCAATTATGGGAGTACTGTGAGGAATTACGTATTTCGAATCCT
GGTAGCACGACAATTTTGAAATTGAAAATCTCTCAAGAGGAAGAGGAGAATAATAGAGAACCCATTGAAAGGATTAGGGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCCTCGCTCGCCTTCTCTCCGCAATGAAGAATCGTGGTCAATGTCTAAACGTATTAATTTGGTTGAACGCATGGGTTATGCTGCCAATTATTTTAAATTGTGGGG
CAGAGATCCTACAATGGATGTTGGTAGTTATAAAACTTTAGAGAGTAATGAAGATGTAGAGACATTAACTAGTCTTTTAAGTGATAGAATGCCTTTTGAAATATTTGTGG
AGCATGATGTAGATTGTGGTTTGTCTGATGATAGTAGTTTTAATTTAGATGATTTTTGTGGTTTTGAGAGTGAAGATATGGATGATTATGAAGAGGAAGAATGCAGACCC
TTAACTTTCAAATGTAGCAATCAAGGAGATAAAGGAAAACAAGTTATTGAGGTAGATATTGAAAGTGATATAGACATAGACTCTGACGAACTACACTCATTAGATGACTC
GTCTGATTCAGACCAAGGATGCAAGAGTAAGATGAAGTTCCCATCCTTCAAGCAAAACATGGAGGAAAATAACATGACCAACTTTGAAATTGGTATGTTATTTAATTCTT
TGAGTGAGTTTAAGAATGTTGTGAACACTTATGCAGTCAAAGGAGGATGGGGGATTCGTTTTACAAAAAATGATAAGATGAGAGTTCGAGCAAAATGTGTAGAAGGTTGT
AAATGGTTAGCCTATGCATCCAAGATTCAAGGGGAAGAAACCTTTCAATTGAAAACCTATGTGAATGAACATACATGTAGTAGATGCTTTAGAAATCCTCGATTGACATC
TTCTTGGTTGAGCAAACGACTAGAAAATGAAGTTAAGGACCATCCAGAAATGAAGCTAACTTCCATTCAGAAAAAGGTACAACGTGAGTACATTTCACAAATTTCTAGGG
GCAAGGCTTATAGGGCCAAACGTCTAGCATTAGATAAGGTTCGTGGTTCATACGCTGAACAATACATTCAATTATGGGAGTACTGTGAGGAATTACGTATTTCGAATCCT
GGTAGCACGACAATTTTGAAATTGAAAATCTCTCAAGAGGAAGAGGAGAATAATAGAGAACCCATTGAAAGGATTAGGGCTTAA
Protein sequenceShow/hide protein sequence
MDPRSPSLRNEESWSMSKRINLVERMGYAANYFKLWGRDPTMDVGSYKTLESNEDVETLTSLLSDRMPFEIFVEHDVDCGLSDDSSFNLDDFCGFESEDMDDYEEEECRP
LTFKCSNQGDKGKQVIEVDIESDIDIDSDELHSLDDSSDSDQGCKSKMKFPSFKQNMEENNMTNFEIGMLFNSLSEFKNVVNTYAVKGGWGIRFTKNDKMRVRAKCVEGC
KWLAYASKIQGEETFQLKTYVNEHTCSRCFRNPRLTSSWLSKRLENEVKDHPEMKLTSIQKKVQREYISQISRGKAYRAKRLALDKVRGSYAEQYIQLWEYCEELRISNP
GSTTILKLKISQEEEENNREPIERIRA