; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034719 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034719
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon protein
Genome locationscaffold7:46230097..46232980
RNA-Seq ExpressionSpg034719
SyntenySpg034719
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033487.1 uncharacterized protein E6C27_scaffold261G00210 [Cucumis melo var. makuwa]3.5e-6950Show/hide
Query:  MLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQDGRLTKSSHALLYACRRMTGADK
        MLRT  GL  T+ +DVEEMV +FLHI+AHDVKNRV RR FARSGETVSRHFN  L+ VLRL+E+LLK+P  +T SC        SH      +  +   K
Subjt:  MLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQDGRLTKSSHALLYACRRMTGADK

Query:  QPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKE
          KH WT +E+  L+E L++LV +G WR DNGTFKP YL ++++++KEK+  S I+ T  ++  V+ LK+QY+ I +M+GP CSGF WN+E KCI AEK 
Subjt:  QPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKE

Query:  VFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVEGDSNFQGTQDYYVPIP
        V ++WVK H  A+ LLNKPFP++ +L  +FGRDRA+G  C  P E   ++   D E D      +D+ +P P
Subjt:  VFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVEGDSNFQGTQDYYVPIP

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]6.3e-7135.89Show/hide
Query:  KHQIRQLACFRLTHESDLCCQESTRMDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEV
        +H+IRQLA FR+ H SDL C++STRMDRRCFAILC +LRT +GL  TE++DVEEMVAMFLHILAHDVKNRVI+R+F RSGET+SRHFN  L  V+RL++ 
Subjt:  KHQIRQLACFRLTHESDLCCQESTRMDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEV

Query:  LLKKPVPITGSCQDGR----------------------------------------------------LT------------------------------
        LLKKP P+   C D R                                                    LT                              
Subjt:  LLKKPVPITGSCQDGR----------------------------------------------------LT------------------------------

Query:  -------------------------------------------------------------------KSSHALLYACR----------------------
                                                                           KS H +   C                       
Subjt:  -------------------------------------------------------------------KSSHALLYACR----------------------

Query:  ----RMTGADKQPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWN
             MT + + PKH WT+ EEA     LVELV+ GGWR DNGTF+P YL +L RM+  K+P   I + STID +++ +KR + A+ +M GP CSGFGWN
Subjt:  ----RMTGADKQPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWN

Query:  EEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPP------VDVEGDSNFQGTQDYYVPIPPTQNLDT
        +E KCIVAEKEVFD+W  SH AAKGLLNK F HY+EL+++FG+DRA+G   +  A+    +PP       D   D++F       + + P   ++T
Subjt:  EEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPP------VDVEGDSNFQGTQDYYVPIPPTQNLDT

TYK05796.1 retrotransposon protein [Cucumis melo var. makuwa]2.0e-6448.12Show/hide
Query:  PSDGLNYIKHQIRQLACFRLTHESDLCCQESTRMDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLS
        PSD L     +IRQLA FR+ HESDL C++STRMDRR FAILC +L+T SGL  TEI+DVEEMVAMFLH+LAHD+KN VI+R+F RS ETVSRHFN  L 
Subjt:  PSDGLNYIKHQIRQLACFRLTHESDLCCQESTRMDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLS

Query:  TVLRLYEVLLKKPVPITGSCQDGRLTKSSHAL-----LYACRRMTGADKQPKHIWTRLEEAKLIESLVEL-------VHEGGWRGDNGTFKPDYLARLKR
        TV+RLYE L+K+PVP+T +C+D R     + L      Y    +   D+       R  + ++  + V L       + EGG     G F          
Subjt:  TVLRLYEVLLKKPVPITGSCQDGRLTKSSHAL-----LYACRRMTGADKQPKHIWTRLEEAKLIESLVEL-------VHEGGWRGDNGTFKPDYLARLKR

Query:  MLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASG
                         ++ V        AI +M GP CSGFGWN+E KCI+ EKE+FD WV+SH A KGLLNKPFP+Y+EL ++FGRDRA+G
Subjt:  MLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASG

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]8.0e-6645.57Show/hide
Query:  MDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQ-DGRLTKSSHAL
        MDRRCF ILC+MLRT  GL  T+ +DV+EMV +FLHI+AHDVKNRV RR  ARSGETVSRHFN  L+ VLRL+E+LLK+P P+T SC  DG   K + ++
Subjt:  MDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQ-DGRLTKSSHAL

Query:  LYACR-RMTGAD-------------------------------KQPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIE
            R R    D                               K  KH WT +E+  L+E L++LV EGGWR DNGTFK  YL                 
Subjt:  LYACR-RMTGAD-------------------------------KQPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIE

Query:  STSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVE
                     +QY+AI +M+GP CSGFGWNE  KCI  EK VFD+WVK H  A+GLLNKPFP++ +L  +FGRDRA+G  C  P E + ++   D E
Subjt:  STSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVE

Query:  GDSNFQGTQDYYVPIP
         D      +D+ +P P
Subjt:  GDSNFQGTQDYYVPIP

TYK26842.1 uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa]3.5e-6950Show/hide
Query:  MLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQDGRLTKSSHALLYACRRMTGADK
        MLRT  GL  T+ +DVEEMV +FLHI+AHDVKNRV RR FARSGETVSRHFN  L+ VLRL+E+LLK+P  +T SC        SH      +  +   K
Subjt:  MLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQDGRLTKSSHALLYACRRMTGADK

Query:  QPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKE
          KH WT +E+  L+E L++LV +G WR DNGTFKP YL ++++++KEK+  S I+ T  ++  V+ LK+QY+ I +M+GP CSGF WN+E KCI AEK 
Subjt:  QPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKE

Query:  VFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVEGDSNFQGTQDYYVPIP
        V ++WVK H  A+ LLNKPFP++ +L  +FGRDRA+G  C  P E   ++   D E D      +D+ +P P
Subjt:  VFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVEGDSNFQGTQDYYVPIP

TrEMBL top hitse value%identityAlignment
A0A5A7SW62 Myb_DNA-bind_3 domain-containing protein1.7e-6950Show/hide
Query:  MLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQDGRLTKSSHALLYACRRMTGADK
        MLRT  GL  T+ +DVEEMV +FLHI+AHDVKNRV RR FARSGETVSRHFN  L+ VLRL+E+LLK+P  +T SC        SH      +  +   K
Subjt:  MLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQDGRLTKSSHALLYACRRMTGADK

Query:  QPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKE
          KH WT +E+  L+E L++LV +G WR DNGTFKP YL ++++++KEK+  S I+ T  ++  V+ LK+QY+ I +M+GP CSGF WN+E KCI AEK 
Subjt:  QPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKE

Query:  VFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVEGDSNFQGTQDYYVPIP
        V ++WVK H  A+ LLNKPFP++ +L  +FGRDRA+G  C  P E   ++   D E D      +D+ +P P
Subjt:  VFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVEGDSNFQGTQDYYVPIP

A0A5A7SWD8 Retrotransposon protein3.1e-7135.89Show/hide
Query:  KHQIRQLACFRLTHESDLCCQESTRMDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEV
        +H+IRQLA FR+ H SDL C++STRMDRRCFAILC +LRT +GL  TE++DVEEMVAMFLHILAHDVKNRVI+R+F RSGET+SRHFN  L  V+RL++ 
Subjt:  KHQIRQLACFRLTHESDLCCQESTRMDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEV

Query:  LLKKPVPITGSCQDGR----------------------------------------------------LT------------------------------
        LLKKP P+   C D R                                                    LT                              
Subjt:  LLKKPVPITGSCQDGR----------------------------------------------------LT------------------------------

Query:  -------------------------------------------------------------------KSSHALLYACR----------------------
                                                                           KS H +   C                       
Subjt:  -------------------------------------------------------------------KSSHALLYACR----------------------

Query:  ----RMTGADKQPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWN
             MT + + PKH WT+ EEA     LVELV+ GGWR DNGTF+P YL +L RM+  K+P   I + STID +++ +KR + A+ +M GP CSGFGWN
Subjt:  ----RMTGADKQPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWN

Query:  EEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPP------VDVEGDSNFQGTQDYYVPIPPTQNLDT
        +E KCIVAEKEVFD+W  SH AAKGLLNK F HY+EL+++FG+DRA+G   +  A+    +PP       D   D++F       + + P   ++T
Subjt:  EEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPP------VDVEGDSNFQGTQDYYVPIPPTQNLDT

A0A5D3C620 Retrotransposon protein9.5e-6548.12Show/hide
Query:  PSDGLNYIKHQIRQLACFRLTHESDLCCQESTRMDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLS
        PSD L     +IRQLA FR+ HESDL C++STRMDRR FAILC +L+T SGL  TEI+DVEEMVAMFLH+LAHD+KN VI+R+F RS ETVSRHFN  L 
Subjt:  PSDGLNYIKHQIRQLACFRLTHESDLCCQESTRMDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLS

Query:  TVLRLYEVLLKKPVPITGSCQDGRLTKSSHAL-----LYACRRMTGADKQPKHIWTRLEEAKLIESLVEL-------VHEGGWRGDNGTFKPDYLARLKR
        TV+RLYE L+K+PVP+T +C+D R     + L      Y    +   D+       R  + ++  + V L       + EGG     G F          
Subjt:  TVLRLYEVLLKKPVPITGSCQDGRLTKSSHAL-----LYACRRMTGADKQPKHIWTRLEEAKLIESLVEL-------VHEGGWRGDNGTFKPDYLARLKR

Query:  MLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASG
                         ++ V        AI +M GP CSGFGWN+E KCI+ EKE+FD WV+SH A KGLLNKPFP+Y+EL ++FGRDRA+G
Subjt:  MLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASG

A0A5D3C7T4 Uncharacterized protein3.9e-6645.57Show/hide
Query:  MDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQ-DGRLTKSSHAL
        MDRRCF ILC+MLRT  GL  T+ +DV+EMV +FLHI+AHDVKNRV RR  ARSGETVSRHFN  L+ VLRL+E+LLK+P P+T SC  DG   K + ++
Subjt:  MDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQ-DGRLTKSSHAL

Query:  LYACR-RMTGAD-------------------------------KQPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIE
            R R    D                               K  KH WT +E+  L+E L++LV EGGWR DNGTFK  YL                 
Subjt:  LYACR-RMTGAD-------------------------------KQPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIE

Query:  STSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVE
                     +QY+AI +M+GP CSGFGWNE  KCI  EK VFD+WVK H  A+GLLNKPFP++ +L  +FGRDRA+G  C  P E + ++   D E
Subjt:  STSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVE

Query:  GDSNFQGTQDYYVPIP
         D      +D+ +P P
Subjt:  GDSNFQGTQDYYVPIP

A0A5D3DTL0 Myb_DNA-bind_3 domain-containing protein1.7e-6950Show/hide
Query:  MLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQDGRLTKSSHALLYACRRMTGADK
        MLRT  GL  T+ +DVEEMV +FLHI+AHDVKNRV RR FARSGETVSRHFN  L+ VLRL+E+LLK+P  +T SC        SH      +  +   K
Subjt:  MLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQDGRLTKSSHALLYACRRMTGADK

Query:  QPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKE
          KH WT +E+  L+E L++LV +G WR DNGTFKP YL ++++++KEK+  S I+ T  ++  V+ LK+QY+ I +M+GP CSGF WN+E KCI AEK 
Subjt:  QPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKE

Query:  VFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVEGDSNFQGTQDYYVPIP
        V ++WVK H  A+ LLNKPFP++ +L  +FGRDRA+G  C  P E   ++   D E D      +D+ +P P
Subjt:  VFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVEGDSNFQGTQDYYVPIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein7.3e-0930.77Show/hide
Query:  GADKQPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTF-KPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCI
        G +K P + WT  E     + L+EL+ +  WR  +G   K    ++L   L ++L  ++        LK   LK  Y +  D L    SGFGW+ E K  
Subjt:  GADKQPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTF-KPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCI

Query:  VAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGS
         A  EV+ +++K+H   K +  +   H+E+L  +FG   A+GS
Subjt:  VAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGS

AT4G02210.1 unknown protein2.1e-0827.96Show/hide
Query:  SRIESTSTIDL---KVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDV
        ++ ES   +D+   + +SL+RQ++AI  +L     GF W+ E + + A+  V+ +++K+H  A+  + +P P+Y++L  + G      + C V
Subjt:  SRIESTSTIDL---KVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDV

AT4G02210.2 unknown protein2.1e-0827.96Show/hide
Query:  SRIESTSTIDL---KVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDV
        ++ ES   +D+   + +SL+RQ++AI  +L     GF W+ E + + A+  V+ +++K+H  A+  + +P P+Y++L  + G      + C V
Subjt:  SRIESTSTIDL---KVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVFDEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDV

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.0e-1039.74Show/hide
Query:  CQESTRMDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRL
        C E+ RMD+  F  LC +L+T   L  T  + +E  +A+FL I+ H+++ R ++  F  SGET+SRHFN  L+ V+ +
Subjt:  CQESTRMDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAACAAAGAATAGCGAGACCGCATATCCAAAGCCAGCCAGCAGTTGCCGGACGGGACTTCCTTGCCTTCAGCCTAGCCGCCTTGTCTTATCCCGCCCTTTCGGACT
ACCTTTCCTTACTTTAGGGCTTGCTTTCCTTGCCTTGCTTCAGACGGGATGCCAAGCACTAATAAGAGCAACAGCTAATATCGTTCCTCCCTTATCTCTTAAAGCTTCTG
CCTTCAGACTTGCCTTGCTTTCCTGCCTTCGGCCTTCAGACGGATTGAACTACATCAAACATCAGATAAGACAATTAGCCTGCTTCCGGTTGACCCATGAAAGTGACCTA
TGTTGTCAAGAGAGCACGAGGATGGATAGGAGATGTTTCGCCATCCTATGCAGTATGCTTAGGACGACTTCCGGTTTGGTAGGGACTGAAATCTTAGACGTTGAAGAGAT
GGTTGCGATGTTCTTACACATCCTTGCTCACGACGTTAAGAATAGAGTCATAAGAAGACAATTTGCACGGTCAGGAGAGACGGTTTCTCGGCACTTCAACACAACTCTAA
GCACCGTACTACGGTTGTACGAAGTTCTACTTAAGAAACCGGTACCGATCACGGGTTCTTGCCAGGATGGGAGACTTACGAAATCTAGTCACGCACTACTATATGCATGT
AGGAGAATGACAGGAGCAGACAAACAACCAAAACATATATGGACAAGGTTGGAGGAGGCAAAATTGATTGAAAGCCTGGTGGAGCTGGTACATGAAGGTGGGTGGCGAGG
TGATAATGGAACGTTCAAACCCGACTACCTTGCACGACTAAAGCGTATGCTGAAGGAAAAATTACCGACATCCAGGATCGAATCAACATCTACAATTGACTTGAAGGTAC
GGTCGTTGAAAAGGCAGTACAGTGCGATTAACGACATGTTGGGGCCCGGATGCAGCGGATTCGGATGGAATGAAGAGTATAAGTGCATTGTGGCGGAGAAAGAAGTCTTC
GATGAGTGGGTGAAGTCCCACTCAGCTGCGAAGGGTCTATTGAACAAGCCATTTCCTCATTATGAGGAACTCGCCTTCATGTTTGGTCGAGATCGGGCTAGTGGATCAGG
GTGCGATGTACCAGCGGAACAAGCTAAAGAAAGCCCCCCAGTGGACGTGGAGGGGGATTCAAACTTCCAAGGAACACAAGATTATTATGTCCCCATCCCTCCAACACAAA
ATTTGGACACGGACGTGGAGATTGAGGACTTTCTGATAGCCCTAATTTCGACTAGAGACCTTTGGTGGGAGCTGAAGCAAGGCCAAGGAATGGGAATCAAGCAAGAAGAC
GTGGAGATCGTGACGGGGACGTGCCGGTTTCGATGGATTCGAGTTTTCCTTTCTCTTTCCCTTAACTCTTGTAGTTTTTATGGCTTTTCTTACAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAACAAAGAATAGCGAGACCGCATATCCAAAGCCAGCCAGCAGTTGCCGGACGGGACTTCCTTGCCTTCAGCCTAGCCGCCTTGTCTTATCCCGCCCTTTCGGACT
ACCTTTCCTTACTTTAGGGCTTGCTTTCCTTGCCTTGCTTCAGACGGGATGCCAAGCACTAATAAGAGCAACAGCTAATATCGTTCCTCCCTTATCTCTTAAAGCTTCTG
CCTTCAGACTTGCCTTGCTTTCCTGCCTTCGGCCTTCAGACGGATTGAACTACATCAAACATCAGATAAGACAATTAGCCTGCTTCCGGTTGACCCATGAAAGTGACCTA
TGTTGTCAAGAGAGCACGAGGATGGATAGGAGATGTTTCGCCATCCTATGCAGTATGCTTAGGACGACTTCCGGTTTGGTAGGGACTGAAATCTTAGACGTTGAAGAGAT
GGTTGCGATGTTCTTACACATCCTTGCTCACGACGTTAAGAATAGAGTCATAAGAAGACAATTTGCACGGTCAGGAGAGACGGTTTCTCGGCACTTCAACACAACTCTAA
GCACCGTACTACGGTTGTACGAAGTTCTACTTAAGAAACCGGTACCGATCACGGGTTCTTGCCAGGATGGGAGACTTACGAAATCTAGTCACGCACTACTATATGCATGT
AGGAGAATGACAGGAGCAGACAAACAACCAAAACATATATGGACAAGGTTGGAGGAGGCAAAATTGATTGAAAGCCTGGTGGAGCTGGTACATGAAGGTGGGTGGCGAGG
TGATAATGGAACGTTCAAACCCGACTACCTTGCACGACTAAAGCGTATGCTGAAGGAAAAATTACCGACATCCAGGATCGAATCAACATCTACAATTGACTTGAAGGTAC
GGTCGTTGAAAAGGCAGTACAGTGCGATTAACGACATGTTGGGGCCCGGATGCAGCGGATTCGGATGGAATGAAGAGTATAAGTGCATTGTGGCGGAGAAAGAAGTCTTC
GATGAGTGGGTGAAGTCCCACTCAGCTGCGAAGGGTCTATTGAACAAGCCATTTCCTCATTATGAGGAACTCGCCTTCATGTTTGGTCGAGATCGGGCTAGTGGATCAGG
GTGCGATGTACCAGCGGAACAAGCTAAAGAAAGCCCCCCAGTGGACGTGGAGGGGGATTCAAACTTCCAAGGAACACAAGATTATTATGTCCCCATCCCTCCAACACAAA
ATTTGGACACGGACGTGGAGATTGAGGACTTTCTGATAGCCCTAATTTCGACTAGAGACCTTTGGTGGGAGCTGAAGCAAGGCCAAGGAATGGGAATCAAGCAAGAAGAC
GTGGAGATCGTGACGGGGACGTGCCGGTTTCGATGGATTCGAGTTTTCCTTTCTCTTTCCCTTAACTCTTGTAGTTTTTATGGCTTTTCTTACAATTAG
Protein sequenceShow/hide protein sequence
MQTKNSETAYPKPASSCRTGLPCLQPSRLVLSRPFGLPFLTLGLAFLALLQTGCQALIRATANIVPPLSLKASAFRLALLSCLRPSDGLNYIKHQIRQLACFRLTHESDL
CCQESTRMDRRCFAILCSMLRTTSGLVGTEILDVEEMVAMFLHILAHDVKNRVIRRQFARSGETVSRHFNTTLSTVLRLYEVLLKKPVPITGSCQDGRLTKSSHALLYAC
RRMTGADKQPKHIWTRLEEAKLIESLVELVHEGGWRGDNGTFKPDYLARLKRMLKEKLPTSRIESTSTIDLKVRSLKRQYSAINDMLGPGCSGFGWNEEYKCIVAEKEVF
DEWVKSHSAAKGLLNKPFPHYEELAFMFGRDRASGSGCDVPAEQAKESPPVDVEGDSNFQGTQDYYVPIPPTQNLDTDVEIEDFLIALISTRDLWWELKQGQGMGIKQED
VEIVTGTCRFRWIRVFLSLSLNSCSFYGFSYN