; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021564 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021564
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionM protein, serotype 5-like
Genome locationchr7:9237940..9241098
RNA-Seq ExpressionLag0021564
SyntenyLag0021564
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAC5386276.1 unnamed protein product [Mytilus coruscus]1.2e-0424.01Show/hide
Query:  SRQKTLKRDQQANRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRS---SKRSTSQPTDQEDQQVSRLIIQEDQQANRSN
        SR++T   +   N+  +K N +  +S +R N  T++S R ++   +   ++ N    K  +    +   ++R T+  TD+  ++        +   N+S 
Subjt:  SRQKTLKRDQQANRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRS---SKRSTSQPTDQEDQQVSRLIIQEDQQANRSN

Query:  RSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQVNR
        R ++   D    + N  T++SRRKT+   +  RR T+  T++  ++        +   N+S ++ N + N S +  N  T  S R ++   D +  + N 
Subjt:  RSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQVNR

Query:  LIIQEDQQANRPIKK----INKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSSKKINKSADRSSNRSSEKINKSADRSSKK
         I +  Q+ N    K     N +  +S +  N  T++SR   N+  N+  ++ N + + S +K N  T +S +K N     ++N+S  + N +A +S ++
Subjt:  LIIQEDQQANRPIKK----INKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSSKKINKSADRSSNRSSEKINKSADRSSKK

Query:  IIKPTGRSSKKINKLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQQ
            T +S ++ N  A +S  R++   NKS  R++   +K   R++   NKSQ+
Subjt:  IIKPTGRSSKKINKLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQQ

KAG5700683.1 hypothetical protein BaRGS_029048 [Batillaria attramentaria]7.7e-0728.95Show/hide
Query:  RQKTLKRDQQANRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPR--DQQANRPI-KKINKSAGRSSKRSTSQPTDQEDQQVSRLIIQEDQQANRSNR
        +Q     +++ N  I+KIN++   + ++IN+   + N  S+++   R  DQQ  R   +KIN +     +      T  EDQQ  R   +EDQQ  + NR
Subjt:  RQKTLKRDQQANRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPR--DQQANRPI-KKINKSAGRSSKRSTSQPTDQEDQQVSRLIIQEDQQANRSNR

Query:  SSSQQ-ADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQ--EDQQANRSIKR-INKSANRSSKSINKLTSRSN----------RSS
        +  QQ     + +IN+  D+   +++++ D      ++  DQ++ +     I   EDQQ  RS +R IN++ ++ +  IN+   + N          R  
Subjt:  SSSQQ-ADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQ--EDQQANRSIKR-INKSANRSSKSINKLTSRSN----------RSS

Query:  SQHADPTDHQVNRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRG-----SINHQANRPIKKINKSADRSSKKINKPTGRSSKKINKSADRSSNRS
         Q+   T+ ++NR    EDQQ      KIN++  + + KIN+  D+  G      IN       +KIN++ D+ + KIN+   + + KIN++ D+ + RS
Subjt:  SQHADPTDHQVNRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRG-----SINHQANRPIKKINKSADRSSKKINKPTGRSSKKINKSADRSSNRS

Query:  SE------------KINKSAD----RSSKKIIKPTGRSS-KKINKLADRSSNRSSE-KINKSADRSSKKINKPTGRSSKK
        +E            KIN++ D    RS+++  +  GRS+ +KIN+  D+   RS+E KIN++   + +KIN+ TGRS+++
Subjt:  SE------------KINKSAD----RSSKKIIKPTGRSS-KKINKLADRSSNRSSE-KINKSADRSSKKINKPTGRSSKK

PAA71337.1 hypothetical protein BOX15_Mlig005876g1 [Macrostomum lignano]9.4e-1326.58Show/hide
Query:  SRQKTLKRDQQA-----NRPIKKINKSAGRSSKRINKLTS------------------RSNRSSSQQADPRDQQANRPIKKINKSAGRS-----------
        SR+   KR +++     +R  +K+N+  GR S+++N+  S                  RS +SS ++   + ++ NR  K+  KS+G+            
Subjt:  SRQKTLKRDQQA-----NRPIKKINKSAGRSSKRINKLTS------------------RSNRSSSQQADPRDQQANRPIKKINKSAGRS-----------

Query:  -SKRSTSQPTDQEDQQVSRLIIQEDQQAN--RSNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRIN
         S++S+ +   ++ ++V+R   ++ ++ N  RS +SS ++   +  ++N    R R + S      R+S+ +   ++ ++V R   ++ ++ NR   R +
Subjt:  -SKRSTSQPTDQEDQQVSRLIIQEDQQAN--RSNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRIN

Query:  KSANRSSKSINKLTSRSNRSSSQHADPTDHQVNRLIIQED--QQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPT
            RS KS      RS +SS +       +VNR   ++   ++ +R  +K+N+  GR S+K+N+   RSR S   + +R  +K+N+   R S+K+N+  
Subjt:  KSANRSSKSINKLTSRSNRSSSQHADPTDHQVNRLIIQED--QQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPT

Query:  GRSSKKIN-----KSADRSSNRSSEKINKSADRSSKKIIKPTGRSSKKINKLADRSS-----NRSSEKINKSADRSSKKINKPTGRSSKKINKSQ
         R S+K+N     KS+ +  +R S K+N+   R S+K+ +  GR S+K+N+   R S     +R S K+N+   R S+K+N+  GR S+K+N+ +
Subjt:  GRSSKKIN-----KSADRSSNRSSEKINKSADRSSKKIIKPTGRSSKKINKLADRSS-----NRSSEKINKSADRSSKKINKPTGRSSKKINKSQ

PAA71360.1 hypothetical protein BOX15_Mlig020368g1 [Macrostomum lignano]3.9e-1125.91Show/hide
Query:  SRQKTLKRDQQA-----NRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRLIIQEDQQANR
        SR+   KR +++     +R  +K+N+  GR S+++N+   R +    +      ++ +R  +K+N+  GR S R  ++   ++ ++V+R   +  +++ R
Subjt:  SRQKTLKRDQQA-----NRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRLIIQEDQQANR

Query:  SNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQV
         NR  S+++  +         RSR+         R+S  +   ++ +++ R   ++     RS K    +  RS KS  K  SR +R  ++       +V
Subjt:  SNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQV

Query:  NRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSS---KKINKSADRSSNRSSEKINKSADRSSK
        NR   ++     R  K   K   R S+++N+   R    +N +  R  +K+N+   R S+K+N+  GR S   K+  KS+ +  +R S K+N+   R S+
Subjt:  NRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSS---KKINKSADRSSNRSSEKINKSADRSSK

Query:  KIIKPTGRSSKKIN-----KLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQ
        K+ +  GR S+K+N     K A +  +R S K+N+   R S+K+N+  GR S+K+N+ +
Subjt:  KIIKPTGRSSKKIN-----KLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQ

QAX24809.1 adhesion protein 1 [Macrostomum lignano]3.9e-1125.91Show/hide
Query:  SRQKTLKRDQQA-----NRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRLIIQEDQQANR
        SR+   KR +++     +R  +K+N+  GR S+++N+   R +    +      ++ +R  +K+N+  GR S R  ++   ++ ++V+R   +  +++ R
Subjt:  SRQKTLKRDQQA-----NRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRLIIQEDQQANR

Query:  SNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQV
         NR  S+++  +         RSR+         R+S  +   ++ +++ R   ++     RS K    +  RS KS  K  SR +R  ++       +V
Subjt:  SNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQV

Query:  NRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSS---KKINKSADRSSNRSSEKINKSADRSSK
        NR   ++     R  K   K   R S+++N+   R    +N +  R  +K+N+   R S+K+N+  GR S   K+  KS+ +  +R S K+N+   R S+
Subjt:  NRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSS---KKINKSADRSSNRSSEKINKSADRSSK

Query:  KIIKPTGRSSKKIN-----KLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQ
        K+ +  GR S+K+N     K A +  +R S K+N+   R S+K+N+  GR S+K+N+ +
Subjt:  KIIKPTGRSSKKIN-----KLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQ

TrEMBL top hitse value%identityAlignment
A0A267FC56 Uncharacterized protein1.9e-1125.91Show/hide
Query:  SRQKTLKRDQQA-----NRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRLIIQEDQQANR
        SR+   KR +++     +R  +K+N+  GR S+++N+   R +    +      ++ +R  +K+N+  GR S R  ++   ++ ++V+R   +  +++ R
Subjt:  SRQKTLKRDQQA-----NRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRLIIQEDQQANR

Query:  SNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQV
         NR  S+++  +         RSR+         R+S  +   ++ +++ R   ++     RS K    +  RS KS  K  SR +R  ++       +V
Subjt:  SNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQV

Query:  NRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSS---KKINKSADRSSNRSSEKINKSADRSSK
        NR   ++     R  K   K   R S+++N+   R    +N +  R  +K+N+   R S+K+N+  GR S   K+  KS+ +  +R S K+N+   R S+
Subjt:  NRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSS---KKINKSADRSSNRSSEKINKSADRSSK

Query:  KIIKPTGRSSKKIN-----KLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQ
        K+ +  GR S+K+N     K A +  +R S K+N+   R S+K+N+  GR S+K+N+ +
Subjt:  KIIKPTGRSSKKIN-----KLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQ

A0A267FDU5 Uncharacterized protein4.5e-1326.58Show/hide
Query:  SRQKTLKRDQQA-----NRPIKKINKSAGRSSKRINKLTS------------------RSNRSSSQQADPRDQQANRPIKKINKSAGRS-----------
        SR+   KR +++     +R  +K+N+  GR S+++N+  S                  RS +SS ++   + ++ NR  K+  KS+G+            
Subjt:  SRQKTLKRDQQA-----NRPIKKINKSAGRSSKRINKLTS------------------RSNRSSSQQADPRDQQANRPIKKINKSAGRS-----------

Query:  -SKRSTSQPTDQEDQQVSRLIIQEDQQAN--RSNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRIN
         S++S+ +   ++ ++V+R   ++ ++ N  RS +SS ++   +  ++N    R R + S      R+S+ +   ++ ++V R   ++ ++ NR   R +
Subjt:  -SKRSTSQPTDQEDQQVSRLIIQEDQQAN--RSNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRIN

Query:  KSANRSSKSINKLTSRSNRSSSQHADPTDHQVNRLIIQED--QQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPT
            RS KS      RS +SS +       +VNR   ++   ++ +R  +K+N+  GR S+K+N+   RSR S   + +R  +K+N+   R S+K+N+  
Subjt:  KSANRSSKSINKLTSRSNRSSSQHADPTDHQVNRLIIQED--QQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPT

Query:  GRSSKKIN-----KSADRSSNRSSEKINKSADRSSKKIIKPTGRSSKKINKLADRSS-----NRSSEKINKSADRSSKKINKPTGRSSKKINKSQ
         R S+K+N     KS+ +  +R S K+N+   R S+K+ +  GR S+K+N+   R S     +R S K+N+   R S+K+N+  GR S+K+N+ +
Subjt:  GRSSKKIN-----KSADRSSNRSSEKINKSADRSSKKIIKPTGRSSKKINKLADRSS-----NRSSEKINKSADRSSKKINKPTGRSSKKINKSQ

A0A411ACX2 Adhesion protein 11.9e-1125.91Show/hide
Query:  SRQKTLKRDQQA-----NRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRLIIQEDQQANR
        SR+   KR +++     +R  +K+N+  GR S+++N+   R +    +      ++ +R  +K+N+  GR S R  ++   ++ ++V+R   +  +++ R
Subjt:  SRQKTLKRDQQA-----NRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRLIIQEDQQANR

Query:  SNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQV
         NR  S+++  +         RSR+         R+S  +   ++ +++ R   ++     RS K    +  RS KS  K  SR +R  ++       +V
Subjt:  SNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQV

Query:  NRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSS---KKINKSADRSSNRSSEKINKSADRSSK
        NR   ++     R  K   K   R S+++N+   R    +N +  R  +K+N+   R S+K+N+  GR S   K+  KS+ +  +R S K+N+   R S+
Subjt:  NRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSS---KKINKSADRSSNRSSEKINKSADRSSK

Query:  KIIKPTGRSSKKIN-----KLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQ
        K+ +  GR S+K+N     K A +  +R S K+N+   R S+K+N+  GR S+K+N+ +
Subjt:  KIIKPTGRSSKKIN-----KLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQ

A0A6J8BRX0 Uncharacterized protein5.9e-0524.01Show/hide
Query:  SRQKTLKRDQQANRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRS---SKRSTSQPTDQEDQQVSRLIIQEDQQANRSN
        SR++T   +   N+  +K N +  +S +R N  T++S R ++   +   ++ N    K  +    +   ++R T+  TD+  ++        +   N+S 
Subjt:  SRQKTLKRDQQANRPIKKINKSAGRSSKRINKLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRS---SKRSTSQPTDQEDQQVSRLIIQEDQQANRSN

Query:  RSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQVNR
        R ++   D    + N  T++SRRKT+   +  RR T+  T++  ++        +   N+S ++ N + N S +  N  T  S R ++   D +  + N 
Subjt:  RSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQVNR

Query:  LIIQEDQQANRPIKK----INKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSSKKINKSADRSSNRSSEKINKSADRSSKK
         I +  Q+ N    K     N +  +S +  N  T++SR   N+  N+  ++ N + + S +K N  T +S +K N     ++N+S  + N +A +S ++
Subjt:  LIIQEDQQANRPIKK----INKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSSKKINKSADRSSNRSSEKINKSADRSSKK

Query:  IIKPTGRSSKKINKLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQQ
            T +S ++ N  A +S  R++   NKS  R++   +K   R++   NKSQ+
Subjt:  IIKPTGRSSKKINKLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQQ

A0A7E6FUY4 M protein, serotype 5-like1.3e-0426.34Show/hide
Query:  QSDWSRQKTLKRDQQANRPIKKIN---KSAGRSSKRINKLTSRSNRSS--------SQQADPRDQQAN-RPIKKINKSAGRSSKRSTSQPTDQEDQQVSR
        +SD   +K+++  +++NR  KK N   K + R SK+ ++ +++SNR S        S+  +PR++  N R   + +K + R SK     P  +     +R
Subjt:  QSDWSRQKTLKRDQQANRPIKKIN---KSAGRSSKRINKLTSRSNRSS--------SQQADPRDQQAN-RPIKKINKSAGRSSKRSTSQPTDQEDQQVSR

Query:  LIIQEDQQANRSNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRST--SQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNR
        ++   ++  N  +R  S +   +  + +K ++R  +K+++++    R +  S    ++  +  +   +E +++NR  K+ N+ + +SS+  +K +SR ++
Subjt:  LIIQEDQQANRSNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRST--SQPTDQEDQQVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNR

Query:  SSSQHADPTDHQVNRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSSKKINKSA--DRSSNRSS
         SS+ +  +  + ++   +E +++NR  KK N+ + +SS++  K +  S+ S N +  R  +K  KS+ R SKK N+ + +S ++  KS+   + SNR S
Subjt:  SSSQHADPTDHQVNRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADRSSKKINKPTGRSSKKINKSA--DRSSNRSS

Query:  EKINKSADRSSKKIIKPTGRSSKKINKLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQQVDPRD
        +K N+ + +S+++  K + R SKK ++   + SNR S+K N+ + +SS++ +K + R SKK N+  +   R+
Subjt:  EKINKSADRSSKKIIKPTGRSSKKINKLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQQVDPRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACAGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCATGGTGACCAC
CCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTACAGGAAACTACAGTC
ATCAAAGTGACTGGTCTAGACAGAAGACTCTAAAAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAG
CTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAG
ATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCTGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGCCAACAGGCCGATC
CAAGAGATCATCAAATCAACAAGCCAACCGATCGATCAAGAAGAAAAACAAGTCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGACCGATCAAGAAGATCAA
CAAGTCTGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGATCAAGAGGATCAATAAGTCAGCAAACCGATCATCCAAGAGTATCAACAAGCTAACAAGTCG
ATCCAACAGATCATCAAGCCAACATGCCGATCCAACAGATCATCAAGTTAACAGACTGATCATCCAAGAAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGT
CAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGATCAAGAGGATCAATAAATCATCAAGCCAATCGACCGATCAAGAAGATCAACAAGTCAGCAGACCGA
TCATCCAAGAAGATCAACAAGCCAACAGGCCGATCATCCAAGAAGATCAACAAATCAGCAGACCGATCATCAAACCGATCATCCGAGAAGATCAATAAGTCAGCAGACCG
ATCATCCAAGAAGATCATCAAGCCAACAGGCCGATCATCCAAGAAGATCAACAAGTTAGCAGATCGATCATCAAACCGATCATCCGAGAAGATCAACAAGTCAGCAGACC
GATCATCCAAGAAGATCAACAAGCCAACAGGCCGATCATCCAAGAAGATCAACAAGTCACAACAGGTCGATCCAAGAGATCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGACGACCGTGGTGACCACCCCTGCAGGAAACTACAGTCATCAAAGTGACAGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCATGGTGACCAC
CCCTGCAGGAAACTACAGTCATCAAAGTGACTGGTCTAGACAGGTGGTGAAATCACTGCAAGTGAAGCTGATGACGACCGTGGTGACCACCCCTACAGGAAACTACAGTC
ATCAAAGTGACTGGTCTAGACAGAAGACTCTAAAAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAG
CTAACAAGCCGATCCAACAGATCATCAAGCCAACAGGCTGATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAG
ATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCTGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGCCAACAGGCCGATC
CAAGAGATCATCAAATCAACAAGCCAACCGATCGATCAAGAAGAAAAACAAGTCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGACCGATCAAGAAGATCAA
CAAGTCTGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGATCAAGAGGATCAATAAGTCAGCAAACCGATCATCCAAGAGTATCAACAAGCTAACAAGTCG
ATCCAACAGATCATCAAGCCAACATGCCGATCCAACAGATCATCAAGTTAACAGACTGATCATCCAAGAAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGT
CAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGATCAAGAGGATCAATAAATCATCAAGCCAATCGACCGATCAAGAAGATCAACAAGTCAGCAGACCGA
TCATCCAAGAAGATCAACAAGCCAACAGGCCGATCATCCAAGAAGATCAACAAATCAGCAGACCGATCATCAAACCGATCATCCGAGAAGATCAATAAGTCAGCAGACCG
ATCATCCAAGAAGATCATCAAGCCAACAGGCCGATCATCCAAGAAGATCAACAAGTTAGCAGATCGATCATCAAACCGATCATCCGAGAAGATCAACAAGTCAGCAGACC
GATCATCCAAGAAGATCAACAAGCCAACAGGCCGATCATCCAAGAAGATCAACAAGTCACAACAGGTCGATCCAAGAGATCATTAA
Protein sequenceShow/hide protein sequence
MTTVVTTPAGNYSHQSDRSRQVVKSLQVKLMTTMVTTPAGNYSHQSDWSRQVVKSLQVKLMTTVVTTPTGNYSHQSDWSRQKTLKRDQQANRPIKKINKSAGRSSKRINK
LTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKRSTSQPTDQEDQQVSRLIIQEDQQANRSNRSSSQQADPRDHQINKPTDRSRRKTSQQADHPRRSTSQPTDQEDQ
QVCRPIIQEDQQANRSIKRINKSANRSSKSINKLTSRSNRSSSQHADPTDHQVNRLIIQEDQQANRPIKKINKSAGRSSKKINKPTDRSRGSINHQANRPIKKINKSADR
SSKKINKPTGRSSKKINKSADRSSNRSSEKINKSADRSSKKIIKPTGRSSKKINKLADRSSNRSSEKINKSADRSSKKINKPTGRSSKKINKSQQVDPRDH