; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004161 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004161
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4283 domain-containing protein
Genome locationchr6:1495461..1497530
RNA-Seq ExpressionLag0004161
SyntenyLag0004161
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039967.1 hypothetical protein E6C27_scaffold122G002490 [Cucumis melo var. makuwa]9.3e-4733.74Show/hide
Query:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV
        +EKK F + +D ++ +   L  E    + FS++++L +L W+K     L D P   +FF EKR E+  +WV+KT  +KG+ AEI ++   G    I+VP 
Subjt:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV

Query:  GEDRKGWRSLISLINSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALYLSSTVIIQRKFFHDCWHDIMRA
        G ++                  AV        KG++S +E+   R V+   +  +NT                + +     TV++ R+FFHD W  I+  
Subjt:  GEDRKGWRSLISLINSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALYLSSTVIIQRKFFHDCWHDIMRA

Query:  LQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCGGYLETTNK
        L ++L       P + DKAL+   +EEQA++L   KGW  VG+F V+F  WS +  A  + +PSYGGWIKVR +PL  W+L+ F QIGD CGG++E   +
Subjt:  LQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCGGYLETTNK

Query:  TLSRMDMMDIGLKVKSNHNGFLPAEV
        T    D+ +  +K+K N++GF+PA +
Subjt:  TLSRMDMMDIGLKVKSNHNGFLPAEV

KAA0040039.1 hypothetical protein E6C27_scaffold366G00060 [Cucumis melo var. makuwa]1.1e-5033.23Show/hide
Query:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV
        +EKK F + +D  + +   L  E    + FS++++L +L W+K     L D P   +FF EKR E+  +WV+KT  +KG+ AEI ++   G    I+VP 
Subjt:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV

Query:  GEDRKGWRSLISLIN-----SLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALY---LSSTVIIQRKFFHD
        G ++ GW   +SL++     S  TN+  +    +     ++   E  K+R         +++   + S  +NI  +      +      T ++ R++FHD
Subjt:  GEDRKGWRSLISLIN-----SLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALY---LSSTVIIQRKFFHD

Query:  CWHDIMRALQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCG
         W  I+  L ++L       P   DKAL+   +EEQA++L   KGW  VG+F V+F  WS +  A  + +PSYGGWIKVR +PL  W+L+ F QIGD CG
Subjt:  CWHDIMRALQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCG

Query:  GYLETTNKTLSRMDMMDIGLKVKSNHNGFLPAEV
        G++E   +T    D+ +  +K+K N+ GF+PA +
Subjt:  GYLETTNKTLSRMDMMDIGLKVKSNHNGFLPAEV

KAA0050054.1 hypothetical protein E6C27_scaffold675G00340 [Cucumis melo var. makuwa]2.8e-5134.04Show/hide
Query:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV
        +EKK F   +D ++ +L  L  E    + FS++++L +L W+K     L D     +FF EKR ED  +WV+KT  +KG+ AEI ++   G    I+VP 
Subjt:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV

Query:  GEDRKGWRSLISLI-----NSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNI---DSSPPMAALYLSSTVIIQRKFFHD
        G ++ GW   +SL+     +S  TN+  +    +     ++   +  K+R         +++   T S   NI     S    +     TV++ R+FFHD
Subjt:  GEDRKGWRSLISLI-----NSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNI---DSSPPMAALYLSSTVIIQRKFFHD

Query:  CWHDIMRALQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCG
         W  I+  L ++L       P   DKAL+   +EEQA ++   KGW  VG+F V+F  W+ +A A  + +PSYGGWIKVR +PL  W+L+ F QIGD CG
Subjt:  CWHDIMRALQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCG

Query:  GYLETTNKTLSRMDMMDIGLKVKSNHNGFLPA
        G++E   +T    D+++  +++K N++GF+PA
Subjt:  GYLETTNKTLSRMDMMDIGLKVKSNHNGFLPA

TYK10355.1 hypothetical protein E5676_scaffold367G00330 [Cucumis melo var. makuwa]1.5e-5234.34Show/hide
Query:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV
        +EKK F + +D ++ +L  L  E    + FS++++L +L W+K     L D P   +FF EKR ED  +WV+KT  +KG+ AEI ++   G    I+VP 
Subjt:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV

Query:  GEDRKGWRSLISLI-----NSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNI---DSSPPMAALYLSSTVIIQRKFFHD
        G ++ GW   +SL+     +S  TN+  +    +     ++   +  K+R         +++   T S   NI     S    +     TV++ R+FFHD
Subjt:  GEDRKGWRSLISLI-----NSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNI---DSSPPMAALYLSSTVIIQRKFFHD

Query:  CWHDIMRALQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCG
         W  I+  L ++L       P   DKAL+   +EEQA ++   KGW  VG+F V+F  W+ +A A  + +PSYGGWIKVR +PL  W+L+ F QIGD CG
Subjt:  CWHDIMRALQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCG

Query:  GYLETTNKTLSRMDMMDIGLKVKSNHNGFLPA
        G++E   +T    D+++  +++K N++GF+PA
Subjt:  GYLETTNKTLSRMDMMDIGLKVKSNHNGFLPA

TYK24535.1 hypothetical protein E5676_scaffold266G00770 [Cucumis melo var. makuwa]1.2e-4633.74Show/hide
Query:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV
        +EKK F + +D ++ +   L  E    + FS++++L +L W+K     L D P   +FF EKR E+  +WV+KT  +KG+ AEI ++   G    I+VP 
Subjt:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV

Query:  GEDRKGWRSLISLINSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALYLSSTVIIQRKFFHDCWHDIMRA
        G ++                  AV        KG++S +E+   R V+   +  +NT                + +     TV++ R+FFHD W  I+  
Subjt:  GEDRKGWRSLISLINSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALYLSSTVIIQRKFFHDCWHDIMRA

Query:  LQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCGGYLETTNK
        L ++L       P   DKAL+   +EEQA++L   KGW  VG+F V+F  WS +  A  + +PSYGGWIKVR +PL  W+L+ F QIGD CGG++E   +
Subjt:  LQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCGGYLETTNK

Query:  TLSRMDMMDIGLKVKSNHNGFLPAEV
        T    D+ +  +K+K N++GF+PA +
Subjt:  TLSRMDMMDIGLKVKSNHNGFLPAEV

TrEMBL top hitse value%identityAlignment
A0A5A7TEK8 DUF4283 domain-containing protein4.5e-4733.74Show/hide
Query:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV
        +EKK F + +D ++ +   L  E    + FS++++L +L W+K     L D P   +FF EKR E+  +WV+KT  +KG+ AEI ++   G    I+VP 
Subjt:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV

Query:  GEDRKGWRSLISLINSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALYLSSTVIIQRKFFHDCWHDIMRA
        G ++                  AV        KG++S +E+   R V+   +  +NT                + +     TV++ R+FFHD W  I+  
Subjt:  GEDRKGWRSLISLINSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALYLSSTVIIQRKFFHDCWHDIMRA

Query:  LQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCGGYLETTNK
        L ++L       P + DKAL+   +EEQA++L   KGW  VG+F V+F  WS +  A  + +PSYGGWIKVR +PL  W+L+ F QIGD CGG++E   +
Subjt:  LQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCGGYLETTNK

Query:  TLSRMDMMDIGLKVKSNHNGFLPAEV
        T    D+ +  +K+K N++GF+PA +
Subjt:  TLSRMDMMDIGLKVKSNHNGFLPAEV

A0A5A7TFK7 DUF4283 domain-containing protein5.1e-5133.23Show/hide
Query:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV
        +EKK F + +D  + +   L  E    + FS++++L +L W+K     L D P   +FF EKR E+  +WV+KT  +KG+ AEI ++   G    I+VP 
Subjt:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV

Query:  GEDRKGWRSLISLIN-----SLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALY---LSSTVIIQRKFFHD
        G ++ GW   +SL++     S  TN+  +    +     ++   E  K+R         +++   + S  +NI  +      +      T ++ R++FHD
Subjt:  GEDRKGWRSLISLIN-----SLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALY---LSSTVIIQRKFFHD

Query:  CWHDIMRALQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCG
         W  I+  L ++L       P   DKAL+   +EEQA++L   KGW  VG+F V+F  WS +  A  + +PSYGGWIKVR +PL  W+L+ F QIGD CG
Subjt:  CWHDIMRALQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCG

Query:  GYLETTNKTLSRMDMMDIGLKVKSNHNGFLPAEV
        G++E   +T    D+ +  +K+K N+ GF+PA +
Subjt:  GYLETTNKTLSRMDMMDIGLKVKSNHNGFLPAEV

A0A5A7U495 DUF4283 domain-containing protein1.4e-5134.04Show/hide
Query:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV
        +EKK F   +D ++ +L  L  E    + FS++++L +L W+K     L D     +FF EKR ED  +WV+KT  +KG+ AEI ++   G    I+VP 
Subjt:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV

Query:  GEDRKGWRSLISLI-----NSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNI---DSSPPMAALYLSSTVIIQRKFFHD
        G ++ GW   +SL+     +S  TN+  +    +     ++   +  K+R         +++   T S   NI     S    +     TV++ R+FFHD
Subjt:  GEDRKGWRSLISLI-----NSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNI---DSSPPMAALYLSSTVIIQRKFFHD

Query:  CWHDIMRALQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCG
         W  I+  L ++L       P   DKAL+   +EEQA ++   KGW  VG+F V+F  W+ +A A  + +PSYGGWIKVR +PL  W+L+ F QIGD CG
Subjt:  CWHDIMRALQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCG

Query:  GYLETTNKTLSRMDMMDIGLKVKSNHNGFLPA
        G++E   +T    D+++  +++K N++GF+PA
Subjt:  GYLETTNKTLSRMDMMDIGLKVKSNHNGFLPA

A0A5D3CFS8 DUF4283 domain-containing protein7.2e-5334.34Show/hide
Query:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV
        +EKK F + +D ++ +L  L  E    + FS++++L +L W+K     L D P   +FF EKR ED  +WV+KT  +KG+ AEI ++   G    I+VP 
Subjt:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV

Query:  GEDRKGWRSLISLI-----NSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNI---DSSPPMAALYLSSTVIIQRKFFHD
        G ++ GW   +SL+     +S  TN+  +    +     ++   +  K+R         +++   T S   NI     S    +     TV++ R+FFHD
Subjt:  GEDRKGWRSLISLI-----NSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNI---DSSPPMAALYLSSTVIIQRKFFHD

Query:  CWHDIMRALQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCG
         W  I+  L ++L       P   DKAL+   +EEQA ++   KGW  VG+F V+F  W+ +A A  + +PSYGGWIKVR +PL  W+L+ F QIGD CG
Subjt:  CWHDIMRALQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCG

Query:  GYLETTNKTLSRMDMMDIGLKVKSNHNGFLPA
        G++E   +T    D+++  +++K N++GF+PA
Subjt:  GYLETTNKTLSRMDMMDIGLKVKSNHNGFLPA

A0A5D3DLP0 DUF4283 domain-containing protein5.9e-4733.74Show/hide
Query:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV
        +EKK F + +D ++ +   L  E    + FS++++L +L W+K     L D P   +FF EKR E+  +WV+KT  +KG+ AEI ++   G    I+VP 
Subjt:  VEKKMFSIEIDPKNPNLYRL-HEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVPV

Query:  GEDRKGWRSLISLINSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALYLSSTVIIQRKFFHDCWHDIMRA
        G ++                  AV        KG++S +E+   R V+   +  +NT                + +     TV++ R+FFHD W  I+  
Subjt:  GEDRKGWRSLISLINSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALYLSSTVIIQRKFFHDCWHDIMRA

Query:  LQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCGGYLETTNK
        L ++L       P   DKAL+   +EEQA++L   KGW  VG+F V+F  WS +  A  + +PSYGGWIKVR +PL  W+L+ F QIGD CGG++E   +
Subjt:  LQQELSAFASISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCGGYLETTNK

Query:  TLSRMDMMDIGLKVKSNHNGFLPAEV
        T    D+ +  +K+K N++GF+PA +
Subjt:  TLSRMDMMDIGLKVKSNHNGFLPAEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGATACCAGCCAACCGCCGCCCAACCCATCAGAGTCGAAAAGAAGATGTTCTCCATTGAAATCGACCCGAAGAACCCCAACCTCTATCGCCTTCATGAAGCCACCTT
CGATAGAAGGTTTTCCTTATCTTTATCCCTACCGACACTCACATGGATTAAAGAGTGCTTATCCCGTCTATGTGATCTCCCCCTGAACCAAAAGTTTTTCAGCGAGAAGA
GAATAGAAGACATTGTCATCTGGGTAGAAAAGACTACGACCAAGAAAGGCCATGCAGCTGAAATAGCAAAACTTGGATCAAATGGCGGGCTAAATAAGATTATTGTTCCT
GTTGGAGAAGACAGAAAGGGTTGGCGCAGCCTTATTAGTCTCATTAACTCCCTCTACACCAACCACAATGCCGTCGCCCCTCATCCCATTTATGCAGCCAAAGGAGCTGC
CTCATATAAAGAGGCTTTTAAGAAGAGACAGGTTGACCCCCTCCCTTCCCATCCTGCCAATACCCCAGCCCTGACTATATCTCCCCCAGACAACATCGACTCTTCCCCAC
CAATGGCAGCCTTATATTTATCTTCGACAGTCATCATACAACGCAAATTCTTCCATGATTGTTGGCACGACATCATGAGAGCCCTTCAACAAGAACTATCGGCTTTTGCC
TCTATCAGCCCCTTACAACCGGACAAAGCTCTGCTAGCGTGTGTAGATGAAGAACAGGCGCGGGTCCTTGCCAATATTAAAGGTTGGTATAGAGTGGGAAAGTTCCTCGT
CAGATTCATGCCATGGAGTGTCGAAGCAGTGGCACACAATCAAAAGGTTCCCTCATATGGAGGCTGGATAAAGGTGCGCAACCTCCCACTTGATAAATGGTCCTTAGATG
TCTTCAAGCAGATTGGAGACGTTTGTGGAGGATACCTCGAAACAACCAACAAAACCTTATCCCGAATGGATATGATGGATATAGGGTTGAAGGTCAAGTCCAACCACAAT
GGTTTCTTACCCGCCGAAGTATACATACCATCTCCTTCCAACAACCCTATAAAGGTGTCTATCGATCCCTTTTTCATGGAAGAATACTCCATTGGCTACATAGCAGGCAT
CCACGGTAAAATCCCGATCGAAACAATGGCTCAGGGAGCCCCACGCGCCGGCAACGCCTCCGATGACATAGAAAAGAGGCCCCCCTTATGTCCACGCGCCGCACAGGAAT
TGGTGAACGAAATAGGGGATTATCCCCAATCAGAGTACAGATTACCTGCTCGAGATGTCACTGTCACAGCCTCCTTCGCAGAAGTATTGCAGACGCCATCAGAGAAGGGC
CCACCTATAGCAGCCACCCACGTGACTCCTCCATACCTTATCAGCATCCCCACACCCCAAACCAAATACGACCGAAGCCCAAGCCCAAACCCAAATCCCCAATCTAATCC
ATGCCCCATTGACCACCAGTCAACAAATCCTACCCCTGCCAATCTGCCCCCAAAAATCCCCGACCAGACCCATCATATTCCAAAGCCCACAATAAGCCCAGCCCCAAAAA
GCCCTAGTCCTGCCAAGATACAGCTTGGGGGTAAAAAGCCGATTACCATCAACAACCAGGAAACTTTCCTCCTGACGAGTACTATGCACTCTACCCTCACTGATTTTCAG
TATACTGAATCCGAAGGAGATTTCTCTTCCCCCTGCTCCCCAAACACGAATGTTTCTCCTCTCATGCCCCATTCCTCAAACAACCAGCATGTGGCATCTCCACCAACTAT
ATCCCACCTTTTCGACCAATCTGCAGCACAACCCCCACCTCTCGAGAATCCCATTCCTTTGCGAATGGAGAAGCCACCAGACACAGTCTCCTCGTTAATCCTATGTAAAG
AGACATCAGAATTGATTGATATTGATGATGAAGAGACGAAAGAGGCTGAGGAAGACCCTAATCCCTCTTATCATCCAGAGACCCCTAAACTTATCTCCCCATTATTTTTC
CATGGCTGGCTGAGCATGGAATGTGCATCATGCCTATGCCTAACAGACAGAAGCTATCCAACACTGCGAAAAAGAAGAAGAAATGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGATACCAGCCAACCGCCGCCCAACCCATCAGAGTCGAAAAGAAGATGTTCTCCATTGAAATCGACCCGAAGAACCCCAACCTCTATCGCCTTCATGAAGCCACCTT
CGATAGAAGGTTTTCCTTATCTTTATCCCTACCGACACTCACATGGATTAAAGAGTGCTTATCCCGTCTATGTGATCTCCCCCTGAACCAAAAGTTTTTCAGCGAGAAGA
GAATAGAAGACATTGTCATCTGGGTAGAAAAGACTACGACCAAGAAAGGCCATGCAGCTGAAATAGCAAAACTTGGATCAAATGGCGGGCTAAATAAGATTATTGTTCCT
GTTGGAGAAGACAGAAAGGGTTGGCGCAGCCTTATTAGTCTCATTAACTCCCTCTACACCAACCACAATGCCGTCGCCCCTCATCCCATTTATGCAGCCAAAGGAGCTGC
CTCATATAAAGAGGCTTTTAAGAAGAGACAGGTTGACCCCCTCCCTTCCCATCCTGCCAATACCCCAGCCCTGACTATATCTCCCCCAGACAACATCGACTCTTCCCCAC
CAATGGCAGCCTTATATTTATCTTCGACAGTCATCATACAACGCAAATTCTTCCATGATTGTTGGCACGACATCATGAGAGCCCTTCAACAAGAACTATCGGCTTTTGCC
TCTATCAGCCCCTTACAACCGGACAAAGCTCTGCTAGCGTGTGTAGATGAAGAACAGGCGCGGGTCCTTGCCAATATTAAAGGTTGGTATAGAGTGGGAAAGTTCCTCGT
CAGATTCATGCCATGGAGTGTCGAAGCAGTGGCACACAATCAAAAGGTTCCCTCATATGGAGGCTGGATAAAGGTGCGCAACCTCCCACTTGATAAATGGTCCTTAGATG
TCTTCAAGCAGATTGGAGACGTTTGTGGAGGATACCTCGAAACAACCAACAAAACCTTATCCCGAATGGATATGATGGATATAGGGTTGAAGGTCAAGTCCAACCACAAT
GGTTTCTTACCCGCCGAAGTATACATACCATCTCCTTCCAACAACCCTATAAAGGTGTCTATCGATCCCTTTTTCATGGAAGAATACTCCATTGGCTACATAGCAGGCAT
CCACGGTAAAATCCCGATCGAAACAATGGCTCAGGGAGCCCCACGCGCCGGCAACGCCTCCGATGACATAGAAAAGAGGCCCCCCTTATGTCCACGCGCCGCACAGGAAT
TGGTGAACGAAATAGGGGATTATCCCCAATCAGAGTACAGATTACCTGCTCGAGATGTCACTGTCACAGCCTCCTTCGCAGAAGTATTGCAGACGCCATCAGAGAAGGGC
CCACCTATAGCAGCCACCCACGTGACTCCTCCATACCTTATCAGCATCCCCACACCCCAAACCAAATACGACCGAAGCCCAAGCCCAAACCCAAATCCCCAATCTAATCC
ATGCCCCATTGACCACCAGTCAACAAATCCTACCCCTGCCAATCTGCCCCCAAAAATCCCCGACCAGACCCATCATATTCCAAAGCCCACAATAAGCCCAGCCCCAAAAA
GCCCTAGTCCTGCCAAGATACAGCTTGGGGGTAAAAAGCCGATTACCATCAACAACCAGGAAACTTTCCTCCTGACGAGTACTATGCACTCTACCCTCACTGATTTTCAG
TATACTGAATCCGAAGGAGATTTCTCTTCCCCCTGCTCCCCAAACACGAATGTTTCTCCTCTCATGCCCCATTCCTCAAACAACCAGCATGTGGCATCTCCACCAACTAT
ATCCCACCTTTTCGACCAATCTGCAGCACAACCCCCACCTCTCGAGAATCCCATTCCTTTGCGAATGGAGAAGCCACCAGACACAGTCTCCTCGTTAATCCTATGTAAAG
AGACATCAGAATTGATTGATATTGATGATGAAGAGACGAAAGAGGCTGAGGAAGACCCTAATCCCTCTTATCATCCAGAGACCCCTAAACTTATCTCCCCATTATTTTTC
CATGGCTGGCTGAGCATGGAATGTGCATCATGCCTATGCCTAACAGACAGAAGCTATCCAACACTGCGAAAAAGAAGAAGAAATGGGTGA
Protein sequenceShow/hide protein sequence
MRYQPTAAQPIRVEKKMFSIEIDPKNPNLYRLHEATFDRRFSLSLSLPTLTWIKECLSRLCDLPLNQKFFSEKRIEDIVIWVEKTTTKKGHAAEIAKLGSNGGLNKIIVP
VGEDRKGWRSLISLINSLYTNHNAVAPHPIYAAKGAASYKEAFKKRQVDPLPSHPANTPALTISPPDNIDSSPPMAALYLSSTVIIQRKFFHDCWHDIMRALQQELSAFA
SISPLQPDKALLACVDEEQARVLANIKGWYRVGKFLVRFMPWSVEAVAHNQKVPSYGGWIKVRNLPLDKWSLDVFKQIGDVCGGYLETTNKTLSRMDMMDIGLKVKSNHN
GFLPAEVYIPSPSNNPIKVSIDPFFMEEYSIGYIAGIHGKIPIETMAQGAPRAGNASDDIEKRPPLCPRAAQELVNEIGDYPQSEYRLPARDVTVTASFAEVLQTPSEKG
PPIAATHVTPPYLISIPTPQTKYDRSPSPNPNPQSNPCPIDHQSTNPTPANLPPKIPDQTHHIPKPTISPAPKSPSPAKIQLGGKKPITINNQETFLLTSTMHSTLTDFQ
YTESEGDFSSPCSPNTNVSPLMPHSSNNQHVASPPTISHLFDQSAAQPPPLENPIPLRMEKPPDTVSSLILCKETSELIDIDDEETKEAEEDPNPSYHPETPKLISPLFF
HGWLSMECASCLCLTDRSYPTLRKRRRNG