; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G015270 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G015270
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSerine/arginine repetitive matrix protein 1-like
Genome locationchr11:23670683..23671808
RNA-Seq ExpressionLsi11G015270
SyntenyLsi11G015270
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031591.1 serine/arginine repetitive matrix protein 1-like [Cucumis melo var. makuwa]2.1e-14285.19Show/hide
Query:  MPRIKHSIDSQYLLHWYWVSSHFTANSATAPLSPLSVSGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQ
        MPRIKH+IDSQ+LLHWYW SSHFTANSA       SVS  ALDRGQPH LDKN RSAKPPSPGWFDTDVEN+P N  PEEET KEVLSETPIAKPCSV+Q
Subjt:  MPRIKHSIDSQYLLHWYWVSSHFTANSATAPLSPLSVSGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQ

Query:  TAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCA
        T+PK K PE KVK+S MDGSL KGEESIVSVSETSQVTEWC NMSESVSMATTISEQ+EGDEASSK SREIGRN +PKIRRKRP SGD SYRREQRDKCA
Subjt:  TAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCA

Query:  SKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPM
        +KRPAELLPEKKSRVNCRY+HGTTESREARTRKLNG +HEQQSGVSHGRRSRSPA RTV++TNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPM
Subjt:  SKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPM

Query:  DDAIQPPNESIENPLVSLECFIFL
        D +IQPPNESIENPLVSLECFIFL
Subjt:  DDAIQPPNESIENPLVSLECFIFL

KAE8645764.1 hypothetical protein Csa_020528 [Cucumis sativus]3.0e-14184.57Show/hide
Query:  MPRIKHSIDSQYLLHWYWVSSHFTANSATAPLSPLSVSGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQ
        MPRIKH+IDSQ+LL+WYW SSHFTANSA+  +SP      ALDRGQPH LDKNHRSAKP SPGWFDTDVEN+P N  PEEET KEVLSETPIAKPCSV Q
Subjt:  MPRIKHSIDSQYLLHWYWVSSHFTANSATAPLSPLSVSGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQ

Query:  TAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCA
        T+ K K PE +VK+SEMDGSLGKGEESIVSVSETSQVTEWC NMSESVSMATTISEQ+EGDEASSK SR+IGRN +PKIRRKRP SG+ SYRREQRDKC 
Subjt:  TAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCA

Query:  SKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPM
        +KRPAELLPEKKSRVNCRY+HGTTESREARTRKLNGG+HEQQSGVSHGRRSRSPATRTV+ETNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPM
Subjt:  SKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPM

Query:  DDAIQPPNESIENPLVSLECFIFL
        D +IQPPNESIENPLVSLECFIFL
Subjt:  DDAIQPPNESIENPLVSLECFIFL

KAG6572349.1 hypothetical protein SDJN03_29077, partial [Cucurbita argyrosperma subsp. sororia]1.4e-13377.23Show/hide
Query:  MPRIKHSIDSQYLLHWYWVSSHFTANSATAPL-------------SPLSV---------SGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMP
        MPRIKH+IDSQYLLHWYW SSHF ANSATAPL             SPL +             L+ GQPHILDK+  SAK PSPG   TDV NIPG G P
Subjt:  MPRIKHSIDSQYLLHWYWVSSHFTANSATAPL-------------SPLSV---------SGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMP

Query:  EEETAKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPK
        EEET KEVLSETPIAKPC++QQT+PKNKS ELKVKSSEMDGSL K EE  +SVSE SQVTEWC NMSESVSMATTISEQ+EGDEASSKQSRE+GRN +PK
Subjt:  EEETAKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPK

Query:  IRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQ
        IRRKRPYSGDPSYRREQRDKCA+KRPAELL EKKSRV CRYTHGTTESREARTRKLNGG+ EQ+SGV+HGRRSRSPATRTVRETNKTGNMKSS +K+TGQ
Subjt:  IRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQ

Query:  TGDQQEAVTTEKRDDGKMEKPMD-DAIQPPNESIENPLVSLECFIFL
         G+Q EAVTTEKRD+GK++K MD  A QPPNESIENPLVSLECFIFL
Subjt:  TGDQQEAVTTEKRDDGKMEKPMD-DAIQPPNESIENPLVSLECFIFL

XP_008455342.1 PREDICTED: uncharacterized protein LOC103495529 isoform X2 [Cucumis melo]4.3e-14385.49Show/hide
Query:  MPRIKHSIDSQYLLHWYWVSSHFTANSATAPLSPLSVSGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQ
        MPRIKH+IDSQ+LLHWYW SSHFTANSA       SVS  ALDRGQPH LDKNHRSAKPPSPGWFDTDVEN+P N  PEEET KEVLSETPIAKPCSV+Q
Subjt:  MPRIKHSIDSQYLLHWYWVSSHFTANSATAPLSPLSVSGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQ

Query:  TAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCA
        T+ K K PE KVK+SEMDGSL KGEESIVSVSETSQVTEWC NMSESVSMATTISEQ+EGDEASSK SREIGRN +PKIRRKRP SGD SYRREQRDKCA
Subjt:  TAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCA

Query:  SKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPM
        +KRPAELLPEKKSRVNCRY+HGTTESREARTRKLNG +HEQQSGVSHGRRSRSPA RTV++TNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPM
Subjt:  SKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPM

Query:  DDAIQPPNESIENPLVSLECFIFL
        D +IQPPNESIENPLVSLECFIFL
Subjt:  DDAIQPPNESIENPLVSLECFIFL

XP_038887224.1 uncharacterized protein LOC120077414 [Benincasa hispida]2.0e-13289.79Show/hide
Query:  ALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEW
        ALDRGQPHILDK+HR AKPPSPGWFDTDV NIP NG PEEET KEVLSETPIAKPC+VQQT PKNKS ELKVK+SEMDGS  K EESIVSVSETSQVTEW
Subjt:  ALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEW

Query:  CGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHE
        C N+SES+SMATTISEQ+EGDEASSK SREIGRNT+PKIRRKRPYSGD SYRREQRDKCA+KRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHE
Subjt:  CGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHE

Query:  QQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL
        QQSGVSHGRRSRSPATRTVRETNKTGNMKSS MKMTGQ GDQ E +TTEK  +GKMEKPMDDAIQPPNESIENPLVSLECFIFL
Subjt:  QQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0K701 Uncharacterized protein2.7e-12787.32Show/hide
Query:  ALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEW
        ALDRGQPH LDKNHRSAKP SPGWFDTDVEN+P N  PEEET KEVLSETPIAKPCSV QT+ K K PE +VK+SEMDGSLGKGEESIVSVSETSQVTEW
Subjt:  ALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEW

Query:  CGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHE
        C NMSESVSMATTISEQ+EGDEASSK SR+IGRN +PKIRRKRP SG+ SYRREQRDKC +KRPAELLPEKKSRVNCRY+HGTTESREARTRKLNGG+HE
Subjt:  CGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHE

Query:  QQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL
        QQSGVSHGRRSRSPATRTV+ETNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPMD +IQPPNESIENPLVSLECFIFL
Subjt:  QQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL

A0A1S3C0P3 uncharacterized protein LOC103495529 isoform X22.1e-14385.49Show/hide
Query:  MPRIKHSIDSQYLLHWYWVSSHFTANSATAPLSPLSVSGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQ
        MPRIKH+IDSQ+LLHWYW SSHFTANSA       SVS  ALDRGQPH LDKNHRSAKPPSPGWFDTDVEN+P N  PEEET KEVLSETPIAKPCSV+Q
Subjt:  MPRIKHSIDSQYLLHWYWVSSHFTANSATAPLSPLSVSGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQ

Query:  TAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCA
        T+ K K PE KVK+SEMDGSL KGEESIVSVSETSQVTEWC NMSESVSMATTISEQ+EGDEASSK SREIGRN +PKIRRKRP SGD SYRREQRDKCA
Subjt:  TAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCA

Query:  SKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPM
        +KRPAELLPEKKSRVNCRY+HGTTESREARTRKLNG +HEQQSGVSHGRRSRSPA RTV++TNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPM
Subjt:  SKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPM

Query:  DDAIQPPNESIENPLVSLECFIFL
        D +IQPPNESIENPLVSLECFIFL
Subjt:  DDAIQPPNESIENPLVSLECFIFL

A0A5D3C531 Serine/arginine repetitive matrix protein 1-like1.0e-14285.19Show/hide
Query:  MPRIKHSIDSQYLLHWYWVSSHFTANSATAPLSPLSVSGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQ
        MPRIKH+IDSQ+LLHWYW SSHFTANSA       SVS  ALDRGQPH LDKN RSAKPPSPGWFDTDVEN+P N  PEEET KEVLSETPIAKPCSV+Q
Subjt:  MPRIKHSIDSQYLLHWYWVSSHFTANSATAPLSPLSVSGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQ

Query:  TAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCA
        T+PK K PE KVK+S MDGSL KGEESIVSVSETSQVTEWC NMSESVSMATTISEQ+EGDEASSK SREIGRN +PKIRRKRP SGD SYRREQRDKCA
Subjt:  TAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCA

Query:  SKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPM
        +KRPAELLPEKKSRVNCRY+HGTTESREARTRKLNG +HEQQSGVSHGRRSRSPA RTV++TNKTGNMKSSVMKMTGQ GDQQE VTTE RD+GK+EKPM
Subjt:  SKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPM

Query:  DDAIQPPNESIENPLVSLECFIFL
        D +IQPPNESIENPLVSLECFIFL
Subjt:  DDAIQPPNESIENPLVSLECFIFL

A0A6J1GNG7 uncharacterized protein LOC1114555167.9e-11983.1Show/hide
Query:  LDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWC
        L+ GQPHILDK+  SAK PSPG   TDV NIPG G PEEET KEVLSETPIAKPC++QQT+PKNKS ELKVKSSEMDGSL K EE  +SVSE SQVTEWC
Subjt:  LDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWC

Query:  GNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQ
         NMSESVSMATTISEQ+EGDEASSKQSRE+GRN +PKIRRKRPYSGDPSYRREQRDKCA+KRPAELL EKKSRV CRYTHGTTESREARTRKLNGG+ EQ
Subjt:  GNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQ

Query:  QSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMD-DAIQPPNESIENPLVSLECFIFL
        +SGV+HGRRSRSPATRTVRETNKTGNMKSS +K+TGQ G+Q EAVTTEKRD+GK++K MD  A QPPNESIENPLVSLECFIFL
Subjt:  QSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMD-DAIQPPNESIENPLVSLECFIFL

A0A6J1HYF2 uncharacterized protein LOC1114680621.3e-11883.1Show/hide
Query:  LDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWC
        L+ GQPHILDK+  SAK PSPG   TDV NIPG G PEEET KEVLSETPIAKPC++QQT+PKNKS ELKVKSSEMDGSL K EE  +SVSE SQVTEWC
Subjt:  LDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQTAPKNKSPELKVKSSEMDGSLGKGEESIVSVSETSQVTEWC

Query:  GNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQ
         NMSESVSMATTISEQ+EGDEASSKQSRE+GRN +PKIRRKRPYSGDPSYRR+QRDKCA+KRPAELL EKKSRV CRYTHGTTESREARTRKLNGG+ EQ
Subjt:  GNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYTHGTTESREARTRKLNGGRHEQ

Query:  QSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDD-AIQPPNESIENPLVSLECFIFL
        +SGVSHGRRSRSPAT+TVRETNKTGNMKSS +K TGQ G+Q EAVTTEKRD+GK++K MD  AIQPPNESIENPLVSLECFIFL
Subjt:  QSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDD-AIQPPNESIENPLVSLECFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCGTATTAAACACTCCATTGACAGTCAGTATCTTCTCCATTGGTACTGGGTGTCGAGCCATTTCACTGCCAACTCTGCCACAGCTCCACTCTCTCCCCTCTCTGT
TTCTGGAGGAGCCTTGGACAGAGGTCAGCCTCATATCCTGGACAAGAATCACCGTTCTGCAAAACCTCCATCACCGGGATGGTTCGACACTGATGTCGAAAACATCCCAG
GCAACGGAATGCCGGAAGAAGAAACCGCCAAAGAAGTCCTCTCAGAGACTCCCATTGCTAAGCCATGTAGCGTACAACAAACAGCCCCTAAGAACAAGTCTCCCGAGCTG
AAAGTAAAATCGAGTGAAATGGATGGTTCGCTTGGCAAGGGGGAAGAAAGTATAGTTTCAGTTTCGGAGACATCTCAGGTTACAGAATGGTGCGGCAATATGAGCGAAAG
TGTGTCAATGGCGACCACAATTTCGGAGCAAAAGGAAGGGGATGAAGCATCAAGTAAACAGAGCAGAGAAATTGGTCGAAATACAAGACCAAAGATTCGCAGAAAGCGTC
CATATTCCGGCGACCCATCGTACCGAAGAGAACAGAGAGACAAATGTGCATCAAAGAGACCTGCTGAACTTTTACCAGAGAAGAAGTCTCGTGTTAATTGCAGGTACACA
CATGGAACGACAGAATCAAGAGAAGCGAGGACCAGGAAGCTGAATGGAGGGCGGCACGAGCAACAATCTGGAGTCAGCCATGGCCGCCGTTCGAGGTCACCAGCTACTCG
AACAGTTAGAGAAACGAATAAGACAGGGAATATGAAAAGCAGTGTCATGAAAATGACCGGACAAACTGGGGACCAGCAAGAGGCAGTGACCACCGAGAAAAGAGATGATG
GAAAGATGGAGAAGCCAATGGATGATGCAATTCAGCCCCCTAATGAATCCATTGAAAACCCACTTGTCTCACTTGAATGTTTCATCTTTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCCCGTATTAAACACTCCATTGACAGTCAGTATCTTCTCCATTGGTACTGGGTGTCGAGCCATTTCACTGCCAACTCTGCCACAGCTCCACTCTCTCCCCTCTCTGT
TTCTGGAGGAGCCTTGGACAGAGGTCAGCCTCATATCCTGGACAAGAATCACCGTTCTGCAAAACCTCCATCACCGGGATGGTTCGACACTGATGTCGAAAACATCCCAG
GCAACGGAATGCCGGAAGAAGAAACCGCCAAAGAAGTCCTCTCAGAGACTCCCATTGCTAAGCCATGTAGCGTACAACAAACAGCCCCTAAGAACAAGTCTCCCGAGCTG
AAAGTAAAATCGAGTGAAATGGATGGTTCGCTTGGCAAGGGGGAAGAAAGTATAGTTTCAGTTTCGGAGACATCTCAGGTTACAGAATGGTGCGGCAATATGAGCGAAAG
TGTGTCAATGGCGACCACAATTTCGGAGCAAAAGGAAGGGGATGAAGCATCAAGTAAACAGAGCAGAGAAATTGGTCGAAATACAAGACCAAAGATTCGCAGAAAGCGTC
CATATTCCGGCGACCCATCGTACCGAAGAGAACAGAGAGACAAATGTGCATCAAAGAGACCTGCTGAACTTTTACCAGAGAAGAAGTCTCGTGTTAATTGCAGGTACACA
CATGGAACGACAGAATCAAGAGAAGCGAGGACCAGGAAGCTGAATGGAGGGCGGCACGAGCAACAATCTGGAGTCAGCCATGGCCGCCGTTCGAGGTCACCAGCTACTCG
AACAGTTAGAGAAACGAATAAGACAGGGAATATGAAAAGCAGTGTCATGAAAATGACCGGACAAACTGGGGACCAGCAAGAGGCAGTGACCACCGAGAAAAGAGATGATG
GAAAGATGGAGAAGCCAATGGATGATGCAATTCAGCCCCCTAATGAATCCATTGAAAACCCACTTGTCTCACTTGAATGTTTCATCTTTCTGTAG
Protein sequenceShow/hide protein sequence
MPRIKHSIDSQYLLHWYWVSSHFTANSATAPLSPLSVSGGALDRGQPHILDKNHRSAKPPSPGWFDTDVENIPGNGMPEEETAKEVLSETPIAKPCSVQQTAPKNKSPEL
KVKSSEMDGSLGKGEESIVSVSETSQVTEWCGNMSESVSMATTISEQKEGDEASSKQSREIGRNTRPKIRRKRPYSGDPSYRREQRDKCASKRPAELLPEKKSRVNCRYT
HGTTESREARTRKLNGGRHEQQSGVSHGRRSRSPATRTVRETNKTGNMKSSVMKMTGQTGDQQEAVTTEKRDDGKMEKPMDDAIQPPNESIENPLVSLECFIFL