; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy1G003780 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy1G003780
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionVQ motif-containing protein 20
Genome locationGy14Chr1:2380325..2381065
RNA-Seq ExpressionCsGy1G003780
SyntenyCsGy1G003780
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006952 - defense response (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR008889 - VQ
IPR039607 - VQ motif-containing protein 8/17/18/20/21/25
IPR039832 - VQ motif-containing protein 20


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN62345.1 hypothetical protein Csa_018596 [Cucumis sativus]6.38e-10977.38Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSPQQQQQ IHGK DNQ TNN  NKP+  PP LKINKDSHLIRK+SSSSNTSSPSSSSSSTTSL NGV   AA  PPQRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGS-GLMEVGQVNSCFGPAIFEPPPPPP--QL
        ALVQKLTGMSRSD   EAST   AT KS VDENN       K SK  VVNDDNESSSVVTTDENCCGG GS G +E GQVNSCFGPAIFEPPPPPP  QL
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGS-GLMEVGQVNSCFGPAIFEPPPPPP--QL

Query:  ANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGN---RSVNDFCEYSEF
        A+SYL+NIP+Y PNSTEFLC NQPIFNYDDSLLFG N    S N   ++SEF
Subjt:  ANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGN---RSVNDFCEYSEF

KGN63830.2 hypothetical protein Csa_013231 [Cucumis sativus]3.67e-139100Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS
        ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS

Query:  YLTNIPIYTPNSTEF
        YLTNIPIYTPNSTEF
Subjt:  YLTNIPIYTPNSTEF

XP_004138347.3 VQ motif-containing protein 20 [Cucumis sativus]1.04e-169100Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS
        ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS

Query:  YLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSEF
        YLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSEF
Subjt:  YLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSEF

XP_008453685.1 PREDICTED: VQ motif-containing protein 20 [Cucumis melo]1.19e-12985.88Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSN--TSSPSSSSSSTTSLANGVAAAAAKP--PPQRHPVIIYTHSPKIIHTHP
        MSPQQQ   IHGK DNQI NNGN KPILCPPPLKINKDSHLIRKTSSSSN  TSSPSSSSSSTTSL NGVAAAAA    P QRHPVIIYTHSPK+IHTHP
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSN--TSSPSSSSSSTTSLANGVAAAAAKP--PPQRHPVIIYTHSPKIIHTHP

Query:  RDFMALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGG-SGLMEVGQVNSCFGPAIFEPPPPPP
        RDFMALVQKLTGMSRSD DHEAST KLATTK  VDENNNNN    KGSKA VV+DDNESSSVVTTDENCCGGGG SG+MEVGQVNSCFGP IFEPPPPPP
Subjt:  RDFMALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGG-SGLMEVGQVNSCFGPAIFEPPPPPP

Query:  -----QLANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSE
             QLANSYLTNIPIYTPNS EFLC NQPIFNYDDSLLFGGNR VNDFCEYSE
Subjt:  -----QLANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSE

XP_038876677.1 VQ motif-containing protein 20 [Benincasa hispida]1.29e-11177.39Show/hide
Query:  MSPQQQQQQIHGKMDNQ--ITNNGN---NKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTH
        MSPQ QQQ +HGK DNQ    NNGN   NKP LCPPPLKINKDSHLIRK+SSSSNTSSPSSSSSSTTSL NGV       PPQRHPVIIYTHSPKIIHTH
Subjt:  MSPQQQQQQIHGKMDNQ--ITNNGN---NKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTH

Query:  PRDFMALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPP
        PRDFMALVQKLTGMSRSD   EAST   ATTKS VDENN       KGSK AVVNDDNESSSVVTTDENCCGGG SG++E GQVNSCFG AIFEPPPPPP
Subjt:  PRDFMALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPP

Query:  --QLANSYLTNIPIYTPNSTEFLCTNQP-IFNYDDSLLFGGNR-------SVNDFCEYSEF
          QLANSYLTNIP+Y PNSTEFLCTNQP  FNYDDSLLFG +         +NDFCEYSEF
Subjt:  --QLANSYLTNIPIYTPNSTEFLCTNQP-IFNYDDSLLFGGNR-------SVNDFCEYSEF

TrEMBL top hitse value%identityAlignment
A0A0A0LKA6 VQ domain-containing protein3.09e-10977.38Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSPQQQQQ IHGK DNQ TNN  NKP+  PP LKINKDSHLIRK+SSSSNTSSPSSSSSSTTSL NGV   AA  PPQRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGS-GLMEVGQVNSCFGPAIFEPPPPPP--QL
        ALVQKLTGMSRSD   EAST   AT KS VDENN       K SK  VVNDDNESSSVVTTDENCCGG GS G +E GQVNSCFGPAIFEPPPPPP  QL
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGS-GLMEVGQVNSCFGPAIFEPPPPPP--QL

Query:  ANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGN---RSVNDFCEYSEF
        A+SYL+NIP+Y PNSTEFLC NQPIFNYDDSLLFG N    S N   ++SEF
Subjt:  ANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGN---RSVNDFCEYSEF

A0A0A0LPK7 VQ domain-containing protein1.14e-13885.37Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS
        ALVQKLTG+                                   KAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS

Query:  YLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSEF
        YLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSEF
Subjt:  YLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSEF

A0A1S3BCI6 VQ motif-containing protein 20-like8.68e-10776.47Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSPQ QQQQIHGK DNQI NN N KP L PP LKINKDSHLIRK+SSSSNTSSPSSSSSSTTSL NGV  A AKPP QRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGS-GLMEVGQVNSCFGPAIFEPPPPPP----
        ALVQKLTGMSRSD   EAST   AT KS VDENN       K SK  VVNDDNESSSV+TTDENCC G GS G +E GQVNSCFGPA FEPPPPPP    
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGS-GLMEVGQVNSCFGPAIFEPPPPPP----

Query:  QLANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNR----SVNDFCEYSEF
        QLANSYLTN+P+Y PNSTEFLC NQPIFNYDDSLLFGG+     S N   ++SEF
Subjt:  QLANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNR----SVNDFCEYSEF

A0A1S3BXM8 VQ motif-containing protein 205.77e-13085.88Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSN--TSSPSSSSSSTTSLANGVAAAAAKP--PPQRHPVIIYTHSPKIIHTHP
        MSPQQQ   IHGK DNQI NNGN KPILCPPPLKINKDSHLIRKTSSSSN  TSSPSSSSSSTTSL NGVAAAAA    P QRHPVIIYTHSPK+IHTHP
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSN--TSSPSSSSSSTTSLANGVAAAAAKP--PPQRHPVIIYTHSPKIIHTHP

Query:  RDFMALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGG-SGLMEVGQVNSCFGPAIFEPPPPPP
        RDFMALVQKLTGMSRSD DHEAST KLATTK  VDENNNNN    KGSKA VV+DDNESSSVVTTDENCCGGGG SG+MEVGQVNSCFGP IFEPPPPPP
Subjt:  RDFMALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGG-SGLMEVGQVNSCFGPAIFEPPPPPP

Query:  -----QLANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSE
             QLANSYLTNIPIYTPNS EFLC NQPIFNYDDSLLFGGNR VNDFCEYSE
Subjt:  -----QLANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSE

A0A5A7V0G0 VQ motif-containing protein 20-like6.12e-10776.47Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSPQ QQQQIHGK DNQI NN N KP L PP LKINKDSHLIRK+SSSSNTSSPSSSSSSTTSL NGV  A AKPP QRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGS-GLMEVGQVNSCFGPAIFEPPPPPP----
        ALVQKLTGMSRSD   EAST   AT KS VDENN       K SK  VVNDDNESSSV+TTDENCC G GS G +E GQVNSCFGPA FEPPPPPP    
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGS-GLMEVGQVNSCFGPAIFEPPPPPP----

Query:  QLANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNR----SVNDFCEYSEF
        QLANSYLTN+P+Y PNSTEFLC NQPIFNYDDSLLFGG+     S N   ++SEF
Subjt:  QLANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNR----SVNDFCEYSEF

SwissProt top hitse value%identityAlignment
Q9CA36 VQ motif-containing protein 8, chloroplastic3.3e-0535.85Show/hide
Query:  PPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLTGMSR--SDHDHEASTTKLATTK
        P  LK+  +SH I+KTSS  +   P   +S                     PVIIY HSPK+IHT   DFMALVQ+LTG+      +  E+S++ +    
Subjt:  PPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLTGMSR--SDHDHEASTTKLATTK

Query:  SAVDEN
        +  D+N
Subjt:  SAVDEN

Q9LS54 VQ motif-containing protein 202.0e-2638.97Show/hide
Query:  HGKMDNQITNNG---NNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLT
        H K +    N+G      P   PP LK+NKDSH+I+K        SPSS SS            AAKP   RHPVIIYTH+P+IIHT+P+DFMALVQKLT
Subjt:  HGKMDNQITNNG---NNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLT

Query:  GMSRSDHD----------HEASTTKLATTKSAVDENNNNN-----NN---------NNKG--SKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNS
        GM+ SD D              +     ++  VD  NN N     NN         N KG      ++++D+ESSSV+TT+EN        + E GQVNS
Subjt:  GMSRSDHD----------HEASTTKLATTKSAVDENNNNN-----NN---------NNKG--SKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNS

Query:  CF----------GPAIFEPPPPPPQLAN-------SYLTNIPIYTP--NSTEFLCTNQPIFNYDDSLLFGGN
                     P    PPPPPP + +       +YL   PI+ P   +  FLC NQP  N+DD L F  N
Subjt:  CF----------GPAIFEPPPPPPQLAN-------SYLTNIPIYTP--NSTEFLCTNQPIFNYDDSLLFGGN

Arabidopsis top hitse value%identityAlignment
AT1G68450.1 VQ motif-containing protein2.3e-0635.85Show/hide
Query:  PPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLTGMSR--SDHDHEASTTKLATTK
        P  LK+  +SH I+KTSS  +   P   +S                     PVIIY HSPK+IHT   DFMALVQ+LTG+      +  E+S++ +    
Subjt:  PPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLTGMSR--SDHDHEASTTKLATTK

Query:  SAVDEN
        +  D+N
Subjt:  SAVDEN

AT3G18360.1 VQ motif-containing protein1.4e-2738.97Show/hide
Query:  HGKMDNQITNNG---NNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLT
        H K +    N+G      P   PP LK+NKDSH+I+K        SPSS SS            AAKP   RHPVIIYTH+P+IIHT+P+DFMALVQKLT
Subjt:  HGKMDNQITNNG---NNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLT

Query:  GMSRSDHD----------HEASTTKLATTKSAVDENNNNN-----NN---------NNKG--SKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNS
        GM+ SD D              +     ++  VD  NN N     NN         N KG      ++++D+ESSSV+TT+EN        + E GQVNS
Subjt:  GMSRSDHD----------HEASTTKLATTKSAVDENNNNN-----NN---------NNKG--SKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNS

Query:  CF----------GPAIFEPPPPPPQLAN-------SYLTNIPIYTP--NSTEFLCTNQPIFNYDDSLLFGGN
                     P    PPPPPP + +       +YL   PI+ P   +  FLC NQP  N+DD L F  N
Subjt:  CF----------GPAIFEPPPPPPQLAN-------SYLTNIPIYTP--NSTEFLCTNQPIFNYDDSLLFGGN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCCTCAACAACAGCAACAACAAATCCATGGAAAAATGGATAATCAGATCACCAACAATGGGAATAATAAACCAATATTGTGTCCACCGCCATTGAAGATCAATAA
GGATTCTCATTTGATTCGAAAAACTTCCTCTTCTTCAAATACTTCTTCCCCTTCTTCCTCTTCATCTTCCACCACCTCTCTTGCCAATGGGGTCGCTGCTGCTGCTGCGA
AGCCACCACCTCAACGACATCCGGTGATTATATACACACATTCTCCCAAAATCATTCACACGCATCCTCGAGATTTCATGGCGTTGGTTCAAAAGCTAACCGGAATGTCT
CGATCAGATCATGATCATGAAGCTTCTACCACGAAGCTGGCTACAACGAAATCCGCCGTGGATGAGAATAATAATAATAATAATAATAATAATAAAGGTTCTAAAGCTGC
TGTTGTGAATGATGATAATGAATCGTCGTCCGTTGTGACGACGGATGAAAATTGTTGCGGCGGCGGTGGAAGCGGTTTGATGGAAGTTGGTCAGGTTAATTCGTGTTTTG
GACCGGCGATTTTTGAGCCGCCGCCACCTCCGCCACAACTGGCGAATTCTTACCTGACGAATATACCCATTTATACGCCGAATTCGACGGAGTTCTTGTGCACGAATCAA
CCAATTTTTAACTATGATGATTCTTTGCTTTTTGGTGGCAACAGATCAGTTAATGATTTCTGCGAATACAGTGAATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTCCTCAACAACAGCAACAACAAATCCATGGAAAAATGGATAATCAGATCACCAACAATGGGAATAATAAACCAATATTGTGTCCACCGCCATTGAAGATCAATAA
GGATTCTCATTTGATTCGAAAAACTTCCTCTTCTTCAAATACTTCTTCCCCTTCTTCCTCTTCATCTTCCACCACCTCTCTTGCCAATGGGGTCGCTGCTGCTGCTGCGA
AGCCACCACCTCAACGACATCCGGTGATTATATACACACATTCTCCCAAAATCATTCACACGCATCCTCGAGATTTCATGGCGTTGGTTCAAAAGCTAACCGGAATGTCT
CGATCAGATCATGATCATGAAGCTTCTACCACGAAGCTGGCTACAACGAAATCCGCCGTGGATGAGAATAATAATAATAATAATAATAATAATAAAGGTTCTAAAGCTGC
TGTTGTGAATGATGATAATGAATCGTCGTCCGTTGTGACGACGGATGAAAATTGTTGCGGCGGCGGTGGAAGCGGTTTGATGGAAGTTGGTCAGGTTAATTCGTGTTTTG
GACCGGCGATTTTTGAGCCGCCGCCACCTCCGCCACAACTGGCGAATTCTTACCTGACGAATATACCCATTTATACGCCGAATTCGACGGAGTTCTTGTGCACGAATCAA
CCAATTTTTAACTATGATGATTCTTTGCTTTTTGGTGGCAACAGATCAGTTAATGATTTCTGCGAATACAGTGAATTTTAG
Protein sequenceShow/hide protein sequence
MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLTGMS
RSDHDHEASTTKLATTKSAVDENNNNNNNNNKGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANSYLTNIPIYTPNSTEFLCTNQ
PIFNYDDSLLFGGNRSVNDFCEYSEF