; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G03840 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G03840
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionVQ motif-containing protein 20
Genome locationChr1:2396902..2397664
RNA-Seq ExpressionCSPI01G03840
SyntenyCSPI01G03840
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006952 - defense response (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR008889 - VQ
IPR039607 - VQ motif-containing protein 8/17/18/20/21/25
IPR039832 - VQ motif-containing protein 20


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN62345.1 hypothetical protein Csa_018596 [Cucumis sativus]1.2e-8476.49Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSP QQQQQIHGK DNQ TNN  NKP+  PP LKINKDSHLIRK+SSSSNTSSPSSSSSSTTSL NGV   AA  PPQRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCCGG-GGSGLMEVGQVNSCFGPAIFE--PPPPPPQLA
        ALVQKLTGMSRS  D EAST   AT KS VDENN          + VVNDDNESSSVVTTDENCCGG G SG +E GQVNSCFGPAIFE  PPPPPPQLA
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCCGG-GGSGLMEVGQVNSCFGPAIFE--PPPPPPQLA

Query:  NSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGN---RSVNDFCEYSEF
        +SYL+NIP+Y PNSTEFLC NQPIFNYDDSLLFG N    S N   ++SEF
Subjt:  NSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGN---RSVNDFCEYSEF

KGN63830.2 hypothetical protein Csa_013231 [Cucumis sativus]4.0e-10999.53Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNN-GSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS
        ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNN GSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNN-GSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS

Query:  YLTNIPIYTPNSTEF
        YLTNIPIYTPNSTEF
Subjt:  YLTNIPIYTPNSTEF

XP_004138347.3 VQ motif-containing protein 20 [Cucumis sativus]1.6e-12999.59Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNN-GSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS
        ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNN GSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNN-GSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANS

Query:  YLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSEF
        YLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSEF
Subjt:  YLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSEF

XP_008453685.1 PREDICTED: VQ motif-containing protein 20 [Cucumis melo]2.6e-10085.83Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSN--TSSPSSSSSSTTSLANGV--AAAAAKPPPQRHPVIIYTHSPKIIHTHP
        MSP   QQQIHGK DNQI NNG NKPILCPPPLKINKDSHLIRKTSSSSN  TSSPSSSSSSTTSL NGV  AAAAA  P QRHPVIIYTHSPK+IHTHP
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSN--TSSPSSSSSSTTSLANGV--AAAAAKPPPQRHPVIIYTHSPKIIHTHP

Query:  RDFMALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCC-GGGGSGLMEVGQVNSCFGPAIFEPP-----
        RDFMALVQKLTGMSRSD DHEAS TKLATTK  VDE   NNNNN GSK AVV+DDNESSSVVTTDENCC GGGGSG+MEVGQVNSCFGP IFEPP     
Subjt:  RDFMALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCC-GGGGSGLMEVGQVNSCFGPAIFEPP-----

Query:  PPPPQLANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSE
        PPPPQLANSYLTNIPIYTPNS EFLC NQPIFNYDDSLLFGGNR VNDFCEYSE
Subjt:  PPPPQLANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSE

XP_038876677.1 VQ motif-containing protein 20 [Benincasa hispida]4.7e-8677.31Show/hide
Query:  MSPQQQQQQIHGKMD--NQITNNGN---NKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTH
        MSP Q QQQ+HGK D  NQ  NNGN   NKP LCPPPLKINKDSHLIRK+SSSSNTSSPSSSSSSTTSL NGV       PPQRHPVIIYTHSPKIIHTH
Subjt:  MSPQQQQQQIHGKMD--NQITNNGN---NKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTH

Query:  PRDFMALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFE--PPPPP
        PRDFMALVQKLTGMSRS  D EAST   ATTKS VDE      NN GSK AVVNDDNESSSVVTTDENCC GGGSG++E GQVNSCFG AIFE  PPPPP
Subjt:  PRDFMALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFE--PPPPP

Query:  PQLANSYLTNIPIYTPNSTEFLCTNQP-IFNYDDSLLFG-------GNRSVNDFCEYSEF
        PQLANSYLTNIP+Y PNSTEFLCTNQP  FNYDDSLLFG        +  +NDFCEYSEF
Subjt:  PQLANSYLTNIPIYTPNSTEFLCTNQP-IFNYDDSLLFG-------GNRSVNDFCEYSEF

TrEMBL top hitse value%identityAlignment
A0A0A0LKA6 VQ domain-containing protein5.7e-8576.49Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSP QQQQQIHGK DNQ TNN  NKP+  PP LKINKDSHLIRK+SSSSNTSSPSSSSSSTTSL NGV   AA  PPQRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCCGG-GGSGLMEVGQVNSCFGPAIFE--PPPPPPQLA
        ALVQKLTGMSRS  D EAST   AT KS VDENN          + VVNDDNESSSVVTTDENCCGG G SG +E GQVNSCFGPAIFE  PPPPPPQLA
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCCGG-GGSGLMEVGQVNSCFGPAIFE--PPPPPPQLA

Query:  NSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGN---RSVNDFCEYSEF
        +SYL+NIP+Y PNSTEFLC NQPIFNYDDSLLFG N    S N   ++SEF
Subjt:  NSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGN---RSVNDFCEYSEF

A0A0A0LPK7 VQ domain-containing protein3.1e-10785.71Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANSY
        ALVQKLTG+                                  KAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANSY
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANSY

Query:  LTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSEF
        LTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSEF
Subjt:  LTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSEF

A0A1S3BCI6 VQ motif-containing protein 20-like2.0e-8275.59Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSPQ QQQQIHGK DNQI NN  NKP L PP LKINKDSHLIRK+SSSSNTSSPSSSSSSTTSL NGV  A AK PPQRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCC-GGGGSGLMEVGQVNSCFGPAIFE----PPPPPPQ
        ALVQKLTGMSRS  D EAST   AT KS VDENN  +         VVNDDNESSSV+TTDENCC G G SG +E GQVNSCFGPA FE    PPPPPPQ
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCC-GGGGSGLMEVGQVNSCFGPAIFE----PPPPPPQ

Query:  LANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNR----SVNDFCEYSEF
        LANSYLTN+P+Y PNSTEFLC NQPIFNYDDSLLFGG+     S N   ++SEF
Subjt:  LANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNR----SVNDFCEYSEF

A0A1S3BXM8 VQ motif-containing protein 201.3e-10085.83Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSN--TSSPSSSSSSTTSLANGV--AAAAAKPPPQRHPVIIYTHSPKIIHTHP
        MSP   QQQIHGK DNQI NNG NKPILCPPPLKINKDSHLIRKTSSSSN  TSSPSSSSSSTTSL NGV  AAAAA  P QRHPVIIYTHSPK+IHTHP
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSN--TSSPSSSSSSTTSLANGV--AAAAAKPPPQRHPVIIYTHSPKIIHTHP

Query:  RDFMALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCC-GGGGSGLMEVGQVNSCFGPAIFEPP-----
        RDFMALVQKLTGMSRSD DHEAS TKLATTK  VDE   NNNNN GSK AVV+DDNESSSVVTTDENCC GGGGSG+MEVGQVNSCFGP IFEPP     
Subjt:  RDFMALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCC-GGGGSGLMEVGQVNSCFGPAIFEPP-----

Query:  PPPPQLANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSE
        PPPPQLANSYLTNIPIYTPNS EFLC NQPIFNYDDSLLFGGNR VNDFCEYSE
Subjt:  PPPPQLANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNRSVNDFCEYSE

A0A5A7V0G0 VQ motif-containing protein 20-like2.4e-8375.59Show/hide
Query:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM
        MSPQ QQQQIHGK DNQI NN  NKP L PP LKINKDSHLIRK+SSSSNTSSPSSSSSSTTSL NGV  A AK PPQRHPVIIYTHSPKIIHTHPRDFM
Subjt:  MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFM

Query:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCC-GGGGSGLMEVGQVNSCFGPAIFE----PPPPPPQ
        ALVQKLTGMSRS  D EAST   AT KS VDENN          + VVNDDNESSSV+TTDENCC G G SG +E GQVNSCFGPA FE    PPPPPPQ
Subjt:  ALVQKLTGMSRSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCC-GGGGSGLMEVGQVNSCFGPAIFE----PPPPPPQ

Query:  LANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNR----SVNDFCEYSEF
        LANSYLTN+P+Y PNSTEFLC NQPIFNYDDSLLFGG+     S N   ++SEF
Subjt:  LANSYLTNIPIYTPNSTEFLCTNQPIFNYDDSLLFGGNR----SVNDFCEYSEF

SwissProt top hitse value%identityAlignment
Q9LS54 VQ motif-containing protein 202.6e-2637.5Show/hide
Query:  HGKMDNQITNNG---NNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLT
        H K +    N+G      P   PP LK+NKDSH+I+K        SPSS SS            AAKP   RHPVIIYTH+P+IIHT+P+DFMALVQKLT
Subjt:  HGKMDNQITNNG---NNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLT

Query:  GMSRSDHD----------HEASTTKLATTKSAVDENNNNNNNNNGS-----------------KAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNS
        GM+ SD D              +     ++  VD  NN N    G+                    ++++D+ESSSV+TT+EN        + E GQVNS
Subjt:  GMSRSDHD----------HEASTTKLATTKSAVDENNNNNNNNNGS-----------------KAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNS

Query:  CF----------GPAIFEPPPPPPQLAN-------SYLTNIPIYTP--NSTEFLCTNQPIFNYDDSLLFGGN
                     P    PPPPPP + +       +YL   PI+ P   +  FLC NQP  N+DD L F  N
Subjt:  CF----------GPAIFEPPPPPPQLAN-------SYLTNIPIYTP--NSTEFLCTNQPIFNYDDSLLFGGN

Arabidopsis top hitse value%identityAlignment
AT1G21320.1 nucleotide binding;nucleic acid binding7.5e-0535.71Show/hide
Query:  MDNQI-TNNGNNKPILCPPPLKINKDSH-LIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLTGM--
        MDN+   + GN  P   P PLK+  DSH +I+K   +     P    +           + ++PPP   PV IYT +P+IIHTHP +FM LVQ+LTG   
Subjt:  MDNQI-TNNGNNKPILCPPPLKINKDSH-LIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLTGM--

Query:  -SRSDHDHEASTTKLATTKSAVDENN
         S +     +ST++   T + VD ++
Subjt:  -SRSDHDHEASTTKLATTKSAVDENN

AT1G21326.1 VQ motif-containing protein2.6e-0536.7Show/hide
Query:  PPPLKINKDSH-LIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLTG---MSRSDHDHEASTTKLAT
        P PLK+  DSH +I+K   +     P    +           + ++PPP   PVIIYT SP+IIHTHP +FM LVQ+LTG    S +   + +ST+    
Subjt:  PPPLKINKDSH-LIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLTG---MSRSDHDHEASTTKLAT

Query:  TKSAVDENN
          + VD ++
Subjt:  TKSAVDENN

AT1G68450.1 VQ motif-containing protein1.3e-0435.09Show/hide
Query:  PPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLTGMSRSDHDHEASTTKLATTKSA
        P  LK+  +SH I+KTSS  +   P   +S                     PVIIY HSPK+IHT   DFMALVQ+LTG+     D         ++ S 
Subjt:  PPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLTGMSRSDHDHEASTTKLATTKSA

Query:  VDENNNNNNNNNGS
        V E  N  ++N  +
Subjt:  VDENNNNNNNNNGS

AT3G18360.1 VQ motif-containing protein1.8e-2737.5Show/hide
Query:  HGKMDNQITNNG---NNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLT
        H K +    N+G      P   PP LK+NKDSH+I+K        SPSS SS            AAKP   RHPVIIYTH+P+IIHT+P+DFMALVQKLT
Subjt:  HGKMDNQITNNG---NNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLT

Query:  GMSRSDHD----------HEASTTKLATTKSAVDENNNNNNNNNGS-----------------KAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNS
        GM+ SD D              +     ++  VD  NN N    G+                    ++++D+ESSSV+TT+EN        + E GQVNS
Subjt:  GMSRSDHD----------HEASTTKLATTKSAVDENNNNNNNNNGS-----------------KAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNS

Query:  CF----------GPAIFEPPPPPPQLAN-------SYLTNIPIYTP--NSTEFLCTNQPIFNYDDSLLFGGN
                     P    PPPPPP + +       +YL   PI+ P   +  FLC NQP  N+DD L F  N
Subjt:  CF----------GPAIFEPPPPPPQLAN-------SYLTNIPIYTP--NSTEFLCTNQPIFNYDDSLLFGGN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCCTCAACAACAGCAACAACAAATCCATGGAAAAATGGATAATCAGATCACCAACAATGGGAATAATAAACCAATATTGTGTCCACCGCCATTGAAGATCAATAA
GGATTCTCATTTGATTCGAAAAACTTCCTCTTCTTCAAATACTTCTTCCCCTTCTTCCTCTTCATCTTCCACCACCTCTCTTGCCAATGGGGTCGCTGCTGCTGCTGCGA
AGCCACCACCTCAACGACATCCGGTGATTATATACACACATTCTCCCAAAATCATTCACACGCATCCTCGAGATTTCATGGCGTTGGTTCAAAAGCTAACCGGAATGTCT
CGATCAGATCATGATCATGAAGCTTCTACCACGAAGCTGGCTACAACGAAATCCGCCGTGGATGAGAATAATAATAATAATAATAATAATAATGGTTCTAAAGCTGCTGT
TGTGAATGATGATAATGAATCGTCGTCCGTTGTGACGACGGATGAAAATTGTTGCGGCGGCGGTGGAAGCGGTTTGATGGAAGTTGGTCAGGTTAATTCGTGTTTTGGAC
CGGCGATTTTTGAGCCGCCGCCACCTCCGCCACAACTGGCGAATTCTTACCTGACGAATATACCCATTTATACGCCGAATTCGACGGAGTTCTTGTGCACGAATCAACCA
ATTTTTAACTATGATGATTCTTTGCTTTTTGGTGGCAACAGATCAGTTAATGATTTCTGCGAATACAGTGAATTTTAG
mRNA sequenceShow/hide mRNA sequence
GTTTGAAGAAAACCCTAAATCAAGAATGAGTCCTCAACAACAGCAACAACAAATCCATGGAAAAATGGATAATCAGATCACCAACAATGGGAATAATAAACCAATATTGT
GTCCACCGCCATTGAAGATCAATAAGGATTCTCATTTGATTCGAAAAACTTCCTCTTCTTCAAATACTTCTTCCCCTTCTTCCTCTTCATCTTCCACCACCTCTCTTGCC
AATGGGGTCGCTGCTGCTGCTGCGAAGCCACCACCTCAACGACATCCGGTGATTATATACACACATTCTCCCAAAATCATTCACACGCATCCTCGAGATTTCATGGCGTT
GGTTCAAAAGCTAACCGGAATGTCTCGATCAGATCATGATCATGAAGCTTCTACCACGAAGCTGGCTACAACGAAATCCGCCGTGGATGAGAATAATAATAATAATAATA
ATAATAATGGTTCTAAAGCTGCTGTTGTGAATGATGATAATGAATCGTCGTCCGTTGTGACGACGGATGAAAATTGTTGCGGCGGCGGTGGAAGCGGTTTGATGGAAGTT
GGTCAGGTTAATTCGTGTTTTGGACCGGCGATTTTTGAGCCGCCGCCACCTCCGCCACAACTGGCGAATTCTTACCTGACGAATATACCCATTTATACGCCGAATTCGAC
GGAGTTCTTGTGCACGAATCAACCAATTTTTAACTATGATGATTCTTTGCTTTTTGGTGGCAACAGATCAGTTAATGATTTCTGCGAATACAGTGAATTTTAG
Protein sequenceShow/hide protein sequence
MSPQQQQQQIHGKMDNQITNNGNNKPILCPPPLKINKDSHLIRKTSSSSNTSSPSSSSSSTTSLANGVAAAAAKPPPQRHPVIIYTHSPKIIHTHPRDFMALVQKLTGMS
RSDHDHEASTTKLATTKSAVDENNNNNNNNNGSKAAVVNDDNESSSVVTTDENCCGGGGSGLMEVGQVNSCFGPAIFEPPPPPPQLANSYLTNIPIYTPNSTEFLCTNQP
IFNYDDSLLFGGNRSVNDFCEYSEF