; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029004 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029004
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description18S pre-ribosomal assembly protein gar2-related, putative isoform 2
Genome locationtig00153210:2466612..2481400
RNA-Seq ExpressionSgr029004
SyntenySgr029004
Gene Ontology termsGO:0009786 - regulation of asymmetric cell division (biological process)
InterPro domainsIPR040378 - Protein BREAKING OF ASYMMETRY IN THE STOMATAL LINEAGE


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446468.1 PREDICTED: uncharacterized protein LOC103489197 isoform X1 [Cucumis melo]5.5e-17068.62Show/hide
Query:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE
        +GEP+VCHSN SPKFVPKSFECDND L  GGM+LEDQKE TS LKGN        HN  A DGWV  K +CL LDDFNDYD+VKAFVSPL NS KVDL E
Subjt:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE

Query:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR-
        EDSELYMEKSIVECQLPELIVCYKENICNIVKDICID+G P RDKL  GSSLDE+  C+I PP +DWKDE   E +++D FASDD EH+ESF +K+SP  
Subjt:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR-

Query:  -DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV
         D  DLA  PEAEYDVAYFTDNDM   PM +LV ESLKPL +NK   HPQSEQV IET   EVP L  V +ESF NT+E  S S T+A   EDPKNS S 
Subjt:  -DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV

Query:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCVEYEDLP--------KAEVGIS
        N +SYNSKVD GNITFDFNSLA T SDG+E  DNGDLNSSAP+ SAS                      H+T SNPK VEYEDLP        K EVG  
Subjt:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCVEYEDLP--------KAEVGIS

Query:  DSHLVSSQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        DSH VSS+VQ G+GETSF S+ PL GSLMSNSGRIGYSGSIS RSDSSTTSTRSFAFPILQ+EWNSSPVRMAK DRKHL+KHRGW+  +LCCRF
Subjt:  DSHLVSSQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF

XP_011655720.1 uncharacterized protein LOC101218906 isoform X1 [Cucumis sativus]4.7e-16967.81Show/hide
Query:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE
        +GEP+VCHSN SPKFVPKSFECDNDAL  GGM+LEDQKE T+ LKGN        HN    DGWV  K +CL LDDFNDYD+VKAFVSPL NS K DL E
Subjt:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE

Query:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPRD
        EDSELYMEKS+VECQLPELIVCYKENICNIVKDICID+G P RDKL  GSSLDEK  C+I PP +DWKDE   E +++D FASDD EH+ESF +K+SP +
Subjt:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPRD

Query:  FG--DLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV
        +   DLA  PEAEYDVAYFTDNDM   PM +LV ESLKPL NN+   HPQSEQV IET   EVP LV V EESFSNT+E  S S T+A   EDPK+  S 
Subjt:  FG--DLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV

Query:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCVEYEDL--------PKAEVGIS
        N +SYNSKVD GNITFDFNSLAST SDG+E  DNGDLNSSAP+ SAS                      H T SNPK VEYEDL        PK EVG  
Subjt:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCVEYEDL--------PKAEVGIS

Query:  DSHLVSSQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        DSH VSS+V  G+GETSF S+ PL GSL+SNSGRIGYSGSIS RSDSSTTSTRSFAFPILQSEW SSPVRM K DRKHL+KHRGW+  +LCCRF
Subjt:  DSHLVSSQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF

XP_022149065.1 uncharacterized protein LOC111017570 [Momordica charantia]1.6e-17773.05Show/hide
Query:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE
        +GE VVCHSN+S KFVPKS   DNDA   GGM LEDQKELTSP K N+Q ADQ            V KHDCLGLDDFN Y+EV+A VSP TNSSKVDLFE
Subjt:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE

Query:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR-
        EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRD LL G+SLDEKA C I P E+DWKDEL  E EK+  F+S   EH ESF NK+SP+ 
Subjt:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR-

Query:  -DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV
         D  DL RIPEAEYDVAYFTDND+PNL MK+LV ESLKPLIN+K++ HPQSEQV IE+ASLEVP  VS VE+S+S T E I+AST       +PKNS SV
Subjt:  -DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV

Query:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCVEYEDLPKAEVGISDSHLVSSQ
        N+ISYNSKVDNGNITFDFNS AS  SDGME HDNG  NSSAPT SAS                      HHT SNPKCVEYEDLPKAEVGIS S  VS+Q
Subjt:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCVEYEDLPKAEVGISDSHLVSSQ

Query:  VQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        VQHGIGETSFSSMGPL GSL+SNSGRIGYSGSISLRSDSSTTSTRSFAFPI+QSEWNSSPVRMAKADR   RKHRGWK  LLCCRF
Subjt:  VQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF

XP_038892052.1 uncharacterized protein LOC120081347 isoform X1 [Benincasa hispida]9.7e-18370.63Show/hide
Query:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE
        +GEP+V HSN SP+FVPKSFECDNDA+  GGM+LEDQKE TS LKGNE      +HN  A DGWV  K +CL LDDFN+YDEVKAFVSPLTNSSKVDLFE
Subjt:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE

Query:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR-
        EDSELYMEKS VECQLPELIVCYKENICNIVKDICID+GVPSRDKLL GSSLDEK  C+I PP   WKD+L RE EK+D +ASDD EH+ESF NK+SP+ 
Subjt:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR-

Query:  -DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV
         D  DL R PEAEYDVAYFTDNDM   PM + V ESLKPL NNK + HP+SEQV IET SLEVP L  V EESFS+++E IS STT+A APE+ KNS S 
Subjt:  -DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV

Query:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCV--------EYEDLPKAEVGIS
        ND+SYNSKVD GNITFDFNSLAST SDG+E  DN DLN+SAP+ SAS                      H T +NPKCV        EYEDLPK EVG  
Subjt:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCV--------EYEDLPKAEVGIS

Query:  DSHLVSSQVQHG----------IGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLL
        DSH VSSQVQHG          +GETSFSSM PL GSLMSNSG IGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADR+HLRKHRGW+Q +L
Subjt:  DSHLVSSQVQHG----------IGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLL

Query:  CCRF
        CCRF
Subjt:  CCRF

XP_038892056.1 uncharacterized protein LOC120081347 isoform X2 [Benincasa hispida]2.6e-17268.22Show/hide
Query:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE
        +GEP+V HSN SP+FVPKSFECDNDA+  GGM+LEDQKE TS LKGNE      +HN  A DGWV  K +CL LDDFN+YDEVKAFVSPLTNSSKVDLFE
Subjt:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE

Query:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR-
        EDSELYMEKS VECQLPELIVCYKENICNIVKDICID+GVPSRDKLL GSSLDEK  C+I PP   WKD+L RE EK+D +ASDD EH+ESF NK+SP+ 
Subjt:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR-

Query:  -DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV
         D  DL R PEAEYDVAYFTDNDM   PM + V ESLKPL NNK + HP+SEQV IET SLEVP L  V EESFS+++E IS STT+A APE+ KNS S 
Subjt:  -DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV

Query:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCV--------EYEDLPKAEVGIS
        ND+SYNSKVD GNITFDFNSLAST SDG+E  DN DLN+SAP+ SAS                      H T +NPKCV        EYEDLPK EVG  
Subjt:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCV--------EYEDLPKAEVGIS

Query:  DSHLVSSQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        DSH VSSQVQHG                       GYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADR+HLRKHRGW+Q +LCCRF
Subjt:  DSHLVSSQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF

TrEMBL top hitse value%identityAlignment
A0A1S3BFZ0 uncharacterized protein LOC103489197 isoform X12.7e-17068.62Show/hide
Query:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE
        +GEP+VCHSN SPKFVPKSFECDND L  GGM+LEDQKE TS LKGN        HN  A DGWV  K +CL LDDFNDYD+VKAFVSPL NS KVDL E
Subjt:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE

Query:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR-
        EDSELYMEKSIVECQLPELIVCYKENICNIVKDICID+G P RDKL  GSSLDE+  C+I PP +DWKDE   E +++D FASDD EH+ESF +K+SP  
Subjt:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR-

Query:  -DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV
         D  DLA  PEAEYDVAYFTDNDM   PM +LV ESLKPL +NK   HPQSEQV IET   EVP L  V +ESF NT+E  S S T+A   EDPKNS S 
Subjt:  -DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV

Query:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCVEYEDLP--------KAEVGIS
        N +SYNSKVD GNITFDFNSLA T SDG+E  DNGDLNSSAP+ SAS                      H+T SNPK VEYEDLP        K EVG  
Subjt:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCVEYEDLP--------KAEVGIS

Query:  DSHLVSSQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        DSH VSS+VQ G+GETSF S+ PL GSLMSNSGRIGYSGSIS RSDSSTTSTRSFAFPILQ+EWNSSPVRMAK DRKHL+KHRGW+  +LCCRF
Subjt:  DSHLVSSQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF

A0A6J1D4Q3 uncharacterized protein LOC1110175707.8e-17873.05Show/hide
Query:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE
        +GE VVCHSN+S KFVPKS   DNDA   GGM LEDQKELTSP K N+Q ADQ            V KHDCLGLDDFN Y+EV+A VSP TNSSKVDLFE
Subjt:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE

Query:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR-
        EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRD LL G+SLDEKA C I P E+DWKDEL  E EK+  F+S   EH ESF NK+SP+ 
Subjt:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR-

Query:  -DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV
         D  DL RIPEAEYDVAYFTDND+PNL MK+LV ESLKPLIN+K++ HPQSEQV IE+ASLEVP  VS VE+S+S T E I+AST       +PKNS SV
Subjt:  -DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV

Query:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCVEYEDLPKAEVGISDSHLVSSQ
        N+ISYNSKVDNGNITFDFNS AS  SDGME HDNG  NSSAPT SAS                      HHT SNPKCVEYEDLPKAEVGIS S  VS+Q
Subjt:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPKCVEYEDLPKAEVGISDSHLVSSQ

Query:  VQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        VQHGIGETSFSSMGPL GSL+SNSGRIGYSGSISLRSDSSTTSTRSFAFPI+QSEWNSSPVRMAKADR   RKHRGWK  LLCCRF
Subjt:  VQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF

A0A6J1GWB6 uncharacterized protein LOC111458170 isoform X27.1e-16366.87Show/hide
Query:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE
        +GEPVV HS+ SPKF+PKSFECDNDAL  GGM+LED K+ T  LK NE      +H           K+  +GLDD N++DEVKAFV  +TNSSKVDLFE
Subjt:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE

Query:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLL-GGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR
        EDSELYMEKSIVECQLPELIVCYKEN CNIVKDICID+GVPSRDKLL G SSLDEKA C I PPEEDWKDEL R  E+ D FASDD EH+ESF  K+SP+
Subjt:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLL-GGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR

Query:  --DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPS
          DF DLAR PEAEYDV YFTDND+ NLPM +L  ES+KPL NNKN+ +PQSEQV      LEVP L  V EES+S+T+EEIS   ++  A E+PKNS S
Subjt:  --DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPS

Query:  VNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPK--------CVEYEDLPKAEVGI
          DISYNSK+D GNITFDFNS AST SDG+E  DNG LNSSAP+ SAS                      +   SNPK         VEYEDL KAEVG 
Subjt:  VNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPK--------CVEYEDLPKAEVGI

Query:  SDSHLVSSQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        SDS+ VSSQVQHG+GE S SSM  L GSL+SNSGRIGYSGSIS RSDSSTTST SFAFPILQSEWNSSPVRMAKAD+KHLRK RGW+Q LLCCRF
Subjt:  SDSHLVSSQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF

A0A6J1GWT7 uncharacterized protein LOC111458170 isoform X11.4e-16667.68Show/hide
Query:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE
        +GEPVV HS+ SPKF+PKSFECDNDAL  GGM+LED K+ T  LK NE      +H           K+  +GLDD N++DEVKAFV  +TNSSKVDLFE
Subjt:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE

Query:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLL-GGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR
        EDSELYMEKSIVECQLPELIVCYKEN CNIVKDICID+GVPSRDKLL G SSLDEKA C I PPEEDWKDEL R  E+ D FASDD EH+ESF  K+SP+
Subjt:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLL-GGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPR

Query:  --DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPS
          DF DLAR PEAEYDV YFTDND+ NLPM +L  ES+KPL NNKN+ +PQSEQV IET SLEVP L  V EES+S+T+EEIS   ++  A E+PKNS S
Subjt:  --DFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPS

Query:  VNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPK--------CVEYEDLPKAEVGI
          DISYNSK+D GNITFDFNS AST SDG+E  DNG LNSSAP+ SAS                      +   SNPK         VEYEDL KAEVG 
Subjt:  VNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASF---------------------HHTCSNPK--------CVEYEDLPKAEVGI

Query:  SDSHLVSSQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        SDS+ VSSQVQHG+GE S SSM  L GSL+SNSGRIGYSGSIS RSDSSTTST SFAFPILQSEWNSSPVRMAKAD+KHLRK RGW+Q LLCCRF
Subjt:  SDSHLVSSQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF

A0A6J1HTL9 uncharacterized protein LOC1114677093.2e-16367.56Show/hide
Query:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE
        +GE VVCHSN  PKFVPKSFECDNDA+  GGM+L D KEL S  KG E        N IA DGWVV K DCL LDDFNDYD+ K  VS  TNSS+VDLFE
Subjt:  KGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLKGNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFE

Query:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSP--
        EDSEL+MEKSIVECQLPELIVCYKENICNIVKDICID+GVPSRDKL       EKA   I PPE+DWKDE  R+  K+D FASDD EH+ESF +K+SP  
Subjt:  EDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSP--

Query:  RDFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV
        RD  D AR P+AEYDVAYFTDND+ NLPM +LVA SLKPLINNKN+ H Q+EQV IET S EV     V EES SN KE +S  TT+AP PEDP+NS S 
Subjt:  RDFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISASTTTAPAPEDPKNSPSV

Query:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISA------------------SFHHTCSNPKCVEYEDLPKAEVGISDSHLVSSQVQH
        NDISYNSK+DNGNITFDFNSL+S  SDG+E  DNGDLNSS P+ SA                    H T  + K VEYED+ KA VG S SH VSS VQ 
Subjt:  NDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISA------------------SFHHTCSNPKCVEYEDLPKAEVGISDSHLVSSQVQH

Query:  G-IGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        G +GE SFSSM P G S M +S RIGYSGSIS+RSDSSTTSTRSFAFP+LQ EWNSSPVRMAKADRKHLRKHR WK+ +LCCRF
Subjt:  G-IGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13650.2 BEST Arabidopsis thaliana protein match is: 18S pre-ribosomal assembly protein gar2-related (TAIR:AT2G03810.4)2.2e-0732.89Show/hide
Query:  VNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASFHHTCSNPKCVEYEDLPKAEVGISDSHLVSSQVQHGIGETSFSSMGPLGGSL
        VND S  +  D    +   +  A T+ D +   D+    +   T     +    NP  +E  +  +A+    D++L+   + +G GE SF          
Subjt:  VNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASFHHTCSNPKCVEYEDLPKAEVGISDSHLVSSQVQHGIGETSFSSMGPLGGSL

Query:  MSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLR
        ++  G +  S ++S+RSD   TS  SFA PILQSEWNSSPVRM KA+   LR
Subjt:  MSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLR

AT2G03810.1 18S pre-ribosomal assembly protein gar2-related1.3e-3632.8Show/hide
Query:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTE--SFCN---
        ++D   YM+K++  C LPE++VCYKEN  +IVKDIC+DEGVP ++K L G     K+  T    E+  K +       +   A D +   +   FCN   
Subjt:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTE--SFCN---

Query:  ------KNSPRDFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISAST--TTA
              ++S  DF D                 ++   P   L    ++P  N+K++V    +  S E   L + D++S  +E  S  ++ IS+ +    +
Subjt:  ------KNSPRDFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISAST--TTA

Query:  PAPEDPKNSPSVNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASFHHTCSNPKCVEYEDLPKAEVGISDSHLVSSQVQHGIGETS
        P+    K   S+   +  ++++                 G E      L+S + T S   + TC+ P+  E E+  +    + +S+          GETS
Subjt:  PAPEDPKNSPSVNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASFHHTCSNPKCVEYEDLPKAEVGISDSHLVSSQVQHGIGETS

Query:  FSSMGPLG-GSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        FS+   +     ++ SG I YSGS+S+RSD+STTS RSFAFPILQSEWNSSPVRMAKAD++  R+  GW+  LLCCRF
Subjt:  FSSMGPLG-GSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF

AT2G03810.2 18S pre-ribosomal assembly protein gar2-related1.3e-3632.8Show/hide
Query:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTE--SFCN---
        ++D   YM+K++  C LPE++VCYKEN  +IVKDIC+DEGVP ++K L G     K+  T    E+  K +       +   A D +   +   FCN   
Subjt:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTE--SFCN---

Query:  ------KNSPRDFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISAST--TTA
              ++S  DF D                 ++   P   L    ++P  N+K++V    +  S E   L + D++S  +E  S  ++ IS+ +    +
Subjt:  ------KNSPRDFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISAST--TTA

Query:  PAPEDPKNSPSVNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASFHHTCSNPKCVEYEDLPKAEVGISDSHLVSSQVQHGIGETS
        P+    K   S+   +  ++++                 G E      L+S + T S   + TC+ P+  E E+  +    + +S+          GETS
Subjt:  PAPEDPKNSPSVNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASFHHTCSNPKCVEYEDLPKAEVGISDSHLVSSQVQHGIGETS

Query:  FSSMGPLG-GSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        FS+   +     ++ SG I YSGS+S+RSD+STTS RSFAFPILQSEWNSSPVRMAKAD++  R+  GW+  LLCCRF
Subjt:  FSSMGPLG-GSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF

AT2G03810.3 18S pre-ribosomal assembly protein gar2-related1.3e-3632.8Show/hide
Query:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTE--SFCN---
        ++D   YM+K++  C LPE++VCYKEN  +IVKDIC+DEGVP ++K L G     K+  T    E+  K +       +   A D +   +   FCN   
Subjt:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTE--SFCN---

Query:  ------KNSPRDFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISAST--TTA
              ++S  DF D                 ++   P   L    ++P  N+K++V    +  S E   L + D++S  +E  S  ++ IS+ +    +
Subjt:  ------KNSPRDFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISAST--TTA

Query:  PAPEDPKNSPSVNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASFHHTCSNPKCVEYEDLPKAEVGISDSHLVSSQVQHGIGETS
        P+    K   S+   +  ++++                 G E      L+S + T S   + TC+ P+  E E+  +    + +S+          GETS
Subjt:  PAPEDPKNSPSVNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASFHHTCSNPKCVEYEDLPKAEVGISDSHLVSSQVQHGIGETS

Query:  FSSMGPLG-GSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        FS+   +     ++ SG I YSGS+S+RSD+STTS RSFAFPILQSEWNSSPVRMAKAD++  R+  GW+  LLCCRF
Subjt:  FSSMGPLG-GSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF

AT2G03810.4 18S pre-ribosomal assembly protein gar2-related1.3e-3632.8Show/hide
Query:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTE--SFCN---
        ++D   YM+K++  C LPE++VCYKEN  +IVKDIC+DEGVP ++K L G     K+  T    E+  K +       +   A D +   +   FCN   
Subjt:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEKAECTIFPPEEDWKDELRREPEKKDNFASDDLEHTE--SFCN---

Query:  ------KNSPRDFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISAST--TTA
              ++S  DF D                 ++   P   L    ++P  N+K++V    +  S E   L + D++S  +E  S  ++ IS+ +    +
Subjt:  ------KNSPRDFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLVSVVEESFSNTKEEISAST--TTA

Query:  PAPEDPKNSPSVNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASFHHTCSNPKCVEYEDLPKAEVGISDSHLVSSQVQHGIGETS
        P+    K   S+   +  ++++                 G E      L+S + T S   + TC+ P+  E E+  +    + +S+          GETS
Subjt:  PAPEDPKNSPSVNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASFHHTCSNPKCVEYEDLPKAEVGISDSHLVSSQVQHGIGETS

Query:  FSSMGPLG-GSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF
        FS+   +     ++ SG I YSGS+S+RSD+STTS RSFAFPILQSEWNSSPVRMAKAD++  R+  GW+  LLCCRF
Subjt:  FSSMGPLG-GSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGACCCGGCCGTCGGCCATGTCTTCGAAATCTAAATCCACGGCGGCGGCTCAGGATGGCATATGGGAGAGAACACAGAGAGAGGAAGCTTTTCGATTTGGCTATCG
TGTGAAAGTTGTCAATAAGAGAAAGAAACAAGCTGGTAAATGCCAAGCAACTTGGGGGAAAGAGAAAATAATAGGATCAGAAGCCTGTTCGTCTAACGTGGGATCTCAAG
ATTGTCCGGATGATCAGAGCCAGAGGGGAGACAATGAATCAGAAGGCTGTTCGTCTCCCAAGACATCCATTAACGGAGACGATGAATTGAATGATTCGGGAGGAAAGGTC
CCACGAAAATATAGAAAAAGGGGACCGCGTCTGATCCGAAAGCCAGGGCAGAGCTTTTGGTTAAACCAGAGAGACGAGAGGAGGCAGCCGAAAATCAGATACTGTACAGA
AGATATTCAGAAACCAGAATCGGCAGCAGCGTTAGAGGTAGAGAGAAAAGCCTCCGGGGGTACACTTTCTGACATGATTGTCAGCCCCAGAAAAAGGTGTCTCAACAGTT
CGTTCTTGGTTAATTTAGGGGGATGGTCAAATCAAACGCTCTCCAATCTCGCAGGGACTGGAAGAGCATTCGGGATTGCTGCTCGACTTAAAAAGCTACATTCGAGCAGC
TTCTTCGGGATTGGAATTCGGGTCGTGCAGTGCAGACGGAAAGTTTTTGAATCTGAGGTGCAGTTGCCAGGGGGAAGGGAAGTCGGATGGGAAAACGGAAGATGGGCACA
CCTCTTCGTTCCAGGCTGCTTGGGTTTTCTGTACAGTTGCTTTAACACATTGTGGAATGCATTTCCCTCTGTTCCTATAGGCGCCAAGGGTGAGCCTGTAGTTTGCCATT
CAAATGTTAGCCCCAAGTTTGTTCCCAAGTCTTTTGAATGTGATAATGATGCTCTTGCTTGTGGTGGGATGAGGCTTGAAGATCAGAAGGAACTTACGAGCCCTCTCAAA
GGCAATGAGCAGGGTGCCGATCAGTTATCACACAATAAGATTGCTGTAGATGGTTGGGTTGTATCTAAGCATGATTGTTTAGGTCTTGATGATTTTAATGACTATGATGA
GGTTAAAGCCTTTGTGTCACCGCTCACTAATTCTTCTAAAGTAGACTTGTTTGAGGAAGACTCGGAGTTATACATGGAAAAAAGTATTGTTGAATGTCAACTTCCTGAAC
TGATAGTTTGTTACAAAGAAAATATTTGCAATATTGTTAAGGATATTTGTATTGACGAGGGAGTTCCTTCTCGGGATAAGCTATTGGGTGGTAGTAGTTTGGATGAGAAG
GCTGAGTGCACCATTTTCCCTCCCGAGGAAGATTGGAAGGACGAATTGCGAAGAGAACCGGAGAAAAAGGATAATTTTGCTTCAGATGATTTAGAGCATACGGAATCTTT
TTGCAATAAGAATTCACCCCGTGATTTCGGGGATTTGGCTAGAATACCTGAGGCAGAATATGATGTGGCATATTTCACTGACAATGATATGCCAAATCTTCCAATGAAAG
AGTTGGTTGCAGAGAGCTTAAAGCCATTGATCAACAATAAGAATGACGTTCACCCTCAGTCTGAACAGGTTTCTATTGAAACTGCAAGTTTGGAGGTCCCAGATTTGGTA
TCTGTAGTTGAAGAATCTTTTAGTAACACCAAGGAAGAAATATCGGCATCCACCACTACAGCTCCAGCACCTGAAGATCCAAAAAATAGCCCTTCTGTGAATGATATATC
ATACAATAGTAAAGTGGATAATGGAAACATTACTTTTGATTTCAATTCTTTAGCATCTACAGTTAGTGATGGAATGGAGTGTCATGATAATGGCGACTTAAACTCTTCAG
CTCCGACGATCAGTGCCTCGTTTCATCATACTTGTAGTAACCCTAAATGTGTGGAATATGAAGACTTACCCAAGGCAGAAGTTGGGATATCTGATAGTCATTTGGTTTCA
AGCCAAGTTCAACATGGCATTGGCGAAACAAGTTTCTCTTCCATGGGACCTCTGGGGGGGAGTCTGATGTCTAATTCCGGCCGTATAGGATACTCTGGAAGCATCTCTCT
TCGGTCTGACAGCAGCACGACCAGCACCCGTTCCTTTGCCTTTCCCATATTACAATCCGAGTGGAATAGTAGTCCTGTTAGAATGGCTAAAGCTGATCGAAAGCATCTAC
GGAAGCATAGGGGTTGGAAACAACGCCTTCTGTGCTGTAGATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGACCCGGCCGTCGGCCATGTCTTCGAAATCTAAATCCACGGCGGCGGCTCAGGATGGCATATGGGAGAGAACACAGAGAGAGGAAGCTTTTCGATTTGGCTATCG
TGTGAAAGTTGTCAATAAGAGAAAGAAACAAGCTGGTAAATGCCAAGCAACTTGGGGGAAAGAGAAAATAATAGGATCAGAAGCCTGTTCGTCTAACGTGGGATCTCAAG
ATTGTCCGGATGATCAGAGCCAGAGGGGAGACAATGAATCAGAAGGCTGTTCGTCTCCCAAGACATCCATTAACGGAGACGATGAATTGAATGATTCGGGAGGAAAGGTC
CCACGAAAATATAGAAAAAGGGGACCGCGTCTGATCCGAAAGCCAGGGCAGAGCTTTTGGTTAAACCAGAGAGACGAGAGGAGGCAGCCGAAAATCAGATACTGTACAGA
AGATATTCAGAAACCAGAATCGGCAGCAGCGTTAGAGGTAGAGAGAAAAGCCTCCGGGGGTACACTTTCTGACATGATTGTCAGCCCCAGAAAAAGGTGTCTCAACAGTT
CGTTCTTGGTTAATTTAGGGGGATGGTCAAATCAAACGCTCTCCAATCTCGCAGGGACTGGAAGAGCATTCGGGATTGCTGCTCGACTTAAAAAGCTACATTCGAGCAGC
TTCTTCGGGATTGGAATTCGGGTCGTGCAGTGCAGACGGAAAGTTTTTGAATCTGAGGTGCAGTTGCCAGGGGGAAGGGAAGTCGGATGGGAAAACGGAAGATGGGCACA
CCTCTTCGTTCCAGGCTGCTTGGGTTTTCTGTACAGTTGCTTTAACACATTGTGGAATGCATTTCCCTCTGTTCCTATAGGCGCCAAGGGTGAGCCTGTAGTTTGCCATT
CAAATGTTAGCCCCAAGTTTGTTCCCAAGTCTTTTGAATGTGATAATGATGCTCTTGCTTGTGGTGGGATGAGGCTTGAAGATCAGAAGGAACTTACGAGCCCTCTCAAA
GGCAATGAGCAGGGTGCCGATCAGTTATCACACAATAAGATTGCTGTAGATGGTTGGGTTGTATCTAAGCATGATTGTTTAGGTCTTGATGATTTTAATGACTATGATGA
GGTTAAAGCCTTTGTGTCACCGCTCACTAATTCTTCTAAAGTAGACTTGTTTGAGGAAGACTCGGAGTTATACATGGAAAAAAGTATTGTTGAATGTCAACTTCCTGAAC
TGATAGTTTGTTACAAAGAAAATATTTGCAATATTGTTAAGGATATTTGTATTGACGAGGGAGTTCCTTCTCGGGATAAGCTATTGGGTGGTAGTAGTTTGGATGAGAAG
GCTGAGTGCACCATTTTCCCTCCCGAGGAAGATTGGAAGGACGAATTGCGAAGAGAACCGGAGAAAAAGGATAATTTTGCTTCAGATGATTTAGAGCATACGGAATCTTT
TTGCAATAAGAATTCACCCCGTGATTTCGGGGATTTGGCTAGAATACCTGAGGCAGAATATGATGTGGCATATTTCACTGACAATGATATGCCAAATCTTCCAATGAAAG
AGTTGGTTGCAGAGAGCTTAAAGCCATTGATCAACAATAAGAATGACGTTCACCCTCAGTCTGAACAGGTTTCTATTGAAACTGCAAGTTTGGAGGTCCCAGATTTGGTA
TCTGTAGTTGAAGAATCTTTTAGTAACACCAAGGAAGAAATATCGGCATCCACCACTACAGCTCCAGCACCTGAAGATCCAAAAAATAGCCCTTCTGTGAATGATATATC
ATACAATAGTAAAGTGGATAATGGAAACATTACTTTTGATTTCAATTCTTTAGCATCTACAGTTAGTGATGGAATGGAGTGTCATGATAATGGCGACTTAAACTCTTCAG
CTCCGACGATCAGTGCCTCGTTTCATCATACTTGTAGTAACCCTAAATGTGTGGAATATGAAGACTTACCCAAGGCAGAAGTTGGGATATCTGATAGTCATTTGGTTTCA
AGCCAAGTTCAACATGGCATTGGCGAAACAAGTTTCTCTTCCATGGGACCTCTGGGGGGGAGTCTGATGTCTAATTCCGGCCGTATAGGATACTCTGGAAGCATCTCTCT
TCGGTCTGACAGCAGCACGACCAGCACCCGTTCCTTTGCCTTTCCCATATTACAATCCGAGTGGAATAGTAGTCCTGTTAGAATGGCTAAAGCTGATCGAAAGCATCTAC
GGAAGCATAGGGGTTGGAAACAACGCCTTCTGTGCTGTAGATTCTGA
Protein sequenceShow/hide protein sequence
MVTRPSAMSSKSKSTAAAQDGIWERTQREEAFRFGYRVKVVNKRKKQAGKCQATWGKEKIIGSEACSSNVGSQDCPDDQSQRGDNESEGCSSPKTSINGDDELNDSGGKV
PRKYRKRGPRLIRKPGQSFWLNQRDERRQPKIRYCTEDIQKPESAAALEVERKASGGTLSDMIVSPRKRCLNSSFLVNLGGWSNQTLSNLAGTGRAFGIAARLKKLHSSS
FFGIGIRVVQCRRKVFESEVQLPGGREVGWENGRWAHLFVPGCLGFLYSCFNTLWNAFPSVPIGAKGEPVVCHSNVSPKFVPKSFECDNDALACGGMRLEDQKELTSPLK
GNEQGADQLSHNKIAVDGWVVSKHDCLGLDDFNDYDEVKAFVSPLTNSSKVDLFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLGGSSLDEK
AECTIFPPEEDWKDELRREPEKKDNFASDDLEHTESFCNKNSPRDFGDLARIPEAEYDVAYFTDNDMPNLPMKELVAESLKPLINNKNDVHPQSEQVSIETASLEVPDLV
SVVEESFSNTKEEISASTTTAPAPEDPKNSPSVNDISYNSKVDNGNITFDFNSLASTVSDGMECHDNGDLNSSAPTISASFHHTCSNPKCVEYEDLPKAEVGISDSHLVS
SQVQHGIGETSFSSMGPLGGSLMSNSGRIGYSGSISLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQRLLCCRF