; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040200 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040200
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Description18S pre-ribosomal assembly protein gar2-related, putative isoform 2
Genome locationchr13:2851397..2856055
RNA-Seq ExpressionLag0040200
SyntenyLag0040200
Gene Ontology termsGO:0009786 - regulation of asymmetric cell division (biological process)
GO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR040378 - Protein BREAKING OF ASYMMETRY IN THE STOMATAL LINEAGE


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032146.1 hypothetical protein SDJN02_06189, partial [Cucurbita argyrosperma subsp. argyrosperma]2.9e-20477.85Show/hide
Query:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS
        ++GEPVV HS+ASPKF+PKSFECDNDALDSGGMKLED K+ T  LK NE+A+H N             +GL D N++DEV+AFV  LTNSSK+DLFEEDS
Subjt:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS

Query:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGS-SLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCD
        ELYMEKSIVECQLPELIVCYKEN CNIVKDICID+GVPSRDKLLCGS SLDEKAVC ILPPE+DWK+ LA  LE+ DMFASDDSEHSESFG K+SPKQ D
Subjt:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGS-SLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCD

Query:  SKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVND
         +DLARTPEAEYDV YFTDNDILNLPMTDL  E +KPL N+ NEP+ QSEQVFIETTSLEVPVLACVAEES  D +E IS   +S  A EEPKNS S  D
Subjt:  SKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVND

Query:  VSYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSD
        +SY+SK+D GNITFDFNS A TASDGLE CDNG LNSSAPSTSASVDC DT SSSN LASADKCQ  C+D SSNP+RVEYEDLLRVEY D+ KAEVGNSD
Subjt:  VSYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSD

Query:  SHPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF
        S+ VSSQVQHG+GE S SSMV LGSL+SNSGRIGYSGS+S RSDSSTTST SFAFPILQSEWNSSPVRMAKAD+KHLRK RGW+QGLLCCRF
Subjt:  SHPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF

XP_008446468.1 PREDICTED: uncharacterized protein LOC103489197 isoform X1 [Cucumis melo]1.4e-20677.8Show/hide
Query:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS
        ++GEP+VCHSNASPKFVPKSFECDND L+SGGMKLEDQKE TS LKGN DA HNN AADGWV  KR+CL L DFNDYD+V+AFVSPL NS K+DL EEDS
Subjt:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS

Query:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCDS
        ELYMEKSIVECQLPELIVCYKENICNIVKDICID+G P RDKL CGSSLDE+ VC+I PP KDWK+    EL++RDMFASDDSEHSESFG+K+SP QCDS
Subjt:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCDS

Query:  KDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESC-DPKEAISASTTSAPAPEEPKNSASVNDV
        KDLA TPEAEYDVAYFTDND   +PMTDLV E LKPL ++  +PH QSEQV IETT  EVPVLA VA+ES  + +E  S S TSA   E+PKNS S N +
Subjt:  KDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESC-DPKEAISASTTSAPAPEEPKNSASVNDV

Query:  SYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSDS
        SY+SKVD GNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASV C++T +SSNPLASADK + QCH+TSSNP+RVEYEDL RVEY D+ K EVGN DS
Subjt:  SYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSDS

Query:  HPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF
        H VSS+VQ G+GETSF S+ PLGSLMSNSGRIGYSGS+S RSDSSTTSTRSFAFPILQ+EWNSSPVRMAK DRKHL+KHRGW+ G+LCCRF
Subjt:  HPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF

XP_022956433.1 uncharacterized protein LOC111458170 isoform X1 [Cucurbita moschata]2.2e-20477.85Show/hide
Query:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS
        ++GEPVV HS+ASPKF+PKSFECDNDALDSGGMKLED K+ T  LK NEDA+H N             +GL D N++DEV+AFV  +TNSSK+DLFEEDS
Subjt:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS

Query:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCG-SSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCD
        ELYMEKSIVECQLPELIVCYKEN CNIVKDICID+GVPSRDKLLCG SSLDEKAVC ILPPE+DWK+ LA  LE+ DMFASDDSEHSESFG K+SPKQ D
Subjt:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCG-SSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCD

Query:  SKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVND
         +DLARTPEAEYDV YFTDNDILNLPMTDL  E +KPL N+ NEP+ QSEQVFIETTSLEVPVLACVAEES  D +E IS   +S  A EEPKNS S  D
Subjt:  SKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVND

Query:  VSYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSD
        +SY+SK+D GNITFDFNS A TASDGLE CDNG LNSSAPSTSASVDC D SSSSN LASADKCQ  C+D SSNP+RVEYEDLLRVEY D+ KAEVGNSD
Subjt:  VSYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSD

Query:  SHPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF
        S+ VSSQVQHG+GE S SSMV LGSL+SNSGRIGYSGS+S RSDSSTTST SFAFPILQSEWNSSPVRMAKAD+KHLRK RGW+QGLLCCRF
Subjt:  SHPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF

XP_038892052.1 uncharacterized protein LOC120081347 isoform X1 [Benincasa hispida]4.4e-22181.04Show/hide
Query:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS
        ++GEP+V HSNASP+FVPKSFECDNDA++SGGMKLEDQKE TS LKGNEDA+HNN AADGWV  KR+CL L DFN+YDEV+AFVSPLTNSSK+DLFEEDS
Subjt:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS

Query:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCDS
        ELYMEKS VECQLPELIVCYKENICNIVKDICID+GVPSRDKLLCGSSLDEK VC+ILPP   WK++L  ELEKRD++ASDDSEHSESFGNK+SPKQ DS
Subjt:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCDS

Query:  KDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVNDV
         DL RTPEAEYDVAYFTDND   +PMTD V E LKPL N+  EPH +SEQVFIETTSLEVPVLACVAEES  D +E IS STTSA APEE KNS S NDV
Subjt:  KDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVNDV

Query:  SYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSDS
        SY+SKVD GNITFDFNSLA TASDGLE CDN DLN+SAPSTSASV CQ+T SSSNPLASADKCQ QCHDTS+NP+ VEYEDL R+EY D+ K EVGN DS
Subjt:  SYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSDS

Query:  HPVSSQVQHG----------LGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCR
        H VSSQVQHG          LGETSFSSMVPLGSLMSNSG IGYSGS+SLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADR+HLRKHRGW+QG+LCCR
Subjt:  HPVSSQVQHG----------LGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCR

Query:  F
        F
Subjt:  F

XP_038892056.1 uncharacterized protein LOC120081347 isoform X2 [Benincasa hispida]2.5e-20878.41Show/hide
Query:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS
        ++GEP+V HSNASP+FVPKSFECDNDA++SGGMKLEDQKE TS LKGNEDA+HNN AADGWV  KR+CL L DFN+YDEV+AFVSPLTNSSK+DLFEEDS
Subjt:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS

Query:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCDS
        ELYMEKS VECQLPELIVCYKENICNIVKDICID+GVPSRDKLLCGSSLDEK VC+ILPP   WK++L  ELEKRD++ASDDSEHSESFGNK+SPKQ DS
Subjt:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCDS

Query:  KDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVNDV
         DL RTPEAEYDVAYFTDND   +PMTD V E LKPL N+  EPH +SEQVFIETTSLEVPVLACVAEES  D +E IS STTSA APEE KNS S NDV
Subjt:  KDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVNDV

Query:  SYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSDS
        SY+SKVD GNITFDFNSLA TASDGLE CDN DLN+SAPSTSASV CQ+T SSSNPLASADKCQ QCHDTS+NP+ VEYEDL R+EY D+ K EVGN DS
Subjt:  SYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSDS

Query:  HPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF
        H VSSQVQHG                      GYSGS+SLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADR+HLRKHRGW+QG+LCCRF
Subjt:  HPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF

TrEMBL top hitse value%identityAlignment
A0A1S3BFZ0 uncharacterized protein LOC103489197 isoform X16.7e-20777.8Show/hide
Query:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS
        ++GEP+VCHSNASPKFVPKSFECDND L+SGGMKLEDQKE TS LKGN DA HNN AADGWV  KR+CL L DFNDYD+V+AFVSPL NS K+DL EEDS
Subjt:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS

Query:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCDS
        ELYMEKSIVECQLPELIVCYKENICNIVKDICID+G P RDKL CGSSLDE+ VC+I PP KDWK+    EL++RDMFASDDSEHSESFG+K+SP QCDS
Subjt:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCDS

Query:  KDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESC-DPKEAISASTTSAPAPEEPKNSASVNDV
        KDLA TPEAEYDVAYFTDND   +PMTDLV E LKPL ++  +PH QSEQV IETT  EVPVLA VA+ES  + +E  S S TSA   E+PKNS S N +
Subjt:  KDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESC-DPKEAISASTTSAPAPEEPKNSASVNDV

Query:  SYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSDS
        SY+SKVD GNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASV C++T +SSNPLASADK + QCH+TSSNP+RVEYEDL RVEY D+ K EVGN DS
Subjt:  SYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSDS

Query:  HPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF
        H VSS+VQ G+GETSF S+ PLGSLMSNSGRIGYSGS+S RSDSSTTSTRSFAFPILQ+EWNSSPVRMAK DRKHL+KHRGW+ G+LCCRF
Subjt:  HPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF

A0A6J1GWB6 uncharacterized protein LOC111458170 isoform X22.3e-19976.63Show/hide
Query:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS
        ++GEPVV HS+ASPKF+PKSFECDNDALDSGGMKLED K+ T  LK NEDA+H N             +GL D N++DEV+AFV  +TNSSK+DLFEEDS
Subjt:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS

Query:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCG-SSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCD
        ELYMEKSIVECQLPELIVCYKEN CNIVKDICID+GVPSRDKLLCG SSLDEKAVC ILPPE+DWK+ LA  LE+ DMFASDDSEHSESFG K+SPKQ D
Subjt:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCG-SSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCD

Query:  SKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVND
         +DLARTPEAEYDV YFTDNDILNLPMTDL  E +KPL N+ NEP+ QSEQV      LEVPVLACVAEES  D +E IS   +S  A EEPKNS S  D
Subjt:  SKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVND

Query:  VSYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSD
        +SY+SK+D GNITFDFNS A TASDGLE CDNG LNSSAPSTSASVDC D SSSSN LASADKCQ  C+D SSNP+RVEYEDLLRVEY D+ KAEVGNSD
Subjt:  VSYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSD

Query:  SHPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF
        S+ VSSQVQHG+GE S SSMV LGSL+SNSGRIGYSGS+S RSDSSTTST SFAFPILQSEWNSSPVRMAKAD+KHLRK RGW+QGLLCCRF
Subjt:  SHPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF

A0A6J1GWT7 uncharacterized protein LOC111458170 isoform X11.1e-20477.85Show/hide
Query:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS
        ++GEPVV HS+ASPKF+PKSFECDNDALDSGGMKLED K+ T  LK NEDA+H N             +GL D N++DEV+AFV  +TNSSK+DLFEEDS
Subjt:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS

Query:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCG-SSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCD
        ELYMEKSIVECQLPELIVCYKEN CNIVKDICID+GVPSRDKLLCG SSLDEKAVC ILPPE+DWK+ LA  LE+ DMFASDDSEHSESFG K+SPKQ D
Subjt:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCG-SSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCD

Query:  SKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVND
         +DLARTPEAEYDV YFTDNDILNLPMTDL  E +KPL N+ NEP+ QSEQVFIETTSLEVPVLACVAEES  D +E IS   +S  A EEPKNS S  D
Subjt:  SKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVND

Query:  VSYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSD
        +SY+SK+D GNITFDFNS A TASDGLE CDNG LNSSAPSTSASVDC D SSSSN LASADKCQ  C+D SSNP+RVEYEDLLRVEY D+ KAEVGNSD
Subjt:  VSYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSD

Query:  SHPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF
        S+ VSSQVQHG+GE S SSMV LGSL+SNSGRIGYSGS+S RSDSSTTST SFAFPILQSEWNSSPVRMAKAD+KHLRK RGW+QGLLCCRF
Subjt:  SHPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF

A0A6J1GWU3 uncharacterized protein LOC111458170 isoform X34.1e-19675.61Show/hide
Query:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS
        ++GEPVV HS+ASPKF+PKSFECDNDALDSGGMKLED K+ T  LK NEDA+H N             +GL D N++DEV+AFV  +TNSSK+DLFEEDS
Subjt:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS

Query:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCG-SSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCD
        ELYMEKSIVECQLPELIVCYKEN CNIVKDICID+GVPSRDKLLCG SSLDEKAVC ILPPE+DWK+ LA  LE+ DMFASDDSEHSESFG K+SPKQ D
Subjt:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCG-SSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCD

Query:  SKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVND
         +DLARTPEAEYDV YFTDNDILNLPMTDL  E +KPL N+ NEP+ QSEQ           VLACVAEES  D +E IS   +S  A EEPKNS S  D
Subjt:  SKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISASTTSAPAPEEPKNSASVND

Query:  VSYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSD
        +SY+SK+D GNITFDFNS A TASDGLE CDNG LNSSAPSTSASVDC D SSSSN LASADKCQ  C+D SSNP+RVEYEDLLRVEY D+ KAEVGNSD
Subjt:  VSYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSD

Query:  SHPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF
        S+ VSSQVQHG+GE S SSMV LGSL+SNSGRIGYSGS+S RSDSSTTST SFAFPILQSEWNSSPVRMAKAD+KHLRK RGW+QGLLCCRF
Subjt:  SHPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF

A0A6J1IHW9 uncharacterized protein LOC111477604 isoform X14.1e-19675.66Show/hide
Query:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS
        ++GEPVV HS+ASPKF+PKSFECDNDALDSGGMKLED K+ T  LKGNEDA+  N             +GL D N++DEV+AFV  LTNSSK+DLFEEDS
Subjt:  IKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEVEAFVSPLTNSSKIDLFEEDS

Query:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCG-SSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCD
        ELYMEKSIVECQLPELIVCYKEN CNIVKDICID+GVPSRDKLLCG SSLDEKAVC ILPPE+DWK+ LA  LE+ DMFASDDSEHSESFG K+SPKQ D
Subjt:  ELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCG-SSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFGNKESPKQCD

Query:  SKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISAS-TTSAPAPEEPKNSASVN
         ++LARTP+AEYDV Y TDND+LNLP+TDL  E +KPL N+ NEP+ QSEQ           VLACVAEES  D +EAIS     S  A EEPKNS S  
Subjt:  SKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEES-CDPKEAISAS-TTSAPAPEEPKNSASVN

Query:  DVSYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNS
        D+SY+SK+D GNITFDFNS A TASDGLE CDNGDLNSSAPSTSASVDC DT SSSN LASADKCQV C+D SSNP+RVEYEDLLRVEY D+ KAEVGNS
Subjt:  DVSYDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNS

Query:  DSHPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF
        DSH VSSQVQHG+GE S SSMV LGSL+SNSGRIGYSGS+S RSDSSTTSTRSFAFPILQSEWNSSPVRMAKAD+KHLRK RGW+QGLLCCRF
Subjt:  DSHPVSSQVQHGLGETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13650.2 BEST Arabidopsis thaliana protein match is: 18S pre-ribosomal assembly protein gar2-related (TAIR:AT2G03810.4)4.6e-0631.13Show/hide
Query:  DLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGD----VVKAEVGNS--DSHPVSSQVQHGLGETSFSSMVPLGSLM
        ++N  +      V    TS S   L   D   ++   +  N       +   V +        ++E  NS  D++ +   + +G GE SF         +
Subjt:  DLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGD----VVKAEVGNS--DSHPVSSQVQHGLGETSFSSMVPLGSLM

Query:  SNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLR
        +  G +  S ++S+RSD   TS  SFA PILQSEWNSSPVRM KA+   LR
Subjt:  SNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLR

AT2G03810.1 18S pre-ribosomal assembly protein gar2-related6.5e-2931.37Show/hide
Query:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDS--EHSESFGNKES
        ++D   YM+K++  C LPE++VCYKEN  +IVKDIC+DEGVP ++K L G              EKD  ++ ++E    D+  +D +    SE+   ++S
Subjt:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDS--EHSESFGNKES

Query:  PKQCDSKDLARTPEAEYDVAYFTDNDILNLPMTD--------LVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESCDPKEAISASTTSAPA
          + D  +     + + DV   +  D  +   T         +V E +K    S     S SE    E +  EV +      +  D KE ++     +  
Subjt:  PKQCDSKDLARTPEAEYDVAYFTDNDILNLPMTD--------LVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESCDPKEAISASTTSAPA

Query:  PEEPK-NSASVNDVSYDSKVDNGNITFDFNSLAFTASD-GLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRV
         E+   N  +++  S++ +  +     +  SL  TA +  LE+ +         S+ ++   Q+ + + N     +K + + H   +      YED    
Subjt:  PEEPK-NSASVNDVSYDSKVDNGNITFDFNSLAFTASD-GLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRV

Query:  EYGDVVKAEVGNSDSHPVSSQVQHGLGETSFSS--MVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWK
                     D    S       GETSFS+   V +   ++ SG I YSGS+S+RSD+STTS RSFAFPILQSEWNSSPVRMAKAD++  R+  GW+
Subjt:  EYGDVVKAEVGNSDSHPVSSQVQHGLGETSFSS--MVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWK

Query:  QGLLCCRF
          LLCCRF
Subjt:  QGLLCCRF

AT2G03810.2 18S pre-ribosomal assembly protein gar2-related6.5e-2931.37Show/hide
Query:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDS--EHSESFGNKES
        ++D   YM+K++  C LPE++VCYKEN  +IVKDIC+DEGVP ++K L G              EKD  ++ ++E    D+  +D +    SE+   ++S
Subjt:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDS--EHSESFGNKES

Query:  PKQCDSKDLARTPEAEYDVAYFTDNDILNLPMTD--------LVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESCDPKEAISASTTSAPA
          + D  +     + + DV   +  D  +   T         +V E +K    S     S SE    E +  EV +      +  D KE ++     +  
Subjt:  PKQCDSKDLARTPEAEYDVAYFTDNDILNLPMTD--------LVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESCDPKEAISASTTSAPA

Query:  PEEPK-NSASVNDVSYDSKVDNGNITFDFNSLAFTASD-GLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRV
         E+   N  +++  S++ +  +     +  SL  TA +  LE+ +         S+ ++   Q+ + + N     +K + + H   +      YED    
Subjt:  PEEPK-NSASVNDVSYDSKVDNGNITFDFNSLAFTASD-GLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRV

Query:  EYGDVVKAEVGNSDSHPVSSQVQHGLGETSFSS--MVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWK
                     D    S       GETSFS+   V +   ++ SG I YSGS+S+RSD+STTS RSFAFPILQSEWNSSPVRMAKAD++  R+  GW+
Subjt:  EYGDVVKAEVGNSDSHPVSSQVQHGLGETSFSS--MVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWK

Query:  QGLLCCRF
          LLCCRF
Subjt:  QGLLCCRF

AT2G03810.3 18S pre-ribosomal assembly protein gar2-related6.5e-2931.37Show/hide
Query:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDS--EHSESFGNKES
        ++D   YM+K++  C LPE++VCYKEN  +IVKDIC+DEGVP ++K L G              EKD  ++ ++E    D+  +D +    SE+   ++S
Subjt:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDS--EHSESFGNKES

Query:  PKQCDSKDLARTPEAEYDVAYFTDNDILNLPMTD--------LVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESCDPKEAISASTTSAPA
          + D  +     + + DV   +  D  +   T         +V E +K    S     S SE    E +  EV +      +  D KE ++     +  
Subjt:  PKQCDSKDLARTPEAEYDVAYFTDNDILNLPMTD--------LVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESCDPKEAISASTTSAPA

Query:  PEEPK-NSASVNDVSYDSKVDNGNITFDFNSLAFTASD-GLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRV
         E+   N  +++  S++ +  +     +  SL  TA +  LE+ +         S+ ++   Q+ + + N     +K + + H   +      YED    
Subjt:  PEEPK-NSASVNDVSYDSKVDNGNITFDFNSLAFTASD-GLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRV

Query:  EYGDVVKAEVGNSDSHPVSSQVQHGLGETSFSS--MVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWK
                     D    S       GETSFS+   V +   ++ SG I YSGS+S+RSD+STTS RSFAFPILQSEWNSSPVRMAKAD++  R+  GW+
Subjt:  EYGDVVKAEVGNSDSHPVSSQVQHGLGETSFSS--MVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWK

Query:  QGLLCCRF
          LLCCRF
Subjt:  QGLLCCRF

AT2G03810.4 18S pre-ribosomal assembly protein gar2-related6.5e-2931.37Show/hide
Query:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDS--EHSESFGNKES
        ++D   YM+K++  C LPE++VCYKEN  +IVKDIC+DEGVP ++K L G              EKD  ++ ++E    D+  +D +    SE+   ++S
Subjt:  EEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDS--EHSESFGNKES

Query:  PKQCDSKDLARTPEAEYDVAYFTDNDILNLPMTD--------LVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESCDPKEAISASTTSAPA
          + D  +     + + DV   +  D  +   T         +V E +K    S     S SE    E +  EV +      +  D KE ++     +  
Subjt:  PKQCDSKDLARTPEAEYDVAYFTDNDILNLPMTD--------LVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESCDPKEAISASTTSAPA

Query:  PEEPK-NSASVNDVSYDSKVDNGNITFDFNSLAFTASD-GLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRV
         E+   N  +++  S++ +  +     +  SL  TA +  LE+ +         S+ ++   Q+ + + N     +K + + H   +      YED    
Subjt:  PEEPK-NSASVNDVSYDSKVDNGNITFDFNSLAFTASD-GLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRV

Query:  EYGDVVKAEVGNSDSHPVSSQVQHGLGETSFSS--MVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWK
                     D    S       GETSFS+   V +   ++ SG I YSGS+S+RSD+STTS RSFAFPILQSEWNSSPVRMAKAD++  R+  GW+
Subjt:  EYGDVVKAEVGNSDSHPVSSQVQHGLGETSFSS--MVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWK

Query:  QGLLCCRF
          LLCCRF
Subjt:  QGLLCCRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAGGGGACCGATTCTGATCCGAAAACCAGGCAGAGCAGAGCTTGTGGAAAAGATGAAGCCATTAACAAAAACAAATTAAAATTCTCCAGCGGTTGCTTTTGGGG
GGGTGGAAATGGAAACATGTCCGGTTCTGACTTTGAAACAGTTGAAAAAGCCCTCTGGGGTACACTTTCTGACATGATTGTCAGCCCCAGAAAAGGCGCACCTCTTCGTT
TCAGGCTGCTTGAGGGGCTCTTCTGTACAGTCTCCTTAACACATTGTGGAGTGCATTTTCCTCTGTTCCTACAGGCCTCAAGGACACTGGATTTGCTTGAAATTTTTTCT
TACAGAGCAGTCCATGTGTACAGATTATTGTTTACAGTATTTAATAGCTGCCCCTTGGAAATTAAGTTATTGACTCGATCTCCTCTTCTGATCAAGGGTGAGCCCGTAGT
TTGCCATTCAAATGCTAGCCCCAAGTTTGTTCCCAAGTCTTTTGAATGTGATAATGATGCTCTTGATTCTGGTGGGATGAAGCTTGAAGATCAGAAAGAAATCACAAGCC
CTCTCAAAGGTAATGAGGATGCCGATCACAATAATATTGCTGCAGATGGTTGGGTTCCGACTAAGCGTGACTGTTTAGGTCTTGTTGATTTTAATGACTATGACGAGGTT
GAAGCCTTTGTGTCACCGCTCACTAATTCTTCTAAAATAGACTTGTTTGAGGAAGATTCGGAATTATACATGGAAAAGAGTATTGTTGAATGCCAACTTCCTGAACTGAT
AGTTTGTTACAAAGAAAATATTTGCAATATTGTGAAGGATATTTGTATTGACGAAGGAGTACCTTCTCGGGATAAGCTCTTGTGTGGTAGTAGTTTGGATGAGAAGGCTG
TCTGTGCCATTCTCCCTCCTGAGAAAGATTGGAAGGAAAACTTGGCAAGTGAACTGGAGAAGAGAGATATGTTTGCTTCAGACGATTCGGAGCATTCGGAATCTTTTGGC
AATAAGGAGTCACCCAAACAATGCGATTCCAAGGATTTGGCTAGAACACCTGAGGCAGAATATGATGTGGCATATTTTACTGATAATGATATATTAAATCTTCCAATGAC
AGACTTGGTTGCAGAGCACTTAAAGCCATTGATCAACAGTATGAATGAGCCTCACTCTCAGTCTGAACAGGTGTTTATTGAAACTACGAGTTTGGAAGTCCCTGTTTTGG
CATGTGTAGCTGAAGAATCTTGTGACCCCAAAGAAGCAATATCAGCGTCCACTACTTCAGCTCCAGCACCTGAAGAGCCCAAAAATAGCGCTTCTGTGAATGATGTATCA
TACGATAGTAAAGTGGACAATGGAAACATTACTTTTGATTTTAATTCTTTAGCATTCACAGCTAGTGATGGACTGGAGCGTTGTGATAATGGTGACTTAAACTCTTCAGC
TCCTTCGACCAGTGCCTCAGTGGACTGCCAAGACACTAGCAGCAGCTCCAACCCGTTAGCTTCCGCTGATAAATGTCAAGTGCAGTGTCATGATACTAGCAGTAACCCCA
GACGTGTGGAATATGAAGACTTACTACGTGTGGAATACGGAGACGTAGTGAAGGCAGAAGTTGGGAATTCTGATAGTCATCCAGTTTCAAGCCAAGTTCAACATGGCTTA
GGTGAAACGAGTTTCTCTTCTATGGTACCTTTGGGGAGTCTGATGTCTAATTCAGGTCGTATAGGTTACTCTGGCAGTGTCTCTCTTCGGTCTGATAGCAGCACAACCAG
CACCCGTTCCTTTGCCTTTCCCATATTACAATCCGAGTGGAATAGTAGTCCTGTTAGAATGGCTAAAGCTGATCGAAAGCATCTACGGAAGCATAGGGGTTGGAAACAAG
GCCTTCTGTGCTGTAGATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAGGGGACCGATTCTGATCCGAAAACCAGGCAGAGCAGAGCTTGTGGAAAAGATGAAGCCATTAACAAAAACAAATTAAAATTCTCCAGCGGTTGCTTTTGGGG
GGGTGGAAATGGAAACATGTCCGGTTCTGACTTTGAAACAGTTGAAAAAGCCCTCTGGGGTACACTTTCTGACATGATTGTCAGCCCCAGAAAAGGCGCACCTCTTCGTT
TCAGGCTGCTTGAGGGGCTCTTCTGTACAGTCTCCTTAACACATTGTGGAGTGCATTTTCCTCTGTTCCTACAGGCCTCAAGGACACTGGATTTGCTTGAAATTTTTTCT
TACAGAGCAGTCCATGTGTACAGATTATTGTTTACAGTATTTAATAGCTGCCCCTTGGAAATTAAGTTATTGACTCGATCTCCTCTTCTGATCAAGGGTGAGCCCGTAGT
TTGCCATTCAAATGCTAGCCCCAAGTTTGTTCCCAAGTCTTTTGAATGTGATAATGATGCTCTTGATTCTGGTGGGATGAAGCTTGAAGATCAGAAAGAAATCACAAGCC
CTCTCAAAGGTAATGAGGATGCCGATCACAATAATATTGCTGCAGATGGTTGGGTTCCGACTAAGCGTGACTGTTTAGGTCTTGTTGATTTTAATGACTATGACGAGGTT
GAAGCCTTTGTGTCACCGCTCACTAATTCTTCTAAAATAGACTTGTTTGAGGAAGATTCGGAATTATACATGGAAAAGAGTATTGTTGAATGCCAACTTCCTGAACTGAT
AGTTTGTTACAAAGAAAATATTTGCAATATTGTGAAGGATATTTGTATTGACGAAGGAGTACCTTCTCGGGATAAGCTCTTGTGTGGTAGTAGTTTGGATGAGAAGGCTG
TCTGTGCCATTCTCCCTCCTGAGAAAGATTGGAAGGAAAACTTGGCAAGTGAACTGGAGAAGAGAGATATGTTTGCTTCAGACGATTCGGAGCATTCGGAATCTTTTGGC
AATAAGGAGTCACCCAAACAATGCGATTCCAAGGATTTGGCTAGAACACCTGAGGCAGAATATGATGTGGCATATTTTACTGATAATGATATATTAAATCTTCCAATGAC
AGACTTGGTTGCAGAGCACTTAAAGCCATTGATCAACAGTATGAATGAGCCTCACTCTCAGTCTGAACAGGTGTTTATTGAAACTACGAGTTTGGAAGTCCCTGTTTTGG
CATGTGTAGCTGAAGAATCTTGTGACCCCAAAGAAGCAATATCAGCGTCCACTACTTCAGCTCCAGCACCTGAAGAGCCCAAAAATAGCGCTTCTGTGAATGATGTATCA
TACGATAGTAAAGTGGACAATGGAAACATTACTTTTGATTTTAATTCTTTAGCATTCACAGCTAGTGATGGACTGGAGCGTTGTGATAATGGTGACTTAAACTCTTCAGC
TCCTTCGACCAGTGCCTCAGTGGACTGCCAAGACACTAGCAGCAGCTCCAACCCGTTAGCTTCCGCTGATAAATGTCAAGTGCAGTGTCATGATACTAGCAGTAACCCCA
GACGTGTGGAATATGAAGACTTACTACGTGTGGAATACGGAGACGTAGTGAAGGCAGAAGTTGGGAATTCTGATAGTCATCCAGTTTCAAGCCAAGTTCAACATGGCTTA
GGTGAAACGAGTTTCTCTTCTATGGTACCTTTGGGGAGTCTGATGTCTAATTCAGGTCGTATAGGTTACTCTGGCAGTGTCTCTCTTCGGTCTGATAGCAGCACAACCAG
CACCCGTTCCTTTGCCTTTCCCATATTACAATCCGAGTGGAATAGTAGTCCTGTTAGAATGGCTAAAGCTGATCGAAAGCATCTACGGAAGCATAGGGGTTGGAAACAAG
GCCTTCTGTGCTGTAGATTCTGA
Protein sequenceShow/hide protein sequence
MKKGTDSDPKTRQSRACGKDEAINKNKLKFSSGCFWGGGNGNMSGSDFETVEKALWGTLSDMIVSPRKGAPLRFRLLEGLFCTVSLTHCGVHFPLFLQASRTLDLLEIFS
YRAVHVYRLLFTVFNSCPLEIKLLTRSPLLIKGEPVVCHSNASPKFVPKSFECDNDALDSGGMKLEDQKEITSPLKGNEDADHNNIAADGWVPTKRDCLGLVDFNDYDEV
EAFVSPLTNSSKIDLFEEDSELYMEKSIVECQLPELIVCYKENICNIVKDICIDEGVPSRDKLLCGSSLDEKAVCAILPPEKDWKENLASELEKRDMFASDDSEHSESFG
NKESPKQCDSKDLARTPEAEYDVAYFTDNDILNLPMTDLVAEHLKPLINSMNEPHSQSEQVFIETTSLEVPVLACVAEESCDPKEAISASTTSAPAPEEPKNSASVNDVS
YDSKVDNGNITFDFNSLAFTASDGLERCDNGDLNSSAPSTSASVDCQDTSSSSNPLASADKCQVQCHDTSSNPRRVEYEDLLRVEYGDVVKAEVGNSDSHPVSSQVQHGL
GETSFSSMVPLGSLMSNSGRIGYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRKHLRKHRGWKQGLLCCRF