; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh07G011530 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh07G011530
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionZinc finger family protein, putative isoform 1
Genome locationCmo_Chr07:6194540..6198640
RNA-Seq ExpressionCmoCh07G011530
SyntenyCmoCh07G011530
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004175 - endopeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595439.1 hypothetical protein SDJN03_11992, partial [Cucurbita argyrosperma subsp. sororia]3.7e-27999.4Show/hide
Query:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
        MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
Subjt:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR

Query:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
        TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
Subjt:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK

Query:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
        VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
Subjt:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK

Query:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
        QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
Subjt:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG

Query:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
        KQPHLSPLASPSISPVHSAASPSQQHH+SPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIV  PSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
Subjt:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY

Query:  M
        M
Subjt:  M

XP_022925200.1 uncharacterized protein LOC111432513 isoform X1 [Cucurbita moschata]1.0e-281100Show/hide
Query:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
        MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
Subjt:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR

Query:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
        TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
Subjt:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK

Query:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
        VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
Subjt:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK

Query:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
        QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
Subjt:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG

Query:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
        KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
Subjt:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY

Query:  M
        M
Subjt:  M

XP_022925201.1 uncharacterized protein LOC111432513 isoform X2 [Cucurbita moschata]2.2e-27999.8Show/hide
Query:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
        MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRG DIVATFVVERPVSLLQDNIERLR
Subjt:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR

Query:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
        TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
Subjt:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK

Query:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
        VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
Subjt:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK

Query:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
        QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
Subjt:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG

Query:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
        KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
Subjt:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY

Query:  M
        M
Subjt:  M

XP_022925202.1 uncharacterized protein LOC111432513 isoform X3 [Cucurbita moschata]5.4e-27899.2Show/hide
Query:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
        MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
Subjt:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR

Query:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
        TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
Subjt:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK

Query:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
        VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
Subjt:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK

Query:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
        QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
Subjt:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG

Query:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
        KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIV    PSPSPSPSSAHHWCMITRWGFTLSLIVAFY
Subjt:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY

Query:  M
        M
Subjt:  M

XP_022925203.1 uncharacterized protein LOC111432513 isoform X4 [Cucurbita moschata]1.0e-27698.8Show/hide
Query:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
        MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
Subjt:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR

Query:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
        TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
Subjt:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK

Query:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
        VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
Subjt:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK

Query:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
        QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
Subjt:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG

Query:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
        KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIV      PSPSPSSAHHWCMITRWGFTLSLIVAFY
Subjt:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY

Query:  M
        M
Subjt:  M

TrEMBL top hitse value%identityAlignment
A0A6J1EB56 uncharacterized protein LOC111432513 isoform X32.6e-27899.2Show/hide
Query:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
        MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
Subjt:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR

Query:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
        TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
Subjt:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK

Query:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
        VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
Subjt:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK

Query:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
        QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
Subjt:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG

Query:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
        KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIV    PSPSPSPSSAHHWCMITRWGFTLSLIVAFY
Subjt:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY

Query:  M
        M
Subjt:  M

A0A6J1EBJ8 uncharacterized protein LOC111432513 isoform X21.1e-27999.8Show/hide
Query:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
        MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRG DIVATFVVERPVSLLQDNIERLR
Subjt:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR

Query:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
        TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
Subjt:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK

Query:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
        VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
Subjt:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK

Query:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
        QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
Subjt:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG

Query:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
        KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
Subjt:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY

Query:  M
        M
Subjt:  M

A0A6J1EEJ8 uncharacterized protein LOC111432513 isoform X44.9e-27798.8Show/hide
Query:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
        MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
Subjt:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR

Query:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
        TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
Subjt:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK

Query:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
        VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
Subjt:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK

Query:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
        QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
Subjt:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG

Query:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
        KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIV      PSPSPSSAHHWCMITRWGFTLSLIVAFY
Subjt:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY

Query:  M
        M
Subjt:  M

A0A6J1EH92 uncharacterized protein LOC111432513 isoform X15.0e-282100Show/hide
Query:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
        MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
Subjt:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR

Query:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
        TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
Subjt:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK

Query:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
        VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
Subjt:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK

Query:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
        QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
Subjt:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG

Query:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
        KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
Subjt:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY

Query:  M
        M
Subjt:  M

A0A6J1HPX6 uncharacterized protein LOC111466276 isoform X11.2e-27096.61Show/hide
Query:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR
        MGKNDGEHPPPSAVGSA SQGRCCSGCVSIRRLIGF+CIFILLLSVALFVSAVFWLPPF HY+DQKDLGLNPSYRGHDIVATFVVERPVSLL+DNIERLR
Subjt:  MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLR

Query:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
        TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCAS+VTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK
Subjt:  TDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQK

Query:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
        VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK
Subjt:  VQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVK

Query:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG
        QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQ HNFHHPPSHHHHHHH+PLTPVISPAPAPETGAPEYGL APKSAASPKRSYEAKPPGCQYKRKSGRKEG
Subjt:  QVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEG

Query:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY
        KQP+LSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKS+SNHPEKSTTSPSIV    PSPSPS SSAHHWCMITRW FTLSLIVAFY
Subjt:  KQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFY

Query:  M
        M
Subjt:  M

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)3.8e-4036.76Show/hide
Query:  SAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLG---LNPSYRGHDIVATFVVERPVSLLQDNIERLRTDIFEEFPIP-S
        S  S GR CS   S  RL+G RC+ +L+LS A+ +SA+FWL P    S+ K  G   LN S     + A+F +++PVS +  +  ++  DI     +  +
Subjt:  SAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLG---LNPSYRGHDIVATFVVERPVSLLQDNIERLRTDIFEEFPIP-S

Query:  IKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNF
         KV +LSLN    SN T V F + P   D EI    LSL+RS+   +   +S L++T S FG+  SF+VLKFPGGIT+ P + A +     +LF+ T+  
Subjt:  IKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNF

Query:  SIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVKQVRLSSILKHS
        SI  +Q     L    +  L L PYE ++ +L N +GST++ P   Q  V   +      QRL    Q I  S + NLGL+   FG+VK +  S+ L   
Subjt:  SIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVKQVRLSSILKHS

Query:  LNGMDGKGPIRSPSPAPTPQP
            DGK P      AP P P
Subjt:  LNGMDGKGPIRSPSPAPTPQP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein3.2e-9546.41Show/hide
Query:  AVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLRTDIFEEFPIPSI
        A G +  +   C  C  I   +GF+C+F+LLLSVALF+SA+F L PF    D++D  L+P +RGH IVA+F + R  S L +N  +L+ DIF+E    SI
Subjt:  AVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLRTDIFEEFPIPSI

Query:  KVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFS
        KV IL++      N TKVVFGIDPDT   EI    LS I+    SV+ NQS L++TKS+FGE F FEVLKFPGGIT+IPPQSAF LQK +I+FNFTLN+S
Subjt:  KVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFS

Query:  IHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVKQVRLSSILKHSL
        IHQIQ++F+ L SQL  GL LAPYE LY+ L N+EGSTV+ PT V SSVLL VG + S  RLKQL  TI+ S S NLGLNNT FGKVKQVRLSS L +S 
Subjt:  IHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVKQVRLSSILKHSL

Query:  NGMDGKGPIRSPSPAPTPQP----------HNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEGKQ
                 +SPSP+P+P            H+ HH   +HHHHHH  L+P ++P  +P        +++P    S KR+  A PP     R   +++  Q
Subjt:  NGMDGKGPIRSPSPAPTPQP----------HNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEGKQ

Query:  PHLSPLASPSI-SPVHSAASP----SQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIV
           +P  +PS  +P H   SP    + + H+ P   S PLP V++AH   P  ++   P  +      V  P P    S SSA        W   L LIV
Subjt:  PHLSPLASPSI-SPVHSAASP----SQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIV

Query:  AF
        A+
Subjt:  AF

AT3G56590.1 hydroxyproline-rich glycoprotein family protein1.2e-9746.65Show/hide
Query:  MGKNDGEH---PPPSAVGSAPSQG----RCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQ
        MGKN  E    P      SA + G      C  C  I      RC+ IL  S A+F+SA+FWLPPFL ++D  DL L+P ++ H IVA+F V +P+S ++
Subjt:  MGKNDGEH---PPPSAVGSAPSQG----RCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQ

Query:  DNIERLRTDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQ
        DN+ +L  DI +E   P  KV +L+L  L   NRT V+F IDP+ ++ +IP+   SLI++   ++V  Q   R+T+S+FGE F FEVLKFPGGIT+IPPQ
Subjt:  DNIERLRTDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQ

Query:  SAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNN
          F LQK Q+LFNFTLNFSI+QIQ +F EL SQL  G+ LA YE LYI L N+ GSTV  PTIV SSVLL  G   S  RLKQLAQTI+ S+S NLGLN+
Subjt:  SAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNN

Query:  TEFGKVKQVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKR
        T FGKVKQVRLSSIL HS           +PSP+P P+ H + H   HHHHHHH         AP P    P  G  AP SA +       + P C Y++
Subjt:  TEFGKVKQVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKR

Query:  KSGRKEGKQPHLSPLASPSISPVHSAASP------SQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSA
        +  R +G        A P+ +P  S   P        +HH  P   S+PLP V++AH+ PPSKS    PE   T      SPSP+P+P  SS+
Subjt:  KSGRKEGKQPHLSPLASPSISPVHSAASP------SQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSA

AT3G56590.2 hydroxyproline-rich glycoprotein family protein2.4e-9846.86Show/hide
Query:  MGKNDGEH---PPPSAVGSAPSQG----RCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQ
        MGKN  E    P      SA + G      C  C  I      RC+ IL  S A+F+SA+FWLPPFL ++D  DL L+P ++ H IVA+F V +P+S ++
Subjt:  MGKNDGEH---PPPSAVGSAPSQG----RCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQ

Query:  DNIERLRTDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQ
        DN+ +L  DI +E   P  KV +L+L  L   NRT V+F IDP+ ++ +IP+   SLI++   ++V  Q   R+T+S+FGE F FEVLKFPGGIT+IPPQ
Subjt:  DNIERLRTDIFEEFPIPSIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQ

Query:  SAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNN
          F LQK Q+LFNFTLNFSI+QIQ +F EL SQL  G+ LA YE LYI L N+ GSTV  PTIV SSVLL  G   S  RLKQLAQTI+ S+S NLGLN+
Subjt:  SAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNN

Query:  TEFGKVKQVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKR
        T FGKVKQVRLSSIL HS           +PSP+P P+ H + H   HHHHHHH         AP P    P  G  AP SA +       + P C Y++
Subjt:  TEFGKVKQVRLSSILKHSLNGMDGKGPIRSPSPAPTPQPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKR

Query:  KSGRKEGKQPHLSPLASPSISPVHSAASP------SQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSA
        +  R +G        A P+ +P  S   P        +HH  P   S+PLP V++AH+ PPSKS    PE   T        SPSP+P+PSSA
Subjt:  KSGRKEGKQPHLSPLASPSISPVHSAASP------SQQHHVSPTQASTPLPSVIYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAAACGACGGAGAACACCCACCGCCCTCCGCCGTCGGTTCCGCTCCGTCCCAAGGCCGATGCTGTTCTGGGTGTGTTTCAATTCGAAGGCTCATTGGCTTCAG
ATGCATCTTCATTCTGCTTTTGTCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGCTGCCCCCTTTTCTCCATTATTCAGATCAGAAGGATCTGGGTCTTAATCCCTCGT
ATCGAGGTCATGATATAGTAGCAACGTTCGTTGTTGAGAGACCAGTTTCTTTGCTGCAAGACAATATCGAGCGACTCCGGACCGACATTTTTGAAGAGTTCCCTATACCT
TCTATCAAAGTGGATATACTATCTTTAAACTCGTTATCAGGATCAAACCGTACAAAAGTTGTATTCGGCATTGATCCAGATACTGATGATCCCGAAATCCCGTCAACTTA
TCTGAGTTTAATCAGGTCGACCTGTGCAAGTGTAGTAACAAATCAGTCGTTCCTCCGCATTACGAAATCCATGTTTGGGGAGGCGTTTTCGTTTGAAGTACTGAAATTCC
CCGGAGGAATAACGATAATCCCGCCTCAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACATTGAACTTCTCTATTCATCAGATTCAAGTACATTTC
AGTGAACTGACAAGCCAATTGGATGCGGGATTACGACTAGCTCCCTACGAGATTTTGTATATTAAACTGTGGAATGCGGAAGGGTCGACTGTGACTGCCCCGACGATTGT
CCAGTCATCTGTTCTTCTAGAAGTTGGAAATACGCCATCGATGCAACGGTTGAAGCAGCTAGCTCAGACCATCTCTGTTTCTAATTCTAGCAACCTCGGCCTGAATAATA
CCGAGTTTGGAAAAGTGAAGCAAGTTCGCCTTTCCTCGATTCTTAAACACTCGCTCAATGGTATGGATGGGAAAGGCCCCATAAGGTCACCTTCTCCAGCTCCTACACCA
CAGCCCCATAACTTCCATCATCCACCATCTCACCACCATCACCACCATCATGCTCCTCTAACTCCTGTAATTTCACCTGCCCCTGCGCCTGAGACCGGTGCACCGGAATA
TGGGTTGTCTGCCCCTAAAAGTGCAGCATCGCCTAAGCGAAGTTACGAGGCAAAGCCTCCTGGTTGTCAATATAAGAGGAAATCTGGTAGGAAAGAGGGAAAGCAACCTC
ATTTATCCCCGCTTGCGTCGCCCAGCATATCTCCTGTTCATTCTGCTGCATCGCCATCACAACAACATCACGTCTCTCCAACTCAGGCATCGACTCCATTGCCGAGTGTC
ATTTACGCTCATGTTCAACCACCATCGAAAAGTGACTCCAACCACCCCGAAAAATCCACGACGAGTCCATCCATTGTACCATCTCCATCTCCATCTCCATCTCCATCTCC
ATCTAGCGCACATCATTGGTGTATGATTACTCGGTGGGGATTCACACTGTCTCTAATTGTCGCATTCTACATGTAA
mRNA sequenceShow/hide mRNA sequence
CATAAAACTAATATATTTTTAAATCTTCAACTTGAATTTCAATGACCCACATGAGGAAACCAGTTAGAAACCCCCAAGTAAAGTAGAAATTTCAATCCCCAAGCAATTCT
GTGGCGATTCCTCTCCTTTTTTTTGACATTAACACCTTTCATATTCATCGCAATCTTATTATTACACACACCCACATTCAAATTTCACTTCCCCACTTCTTGTTCTCACT
CACACCCAATAATGGGCGCACCCAACTCATCCCCAGCGGAGCAGAGCCGGAATGGCCCTTGACCCACTTCCCCACCATTCTTCCCGATGGGGAAAAACGACGGAGAACAC
CCACCGCCCTCCGCCGTCGGTTCCGCTCCGTCCCAAGGCCGATGCTGTTCTGGGTGTGTTTCAATTCGAAGGCTCATTGGCTTCAGATGCATCTTCATTCTGCTTTTGTC
CGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGCTGCCCCCTTTTCTCCATTATTCAGATCAGAAGGATCTGGGTCTTAATCCCTCGTATCGAGGTCATGATATAGTAGCAA
CGTTCGTTGTTGAGAGACCAGTTTCTTTGCTGCAAGACAATATCGAGCGACTCCGGACCGACATTTTTGAAGAGTTCCCTATACCTTCTATCAAAGTGGATATACTATCT
TTAAACTCGTTATCAGGATCAAACCGTACAAAAGTTGTATTCGGCATTGATCCAGATACTGATGATCCCGAAATCCCGTCAACTTATCTGAGTTTAATCAGGTCGACCTG
TGCAAGTGTAGTAACAAATCAGTCGTTCCTCCGCATTACGAAATCCATGTTTGGGGAGGCGTTTTCGTTTGAAGTACTGAAATTCCCCGGAGGAATAACGATAATCCCGC
CTCAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACATTGAACTTCTCTATTCATCAGATTCAAGTACATTTCAGTGAACTGACAAGCCAATTGGAT
GCGGGATTACGACTAGCTCCCTACGAGATTTTGTATATTAAACTGTGGAATGCGGAAGGGTCGACTGTGACTGCCCCGACGATTGTCCAGTCATCTGTTCTTCTAGAAGT
TGGAAATACGCCATCGATGCAACGGTTGAAGCAGCTAGCTCAGACCATCTCTGTTTCTAATTCTAGCAACCTCGGCCTGAATAATACCGAGTTTGGAAAAGTGAAGCAAG
TTCGCCTTTCCTCGATTCTTAAACACTCGCTCAATGGTATGGATGGGAAAGGCCCCATAAGGTCACCTTCTCCAGCTCCTACACCACAGCCCCATAACTTCCATCATCCA
CCATCTCACCACCATCACCACCATCATGCTCCTCTAACTCCTGTAATTTCACCTGCCCCTGCGCCTGAGACCGGTGCACCGGAATATGGGTTGTCTGCCCCTAAAAGTGC
AGCATCGCCTAAGCGAAGTTACGAGGCAAAGCCTCCTGGTTGTCAATATAAGAGGAAATCTGGTAGGAAAGAGGGAAAGCAACCTCATTTATCCCCGCTTGCGTCGCCCA
GCATATCTCCTGTTCATTCTGCTGCATCGCCATCACAACAACATCACGTCTCTCCAACTCAGGCATCGACTCCATTGCCGAGTGTCATTTACGCTCATGTTCAACCACCA
TCGAAAAGTGACTCCAACCACCCCGAAAAATCCACGACGAGTCCATCCATTGTACCATCTCCATCTCCATCTCCATCTCCATCTCCATCTAGCGCACATCATTGGTGTAT
GATTACTCGGTGGGGATTCACACTGTCTCTAATTGTCGCATTCTACATGTAACATTAAGAAAGAAGACTAGCGGTTTTGTGATGAGCACGTGTCGATGAGGTCGAGAGAT
GCTTAGAAGTGTGAGGAAAGGAAAGCAAAGGAAAAGAGGCATTGGGTGTTGAATGAAAGTGTGTAAATATGATGTCTGATTAAGAAAGTTGTTGCAGGCAGATGCAGTTT
CAGGTCAAAGCCCACAGAGGTGGCAGGCCTTCAGAAACTTGCATATTTTCCCACTGTTTTGTGTATTATCTTATCATCTTCTTCTCCATAAAATGCAAAGAGAAAAAGAA
AAAGAAGAACCAACATAGCAAATGCTTAACTTTTTTCTTTCACATTCATTGTTTTTTCTCTCAACCCTTTTGTTTGTATGTATATATAAACAAACACATAAAACCGTCTA
TGCCTTTTGTGTGGGTCGGAACTATTTTGAAGGCACCCTTCTCCTTCGAGATAATTTGTGGCATGTCAAGTCCTGGTAAGGTTTTTTATAAACTTATGATCTTGACGACT
CCTTCCCTAGAGCC
Protein sequenceShow/hide protein sequence
MGKNDGEHPPPSAVGSAPSQGRCCSGCVSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYSDQKDLGLNPSYRGHDIVATFVVERPVSLLQDNIERLRTDIFEEFPIP
SIKVDILSLNSLSGSNRTKVVFGIDPDTDDPEIPSTYLSLIRSTCASVVTNQSFLRITKSMFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQIQVHF
SELTSQLDAGLRLAPYEILYIKLWNAEGSTVTAPTIVQSSVLLEVGNTPSMQRLKQLAQTISVSNSSNLGLNNTEFGKVKQVRLSSILKHSLNGMDGKGPIRSPSPAPTP
QPHNFHHPPSHHHHHHHAPLTPVISPAPAPETGAPEYGLSAPKSAASPKRSYEAKPPGCQYKRKSGRKEGKQPHLSPLASPSISPVHSAASPSQQHHVSPTQASTPLPSV
IYAHVQPPSKSDSNHPEKSTTSPSIVPSPSPSPSPSPSSAHHWCMITRWGFTLSLIVAFYM