; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg21284 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg21284
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionprotein ZGRF1 isoform X4
Genome locationCarg_Chr15:9126160..9132311
RNA-Seq ExpressionCarg21284
SyntenyCarg21284
Gene Ontology termsGO:0006302 - double-strand break repair (biological process)
GO:0035861 - site of double-strand break (cellular component)
InterPro domainsIPR018838 - Domain of unknown function DUF2439


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579564.1 hypothetical protein SDJN03_24012, partial [Cucurbita argyrosperma subsp. sororia]5.4e-29087.71Show/hide
Query:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
        MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
Subjt:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV

Query:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM
        LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEET W                                     
Subjt:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM

Query:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
                         +VLYTSNITQKAKKYHDGFLKLLICGSLGRQ                 KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
Subjt:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS

Query:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
        QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
Subjt:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG

Query:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK
        EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK
Subjt:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK

Query:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI
        PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI
Subjt:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI

KAG7017022.1 hypothetical protein SDJN02_22133 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
        MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
Subjt:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV

Query:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM
        LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM
Subjt:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM

Query:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQKDETVKSGESIAFDAHLVDIGECEREHKPPKIPLSQGSSFGDRGTRVLHEPK
        ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQKDETVKSGESIAFDAHLVDIGECEREHKPPKIPLSQGSSFGDRGTRVLHEPK
Subjt:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQKDETVKSGESIAFDAHLVDIGECEREHKPPKIPLSQGSSFGDRGTRVLHEPK

Query:  KCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIGEACGNVKVEIANRDFDI
        KCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIGEACGNVKVEIANRDFDI
Subjt:  KCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIGEACGNVKVEIANRDFDI

Query:  RKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKKPSENLDTRDSTKNAENN
        RKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKKPSENLDTRDSTKNAENN
Subjt:  RKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKKPSENLDTRDSTKNAENN

Query:  QSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI
        QSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI
Subjt:  QSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI

XP_022928768.1 uncharacterized protein LOC111435594 isoform X1 [Cucurbita moschata]5.2e-27786.03Show/hide
Query:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
        MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKT LYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
Subjt:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV

Query:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM
        LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEET W                                     
Subjt:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM

Query:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
                         +VL+TSNITQKAKKYHDGFLKLLICGSLGRQ                 KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
Subjt:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS

Query:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
        QGSSFGDRGTRVL+EPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
Subjt:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG

Query:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK
        EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSV SSKVPEPSLAAEALDLPMDDRSHKK
Subjt:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK

Query:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN---QTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERET
        PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN   QTEHVEAESSSLRDTIS TQGTSQFAACELVNDEGKMCEE+TYERET
Subjt:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN---QTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERET

XP_022928770.1 uncharacterized protein LOC111435594 isoform X2 [Cucurbita moschata]5.8e-28486.26Show/hide
Query:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
        MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKT LYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
Subjt:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV

Query:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM
        LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEET W                                     
Subjt:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM

Query:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
                         +VL+TSNITQKAKKYHDGFLKLLICGSLGRQ                 KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
Subjt:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS

Query:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
        QGSSFGDRGTRVL+EPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
Subjt:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG

Query:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK
        EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSV SSKVPEPSLAAEALDLPMDDRSHKK
Subjt:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK

Query:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN---QTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI
        PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN   QTEHVEAESSSLRDTIS TQGTSQFAACELVNDEGKMCEE+TYERETGTCPSFDLGI
Subjt:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN---QTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI

XP_022928771.1 uncharacterized protein LOC111435594 isoform X3 [Cucurbita moschata]1.2e-27886.47Show/hide
Query:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
        MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKT LYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
Subjt:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV

Query:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM
        LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEET W                                     
Subjt:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM

Query:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
                         +VL+TSNITQKAKKYHDGFLKLLICGSLGRQ                 KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
Subjt:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS

Query:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
        QGSSFGDRGTRVL+EPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
Subjt:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG

Query:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK
        EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSV SSKVPEPSLAAEALDLPMDDRSHKK
Subjt:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK

Query:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERET
        PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTIS TQGTSQFAACELVNDEGKMCEE+TYERET
Subjt:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERET

TrEMBL top hitse value%identityAlignment
A0A6J1EL85 uncharacterized protein LOC111435594 isoform X36.0e-27986.47Show/hide
Query:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
        MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKT LYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
Subjt:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV

Query:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM
        LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEET W                                     
Subjt:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM

Query:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
                         +VL+TSNITQKAKKYHDGFLKLLICGSLGRQ                 KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
Subjt:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS

Query:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
        QGSSFGDRGTRVL+EPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
Subjt:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG

Query:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK
        EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSV SSKVPEPSLAAEALDLPMDDRSHKK
Subjt:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK

Query:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERET
        PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTIS TQGTSQFAACELVNDEGKMCEE+TYERET
Subjt:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERET

A0A6J1EQ22 uncharacterized protein LOC111435594 isoform X12.5e-27786.03Show/hide
Query:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
        MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKT LYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
Subjt:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV

Query:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM
        LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEET W                                     
Subjt:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM

Query:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
                         +VL+TSNITQKAKKYHDGFLKLLICGSLGRQ                 KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
Subjt:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS

Query:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
        QGSSFGDRGTRVL+EPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
Subjt:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG

Query:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK
        EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSV SSKVPEPSLAAEALDLPMDDRSHKK
Subjt:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK

Query:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN---QTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERET
        PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN   QTEHVEAESSSLRDTIS TQGTSQFAACELVNDEGKMCEE+TYERET
Subjt:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN---QTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERET

A0A6J1ESH9 uncharacterized protein LOC111435594 isoform X22.8e-28486.26Show/hide
Query:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
        MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKT LYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
Subjt:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV

Query:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM
        LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEET W                                     
Subjt:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM

Query:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
                         +VL+TSNITQKAKKYHDGFLKLLICGSLGRQ                 KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
Subjt:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS

Query:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
        QGSSFGDRGTRVL+EPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
Subjt:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG

Query:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK
        EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSV SSKVPEPSLAAEALDLPMDDRSHKK
Subjt:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK

Query:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN---QTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI
        PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN   QTEHVEAESSSLRDTIS TQGTSQFAACELVNDEGKMCEE+TYERETGTCPSFDLGI
Subjt:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN---QTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI

A0A6J1HXZ5 protein ZGRF1 isoform X48.4e-27383.16Show/hide
Query:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
        MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKT LYDECEKLLECRILKQEEVVCSGETLIFNSYLV+IDTPLGD+KPESGLNFQAGHD+I EKSGV
Subjt:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV

Query:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM
        LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQ+SPDTRQTEET W                                     
Subjt:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM

Query:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
                         +VLYTSNITQKAKKYHDGFLKLLICGSLGRQ                 KDE VKSGESIAFDAHLVDIGECEREHKPPKIP+S
Subjt:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS

Query:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
        QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDR ILSSKH+SLSKKLGMGEILELPKYLVEIG
Subjt:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG

Query:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK
        EAC NVKVE+ANRDFDIRKD SFCISGEDEKGS RATMKKSLRDAHEILSILQRPKARVSLSSG SDKNI VSVSSSKVPEPSLA EALDLPMD+RSH+K
Subjt:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK

Query:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI
        PSENLDTR+STKNAE+NQS ALT STLT ELEIGHSNQTE+VEAESSSLRDTISWTQGTSQFAAC+LVNDEGKMCEE+TYERETGTCPSFDLGI
Subjt:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI

A0A6J1I1P0 protein ZGRF1 isoform X23.5e-27182.75Show/hide
Query:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
        MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKT LYDECEKLLECRILKQEEVVCSGETLIFNSYLV+IDTPLGD+KPESGLNFQAGHD+I EKSGV
Subjt:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV

Query:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM
        LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQ+SPDTRQTEET W                                     
Subjt:  LRGKNFRNNSVCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAM

Query:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS
                         +VLYTSNITQKAKKYHDGFLKLLICGSLGRQ                 KDE VKSGESIAFDAHLVDIGECEREHKPPKIP+S
Subjt:  ELVHKIEEPCKVIMWIKKVLYTSNITQKAKKYHDGFLKLLICGSLGRQ-----------------KDETVKSGESIAFDAHLVDIGECEREHKPPKIPLS

Query:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG
        QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDR ILSSKH+SLSKKLGMGEILELPKYLVEIG
Subjt:  QGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYHNGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIG

Query:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK
        EAC NVKVE+ANRDFDIRKD SFCISGEDEKGS RATMKKSLRDAHEILSILQRPKARVSLSSG SDKNI VSVSSSKVPEPSLA EALDLPMD+RSH+K
Subjt:  EACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKARVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKK

Query:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN---QTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI
        PSENLDTR+STKNAE+NQS ALT STLT ELEIGHSN   QTE+VEAESSSLRDTISWTQGTSQFAAC+LVNDEGKMCEE+TYERETGTCPSFDLGI
Subjt:  PSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSN---QTEHVEAESSSLRDTISWTQGTSQFAACELVNDEGKMCEEVTYERETGTCPSFDLGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10890.1 unknown protein3.3e-2744.79Show/hide
Query:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV
        MAE  RW   YTKHLKQ+RKVYHDGFLD+H +  K  LYDE + LLE R LK  EVV +GETL F +YLVDI  P    K  S    +    K   K   
Subjt:  MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGV

Query:  LRGKNFRNNSV-CFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETG
        +   NF+ +S+ C E K       +  +LSPS  +IR FKK  L  YG+      T+ T   G
Subjt:  LRGKNFRNNSV-CFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAAATGAATCGATGGAAGGTGACCTATACTAAGCACCTCAAGCAGAGGCGCAAAGTTTATCACGATGGCTTCTTAGACGTCCACCGTTCCAGCAATAAGACCAG
GCTCTATGATGAGTGCGAGAAGCTCCTCGAATGCAGGATCTTAAAGCAAGAAGAAGTAGTTTGCTCTGGCGAAACGCTTATATTCAACAGTTACCTTGTTGACATTGACA
CTCCTCTGGGAGATCATAAGCCGGAGTCTGGTCTGAATTTCCAAGCAGGTCATGATAAGATACCTGAAAAATCGGGTGTGTTGCGAGGGAAGAACTTCAGAAATAACTCT
GTTTGCTTTGAAAATAAAGCTAGTGCAGAGAAGAATAAAACTCGGCCCACTCTCAGCCCTTCATGTAAAATAATCAGAGAATTTAAGAAGAGCAGATTGAAGTGCTATGG
ATCGCCACAAAGTAGTCCAGACACTAGACAAACAGAGGAGACAGGTTGGTTCATGACTTCCTGGGTTGTTAAATCTTCATTATCTTTTCTTCGTGGGGAGGGGGGCAATT
GTGCAATGATGCTGGAAGATGAAAAACTCAAAGCTGACGAGCGAGCAATGGAGCTAGTACACAAGATCGAGGAGCCATGTAAAGTGATCATGTGGATTAAAAAGGTCCTT
TACACCTCAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTGAAACTTCTTATTTGTGGATCCCTCGGGAGGCAGAAAGATGAAACAGTAAAATCTGGAGA
ATCAATAGCCTTTGATGCTCATTTAGTAGATATTGGAGAATGTGAAAGGGAACATAAGCCTCCAAAAATTCCTTTAAGTCAAGGTAGCAGTTTCGGAGACAGGGGAACCA
GGGTACTGCATGAACCGAAAAAATGTTTCAGTGAAAATGAAATATCAACTGGAAAAGAATGGCATGTTTTGTACACAAGTCAGATAACTCAGAAGTCCAAGAAATATCAC
AATGGGATCATCAAAATTTCCTCCTCTGGTTCTCACCATATGCAGGTTACTTTACTGAATGAAGACAGAACTATATTAAGCAGTAAACACCTCAGTTTATCTAAAAAATT
GGGGATGGGGGAGATACTTGAGCTTCCAAAGTACTTGGTGGAGATTGGTGAGGCATGTGGAAATGTTAAAGTAGAGATCGCTAACAGAGATTTTGATATAAGAAAAGACA
CCAGCTTTTGCATTTCTGGTGAAGATGAAAAGGGATCGGACAGAGCGACTATGAAAAAGTCTTTACGGGATGCCCATGAAATATTGTCCATTCTTCAAAGGCCAAAGGCT
AGAGTGAGCCTTTCTTCTGGTCATTCCGATAAAAATATTTGCGTATCAGTTTCGTCATCTAAGGTTCCTGAACCTTCCCTTGCGGCGGAGGCATTGGATCTTCCCATGGA
TGACCGATCTCATAAAAAACCAAGCGAAAATCTGGACACGAGGGATTCAACTAAGAATGCAGAAAACAACCAATCCATTGCTCTAACTCCATCAACATTGACAGAAGAAC
TCGAAATTGGACACTCCAACCAGACAGAGCATGTGGAGGCCGAAAGTAGTTCTCTCAGAGATACAATTTCTTGGACGCAAGGTACAAGTCAATTTGCTGCTTGTGAACTT
GTTAATGATGAAGGCAAAATGTGCGAGGAGGTCACATATGAAAGAGAAACAGGTACATGCCCAAGTTTTGATCTTGGAATTTGA
mRNA sequenceShow/hide mRNA sequence
AATCGCTATTGAACGAGATCAGAAAATAGGCCGTAATTTCTCGATATGAAATCATAGAGCGTGGCTCAGTAGTGTCATTTTTTGTCCGATGGAGAGTTTCCCGCCGGAAG
AAATTCCTCGGCGAAATTCCACTCCGTTCGTTATAATCGCAAGATATGATCTCGGTCGGCGAGTTTGATTTTGATTTGAGAAACTGAGGGAGAAGAGGCTGAGGCAATGG
CAGAAATGAATCGATGGAAGGTGACCTATACTAAGCACCTCAAGCAGAGGCGCAAAGTTTATCACGATGGCTTCTTAGACGTCCACCGTTCCAGCAATAAGACCAGGCTC
TATGATGAGTGCGAGAAGCTCCTCGAATGCAGGATCTTAAAGCAAGAAGAAGTAGTTTGCTCTGGCGAAACGCTTATATTCAACAGTTACCTTGTTGACATTGACACTCC
TCTGGGAGATCATAAGCCGGAGTCTGGTCTGAATTTCCAAGCAGGTCATGATAAGATACCTGAAAAATCGGGTGTGTTGCGAGGGAAGAACTTCAGAAATAACTCTGTTT
GCTTTGAAAATAAAGCTAGTGCAGAGAAGAATAAAACTCGGCCCACTCTCAGCCCTTCATGTAAAATAATCAGAGAATTTAAGAAGAGCAGATTGAAGTGCTATGGATCG
CCACAAAGTAGTCCAGACACTAGACAAACAGAGGAGACAGGTTGGTTCATGACTTCCTGGGTTGTTAAATCTTCATTATCTTTTCTTCGTGGGGAGGGGGGCAATTGTGC
AATGATGCTGGAAGATGAAAAACTCAAAGCTGACGAGCGAGCAATGGAGCTAGTACACAAGATCGAGGAGCCATGTAAAGTGATCATGTGGATTAAAAAGGTCCTTTACA
CCTCAAATATTACTCAGAAAGCTAAGAAGTATCATGATGGTTTCTTGAAACTTCTTATTTGTGGATCCCTCGGGAGGCAGAAAGATGAAACAGTAAAATCTGGAGAATCA
ATAGCCTTTGATGCTCATTTAGTAGATATTGGAGAATGTGAAAGGGAACATAAGCCTCCAAAAATTCCTTTAAGTCAAGGTAGCAGTTTCGGAGACAGGGGAACCAGGGT
ACTGCATGAACCGAAAAAATGTTTCAGTGAAAATGAAATATCAACTGGAAAAGAATGGCATGTTTTGTACACAAGTCAGATAACTCAGAAGTCCAAGAAATATCACAATG
GGATCATCAAAATTTCCTCCTCTGGTTCTCACCATATGCAGGTTACTTTACTGAATGAAGACAGAACTATATTAAGCAGTAAACACCTCAGTTTATCTAAAAAATTGGGG
ATGGGGGAGATACTTGAGCTTCCAAAGTACTTGGTGGAGATTGGTGAGGCATGTGGAAATGTTAAAGTAGAGATCGCTAACAGAGATTTTGATATAAGAAAAGACACCAG
CTTTTGCATTTCTGGTGAAGATGAAAAGGGATCGGACAGAGCGACTATGAAAAAGTCTTTACGGGATGCCCATGAAATATTGTCCATTCTTCAAAGGCCAAAGGCTAGAG
TGAGCCTTTCTTCTGGTCATTCCGATAAAAATATTTGCGTATCAGTTTCGTCATCTAAGGTTCCTGAACCTTCCCTTGCGGCGGAGGCATTGGATCTTCCCATGGATGAC
CGATCTCATAAAAAACCAAGCGAAAATCTGGACACGAGGGATTCAACTAAGAATGCAGAAAACAACCAATCCATTGCTCTAACTCCATCAACATTGACAGAAGAACTCGA
AATTGGACACTCCAACCAGACAGAGCATGTGGAGGCCGAAAGTAGTTCTCTCAGAGATACAATTTCTTGGACGCAAGGTACAAGTCAATTTGCTGCTTGTGAACTTGTTA
ATGATGAAGGCAAAATGTGCGAGGAGGTCACATATGAAAGAGAAACAGGTACATGCCCAAGTTTTGATCTTGGAATTTGAGAAACTACATTTAATTTGTTTTCGCTATAT
TTTGAAAATTTTCGTTTGGGTATTTTAGAGCACCGCACCCTCTGTCTATCAAGTTTCAATTAGTTTGAGCATGCAAATGCATTATTAGGCCTGTAGTTGATGAGTCCAGA
AGTTGTCTCCTTAGGAGTAGCAGAGTTCTTTGTCTTCTGCTCGAGTATCATTTTCCAACATGCGCCTATATTTTCACTATCCAAAGATTGATCTCTCAAGTTTGTTTATG
AACCCATTTGGATTAGGCTGAAAACAAGAGTTCAAACATATTATTGCGCTAGATACTTACTTCCTATATTTTATACATTCTAAATATGGTCAGACTTTACATTAGTTGCT
TTTTTAAATTATAAATACGGAGATACCAAATTGAAAATTTAGAGTTTACCTTCATTTATGAAAATTTTAAAGTTTAGAAATTTAAACACAAGATAGATA
Protein sequenceShow/hide protein sequence
MAEMNRWKVTYTKHLKQRRKVYHDGFLDVHRSSNKTRLYDECEKLLECRILKQEEVVCSGETLIFNSYLVDIDTPLGDHKPESGLNFQAGHDKIPEKSGVLRGKNFRNNS
VCFENKASAEKNKTRPTLSPSCKIIREFKKSRLKCYGSPQSSPDTRQTEETGWFMTSWVVKSSLSFLRGEGGNCAMMLEDEKLKADERAMELVHKIEEPCKVIMWIKKVL
YTSNITQKAKKYHDGFLKLLICGSLGRQKDETVKSGESIAFDAHLVDIGECEREHKPPKIPLSQGSSFGDRGTRVLHEPKKCFSENEISTGKEWHVLYTSQITQKSKKYH
NGIIKISSSGSHHMQVTLLNEDRTILSSKHLSLSKKLGMGEILELPKYLVEIGEACGNVKVEIANRDFDIRKDTSFCISGEDEKGSDRATMKKSLRDAHEILSILQRPKA
RVSLSSGHSDKNICVSVSSSKVPEPSLAAEALDLPMDDRSHKKPSENLDTRDSTKNAENNQSIALTPSTLTEELEIGHSNQTEHVEAESSSLRDTISWTQGTSQFAACEL
VNDEGKMCEEVTYERETGTCPSFDLGI