; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G27200 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G27200
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionWPP domain-associated protein isoform X2
Genome locationChr5:25789395..25791143
RNA-Seq ExpressionCSPI05G27200
SyntenyCSPI05G27200
Gene Ontology termsNA
InterPro domainsIPR037490 - WPP domain-associated protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034726.1 WPP domain-associated protein isoform X2 [Cucumis melo var. makuwa]1.3e-29693.12Show/hide
Query:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
        MDGIFGMIDG FKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
Subjt:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVE
        SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGE GEVKEK E  DDYE KVKTKRNRCINDVIRVE
Subjt:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVE

Query:  EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM
        EMGSDIDILKETLDIAFGKMHSAILISE+GAIEQQVKSSIEND+ISILLKGFV DCQED+EAEVTRKE+QVSANK WSDLMNEVIGLFEDLKPV+GQNEM
Subjt:  EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM

Query:  RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK
        +SRECNIL+FESIIKKKS EAE DQ N E LHDKTSLSLRREES ES KRRFQEILE+LENSMILNATVNK I+QNEDF+EEDIP EKGEQIFVENH+QK
Subjt:  RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK

Query:  SDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEA
        SDVDTLADVWGKMHQLQDEE SGIQNQICALRQERE+REFQNIMKEETYI L QGLREKFCDDLS+WELEILIS+GI RDLIR+MFNQLDETMKSNH EA
Subjt:  SDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEA

Query:  KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR
        KIKDDIYHVVFKETMEDYCSIND GL RLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITY+FELMANRKLEAIMLR
Subjt:  KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR

XP_008447019.1 PREDICTED: uncharacterized protein LOC103489567 [Cucumis melo]1.3e-29693.12Show/hide
Query:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
        MDGIFGMIDG FKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
Subjt:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVE
        SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGE GEVKEK E  DDYE KVKTKRNRCINDVIRVE
Subjt:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVE

Query:  EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM
        EMGSDIDILKETLDIAFGKMHSAILISE+GAIEQQVKSSIEND+ISILLKGFV DCQED+EAEVTRKE+QVSANK WSDLMNEVIGLFEDLKPV+GQNEM
Subjt:  EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM

Query:  RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK
        +SRECNIL+FESIIKKKS EAE DQ N E LHDKTSLSLRREES ES KRRFQEILE+LENSMILNATVNK I+QNEDF+EEDIP EKGEQIFVENH+QK
Subjt:  RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK

Query:  SDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEA
        SDVDTLADVWGKMHQLQDEE SGIQNQICALRQERE+REFQNIMKEETYI L QGLREKFCDDLS+WELEILIS+GI RDLIR+MFNQLDETMKSNH EA
Subjt:  SDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEA

Query:  KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR
        KIKDDIYHVVFKETMEDYCSIND GL RLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITY+FELMANRKLEAIMLR
Subjt:  KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR

XP_022985013.1 uncharacterized protein LOC111483104 [Cucurbita maxima]9.7e-0563.04Show/hide
Query:  KSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR
        K  + ELHNM+L KSDSK LKL+E PHI Y+FELMAN+KL  + +R
Subjt:  KSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR

XP_031741790.1 uncharacterized protein LOC101222640 [Cucumis sativus]0.0e+0099.31Show/hide
Query:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
        MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
Subjt:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVE
        SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEK+ELNDDYEHKVKTKRNRCINDVIRVE
Subjt:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVE

Query:  EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM
        EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIEND+ISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM
Subjt:  EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM

Query:  RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK
        RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK
Subjt:  RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK

Query:  SDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEA
        SDVDTLADVWGKMHQLQDEE SGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILIS+GICRDLIRNMFNQLDETMKSNHIEA
Subjt:  SDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEA

Query:  KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLRY
        KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLRY
Subjt:  KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLRY

XP_038891653.1 uncharacterized protein LOC120081046 [Benincasa hispida]4.5e-21272.06Show/hide
Query:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
        MDGIFG+ID  FK+SIVDSTMM IVHRAMDKAH+RVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDT NPE+SHEEVLAGLAEIRNRLQRRLYE
Subjt:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGE-----------VKEKLELNDDYEHKVKTKR
        SELAILQKDRELADR ESEVKLRQALE TERELVSSQEDLELERSRSAGSSNLSPHEGEDDE+RDGE GE           +KEKLE  DD E KVK +R
Subjt:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGE-----------VKEKLELNDDYEHKVKTKR

Query:  NRCINDVIRVEEMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSA--NKWWSDLMNEVIGL
        N CINDV RVEEMGSDIDILKETLDIAFGKM SAI ISE+G IEQQVKSSIEND+ISI LKGF  DCQED+EAE TRKE++VS   N  WSDLMNEV GL
Subjt:  NRCINDVIRVEEMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSA--NKWWSDLMNEVIGL

Query:  FEDLKPVLGQNEMRSRE---CNILNF------------------------------------ESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESF
         EDLKP++GQNEM+ ++   CNIL+F                                    ESIIKK+S+EA+  Q  PE L +KTSLS RREES E  
Subjt:  FEDLKPVLGQNEMRSRE---CNILNF------------------------------------ESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESF

Query:  KRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEET
        K RFQE+   LEN MI  A VNKI+ QN +FNEEDIP EK EQ+F ENHRQKSDVD+LADVWGKMHQLQDEE  GIQNQIC LRQERE+ EFQNIM EE 
Subjt:  KRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEET

Query:  YIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEAKIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIK
        YI LFQGLREKFC+DL+  E EILI++GICRD+IRN FNQLD+TM+S  IE +IKDD+YHVVFKE M+DY    D  L RL+ECKI+
Subjt:  YIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEAKIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIK

XP_038891653.1 uncharacterized protein LOC120081046 [Benincasa hispida]4.7e-0771.43Show/hide
Query:  KKSSILELHNM-ELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLRY
        +K S+LELH+M +LNKSDS  LKL ELPHI Y+FEL+ NRKLE+IMLRY
Subjt:  KKSSILELHNM-ELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLRY

XP_038891653.1 uncharacterized protein LOC120081046 [Benincasa hispida]4.9e-19068.61Show/hide
Query:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
        MDGIFG+ID +FK+SIVDSTMM IVHRAMDKAH+RVKS EGVIERLHEISKFYELSVMQLDGCIKFVQEETD+HNPE+SHEEVLAGLAEIRNRLQRRLYE
Subjt:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVK-----------EKLELNDDYEHKVKTKR
        SELAILQKDRELADR  SE KLRQALE TE+ELVSSQEDLE  RSRSAGSSNLSPHEGEDD NRDGE  E+K           EKLE  DDYE KVK +R
Subjt:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVK-----------EKLELNDDYEHKVKTKR

Query:  NRCINDVIRVEEMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSA--NKWWSDLMNEVIGL
        N CINDV +VEEMGSDIDILKETLDIAFGKM SAI  S++G IEQQVKSSIEND+IS+ L GFV DCQED+EAE  +KE QVS   N+ WS LMNE IGL
Subjt:  NRCINDVIRVEEMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSA--NKWWSDLMNEVIGL

Query:  FEDLKPVLGQNEMRSRECNILNFESIIKKKSKE----AEPDQWNPEKLHDKTSL-------------SLRREESTESFKRRFQEILEKLENSMILNATVN
         E+LKP++ QNE++ ++    +F+  +  +  E     + ++   +  HD   +              + REES ES K RFQE+LEKLEN  ILNA +N
Subjt:  FEDLKPVLGQNEMRSRECNILNFESIIKKKSKE----AEPDQWNPEKLHDKTSL-------------SLRREESTESFKRRFQEILEKLENSMILNATVN

Query:  KIIDQNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELE
        KI+ QN DF+EEDIPPE G+QIF ENHRQKSDV TLAD+WGKMHQL++EE  GIQNQIC    +RE+ +FQNIM EE Y  LF+GLREKFC+DLS WELE
Subjt:  KIIDQNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELE

Query:  ILISEGICRDLIRNMFNQLDETMKSNHIEAKIKDDIYHVVFKETMEDY
        ILIS+GICR  IR+MF+QLDETM+S  IEA+IKDDIYH+ F E M+ Y
Subjt:  ILISEGICRDLIRNMFNQLDETMKSNHIEAKIKDDIYHVVFKETMEDY

TrEMBL top hitse value%identityAlignment
A0A0A0KWT8 Uncharacterized protein0.0e+0099.31Show/hide
Query:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
        MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
Subjt:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVE
        SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEK+ELNDDYEHKVKTKRNRCINDVIRVE
Subjt:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVE

Query:  EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM
        EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIEND+ISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM
Subjt:  EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM

Query:  RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK
        RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK
Subjt:  RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK

Query:  SDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEA
        SDVDTLADVWGKMHQLQDEE SGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILIS+GICRDLIRNMFNQLDETMKSNHIEA
Subjt:  SDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEA

Query:  KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLRY
        KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLRY
Subjt:  KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLRY

A0A1S3BGE6 uncharacterized protein LOC1034895676.5e-29793.12Show/hide
Query:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
        MDGIFGMIDG FKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
Subjt:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVE
        SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGE GEVKEK E  DDYE KVKTKRNRCINDVIRVE
Subjt:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVE

Query:  EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM
        EMGSDIDILKETLDIAFGKMHSAILISE+GAIEQQVKSSIEND+ISILLKGFV DCQED+EAEVTRKE+QVSANK WSDLMNEVIGLFEDLKPV+GQNEM
Subjt:  EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM

Query:  RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK
        +SRECNIL+FESIIKKKS EAE DQ N E LHDKTSLSLRREES ES KRRFQEILE+LENSMILNATVNK I+QNEDF+EEDIP EKGEQIFVENH+QK
Subjt:  RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK

Query:  SDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEA
        SDVDTLADVWGKMHQLQDEE SGIQNQICALRQERE+REFQNIMKEETYI L QGLREKFCDDLS+WELEILIS+GI RDLIR+MFNQLDETMKSNH EA
Subjt:  SDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEA

Query:  KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR
        KIKDDIYHVVFKETMEDYCSIND GL RLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITY+FELMANRKLEAIMLR
Subjt:  KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR

A0A5D3CG51 WPP domain-associated protein isoform X26.5e-29793.12Show/hide
Query:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
        MDGIFGMIDG FKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
Subjt:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVE
        SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGE GEVKEK E  DDYE KVKTKRNRCINDVIRVE
Subjt:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVE

Query:  EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM
        EMGSDIDILKETLDIAFGKMHSAILISE+GAIEQQVKSSIEND+ISILLKGFV DCQED+EAEVTRKE+QVSANK WSDLMNEVIGLFEDLKPV+GQNEM
Subjt:  EMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEM

Query:  RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK
        +SRECNIL+FESIIKKKS EAE DQ N E LHDKTSLSLRREES ES KRRFQEILE+LENSMILNATVNK I+QNEDF+EEDIP EKGEQIFVENH+QK
Subjt:  RSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQK

Query:  SDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEA
        SDVDTLADVWGKMHQLQDEE SGIQNQICALRQERE+REFQNIMKEETYI L QGLREKFCDDLS+WELEILIS+GI RDLIR+MFNQLDETMKSNH EA
Subjt:  SDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEA

Query:  KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR
        KIKDDIYHVVFKETMEDYCSIND GL RLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITY+FELMANRKLEAIMLR
Subjt:  KIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR

A0A6J1GZ55 uncharacterized protein LOC1114584752.1e-0565.22Show/hide
Query:  KSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR
        K  + ELHNM+L+KSDSK LKL+E PHI Y+FELMAN+KL  + LR
Subjt:  KSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR

A0A6J1JCB6 uncharacterized protein LOC1114831042.4e-19068.61Show/hide
Query:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
        MDGIFG+ID +FK+SIVDSTMM IVHRAMDKAH+RVKS EGVIERLHEISKFYELSVMQLDGCIKFVQEETD+HNPE+SHEEVLAGLAEIRNRLQRRLYE
Subjt:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVK-----------EKLELNDDYEHKVKTKR
        SELAILQKDRELADR  SE KLRQALE TE+ELVSSQEDLE  RSRSAGSSNLSPHEGEDD NRDGE  E+K           EKLE  DDYE KVK +R
Subjt:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVK-----------EKLELNDDYEHKVKTKR

Query:  NRCINDVIRVEEMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSA--NKWWSDLMNEVIGL
        N CINDV +VEEMGSDIDILKETLDIAFGKM SAI  S++G IEQQVKSSIEND+IS+ L GFV DCQED+EAE  +KE QVS   N+ WS LMNE IGL
Subjt:  NRCINDVIRVEEMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSA--NKWWSDLMNEVIGL

Query:  FEDLKPVLGQNEMRSRECNILNFESIIKKKSKE----AEPDQWNPEKLHDKTSL-------------SLRREESTESFKRRFQEILEKLENSMILNATVN
         E+LKP++ QNE++ ++    +F+  +  +  E     + ++   +  HD   +              + REES ES K RFQE+LEKLEN  ILNA +N
Subjt:  FEDLKPVLGQNEMRSRECNILNFESIIKKKSKE----AEPDQWNPEKLHDKTSL-------------SLRREESTESFKRRFQEILEKLENSMILNATVN

Query:  KIIDQNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELE
        KI+ QN DF+EEDIPPE G+QIF ENHRQKSDV TLAD+WGKMHQL++EE  GIQNQIC    +RE+ +FQNIM EE Y  LF+GLREKFC+DLS WELE
Subjt:  KIIDQNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELE

Query:  ILISEGICRDLIRNMFNQLDETMKSNHIEAKIKDDIYHVVFKETMEDY
        ILIS+GICR  IR+MF+QLDETM+S  IEA+IKDDIYH+ F E M+ Y
Subjt:  ILISEGICRDLIRNMFNQLDETMKSNHIEAKIKDDIYHVVFKETMEDY

A0A6J1JCB6 uncharacterized protein LOC1114831044.7e-0563.04Show/hide
Query:  KSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR
        K  + ELHNM+L KSDSK LKL+E PHI Y+FELMAN+KL  + +R
Subjt:  KSSILELHNMELNKSDSKSLKLMELPHITYEFELMANRKLEAIMLR

A0A6J1JCB6 uncharacterized protein LOC1114831043.2e-18768.2Show/hide
Query:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
        MDGIFG+ID +FK+SIVDSTMM IVHRAMDKAH+RVKS EGVIERLHEISKFYELSVMQLDGCI FVQEETD+HNPE+SHEEVLAGLAEIRNRLQRRLYE
Subjt:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVK-----------EKLELNDDYEHKVKTKR
        SELAILQKDRELADR  SE KLRQALE TE+ELVSSQEDLE  RSRSAGSSNLSPHEGEDD NRDGE  E+K           EKLE  DDY  KVK +R
Subjt:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVK-----------EKLELNDDYEHKVKTKR

Query:  NRCINDVIRVEEMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSA--NKWWSDLMNEVIGL
        N CIND ++VEEMGSDIDILKETLDIAFGKM SAI  S++G IEQQVKSSIEND+IS+ L GFV DCQED+EAE  RKE QVS   N+ WS LMNE IGL
Subjt:  NRCINDVIRVEEMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSA--NKWWSDLMNEVIGL

Query:  FEDLKPVLGQNEMRSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSL-------------SLRREESTESFKRRFQEILEKLENSMILNATVNKIID
         E LKP++ QNE++ ++ ++   +    +     + ++   E  HD   +              + REES ES K RF+E+LEKLEN  ILNA +NKI+ 
Subjt:  FEDLKPVLGQNEMRSRECNILNFESIIKKKSKEAEPDQWNPEKLHDKTSL-------------SLRREESTESFKRRFQEILEKLENSMILNATVNKIID

Query:  QNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILIS
        QN DF+EEDIPPE GEQI  ENHRQKSDV TLAD+WGKMH+L++EE  GIQNQIC L  +RE+ +FQNI+ EE Y  LF+GLREKFC+DLS WELE LIS
Subjt:  QNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGLREKFCDDLSTWELEILIS

Query:  EGICRDLIRNMFNQLDETMKSNHIEAKIKDDIYHVVFKETMEDY
        +GICR  IR+MFNQLDETM+S  IEA+IKDDIYH+ F E M+ Y
Subjt:  EGICRDLIRNMFNQLDETMKSNHIEAKIKDDIYHVVFKETMEDY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G14990.1 BEST Arabidopsis thaliana protein match is: myosin heavy chain-related (TAIR:AT2G34730.1)5.3e-4932.35Show/hide
Query:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
        M  I   ++G  K S+ DSTMML+V +AMDKAH+++K++ G++ RL+ IS FYEL+V+QL+ C+ FV +ETD    E++HEEV+  L EI++RL  RL E
Subjt:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHE-GEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRV
        +E+AIL+KDR+L + SE++  LR  LE  E ELV  Q DLE +R  S     +   E  E   + D ++  +++KLE   D E + +T+           
Subjt:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHE-GEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRV

Query:  EEMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEA--------EVTRKERQVSANKWWSDLMNE--------
        +    DID+LK T+D+AF KMH AI +SE+G IEQ  + SIE D +++L+KGF+N  +E +E         E   K+R  S  +    L ++        
Subjt:  EEMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEA--------EVTRKERQVSANKWWSDLMNE--------

Query:  ------------VIGLFEDLKPVLGQN----EMRSRECNILNF---------ESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREEST--ESFKRRFQEIL
                     I     +   +G +    E R  E +  NF         ESII++KS+E  P +            S++R++S    S KR   +I+
Subjt:  ------------VIGLFEDLKPVLGQN----EMRSRECNILNF---------ESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREEST--ESFKRRFQEIL

Query:  EKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGL
          L++ M LN  + + +  ++D +              E+H +    D L DVW KM   Q        N I    +E+E+ E + ++ E+TY+ L +GL
Subjt:  EKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGL

Query:  R-------EKFCDDLSTWELEILISEGICRDLIRNMFNQLD
        +        K  ++    + E + SE  C D + N+  + D
Subjt:  R-------EKFCDDLSTWELEILISEGICRDLIRNMFNQLD

AT5G14990.2 unknown protein5.3e-4932.35Show/hide
Query:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE
        M  I   ++G  K S+ DSTMML+V +AMDKAH+++K++ G++ RL+ IS FYEL+V+QL+ C+ FV +ETD    E++HEEV+  L EI++RL  RL E
Subjt:  MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHE-GEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRV
        +E+AIL+KDR+L + SE++  LR  LE  E ELV  Q DLE +R  S     +   E  E   + D ++  +++KLE   D E + +T+           
Subjt:  SELAILQKDRELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHE-GEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRV

Query:  EEMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEA--------EVTRKERQVSANKWWSDLMNE--------
        +    DID+LK T+D+AF KMH AI +SE+G IEQ  + SIE D +++L+KGF+N  +E +E         E   K+R  S  +    L ++        
Subjt:  EEMGSDIDILKETLDIAFGKMHSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEA--------EVTRKERQVSANKWWSDLMNE--------

Query:  ------------VIGLFEDLKPVLGQN----EMRSRECNILNF---------ESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREEST--ESFKRRFQEIL
                     I     +   +G +    E R  E +  NF         ESII++KS+E  P +            S++R++S    S KR   +I+
Subjt:  ------------VIGLFEDLKPVLGQN----EMRSRECNILNF---------ESIIKKKSKEAEPDQWNPEKLHDKTSLSLRREEST--ESFKRRFQEIL

Query:  EKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGL
          L++ M LN  + + +  ++D +              E+H +    D L DVW KM   Q        N I    +E+E+ E + ++ E+TY+ L +GL
Subjt:  EKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREFQNIMKEETYIALFQGL

Query:  R-------EKFCDDLSTWELEILISEGICRDLIRNMFNQLD
        +        K  ++    + E + SE  C D + N+  + D
Subjt:  R-------EKFCDDLSTWELEILISEGICRDLIRNMFNQLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGAATTTTTGGAATGATCGATGGCAGTTTCAAACTGTCAATAGTAGATTCAACCATGATGTTGATTGTGCATCGTGCAATGGATAAAGCTCACCAAAGAGTCAA
ATCTAGAGAAGGAGTTATAGAGAGATTACATGAGATATCAAAATTCTACGAGTTATCCGTAATGCAATTGGATGGTTGTATCAAATTTGTTCAAGAAGAAACTGACACTC
ACAATCCCGAAACCAGCCATGAAGAAGTTCTTGCTGGCTTGGCCGAAATACGAAACCGCCTTCAACGACGCCTCTACGAATCGGAACTTGCCATCCTACAGAAAGATAGA
GAGTTGGCAGATCGATCCGAGAGCGAGGTGAAGTTAAGGCAGGCTTTGGAAATTACAGAGAGGGAATTGGTTTCTTCACAGGAAGATCTTGAACTTGAAAGATCAAGAAG
TGCTGGAAGTTCCAACCTTAGCCCACATGAAGGTGAGGATGATGAGAATAGAGATGGGGAATTGGGTGAAGTGAAAGAAAAACTCGAGTTAAATGATGATTATGAGCATA
AGGTGAAGACCAAACGAAATCGTTGTATCAACGACGTAATAAGAGTTGAAGAGATGGGATCTGACATCGATATTTTGAAGGAAACTCTTGATATTGCGTTTGGAAAGATG
CATAGTGCCATTTTGATTTCTGAAATAGGAGCAATAGAGCAGCAAGTAAAGTCAAGTATTGAGAACGACATGATATCAATCCTGCTTAAGGGATTTGTGAATGATTGTCA
AGAGGATATAGAAGCAGAAGTGACAAGGAAAGAGAGGCAAGTTTCGGCAAACAAATGGTGGTCGGATTTAATGAATGAAGTTATAGGCTTGTTTGAGGATCTCAAACCTG
TTCTTGGCCAAAATGAAATGCGGTCCCGAGAGTGCAACATTTTGAATTTTGAGTCAATTATTAAGAAAAAGAGTAAAGAAGCAGAACCGGATCAATGGAATCCAGAAAAG
CTTCATGATAAAACTTCATTATCATTAAGGAGAGAAGAAAGTACTGAAAGCTTCAAAAGAAGGTTCCAAGAAATACTAGAGAAACTAGAGAATTCGATGATTTTGAATGC
TACGGTTAACAAAATTATAGACCAAAATGAGGATTTTAATGAAGAAGACATACCTCCAGAGAAAGGGGAGCAAATATTTGTAGAAAATCATAGACAGAAATCTGATGTGG
ATACTTTGGCAGATGTCTGGGGGAAGATGCATCAACTGCAGGATGAAGAAAAAAGCGGAATACAAAATCAAATTTGCGCGCTAAGGCAAGAAAGAGAGGAAAGAGAATTT
CAAAACATAATGAAGGAAGAAACTTACATCGCTTTATTTCAAGGGTTGAGAGAAAAGTTTTGTGATGATTTAAGTACCTGGGAATTGGAGATCCTGATTTCAGAAGGAAT
ATGCAGAGATCTCATCAGGAATATGTTCAATCAGTTGGATGAAACCATGAAAAGTAACCATATTGAAGCCAAAATTAAAGATGATATATATCATGTTGTCTTCAAGGAGA
CAATGGAAGATTATTGCTCTATAAATGACTTAGGATTACACAGATTGCAGGAATGCAAAATAAAGAAGTCATCCATCTTAGAACTTCACAATATGGAGTTAAACAAGTCA
GATTCTAAGTCCCTAAAACTTATGGAGCTTCCACATATAACATATGAATTTGAGCTGATGGCAAATAGAAAACTGGAAGCAATAATGCTTAGGTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATGGAATTTTTGGAATGATCGATGGCAGTTTCAAACTGTCAATAGTAGATTCAACCATGATGTTGATTGTGCATCGTGCAATGGATAAAGCTCACCAAAGAGTCAA
ATCTAGAGAAGGAGTTATAGAGAGATTACATGAGATATCAAAATTCTACGAGTTATCCGTAATGCAATTGGATGGTTGTATCAAATTTGTTCAAGAAGAAACTGACACTC
ACAATCCCGAAACCAGCCATGAAGAAGTTCTTGCTGGCTTGGCCGAAATACGAAACCGCCTTCAACGACGCCTCTACGAATCGGAACTTGCCATCCTACAGAAAGATAGA
GAGTTGGCAGATCGATCCGAGAGCGAGGTGAAGTTAAGGCAGGCTTTGGAAATTACAGAGAGGGAATTGGTTTCTTCACAGGAAGATCTTGAACTTGAAAGATCAAGAAG
TGCTGGAAGTTCCAACCTTAGCCCACATGAAGGTGAGGATGATGAGAATAGAGATGGGGAATTGGGTGAAGTGAAAGAAAAACTCGAGTTAAATGATGATTATGAGCATA
AGGTGAAGACCAAACGAAATCGTTGTATCAACGACGTAATAAGAGTTGAAGAGATGGGATCTGACATCGATATTTTGAAGGAAACTCTTGATATTGCGTTTGGAAAGATG
CATAGTGCCATTTTGATTTCTGAAATAGGAGCAATAGAGCAGCAAGTAAAGTCAAGTATTGAGAACGACATGATATCAATCCTGCTTAAGGGATTTGTGAATGATTGTCA
AGAGGATATAGAAGCAGAAGTGACAAGGAAAGAGAGGCAAGTTTCGGCAAACAAATGGTGGTCGGATTTAATGAATGAAGTTATAGGCTTGTTTGAGGATCTCAAACCTG
TTCTTGGCCAAAATGAAATGCGGTCCCGAGAGTGCAACATTTTGAATTTTGAGTCAATTATTAAGAAAAAGAGTAAAGAAGCAGAACCGGATCAATGGAATCCAGAAAAG
CTTCATGATAAAACTTCATTATCATTAAGGAGAGAAGAAAGTACTGAAAGCTTCAAAAGAAGGTTCCAAGAAATACTAGAGAAACTAGAGAATTCGATGATTTTGAATGC
TACGGTTAACAAAATTATAGACCAAAATGAGGATTTTAATGAAGAAGACATACCTCCAGAGAAAGGGGAGCAAATATTTGTAGAAAATCATAGACAGAAATCTGATGTGG
ATACTTTGGCAGATGTCTGGGGGAAGATGCATCAACTGCAGGATGAAGAAAAAAGCGGAATACAAAATCAAATTTGCGCGCTAAGGCAAGAAAGAGAGGAAAGAGAATTT
CAAAACATAATGAAGGAAGAAACTTACATCGCTTTATTTCAAGGGTTGAGAGAAAAGTTTTGTGATGATTTAAGTACCTGGGAATTGGAGATCCTGATTTCAGAAGGAAT
ATGCAGAGATCTCATCAGGAATATGTTCAATCAGTTGGATGAAACCATGAAAAGTAACCATATTGAAGCCAAAATTAAAGATGATATATATCATGTTGTCTTCAAGGAGA
CAATGGAAGATTATTGCTCTATAAATGACTTAGGATTACACAGATTGCAGGAATGCAAAATAAAGAAGTCATCCATCTTAGAACTTCACAATATGGAGTTAAACAAGTCA
GATTCTAAGTCCCTAAAACTTATGGAGCTTCCACATATAACATATGAATTTGAGCTGATGGCAAATAGAAAACTGGAAGCAATAATGCTTAGGTACTAA
Protein sequenceShow/hide protein sequence
MDGIFGMIDGSFKLSIVDSTMMLIVHRAMDKAHQRVKSREGVIERLHEISKFYELSVMQLDGCIKFVQEETDTHNPETSHEEVLAGLAEIRNRLQRRLYESELAILQKDR
ELADRSESEVKLRQALEITERELVSSQEDLELERSRSAGSSNLSPHEGEDDENRDGELGEVKEKLELNDDYEHKVKTKRNRCINDVIRVEEMGSDIDILKETLDIAFGKM
HSAILISEIGAIEQQVKSSIENDMISILLKGFVNDCQEDIEAEVTRKERQVSANKWWSDLMNEVIGLFEDLKPVLGQNEMRSRECNILNFESIIKKKSKEAEPDQWNPEK
LHDKTSLSLRREESTESFKRRFQEILEKLENSMILNATVNKIIDQNEDFNEEDIPPEKGEQIFVENHRQKSDVDTLADVWGKMHQLQDEEKSGIQNQICALRQEREEREF
QNIMKEETYIALFQGLREKFCDDLSTWELEILISEGICRDLIRNMFNQLDETMKSNHIEAKIKDDIYHVVFKETMEDYCSINDLGLHRLQECKIKKSSILELHNMELNKS
DSKSLKLMELPHITYEFELMANRKLEAIMLRY