; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G007460 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G007460
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionhigh mobility group B protein 6-like
Genome locationCmo_Chr04:3715111..3722157
RNA-Seq ExpressionCmoCh04G007460
SyntenyCmoCh04G007460
Gene Ontology termsGO:1900150 - regulation of defense response to fungus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR009071 - High mobility group box domain
IPR021480 - Probable zinc-ribbon domain, plant
IPR036910 - High mobility group box domain superfamily
IPR044601 - 3xHMG-box protein HMGB6/HMGB13


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600530.1 Protein ENHANCED DISEASE RESISTANCE 4, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0095.41Show/hide
Query:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
        MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKND MSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDI D
Subjt:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND

Query:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
        YEMKVGKE NGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERD+YTRYSRNSMAVADRPSSSNFEGLNPNPAE
Subjt:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE

Query:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
        LLRRLDELKDQII SCDVRAPANQYYGRPTY+VP+QPSTKS QLSHGSHYQRNSEEFLHPKEPIKMS YYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
Subjt:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS

Query:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
        S TDGYGLVQPRKAPLLQ NGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVV+VVRVENGRLVVSVPSESKLKE S DDGSPKRA
Subjt:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA

Query:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
         NATN   SSDDS HK ISTDHNKQEQTSLKTTPA KCEPSLLNDSADLPSKDVSKENSDSTSYQ+A  YREGGDENKQNTVID NAEPIELDVSFEDYS
Subjt:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS

Query:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSRYDYQAGFWGVMGHPCLGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPT
        NIHVSQDFVETSKEEVEDQSKIKNS+ES+TFFVGLSRYDYQAGFWGVMGHPCLGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPT
Subjt:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSRYDYQAGFWGVMGHPCLGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPT

Query:  TPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT
        TP+K YRIDISGRVVDEDTGKVLHNLGKLAPT
Subjt:  TPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT

XP_022942883.1 uncharacterized protein LOC111447780 isoform X1 [Cucurbita moschata]0.0e+0092.94Show/hide
Query:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
        MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
Subjt:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND

Query:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
        YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
Subjt:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE

Query:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
        LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
Subjt:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS

Query:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
        SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
Subjt:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA

Query:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
        VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
Subjt:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS

Query:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC
        NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR                                                YDYQAGFWGVMGHPC
Subjt:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC

Query:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT
        LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT
Subjt:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT

XP_022942884.1 uncharacterized protein LOC111447780 isoform X2 [Cucurbita moschata]0.0e+0092.94Show/hide
Query:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
        MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
Subjt:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND

Query:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
        YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
Subjt:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE

Query:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
        LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
Subjt:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS

Query:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
        SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
Subjt:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA

Query:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
        VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
Subjt:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS

Query:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC
        NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR                                                YDYQAGFWGVMGHPC
Subjt:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC

Query:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT
        LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT
Subjt:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT

XP_023521498.1 uncharacterized protein LOC111785289 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0089.15Show/hide
Query:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
        MS +QKVRVVRCPKCENLLPEPSE PVYQCGGCGAVLRAKSKVPLNEKND MSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDR+DIND
Subjt:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND

Query:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
        YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDV+VYDLDYPSTAPY TRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
Subjt:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE

Query:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
        LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
Subjt:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS

Query:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
        S TDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVV+VVRVEN RLVVSVPSESKLKEVS DDGSPKRA
Subjt:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA

Query:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
         NATNSLE+S DS HK ISTDHNK EQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQ+AS YREGGD +KQNTVID NAEPIELDVSFEDYS
Subjt:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS

Query:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC
        NIHVSQDFVETSKEEVEDQSKIKNSQES+TFFVGLSR                                                YDYQAGFWGVMGHPC
Subjt:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC

Query:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPTSA
        LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTP+K YRIDISGRVVDEDTGKVLHNLGKLAPT A
Subjt:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPTSA

XP_023521564.1 uncharacterized protein LOC111785289 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0089.26Show/hide
Query:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
        MS +QKVRVVRCPKCENLLPEPSE PVYQCGGCGAVLRAKSKVPLNEKND MSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDR+DIND
Subjt:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND

Query:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
        YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDV+VYDLDYPSTAPY TRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
Subjt:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE

Query:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
        LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
Subjt:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS

Query:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
        S TDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVV+VVRVEN RLVVSVPSESKLKEVS DDGSPKRA
Subjt:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA

Query:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
         NATNSLE+S DS HK ISTDHNK EQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQ+AS YREGGD +KQNTVID NAEPIELDVSFEDYS
Subjt:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS

Query:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC
        NIHVSQDFVETSKEEVEDQSKIKNSQES+TFFVGLSR                                                YDYQAGFWGVMGHPC
Subjt:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC

Query:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT
        LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTP+K YRIDISGRVVDEDTGKVLHNLGKLAPT
Subjt:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT

TrEMBL top hitse value%identityAlignment
A0A6J1FSM2 uncharacterized protein LOC111447780 isoform X20.0e+0092.94Show/hide
Query:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
        MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
Subjt:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND

Query:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
        YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
Subjt:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE

Query:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
        LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
Subjt:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS

Query:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
        SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
Subjt:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA

Query:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
        VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
Subjt:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS

Query:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC
        NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR                                                YDYQAGFWGVMGHPC
Subjt:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC

Query:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT
        LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT
Subjt:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT

A0A6J1FVW4 uncharacterized protein LOC111447780 isoform X10.0e+0092.94Show/hide
Query:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
        MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
Subjt:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND

Query:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
        YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
Subjt:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE

Query:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
        LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
Subjt:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS

Query:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
        SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
Subjt:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA

Query:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
        VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
Subjt:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS

Query:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC
        NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR                                                YDYQAGFWGVMGHPC
Subjt:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC

Query:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT
        LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT
Subjt:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT

A0A6J1FX31 uncharacterized protein LOC111447780 isoform X30.0e+0092.37Show/hide
Query:  MSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDINDYEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRT
        MSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDINDYEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRT
Subjt:  MSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDINDYEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRT

Query:  RIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAELLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQ
        RIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAELLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQ
Subjt:  RIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAELLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQ

Query:  RNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFSSGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYK
        RNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFSSGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYK
Subjt:  RNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFSSGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYK

Query:  LQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRAVNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPS
        LQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRAVNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPS
Subjt:  LQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRAVNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPS

Query:  KDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYSNIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR--------------
        KDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYSNIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR              
Subjt:  KDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYSNIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR--------------

Query:  ----------------------------------YDYQAGFWGVMGHPCLGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPS
                                          YDYQAGFWGVMGHPCLGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPS
Subjt:  ----------------------------------YDYQAGFWGVMGHPCLGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPS

Query:  KLYRIDISGRVVDEDTGKVLHNLGKLAPT
        KLYRIDISGRVVDEDTGKVLHNLGKLAPT
Subjt:  KLYRIDISGRVVDEDTGKVLHNLGKLAPT

A0A6J1INE2 uncharacterized protein LOC111479066 isoform X10.0e+0088.27Show/hide
Query:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
        MSGDQKVRVVRCPKCENLLPEPSE PVYQCGGCGAVLRAKSKVPLNEKND MSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDRE I D
Subjt:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND

Query:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
        YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYD DYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSR+SMAVADRPSSSNFEGLNPNPAE
Subjt:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE

Query:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
        LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKS QL HGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAG FPHSRQSSE S
Subjt:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS

Query:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
        S TDGY L+QPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVV+VVRVEN RLVVSVPSE+KLKEVS DDGSPKRA
Subjt:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA

Query:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
        VNATNSLESSDDS HK ISTDHNKQEQTSLKTTPAIKCEPSLL+DSADLPSKDVSKENSDSTSYQ+AS +REG DENKQNTVID +AEPIELDVSFEDYS
Subjt:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS

Query:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC
        NIHVSQDFVET KEEVEDQSKIKNSQES+TFFVGLS+                                                YDYQAGFWGVMGHPC
Subjt:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC

Query:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPTSA
        LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTP+K YRIDISGRVVDEDTGKVLHNLGKLAPT A
Subjt:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPTSA

A0A6J1ISW1 uncharacterized protein LOC111479066 isoform X20.0e+0088.38Show/hide
Query:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND
        MSGDQKVRVVRCPKCENLLPEPSE PVYQCGGCGAVLRAKSKVPLNEKND MSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDRE I D
Subjt:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDIND

Query:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE
        YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYD DYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSR+SMAVADRPSSSNFEGLNPNPAE
Subjt:  YEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAE

Query:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS
        LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKS QL HGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAG FPHSRQSSE S
Subjt:  LLRRLDELKDQIIMSCDVRAPANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFS

Query:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA
        S TDGY L+QPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVV+VVRVEN RLVVSVPSE+KLKEVS DDGSPKRA
Subjt:  SGTDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRA

Query:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS
        VNATNSLESSDDS HK ISTDHNKQEQTSLKTTPAIKCEPSLL+DSADLPSKDVSKENSDSTSYQ+AS +REG DENKQNTVID +AEPIELDVSFEDYS
Subjt:  VNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYS

Query:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC
        NIHVSQDFVET KEEVEDQSKIKNSQES+TFFVGLS+                                                YDYQAGFWGVMGHPC
Subjt:  NIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSR------------------------------------------------YDYQAGFWGVMGHPC

Query:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT
        LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTP+K YRIDISGRVVDEDTGKVLHNLGKLAPT
Subjt:  LGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT

SwissProt top hitse value%identityAlignment
Q9SUP7 High mobility group B protein 65.4e-14062.5Show/hide
Query:  LVTGEPVGRTKKPRNSRKALKDKNSSPEESQSMVTKVTQPSEEENLSQNQPKPKAAQKKQPAKQSFDKDLQEMQDMLQQLRLDKEKTEELLKEKDEMLKQ
        + T      TKKPRNSRKALK KN                     L +  P P + + K  + +SF++DL EMQ ML++++++K+KTEELLKEKDE+L++
Subjt:  LVTGEPVGRTKKPRNSRKALKDKNSSPEESQSMVTKVTQPSEEENLSQNQPKPKAAQKKQPAKQSFDKDLQEMQDMLQQLRLDKEKTEELLKEKDEMLKQ

Query:  KDEELKTRDKEQEKLQIELKKLQKLKEFKPNMNFPMIQ-ILKDKEQE---KKEKKKCSEKKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKS
        K+EEL+TRD EQEKL++ELKKLQK+KEFKPNM F   Q  L   EQE   KK+KK C E KRPS  Y+LWCKDQW E+KKENPEA+FKE SNILGAKWKS
Subjt:  KDEELKTRDKEQEKLQIELKKLQKLKEFKPNMNFPMIQ-ILKDKEQE---KKEKKKCSEKKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKS

Query:  VTAEEKKPYEERYQAEKEAYLQITSKEKRETEAMKLLEEEQKQKTAMELLDQYLQFKEEAEKDNKKK-KKERDPLKPKQPMSAFFLFSNERRGSLFAENK
        ++AE+KKPYEERYQ EKEAYLQ+ +KEKRE EAMKLLE++QKQ+TAMELLDQYL F +EAE+DNKKK KKE+DPLKPK P+SAF +++NERR +L  ENK
Subjt:  VTAEEKKPYEERYQAEKEAYLQITSKEKRETEAMKLLEEEQKQKTAMELLDQYLQFKEEAEKDNKKK-KKERDPLKPKQPMSAFFLFSNERRGSLFAENK

Query:  NVLEVAKITGGEWKNMTEEQRGPYEEMAKKKKEKYMQEMETYKQKKEEEAANLKKEEEEQMKLQKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKNV
        +V+EVAKITG EWKN++++++ PYE++AKK KE Y+Q ME YK+ KEEEA + KKEEEE +KL K EAL +LKKKEKT+ +IKK K  ++KK     +NV
Subjt:  NVLEVAKITGGEWKNMTEEQRGPYEEMAKKKKEKYMQEMETYKQKKEEEAANLKKEEEEQMKLQKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKNV

Query:  DPNKPKKPASSYILFSKEARKVVMEEKPGVNNSTVNALISVKWKELSEGERKIWNDKAAEAMEAYKKEVEEYNKTVAETT
        DPNKPKKPASSY LFSK+ RK + EE+PG NN+TV ALIS+KWKELSE E++++N KAA+ MEAYKKEVE YNK  A TT
Subjt:  DPNKPKKPASSYILFSKEARKVVMEEKPGVNNSTVNALISVKWKELSEGERKIWNDKAAEAMEAYKKEVEEYNKTVAETT

Q9T012 High mobility group B protein 135.6e-13763Show/hide
Query:  VTGEPVGRTKKPRNSRKALKDKNSSPEESQSMVTKVTQPSEEENLSQNQPKPKAAQKKQPAKQSFDKDLQEMQDMLQQLRLDKEKTEELLKEKDEMLKQK
        V+ +P    KK RNSRKALK KN   E S                            K    +SF+KDL EMQ ML++++++KEKTE+LLKEKDE+L++K
Subjt:  VTGEPVGRTKKPRNSRKALKDKNSSPEESQSMVTKVTQPSEEENLSQNQPKPKAAQKKQPAKQSFDKDLQEMQDMLQQLRLDKEKTEELLKEKDEMLKQK

Query:  DEELKTRDKEQEKLQIELKKLQKLKEFKPNMNFPMIQILKDKEQEKKEKKK---CSEKKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKSVT
               + EQEKL+ ELKKLQK+KEFKPNM F   Q L   E+EKK KKK   C+E KRPS PYILWCKD WNE+KK+NPEA+FKE SNILGAKWK ++
Subjt:  DEELKTRDKEQEKLQIELKKLQKLKEFKPNMNFPMIQILKDKEQEKKEKKK---CSEKKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKSVT

Query:  AEEKKPYEERYQAEKEAYLQITSKEKRETEAMKLLEEEQKQKTAMELLDQYLQFKEEAEKDNKKK-KKERDPLKPKQPMSAFFLFSNERRGSLFAENKNV
        AEEKKPYEE+YQA+KEAYLQ+ +KEKRE EAMKLL++EQKQKTAMELLDQYL F +EAE DNKKK KK +DPLKPKQP+SA+ +++NERR +L  ENK+V
Subjt:  AEEKKPYEERYQAEKEAYLQITSKEKRETEAMKLLEEEQKQKTAMELLDQYLQFKEEAEKDNKKK-KKERDPLKPKQPMSAFFLFSNERRGSLFAENKNV

Query:  LEVAKITGGEWKNMTEEQRGPYEEMAKKKKEKYMQEMETYKQKKEEEAANLKKEEEEQMKLQKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKNVDP
        +EVAK+ G EWKN++EE++ PY++MAKK KE Y+QEME YK+ KEEEA + KKEEEE MKL K EAL LLKKKEKT+ IIKKTKE  + KKK   +NVDP
Subjt:  LEVAKITGGEWKNMTEEQRGPYEEMAKKKKEKYMQEMETYKQKKEEEAANLKKEEEEQMKLQKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKNVDP

Query:  NKPKKPASSYILFSKEARKVVMEEKPGVNNSTVNALISVKWKELSEGERKIWNDKAAEAMEAYKKEVEEYNKT
        NKPKKP SSY LF K+ARK V+EE PG+NNSTV A IS+KW EL E E++++N KAAE MEAYKKEVEEYNKT
Subjt:  NKPKKPASSYILFSKEARKVVMEEKPGVNNSTVNALISVKWKELSEGERKIWNDKAAEAMEAYKKEVEEYNKT

Arabidopsis top hitse value%identityAlignment
AT2G46380.1 Protein of unknown function (DUF3133)2.3e-5327.99Show/hide
Query:  KVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDINDYEMKV
        + R+VRCPKC+NLL EP + P +QCGGCG VL AK+K   + + D +S ++ E+ S++      +S+ E  S +S R        +T   +  ND   K+
Subjt:  KVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDINDYEMKV

Query:  GKETNGVWPIQRFGDQYIKNWVGRCNLEQ---DVNVYDLDYPST--APYRTRIGAARSRASFEHRKVE-------------RDAYTRYSRNSMAVADRPS
            +G    Q   DQ  K    RC+LE      + Y  D  ST  A  + R G  R     + + V+             + A  R   +   +A  PS
Subjt:  GKETNGVWPIQRFGDQYIKNWVGRCNLEQ---DVNVYDLDYPST--APYRTRIGAARSRASFEHRKVE-------------RDAYTRYSRNSMAVADRPS

Query:  SSNFEGLNPNPAELLRRLDELKDQIIMSCDVRAPA-NQYYGRPTYNVPMQPS---TKSQQLSHGSH-----------YQRNSEEFLHPKEPIKMSAYYNE
           +    P P  L     ++     M      PA    +G P +   +QPS      Q + +  H           + +++    H  +P   ++    
Subjt:  SSNFEGLNPNPAELLRRLDELKDQIIMSCDVRAPA-NQYYGRPTYNVPMQPS---TKSQQLSHGSH-----------YQRNSEEFLHPKEPIKMSAYYNE

Query:  NAIPIGLEASDLRRAGRFPHSRQSSEFSSGT---------------------------------DGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSC
           P GL  + LR  G +PH R    F  GT                                 +    V P K             +AGGAPFI C++C
Subjt:  NAIPIGLEASDLRRAGRFPHSRQSSEFSSGT---------------------------------DGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSC

Query:  LELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRAVNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCE
         +LLKLP K+       Q+++CGACS VI     + +L++S    S  K  +      +    A  S +  D++ ++F + D    + ++     +   +
Subjt:  LELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRAVNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCE

Query:  PSLLNDSADLPSKDVSKENSDSTSYQKAS--------------NYREGGDENKQN------------TVIDGNA---EPIELDVSFEDYS-NIHVSQD--
           + D A  PS    + +SDS++ +K +              N R+      Q+            T +  N+     + +++   DYS N  VSQD  
Subjt:  PSLLNDSADLPSKDVSKENSDSTSYQKAS--------------NYREGGDENKQN------------TVIDGNA---EPIELDVSFEDYS-NIHVSQD--

Query:  --------------FVETSKEEVEDQSK-IKNSQESDTFF---------------------VGLSRYDYQAGFWGVMGHPCLGIIPPFIDEFTYPLSRNC
                      F    K   +D  K I+N   SD                         G   YDY+AGFWGV+G  CLGI+PPFI+E  YP+  NC
Subjt:  --------------FVETSKEEVEDQSK-IKNSQESDTFF---------------------VGLSRYDYQAGFWGVMGHPCLGIIPPFIDEFTYPLSRNC

Query:  AAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT
        A G T +FVNGRELH++DL LL++RGLP    + Y + ISGRV+DEDTG+ L +LGKLAPT
Subjt:  AAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPT

AT3G61670.1 Protein of unknown function (DUF3133)7.0e-5827.73Show/hide
Query:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGS----------SLGAASDTEWGS-------------
        M+   KVR+VRCPKCENLL EP + P +QCGGC  VLRAK+K     + D +S ++ E  ++  S          S   +SD++  S             
Subjt:  MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGS----------SLGAASDTEWGS-------------

Query:  --PSSKRTVF--SNSPIRTNDREDINDYEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYT
          P SK +     N  I   D++D+   + + G++ +  W      D++ K    RC+ +  +N       ST+ +    G + S   F    +E   + 
Subjt:  --PSSKRTVF--SNSPIRTNDREDINDYEMKVGKETNGVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYT

Query:  RYSRNSMAVADRPSSSNFEGLNPNPAELLRRLDELKDQIIMSCDV-------RAPAN-------------------------QYYGRPTY--------NV
        +   N     DR             A LLR+L+++K+Q++ SC+V       +AP++                          YY +P +          
Subjt:  RYSRNSMAVADRPSSSNFEGLNPNPAELLRRLDELKDQIIMSCDV-------RAPAN-------------------------QYYGRPTY--------NV

Query:  PMQPS---------------------------------------TKSQQLSHGSH-----YQRNSEEFLHPKEPIKMSAYYNENAIP----IGLEASDLR
        PM  S                                          QQ  H  H     Y     ++     P+   A YN    P    +G       
Subjt:  PMQPS---------------------------------------TKSQQLSHGSH-----YQRNSEEFLHPKEPIKMSAYYNENAIP----IGLEASDLR

Query:  RA----GRFPHSRQSSEFSSG-TDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVS
        R     G  PH R  S FS    D    ++P K  +L         +AGGAPFI C +C ELL+LP+K        QK++CGACS +I + V N + V+S
Subjt:  RA----GRFPHSRQSSEFSSG-TDGYGLVQPRKAPLLQRNGNSCDAIAGGAPFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVS

Query:  VPSESKLKEVSLDDGSPKRAVNATNSLESSDDSGHKFISTD-----------HNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASN
                  S   G  + A + T+  +  D  G+ F S D              Q+   + +  A   E  L +DS  L +K +++ + +   Y   ++
Subjt:  VPSESKLKEVSLDDGSPKRAVNATNSLESSDDSGHKFISTD-----------HNKQEQTSLKTTPAIKCEPSLLNDSADLPSKDVSKENSDSTSYQKASN

Query:  YREGGDEN-----------------KQNTVIDGNAEPIELDVSFEDYS--NIHVSQD---------FVETSKEEVEDQSKIKNSQESDTFFVGLSR----
         R G                     +QN++ + +    E++V+F DYS  N  VS+D         F    K+  +D +K   + E +   V ++     
Subjt:  YREGGDEN-----------------KQNTVIDGNAEPIELDVSFEDYS--NIHVSQD---------FVETSKEEVEDQSKIKNSQESDTFFVGLSR----

Query:  --------------------YDYQAGFWGVMGHPCLGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDE
                            YDY+AGFWGVMG P LGI+PPFI+E  YP+  NC+ G T +FVNGRELH++DL+LL+ RGLP    + Y +DI+GRV+DE
Subjt:  --------------------YDYQAGFWGVMGHPCLGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDE

Query:  DTGKVLHNLGKLAPT
        DTG+ L  LGKLAPT
Subjt:  DTGKVLHNLGKLAPT

AT4G11080.1 HMG (high mobility group) box protein4.0e-13863Show/hide
Query:  VTGEPVGRTKKPRNSRKALKDKNSSPEESQSMVTKVTQPSEEENLSQNQPKPKAAQKKQPAKQSFDKDLQEMQDMLQQLRLDKEKTEELLKEKDEMLKQK
        V+ +P    KK RNSRKALK KN   E S                            K    +SF+KDL EMQ ML++++++KEKTE+LLKEKDE+L++K
Subjt:  VTGEPVGRTKKPRNSRKALKDKNSSPEESQSMVTKVTQPSEEENLSQNQPKPKAAQKKQPAKQSFDKDLQEMQDMLQQLRLDKEKTEELLKEKDEMLKQK

Query:  DEELKTRDKEQEKLQIELKKLQKLKEFKPNMNFPMIQILKDKEQEKKEKKK---CSEKKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKSVT
               + EQEKL+ ELKKLQK+KEFKPNM F   Q L   E+EKK KKK   C+E KRPS PYILWCKD WNE+KK+NPEA+FKE SNILGAKWK ++
Subjt:  DEELKTRDKEQEKLQIELKKLQKLKEFKPNMNFPMIQILKDKEQEKKEKKK---CSEKKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKSVT

Query:  AEEKKPYEERYQAEKEAYLQITSKEKRETEAMKLLEEEQKQKTAMELLDQYLQFKEEAEKDNKKK-KKERDPLKPKQPMSAFFLFSNERRGSLFAENKNV
        AEEKKPYEE+YQA+KEAYLQ+ +KEKRE EAMKLL++EQKQKTAMELLDQYL F +EAE DNKKK KK +DPLKPKQP+SA+ +++NERR +L  ENK+V
Subjt:  AEEKKPYEERYQAEKEAYLQITSKEKRETEAMKLLEEEQKQKTAMELLDQYLQFKEEAEKDNKKK-KKERDPLKPKQPMSAFFLFSNERRGSLFAENKNV

Query:  LEVAKITGGEWKNMTEEQRGPYEEMAKKKKEKYMQEMETYKQKKEEEAANLKKEEEEQMKLQKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKNVDP
        +EVAK+ G EWKN++EE++ PY++MAKK KE Y+QEME YK+ KEEEA + KKEEEE MKL K EAL LLKKKEKT+ IIKKTKE  + KKK   +NVDP
Subjt:  LEVAKITGGEWKNMTEEQRGPYEEMAKKKKEKYMQEMETYKQKKEEEAANLKKEEEEQMKLQKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKNVDP

Query:  NKPKKPASSYILFSKEARKVVMEEKPGVNNSTVNALISVKWKELSEGERKIWNDKAAEAMEAYKKEVEEYNKT
        NKPKKP SSY LF K+ARK V+EE PG+NNSTV A IS+KW EL E E++++N KAAE MEAYKKEVEEYNKT
Subjt:  NKPKKPASSYILFSKEARKVVMEEKPGVNNSTVNALISVKWKELSEGERKIWNDKAAEAMEAYKKEVEEYNKT

AT4G23800.1 HMG (high mobility group) box protein3.9e-14162.5Show/hide
Query:  LVTGEPVGRTKKPRNSRKALKDKNSSPEESQSMVTKVTQPSEEENLSQNQPKPKAAQKKQPAKQSFDKDLQEMQDMLQQLRLDKEKTEELLKEKDEMLKQ
        + T      TKKPRNSRKALK KN                     L +  P P + + K  + +SF++DL EMQ ML++++++K+KTEELLKEKDE+L++
Subjt:  LVTGEPVGRTKKPRNSRKALKDKNSSPEESQSMVTKVTQPSEEENLSQNQPKPKAAQKKQPAKQSFDKDLQEMQDMLQQLRLDKEKTEELLKEKDEMLKQ

Query:  KDEELKTRDKEQEKLQIELKKLQKLKEFKPNMNFPMIQ-ILKDKEQE---KKEKKKCSEKKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKS
        K+EEL+TRD EQEKL++ELKKLQK+KEFKPNM F   Q  L   EQE   KK+KK C E KRPS  Y+LWCKDQW E+KKENPEA+FKE SNILGAKWKS
Subjt:  KDEELKTRDKEQEKLQIELKKLQKLKEFKPNMNFPMIQ-ILKDKEQE---KKEKKKCSEKKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKS

Query:  VTAEEKKPYEERYQAEKEAYLQITSKEKRETEAMKLLEEEQKQKTAMELLDQYLQFKEEAEKDNKKK-KKERDPLKPKQPMSAFFLFSNERRGSLFAENK
        ++AE+KKPYEERYQ EKEAYLQ+ +KEKRE EAMKLLE++QKQ+TAMELLDQYL F +EAE+DNKKK KKE+DPLKPK P+SAF +++NERR +L  ENK
Subjt:  VTAEEKKPYEERYQAEKEAYLQITSKEKRETEAMKLLEEEQKQKTAMELLDQYLQFKEEAEKDNKKK-KKERDPLKPKQPMSAFFLFSNERRGSLFAENK

Query:  NVLEVAKITGGEWKNMTEEQRGPYEEMAKKKKEKYMQEMETYKQKKEEEAANLKKEEEEQMKLQKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKNV
        +V+EVAKITG EWKN++++++ PYE++AKK KE Y+Q ME YK+ KEEEA + KKEEEE +KL K EAL +LKKKEKT+ +IKK K  ++KK     +NV
Subjt:  NVLEVAKITGGEWKNMTEEQRGPYEEMAKKKKEKYMQEMETYKQKKEEEAANLKKEEEEQMKLQKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKNV

Query:  DPNKPKKPASSYILFSKEARKVVMEEKPGVNNSTVNALISVKWKELSEGERKIWNDKAAEAMEAYKKEVEEYNKTVAETT
        DPNKPKKPASSY LFSK+ RK + EE+PG NN+TV ALIS+KWKELSE E++++N KAA+ MEAYKKEVE YNK  A TT
Subjt:  DPNKPKKPASSYILFSKEARKVVMEEKPGVNNSTVNALISVKWKELSEGERKIWNDKAAEAMEAYKKEVEEYNKTVAETT

AT4G23800.2 HMG (high mobility group) box protein8.6e-14162.29Show/hide
Query:  LVTGEPVGRTKKPRNSRKALKDKNSSPEESQSMVTKVTQPSEEENLSQNQPKPKAAQKKQPAKQSFDKDLQEMQDMLQQLRLDKEKTEELLKEKDEMLKQ
        + T      TKKPRNSRKALK KN                     L +  P P + + K  + +SF++DL EMQ ML++++++K+KTEELLKEKDE+L++
Subjt:  LVTGEPVGRTKKPRNSRKALKDKNSSPEESQSMVTKVTQPSEEENLSQNQPKPKAAQKKQPAKQSFDKDLQEMQDMLQQLRLDKEKTEELLKEKDEMLKQ

Query:  KDEELKTRDKEQEKLQIELKKLQKLKEFKPNMNFPMIQ-ILKDKEQE---KKEKKKCSEKKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKS
        K+EEL+TRD EQEKL++ELKKLQK+KEFKPNM F   Q  L   EQE   KK+KK C E KRPS  Y+LWCKDQW E+KKENPEA+FKE SNILGAKWKS
Subjt:  KDEELKTRDKEQEKLQIELKKLQKLKEFKPNMNFPMIQ-ILKDKEQE---KKEKKKCSEKKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKS

Query:  VTAEEKKPYEERYQAEKEAYLQITSKEKRETEAMKLLEEEQKQKTAMELLDQYLQFKEEAEKDNKKK-KKERDPLKPKQPMSAFFLFSNERRGSLFAENK
        ++AE+KKPYEERYQ EKEAYLQ+ +KEKRE EAMKLLE++QKQ+TAMELLDQYL F +EAE+DNKKK KKE+DPLKPK P+SAF +++NERR +L  ENK
Subjt:  VTAEEKKPYEERYQAEKEAYLQITSKEKRETEAMKLLEEEQKQKTAMELLDQYLQFKEEAEKDNKKK-KKERDPLKPKQPMSAFFLFSNERRGSLFAENK

Query:  NVLEVAKITGGEWKNMTEEQRGPYEEMAKKKKEKYMQEMETYKQKKEEEAANLKKEEEEQMKLQKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKNV
        +V+EVAKITG EWKN++++++ PYE++AKK KE Y+Q ME YK+ KEEEA + KKEEEE +KL K EAL +LKKKEKT+ +IKK K E          NV
Subjt:  NVLEVAKITGGEWKNMTEEQRGPYEEMAKKKKEKYMQEMETYKQKKEEEAANLKKEEEEQMKLQKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKNV

Query:  DPNKPKKPASSYILFSKEARKVVMEEKPGVNNSTVNALISVKWKELSEGERKIWNDKAAEAMEAYKKEVEEYNKTVAETT
        DPNKPKKPASSY LFSK+ RK + EE+PG NN+TV ALIS+KWKELSE E++++N KAA+ MEAYKKEVE YNK  A TT
Subjt:  DPNKPKKPASSYILFSKEARKVVMEEKPGVNNSTVNALISVKWKELSEGERKIWNDKAAEAMEAYKKEVEEYNKTVAETT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGGTGACCAGAAAGTTCGAGTAGTTCGTTGTCCCAAATGCGAGAATCTCTTGCCTGAGCCCTCGGAGCCCCCTGTTTATCAGTGTGGTGGCTGTGGGGCTGTTCT
TAGAGCAAAGAGCAAAGTTCCCCTAAATGAGAAAAATGATTGTATGAGCAGTGAAAATTATGAGTCCTTATCAGAACAAGGCAGTAGTTTAGGTGCTGCTTCTGACACTG
AGTGGGGCAGTCCGAGCTCTAAAAGGACTGTTTTCAGCAACAGCCCAATTAGAACAAATGATAGAGAGGATATAAATGATTATGAGATGAAAGTTGGGAAGGAAACTAAT
GGAGTTTGGCCAATCCAGAGGTTTGGAGATCAATATATCAAGAATTGGGTTGGTCGATGTAATCTTGAACAAGATGTGAACGTTTATGATTTGGATTATCCAAGTACAGC
ACCGTATCGTACTCGTATAGGAGCAGCAAGAAGCCGGGCGAGTTTCGAGCATCGAAAAGTTGAAAGAGATGCATATACAAGGTACTCTAGGAACTCTATGGCTGTTGCTG
ACAGACCTTCAAGTTCTAACTTTGAAGGTTTGAACCCAAATCCAGCTGAGCTGCTTAGAAGGTTGGATGAGTTGAAAGACCAAATTATTATGTCTTGTGATGTGAGAGCT
CCAGCCAATCAGTACTACGGTCGGCCTACTTACAATGTTCCAATGCAGCCTTCAACAAAGAGCCAACAGCTGAGCCATGGCTCTCATTACCAGAGAAATAGTGAGGAGTT
CTTACATCCAAAAGAGCCAATCAAAATGAGTGCTTATTACAATGAGAATGCTATTCCAATTGGGCTCGAGGCGTCTGATCTGCGACGTGCTGGTCGTTTTCCACACTCGA
GACAGTCTAGTGAGTTTAGTTCAGGGACTGATGGTTACGGTCTGGTTCAACCAAGAAAGGCTCCACTTTTGCAAAGAAATGGAAATTCTTGTGATGCCATTGCAGGTGGT
GCCCCTTTCATTGTATGTGTTAGTTGCTTGGAATTGCTTAAACTGCCAAGAAAGCTTTATAAGTTGCAAATGGATTGGCAGAAACTACAATGTGGTGCTTGTTCGGTTGT
CATCGTTGTACGAGTCGAGAACGGAAGGCTTGTTGTTAGCGTTCCATCGGAATCCAAGCTCAAAGAAGTTTCTCTTGATGATGGTTCCCCCAAACGAGCTGTCAATGCCA
CCAACTCCTTAGAAAGCTCTGATGATTCTGGTCACAAGTTCATCAGTACTGACCACAACAAGCAGGAGCAAACTTCATTGAAGACCACCCCAGCTATAAAATGTGAACCA
AGCCTTCTCAACGACTCAGCTGACCTGCCTTCAAAAGATGTTTCCAAGGAGAATTCTGATAGCACTTCTTATCAGAAAGCTAGCAATTACAGAGAGGGAGGTGATGAAAA
TAAGCAGAATACTGTGATAGACGGCAACGCCGAGCCGATCGAGTTGGACGTATCGTTTGAGGATTATTCGAACATTCATGTTTCTCAAGATTTTGTGGAAACAAGCAAAG
AAGAAGTGGAAGATCAAAGCAAGATCAAAAACAGTCAAGAATCAGACACCTTTTTTGTGGGTCTCAGCAGGTATGATTATCAAGCTGGATTCTGGGGCGTAATGGGGCAT
CCATGTCTTGGCATCATTCCTCCGTTCATCGACGAGTTCACCTATCCATTGTCAAGGAACTGTGCTGCTGGAAACACTGAAATCTTTGTGAATGGCAGAGAGCTTCACAA
AAGGGATTTGGAGCTGCTTTCTAGCAGAGGGTTGCCCACTACTCCAAGCAAGCTTTATAGAATCGACATCTCGGGAAGAGTTGTGGATGAAGATACTGGGAAAGTGTTGC
ACAATCTGGGAAAACTCGCCCCAACTTCCGCCACTGCTGAAGTTCTGGTCACCGGCGAGCCCGTCGGACGGACGAAGAAACCTAGAAACAGCCGGAAGGCTCTCAAGGAC
AAAAACTCGTCACCGGAGGAATCTCAATCTATGGTCACGAAGGTAACGCAGCCGTCGGAAGAGGAGAACCTCTCTCAGAATCAACCGAAGCCGAAAGCTGCGCAGAAGAA
GCAGCCGGCGAAGCAGTCCTTCGATAAAGATTTGCAGGAAATGCAGGACATGCTACAACAATTGAGGCTCGATAAGGAGAAGACTGAGGAGCTTTTGAAAGAAAAGGATG
AGATGCTTAAGCAGAAGGATGAAGAGCTTAAAACGAGGGATAAAGAACAGGAGAAGCTCCAGATCGAATTGAAGAAGCTGCAGAAATTGAAGGAATTCAAACCTAATATG
AACTTCCCTATGATTCAAATTTTGAAAGACAAGGAGCAAGAGAAGAAAGAGAAGAAGAAGTGCTCAGAAAAGAAGCGGCCCTCTCCACCTTACATATTGTGGTGCAAAGA
TCAGTGGAATGAGATCAAGAAGGAGAATCCAGAGGCGGAGTTCAAAGAAATCTCTAACATTTTGGGGGCAAAATGGAAGAGTGTTACTGCAGAGGAGAAGAAGCCATATG
AGGAAAGGTATCAGGCAGAGAAAGAAGCCTATTTACAAATCACTTCTAAAGAGAAGCGTGAGACTGAGGCGATGAAGCTGTTAGAAGAGGAGCAGAAGCAGAAGACAGCC
ATGGAGTTGCTTGATCAATACCTCCAATTCAAAGAGGAAGCAGAGAAGGATAACAAGAAGAAGAAGAAAGAGAGAGATCCACTGAAGCCCAAGCAACCGATGTCGGCGTT
TTTCCTCTTCTCAAATGAGAGGCGTGGATCCCTTTTTGCTGAGAACAAGAATGTCCTAGAGGTAGCTAAGATAACAGGAGGGGAGTGGAAGAACATGACAGAGGAGCAAA
GAGGTCCCTATGAAGAGATGGCGAAGAAGAAAAAGGAGAAATACATGCAGGAGATGGAAACTTACAAGCAGAAAAAGGAGGAAGAAGCAGCAAACCTCAAGAAGGAAGAG
GAAGAGCAAATGAAGCTTCAGAAACATGAAGCTTTGCTACTGCTAAAGAAGAAAGAGAAAACTGAGACAATTATAAAGAAAACAAAGGAGGAACGCCAGAAGAAGAAGAA
GGAAGGGAAGAAAAACGTTGATCCTAACAAGCCTAAGAAGCCTGCATCCTCTTACATCTTGTTCAGCAAAGAAGCAAGGAAAGTTGTAATGGAGGAGAAGCCTGGAGTGA
ACAACTCCACAGTCAATGCCCTGATTTCAGTGAAATGGAAGGAACTAAGTGAAGGGGAGAGAAAAATATGGAATGACAAAGCTGCAGAAGCCATGGAAGCTTACAAAAAG
GAAGTGGAGGAATACAACAAAACTGTTGCTGAAACAACAAAGGGTGAGGAGGAGGAGAAAGCCTGA
mRNA sequenceShow/hide mRNA sequence
AAAAGTCCATTCTGTTCCTCTTTTAAGCTACTGGGAATTTGCATTTGTATTTGTATTTGTATTTGTATGAACTTCTAAGTTCTTTGTGTTGAAATTTATTGTGGGTTTGT
GATTTTGGAAATTGTTCTCGAGTTTGTTAATGGGCTGAAGAGTTTTTTTTCTTTTTATTCATTGATTTCCGGATTTGGGATTGTTTGATTGTTTCTTCTCTTTAATGGGT
TTGAAGAATTTCTAAGTTTGATCTTCTCTTGGGTTCTGAATTATGTCTGGTGACCAGAAAGTTCGAGTAGTTCGTTGTCCCAAATGCGAGAATCTCTTGCCTGAGCCCTC
GGAGCCCCCTGTTTATCAGTGTGGTGGCTGTGGGGCTGTTCTTAGAGCAAAGAGCAAAGTTCCCCTAAATGAGAAAAATGATTGTATGAGCAGTGAAAATTATGAGTCCT
TATCAGAACAAGGCAGTAGTTTAGGTGCTGCTTCTGACACTGAGTGGGGCAGTCCGAGCTCTAAAAGGACTGTTTTCAGCAACAGCCCAATTAGAACAAATGATAGAGAG
GATATAAATGATTATGAGATGAAAGTTGGGAAGGAAACTAATGGAGTTTGGCCAATCCAGAGGTTTGGAGATCAATATATCAAGAATTGGGTTGGTCGATGTAATCTTGA
ACAAGATGTGAACGTTTATGATTTGGATTATCCAAGTACAGCACCGTATCGTACTCGTATAGGAGCAGCAAGAAGCCGGGCGAGTTTCGAGCATCGAAAAGTTGAAAGAG
ATGCATATACAAGGTACTCTAGGAACTCTATGGCTGTTGCTGACAGACCTTCAAGTTCTAACTTTGAAGGTTTGAACCCAAATCCAGCTGAGCTGCTTAGAAGGTTGGAT
GAGTTGAAAGACCAAATTATTATGTCTTGTGATGTGAGAGCTCCAGCCAATCAGTACTACGGTCGGCCTACTTACAATGTTCCAATGCAGCCTTCAACAAAGAGCCAACA
GCTGAGCCATGGCTCTCATTACCAGAGAAATAGTGAGGAGTTCTTACATCCAAAAGAGCCAATCAAAATGAGTGCTTATTACAATGAGAATGCTATTCCAATTGGGCTCG
AGGCGTCTGATCTGCGACGTGCTGGTCGTTTTCCACACTCGAGACAGTCTAGTGAGTTTAGTTCAGGGACTGATGGTTACGGTCTGGTTCAACCAAGAAAGGCTCCACTT
TTGCAAAGAAATGGAAATTCTTGTGATGCCATTGCAGGTGGTGCCCCTTTCATTGTATGTGTTAGTTGCTTGGAATTGCTTAAACTGCCAAGAAAGCTTTATAAGTTGCA
AATGGATTGGCAGAAACTACAATGTGGTGCTTGTTCGGTTGTCATCGTTGTACGAGTCGAGAACGGAAGGCTTGTTGTTAGCGTTCCATCGGAATCCAAGCTCAAAGAAG
TTTCTCTTGATGATGGTTCCCCCAAACGAGCTGTCAATGCCACCAACTCCTTAGAAAGCTCTGATGATTCTGGTCACAAGTTCATCAGTACTGACCACAACAAGCAGGAG
CAAACTTCATTGAAGACCACCCCAGCTATAAAATGTGAACCAAGCCTTCTCAACGACTCAGCTGACCTGCCTTCAAAAGATGTTTCCAAGGAGAATTCTGATAGCACTTC
TTATCAGAAAGCTAGCAATTACAGAGAGGGAGGTGATGAAAATAAGCAGAATACTGTGATAGACGGCAACGCCGAGCCGATCGAGTTGGACGTATCGTTTGAGGATTATT
CGAACATTCATGTTTCTCAAGATTTTGTGGAAACAAGCAAAGAAGAAGTGGAAGATCAAAGCAAGATCAAAAACAGTCAAGAATCAGACACCTTTTTTGTGGGTCTCAGC
AGGTATGATTATCAAGCTGGATTCTGGGGCGTAATGGGGCATCCATGTCTTGGCATCATTCCTCCGTTCATCGACGAGTTCACCTATCCATTGTCAAGGAACTGTGCTGC
TGGAAACACTGAAATCTTTGTGAATGGCAGAGAGCTTCACAAAAGGGATTTGGAGCTGCTTTCTAGCAGAGGGTTGCCCACTACTCCAAGCAAGCTTTATAGAATCGACA
TCTCGGGAAGAGTTGTGGATGAAGATACTGGGAAAGTGTTGCACAATCTGGGAAAACTCGCCCCAACTTCCGCCACTGCTGAAGTTCTGGTCACCGGCGAGCCCGTCGGA
CGGACGAAGAAACCTAGAAACAGCCGGAAGGCTCTCAAGGACAAAAACTCGTCACCGGAGGAATCTCAATCTATGGTCACGAAGGTAACGCAGCCGTCGGAAGAGGAGAA
CCTCTCTCAGAATCAACCGAAGCCGAAAGCTGCGCAGAAGAAGCAGCCGGCGAAGCAGTCCTTCGATAAAGATTTGCAGGAAATGCAGGACATGCTACAACAATTGAGGC
TCGATAAGGAGAAGACTGAGGAGCTTTTGAAAGAAAAGGATGAGATGCTTAAGCAGAAGGATGAAGAGCTTAAAACGAGGGATAAAGAACAGGAGAAGCTCCAGATCGAA
TTGAAGAAGCTGCAGAAATTGAAGGAATTCAAACCTAATATGAACTTCCCTATGATTCAAATTTTGAAAGACAAGGAGCAAGAGAAGAAAGAGAAGAAGAAGTGCTCAGA
AAAGAAGCGGCCCTCTCCACCTTACATATTGTGGTGCAAAGATCAGTGGAATGAGATCAAGAAGGAGAATCCAGAGGCGGAGTTCAAAGAAATCTCTAACATTTTGGGGG
CAAAATGGAAGAGTGTTACTGCAGAGGAGAAGAAGCCATATGAGGAAAGGTATCAGGCAGAGAAAGAAGCCTATTTACAAATCACTTCTAAAGAGAAGCGTGAGACTGAG
GCGATGAAGCTGTTAGAAGAGGAGCAGAAGCAGAAGACAGCCATGGAGTTGCTTGATCAATACCTCCAATTCAAAGAGGAAGCAGAGAAGGATAACAAGAAGAAGAAGAA
AGAGAGAGATCCACTGAAGCCCAAGCAACCGATGTCGGCGTTTTTCCTCTTCTCAAATGAGAGGCGTGGATCCCTTTTTGCTGAGAACAAGAATGTCCTAGAGGTAGCTA
AGATAACAGGAGGGGAGTGGAAGAACATGACAGAGGAGCAAAGAGGTCCCTATGAAGAGATGGCGAAGAAGAAAAAGGAGAAATACATGCAGGAGATGGAAACTTACAAG
CAGAAAAAGGAGGAAGAAGCAGCAAACCTCAAGAAGGAAGAGGAAGAGCAAATGAAGCTTCAGAAACATGAAGCTTTGCTACTGCTAAAGAAGAAAGAGAAAACTGAGAC
AATTATAAAGAAAACAAAGGAGGAACGCCAGAAGAAGAAGAAGGAAGGGAAGAAAAACGTTGATCCTAACAAGCCTAAGAAGCCTGCATCCTCTTACATCTTGTTCAGCA
AAGAAGCAAGGAAAGTTGTAATGGAGGAGAAGCCTGGAGTGAACAACTCCACAGTCAATGCCCTGATTTCAGTGAAATGGAAGGAACTAAGTGAAGGGGAGAGAAAAATA
TGGAATGACAAAGCTGCAGAAGCCATGGAAGCTTACAAAAAGGAAGTGGAGGAATACAACAAAACTGTTGCTGAAACAACAAAGGGTGAGGAGGAGGAGAAAGCCTGA
Protein sequenceShow/hide protein sequence
MSGDQKVRVVRCPKCENLLPEPSEPPVYQCGGCGAVLRAKSKVPLNEKNDCMSSENYESLSEQGSSLGAASDTEWGSPSSKRTVFSNSPIRTNDREDINDYEMKVGKETN
GVWPIQRFGDQYIKNWVGRCNLEQDVNVYDLDYPSTAPYRTRIGAARSRASFEHRKVERDAYTRYSRNSMAVADRPSSSNFEGLNPNPAELLRRLDELKDQIIMSCDVRA
PANQYYGRPTYNVPMQPSTKSQQLSHGSHYQRNSEEFLHPKEPIKMSAYYNENAIPIGLEASDLRRAGRFPHSRQSSEFSSGTDGYGLVQPRKAPLLQRNGNSCDAIAGG
APFIVCVSCLELLKLPRKLYKLQMDWQKLQCGACSVVIVVRVENGRLVVSVPSESKLKEVSLDDGSPKRAVNATNSLESSDDSGHKFISTDHNKQEQTSLKTTPAIKCEP
SLLNDSADLPSKDVSKENSDSTSYQKASNYREGGDENKQNTVIDGNAEPIELDVSFEDYSNIHVSQDFVETSKEEVEDQSKIKNSQESDTFFVGLSRYDYQAGFWGVMGH
PCLGIIPPFIDEFTYPLSRNCAAGNTEIFVNGRELHKRDLELLSSRGLPTTPSKLYRIDISGRVVDEDTGKVLHNLGKLAPTSATAEVLVTGEPVGRTKKPRNSRKALKD
KNSSPEESQSMVTKVTQPSEEENLSQNQPKPKAAQKKQPAKQSFDKDLQEMQDMLQQLRLDKEKTEELLKEKDEMLKQKDEELKTRDKEQEKLQIELKKLQKLKEFKPNM
NFPMIQILKDKEQEKKEKKKCSEKKRPSPPYILWCKDQWNEIKKENPEAEFKEISNILGAKWKSVTAEEKKPYEERYQAEKEAYLQITSKEKRETEAMKLLEEEQKQKTA
MELLDQYLQFKEEAEKDNKKKKKERDPLKPKQPMSAFFLFSNERRGSLFAENKNVLEVAKITGGEWKNMTEEQRGPYEEMAKKKKEKYMQEMETYKQKKEEEAANLKKEE
EEQMKLQKHEALLLLKKKEKTETIIKKTKEERQKKKKEGKKNVDPNKPKKPASSYILFSKEARKVVMEEKPGVNNSTVNALISVKWKELSEGERKIWNDKAAEAMEAYKK
EVEEYNKTVAETTKGEEEEKA