; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007884 (gene) of Snake gourd v1 genome

Gene IDTan0007884
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG10:975486..985602
RNA-Seq ExpressionTan0007884
SyntenyTan0007884
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022946534.1 uncharacterized protein LOC111450568 isoform X1 [Cucurbita moschata]9.4e-20881.08Show/hide
Query:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL
        M PE GSKV+E  +     +EDKNA QDK S SSGQEK H MEAPSVER+MM DR EDME+DIIGCTDNCEGGPSSECN STENSSSFGDTVSGTDYG L
Subjt:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL

Query:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ
        LDDEEVESQLY DNNLQP+SNG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVYE FS EDFDVKSTGFS HTQ
Subjt:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ

Query:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH
        R R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +ADDI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Subjt:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH

Query:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK
        ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DVLMPE+A  SHGEVMLLPDMIQ+ D G ST+KVL+QDSA K
Subjt:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK

Query:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG
        EE QI EEVN QFIEQT KLEEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK TG
Subjt:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG

XP_022946550.1 uncharacterized protein LOC111450568 isoform X3 [Cucurbita moschata]9.4e-20881.08Show/hide
Query:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL
        M PE GSKV+E  +     +EDKNA QDK S SSGQEK H MEAPSVER+MM DR EDME+DIIGCTDNCEGGPSSECN STENSSSFGDTVSGTDYG L
Subjt:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL

Query:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ
        LDDEEVESQLY DNNLQP+SNG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVYE FS EDFDVKSTGFS HTQ
Subjt:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ

Query:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH
        R R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +ADDI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Subjt:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH

Query:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK
        ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DVLMPE+A  SHGEVMLLPDMIQ+ D G ST+KVL+QDSA K
Subjt:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK

Query:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG
        EE QI EEVN QFIEQT KLEEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK TG
Subjt:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG

XP_022974817.1 uncharacterized protein LOC111473601 isoform X1 [Cucurbita maxima]1.1e-20881.5Show/hide
Query:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL
        M PE GSKV+E  +     +EDKNA QDKQS SSGQEK H MEAPSVE++MM +RSEDME+DIIGCTDNCEGGPSSECN STE SSSFGDTVSGTDYGLL
Subjt:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL

Query:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ
        LDDEEVESQLY DNNLQP+SNG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVYE FS EDFDVKSTGFS HTQ
Subjt:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ

Query:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH
        R R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +ADDI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Subjt:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH

Query:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK
        ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DVLMPE+A  SHGEVMLLPDMIQS D G ST+KVL+QDSAVK
Subjt:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK

Query:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG
        EE QI EEVN QFIEQT KLEEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK TG
Subjt:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG

XP_022974820.1 uncharacterized protein LOC111473601 isoform X3 [Cucurbita maxima]1.1e-20881.5Show/hide
Query:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL
        M PE GSKV+E  +     +EDKNA QDKQS SSGQEK H MEAPSVE++MM +RSEDME+DIIGCTDNCEGGPSSECN STE SSSFGDTVSGTDYGLL
Subjt:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL

Query:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ
        LDDEEVESQLY DNNLQP+SNG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVYE FS EDFDVKSTGFS HTQ
Subjt:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ

Query:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH
        R R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +ADDI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Subjt:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH

Query:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK
        ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DVLMPE+A  SHGEVMLLPDMIQS D G ST+KVL+QDSAVK
Subjt:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK

Query:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG
        EE QI EEVN QFIEQT KLEEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK TG
Subjt:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG

XP_023540268.1 uncharacterized protein LOC111800693 isoform X3 [Cucurbita pepo subsp. pepo]1.0e-20680.87Show/hide
Query:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL
        M PE GSKV+E  +     +EDKNA QDKQS SSGQEK H MEAPSVER+MM DRSEDME+DIIGCTDNCEGGPSSECN STENSSSFGDTVSGTDYG L
Subjt:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL

Query:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ
        LDDEEVESQLY DNNLQP+SNG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVYE FS EDFDVKSTGF  HTQ
Subjt:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ

Query:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH
        R R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +ADDI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Subjt:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH

Query:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK
        ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DVLMPE+A  SHGEVMLLPDMIQS D G ST+KVL+QDSA K
Subjt:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK

Query:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG
        EE QI EEV+ QFIE+T KLEEQI      S ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK TG
Subjt:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG

TrEMBL top hitse value%identityAlignment
A0A6J1G437 uncharacterized protein LOC111450568 isoform X14.6e-20881.08Show/hide
Query:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL
        M PE GSKV+E  +     +EDKNA QDK S SSGQEK H MEAPSVER+MM DR EDME+DIIGCTDNCEGGPSSECN STENSSSFGDTVSGTDYG L
Subjt:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL

Query:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ
        LDDEEVESQLY DNNLQP+SNG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVYE FS EDFDVKSTGFS HTQ
Subjt:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ

Query:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH
        R R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +ADDI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Subjt:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH

Query:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK
        ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DVLMPE+A  SHGEVMLLPDMIQ+ D G ST+KVL+QDSA K
Subjt:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK

Query:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG
        EE QI EEVN QFIEQT KLEEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK TG
Subjt:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG

A0A6J1G468 uncharacterized protein LOC111450568 isoform X34.6e-20881.08Show/hide
Query:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL
        M PE GSKV+E  +     +EDKNA QDK S SSGQEK H MEAPSVER+MM DR EDME+DIIGCTDNCEGGPSSECN STENSSSFGDTVSGTDYG L
Subjt:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL

Query:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ
        LDDEEVESQLY DNNLQP+SNG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVYE FS EDFDVKSTGFS HTQ
Subjt:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ

Query:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH
        R R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +ADDI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Subjt:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH

Query:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK
        ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DVLMPE+A  SHGEVMLLPDMIQ+ D G ST+KVL+QDSA K
Subjt:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK

Query:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG
        EE QI EEVN QFIEQT KLEEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK TG
Subjt:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG

A0A6J1IBA7 uncharacterized protein LOC111473601 isoform X15.4e-20981.5Show/hide
Query:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL
        M PE GSKV+E  +     +EDKNA QDKQS SSGQEK H MEAPSVE++MM +RSEDME+DIIGCTDNCEGGPSSECN STE SSSFGDTVSGTDYGLL
Subjt:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL

Query:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ
        LDDEEVESQLY DNNLQP+SNG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVYE FS EDFDVKSTGFS HTQ
Subjt:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ

Query:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH
        R R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +ADDI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Subjt:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH

Query:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK
        ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DVLMPE+A  SHGEVMLLPDMIQS D G ST+KVL+QDSAVK
Subjt:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK

Query:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG
        EE QI EEVN QFIEQT KLEEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK TG
Subjt:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG

A0A6J1IEX6 uncharacterized protein LOC111473601 isoform X35.4e-20981.5Show/hide
Query:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL
        M PE GSKV+E  +     +EDKNA QDKQS SSGQEK H MEAPSVE++MM +RSEDME+DIIGCTDNCEGGPSSECN STE SSSFGDTVSGTDYGLL
Subjt:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL

Query:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ
        LDDEEVESQLY DNNLQP+SNG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVYE FS EDFDVKSTGFS HTQ
Subjt:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ

Query:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH
        R R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +ADDI+ E TSLKL+KT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Subjt:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH

Query:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK
        ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DVLMPE+A  SHGEVMLLPDMIQS D G ST+KVL+QDSAVK
Subjt:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK

Query:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG
        EE QI EEVN QFIEQT KLEEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK TG
Subjt:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG

A0A6J1IHF9 uncharacterized protein LOC111473601 isoform X23.6e-20580.67Show/hide
Query:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL
        M PE GSKV+E  +     +EDKNA QDKQS SSGQEK H MEAPSVE++MM +RSEDME+DIIGCTDNCEGGPSSECN STE SSSFGDTVSGTDYGLL
Subjt:  MGPEIGSKVKEEAL-----MEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLL

Query:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ
        LDDEEVESQLY DNNLQP+SNG REVFPRKKKLTDHW+KFISPV WRCRWLEL+I+KLQSQS KYDRELALYDQRKQSVYE FS EDFDVKSTGFS HTQ
Subjt:  LDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQ

Query:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH
        R R+MKRK+RKK EETTE ASYM HHN+FSYYEKKR +ADDI+ E      DKT+NM+HD INDF  IATDGWPSSMLGDNDNNLEE+FLKIEAAQS+VH
Subjt:  RHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVH

Query:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK
        ELKNRIDKVVNENPMKFSSI+Q+Y+LASSDDPASPEDGNDVFVRSLHEASQHMSE A DVLMPE+A  SHGEVMLLPDMIQS D G ST+KVL+QDSAVK
Subjt:  ELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPEDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRG-STEKVLMQDSAVK

Query:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG
        EE QI EEVN QFIEQT KLEEQI SP     ADLAS  +EPDMQHKT+ PSA KPSSSK+TRKRGRRK    KQKRK TG
Subjt:  EEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G50040.1 unknown protein2.8e-3235.29Show/hide
Query:  EDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTV---SGTDYGLLLDDEEVESQLYGDNNLQ-PMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLE
        E+  +DI+   D+ E     E      +SSSFGD++    G D+G     +E +S L  D  L     +G   +   KKK  D WR+   P+MWRC+W+E
Subjt:  EDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTV---SGTDYGLLLDDEEVESQLYGDNNLQ-PMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLE

Query:  LQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLD
        L++K++QSQ+  Y++E+  Y   KQ   E   +E FD KS  F  + QR  V KR RRK+ EETT+ A+YM +HN+FSY +K+ P+       D+     
Subjt:  LQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLD

Query:  KTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVHELKNRIDKVV-NENPMKFSSISQL
        +    K DAI D   I       S L  +D+ L +   KI+ AQ K   L+ R+D+++ +  P   SS+ Q+
Subjt:  KTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVHELKNRIDKVV-NENPMKFSSISQL

AT3G59670.1 unknown protein7.1e-4434.19Show/hide
Query:  KVKEEALMEDKNAVQDKQSASSGQEKIH---DMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLD----DEE
        ++ EE+   D   +  K+  S G E           S E    +   E++++DI+   +N       + N +TE SSSF DT S  +  +LLD    + E
Subjt:  KVKEEALMEDKNAVQDKQSASSGQEKIH---DMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLD----DEE

Query:  VESQLYGDNNLQPMSNGYREVFP-RKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDF---DVKSTGFSIHTQR
        VES  + + +L P  + +  +F  RKK+LT+HWR+FI P+MWR +W+EL+I++L+S++L+Y +EL LYDQ K       S+ +     +KS  FS    +
Subjt:  VESQLYGDNNLQPMSNGYREVFP-RKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDF---DVKSTGFSIHTQR

Query:  HR-VMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSM-LGDNDNNLEEIFLKIEAAQSKV
         R   KR++RKK E T + ASYM  HN+FSY E KR  +D + + D        R+  ++      P+  D   S     D D+ LEE+  KIE   S+V
Subjt:  HR-VMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSM-LGDNDNNLEEIFLKIEAAQSKV

Query:  HELKNRIDKVVNENPMKFSSISQLYLLASSDDP----ASPEDGNDVFVRSLHEASQHMSEHALD--VLMPESATRSHGEVMLLPDMIQS
        H LK ++D V+++N  +FSS   L LLA+S  P    ++  +G+ +   +++ ASQHM+++ L   V   E    S+G+   +PD+I+S
Subjt:  HELKNRIDKVVNENPMKFSSISQLYLLASSDDP----ASPEDGNDVFVRSLHEASQHMSEHALD--VLMPESATRSHGEVMLLPDMIQS

AT4G37440.1 unknown protein3.6e-4031.78Show/hide
Query:  DMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTEN-SSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRK
        D++A +  +  + +  ED E+DI+ C DN E   S  C+  T+  SSSFG T S  +     +D+EV+S +  + +L         ++ RK+KLTDHWR+
Subjt:  DMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTEN-SSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRK

Query:  FISP-VMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKS-TGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRP
        F+ P +MWRC+W+EL+ K+LQ+Q+ KYD+E+  Y Q K+   E+   E+  VK+      +TQ+ R+MKRK RK+ EET +  SY  +HN+FSYY+ ++ 
Subjt:  FISP-VMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKS-TGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRP

Query:  IADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVHELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPED
        +A DI + D S  LDK      D         ++  P     + D  LE+I LKIEAA+S+   LK R+DKV++ENP  F   + +  L ++D   S E 
Subjt:  IADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVHELKNRIDKVVNENPMKFSSISQLYLLASSDDPASPED

Query:  GNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEV---------MLLPDMIQSVDRGSTEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKLEEQIISPAA
           +      +    +SE        +SA+ S   V         +LL +++ S  R     +  ++    E+  I E  +    ++TP+  E I    +
Subjt:  GNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEV---------MLLPDMIQSVDRGSTEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKLEEQIISPAA

Query:  ASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKA
          +    S +       K K  + +    S R RKRG+R+  S+  +R++
Subjt:  ASQADLASEDKEPDMQHKTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKA

AT4G37440.2 unknown protein3.0e-4234.22Show/hide
Query:  DMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTEN-SSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRK
        D++A +  +  + +  ED E+DI+ C DN E   S  C+  T+  SSSFG T S  +     +D+EV+S +  + +L         ++ RK+KLTDHWR+
Subjt:  DMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTEN-SSSFGDTVSGTDYGLLLDDEEVESQLYGDNNLQPMSNGYREVFPRKKKLTDHWRK

Query:  FISP-VMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKS-TGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRP
        F+ P +MWRC+W+EL+ K+LQ+Q+ KYD+E+  Y Q K+   E+   E+  VK+      +TQ+ R+MKRK RK+ EET +  SY  +HN+FSYY+ ++ 
Subjt:  FISP-VMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKS-TGFSIHTQRHRVMKRKRRKKTEETTEAASYMGHHNVFSYYEKKRP

Query:  IADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVHELKNRIDKVVNENPMKF----------------SSIS
        +A DI + D S  LDK      D         ++  P     + D  LE+I LKIEAA+S+   LK R+DKV++ENP  F                SS  
Subjt:  IADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVHELKNRIDKVVNENPMKF----------------SSIS

Query:  QLYLLASSDDPASPEDGNDVFVRSLHEASQHMS---EHALDVLMPE--SATRSHGEVMLLPDMIQSVDRGSTEK
        Q  LLA  ++        +  V+S   +S H+S   +   D+L+ E  ++ R  G+ ++    +   ++ S E+
Subjt:  QLYLLASSDDPASPEDGNDVFVRSLHEASQHMS---EHALDVLMPE--SATRSHGEVMLLPDMIQSVDRGSTEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACCTGAAATTGGATCAAAAGTGAAGGAGGAAGCTTTGATGGAGGACAAAAATGCTGTTCAGGATAAGCAGAGTGCAAGTAGTGGTCAGGAAAAAATTCATGACAT
GGAAGCTCCATCTGTTGAACGAACTATGATGTTAGATAGAAGTGAGGATATGGAGCTTGATATTATTGGGTGTACAGATAATTGTGAGGGAGGTCCTAGTAGTGAATGCA
ATGTTTCAACTGAAAATTCAAGCTCGTTTGGTGATACTGTTTCTGGGACAGATTATGGTTTGTTATTGGATGATGAAGAAGTTGAATCCCAATTATATGGAGATAATAAT
TTGCAGCCTATGTCTAATGGATACAGAGAAGTATTTCCAAGGAAGAAAAAATTGACAGATCACTGGAGGAAGTTTATAAGTCCTGTTATGTGGCGGTGTAGATGGTTAGA
ACTGCAAATTAAGAAACTTCAGTCTCAATCATTAAAATATGATAGAGAACTTGCATTATATGATCAAAGAAAGCAGTCTGTCTACGAACACTTCTCAATGGAAGATTTTG
ATGTGAAGTCAACAGGATTCTCAATTCACACTCAAAGACACAGGGTTATGAAAAGAAAGAGAAGGAAGAAAACTGAAGAGACAACTGAAGCAGCTTCATATATGGGACAC
CATAATGTGTTCTCCTACTATGAGAAGAAGAGGCCCATTGCTGATGACATAACTATGGAAGATACTTCTCTTAAATTAGACAAGACAAGGAATATGAAACATGATGCCAT
CAATGACTTCGGGCCAATTGCAACTGATGGATGGCCATCTTCTATGTTGGGAGATAATGATAATAATTTGGAAGAAATCTTTCTAAAAATTGAAGCTGCGCAGTCAAAAG
TTCACGAGTTGAAGAACAGAATTGACAAGGTGGTGAATGAAAATCCCATGAAGTTCTCCTCAATCAGTCAGCTATACTTGCTTGCATCAAGTGATGATCCCGCTTCACCT
GAAGACGGAAATGATGTGTTTGTTAGGTCTTTGCATGAAGCATCACAACACATGTCTGAGCATGCATTAGATGTACTTATGCCCGAAAGTGCGACTAGAAGTCATGGAGA
GGTCATGCTACTTCCTGATATGATTCAGAGCGTGGATCGTGGAAGTACTGAGAAAGTTCTGATGCAAGATTCCGCAGTCAAGGAAGAGGTGCAAATTCCTGAAGAGGTTA
ATACTCAGTTTATTGAGCAGACTCCGAAATTGGAGGAGCAAATCATTTCTCCAGCTGCAGCCTCTCAAGCTGACTTAGCCTCAGAAGACAAGGAGCCTGACATGCAACAC
AAAACAAAACCCCCTTCTGCTGTCAAACCTAGTTCATCTAAGAGAACAAGAAAGCGGGGAAGGCGAAAAATCAGTTCGAGTAAACAGAAACGGAAAGCAACAGGTTAG
mRNA sequenceShow/hide mRNA sequence
CGTAAATTCTTCCCACGTTCTGACTGTACAGTTTTTGCTCAATTCCGTTCACTTTCTTCCTCTTTTGATTCCTTTTCCGTTGAATTAACCCGATCTGACTTCTTCATCCA
GTCGTCGTCCGGCTATCTCCAGCCTTCCTTGCCCTTCTATTCTCTCTCCGATCGACGAGTTCAAGCAGAAGAACTGTTCGAAATCAATTCACCCCCAAACCCAGATGTTA
TTGCCTGATTGCACAAAAGAGGGCCTTAATTTGGGCCTATCTTAGTATGGGACCTGAAATTGGATCAAAAGTGAAGGAGGAAGCTTTGATGGAGGACAAAAATGCTGTTC
AGGATAAGCAGAGTGCAAGTAGTGGTCAGGAAAAAATTCATGACATGGAAGCTCCATCTGTTGAACGAACTATGATGTTAGATAGAAGTGAGGATATGGAGCTTGATATT
ATTGGGTGTACAGATAATTGTGAGGGAGGTCCTAGTAGTGAATGCAATGTTTCAACTGAAAATTCAAGCTCGTTTGGTGATACTGTTTCTGGGACAGATTATGGTTTGTT
ATTGGATGATGAAGAAGTTGAATCCCAATTATATGGAGATAATAATTTGCAGCCTATGTCTAATGGATACAGAGAAGTATTTCCAAGGAAGAAAAAATTGACAGATCACT
GGAGGAAGTTTATAAGTCCTGTTATGTGGCGGTGTAGATGGTTAGAACTGCAAATTAAGAAACTTCAGTCTCAATCATTAAAATATGATAGAGAACTTGCATTATATGAT
CAAAGAAAGCAGTCTGTCTACGAACACTTCTCAATGGAAGATTTTGATGTGAAGTCAACAGGATTCTCAATTCACACTCAAAGACACAGGGTTATGAAAAGAAAGAGAAG
GAAGAAAACTGAAGAGACAACTGAAGCAGCTTCATATATGGGACACCATAATGTGTTCTCCTACTATGAGAAGAAGAGGCCCATTGCTGATGACATAACTATGGAAGATA
CTTCTCTTAAATTAGACAAGACAAGGAATATGAAACATGATGCCATCAATGACTTCGGGCCAATTGCAACTGATGGATGGCCATCTTCTATGTTGGGAGATAATGATAAT
AATTTGGAAGAAATCTTTCTAAAAATTGAAGCTGCGCAGTCAAAAGTTCACGAGTTGAAGAACAGAATTGACAAGGTGGTGAATGAAAATCCCATGAAGTTCTCCTCAAT
CAGTCAGCTATACTTGCTTGCATCAAGTGATGATCCCGCTTCACCTGAAGACGGAAATGATGTGTTTGTTAGGTCTTTGCATGAAGCATCACAACACATGTCTGAGCATG
CATTAGATGTACTTATGCCCGAAAGTGCGACTAGAAGTCATGGAGAGGTCATGCTACTTCCTGATATGATTCAGAGCGTGGATCGTGGAAGTACTGAGAAAGTTCTGATG
CAAGATTCCGCAGTCAAGGAAGAGGTGCAAATTCCTGAAGAGGTTAATACTCAGTTTATTGAGCAGACTCCGAAATTGGAGGAGCAAATCATTTCTCCAGCTGCAGCCTC
TCAAGCTGACTTAGCCTCAGAAGACAAGGAGCCTGACATGCAACACAAAACAAAACCCCCTTCTGCTGTCAAACCTAGTTCATCTAAGAGAACAAGAAAGCGGGGAAGGC
GAAAAATCAGTTCGAGTAAACAGAAACGGAAAGCAACAGGTTAGCTGGATAAATGGGACTACAGAAGATGGTTATTTTTCTTCTTATGATGTGGTTGATGGAGGTTAGAA
TTTTGTTAATGGTTATTGCTCTACTTCTACTGTATATTTTCCATGTCATGGTCCTCTCTCGTCTGCTGCACATATCAACTCAATTATTTGTATATGCTGCAAATATTCAT
TAATTATGTTAATATGAAGAATAGCATTAATCCACAGTGGTTGGCCAC
Protein sequenceShow/hide protein sequence
MGPEIGSKVKEEALMEDKNAVQDKQSASSGQEKIHDMEAPSVERTMMLDRSEDMELDIIGCTDNCEGGPSSECNVSTENSSSFGDTVSGTDYGLLLDDEEVESQLYGDNN
LQPMSNGYREVFPRKKKLTDHWRKFISPVMWRCRWLELQIKKLQSQSLKYDRELALYDQRKQSVYEHFSMEDFDVKSTGFSIHTQRHRVMKRKRRKKTEETTEAASYMGH
HNVFSYYEKKRPIADDITMEDTSLKLDKTRNMKHDAINDFGPIATDGWPSSMLGDNDNNLEEIFLKIEAAQSKVHELKNRIDKVVNENPMKFSSISQLYLLASSDDPASP
EDGNDVFVRSLHEASQHMSEHALDVLMPESATRSHGEVMLLPDMIQSVDRGSTEKVLMQDSAVKEEVQIPEEVNTQFIEQTPKLEEQIISPAAASQADLASEDKEPDMQH
KTKPPSAVKPSSSKRTRKRGRRKISSSKQKRKATG