; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg01005 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg01005
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionNuclear pore complex protein NUP62 isoform X2
Genome locationCarg_Chr06:1690572..1694461
RNA-Seq ExpressionCarg01005
SyntenyCarg01005
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596453.1 hypothetical protein SDJN03_09633, partial [Cucurbita argyrosperma subsp. sororia]1.1e-27991.89Show/hide
Query:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
        MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
Subjt:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS

Query:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------
        LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED                                
Subjt:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------

Query:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
                    IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQ   AMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
Subjt:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA

Query:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
        GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSS STQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
Subjt:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN

Query:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL
        SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL
Subjt:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL

Query:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT
        PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT
Subjt:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT

KAG7027996.1 hypothetical protein SDJN02_09175 [Cucurbita argyrosperma subsp. argyrosperma]5.1e-290100Show/hide
Query:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
        MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
Subjt:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS

Query:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFEDIKDGSASSKQSDGLQGPGRTTKQNSSQPREAQ
        LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFEDIKDGSASSKQSDGLQGPGRTTKQNSSQPREAQ
Subjt:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFEDIKDGSASSKQSDGLQGPGRTTKQNSSQPREAQ

Query:  QLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTAGTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQK
        QLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTAGTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQK
Subjt:  QLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTAGTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQK

Query:  TQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSNSSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGL
        TQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSNSSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGL
Subjt:  TQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSNSSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGL

Query:  RLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVLPKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSN
        RLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVLPKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSN
Subjt:  RLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVLPKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSN

Query:  TDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT
        TDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT
Subjt:  TDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT

XP_022934275.1 uncharacterized protein LOC111441484 isoform X1 [Cucurbita moschata]2.0e-27891.22Show/hide
Query:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
        MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETV+SNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
Subjt:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS

Query:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------
        LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED                                
Subjt:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------

Query:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
                    IKDGSASSKQSDGLQGP RTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
Subjt:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA

Query:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
        GTREQKTSSEVS SSS KVGKSSSKDARIKTDCKASPSS STQKTQSRVAPKVKTRIGNSRLPSY+ISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
Subjt:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN

Query:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL
        SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTT+IGAGNVSAIGGQNKTKPSKLQPVTVL
Subjt:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL

Query:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT
        PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATR+MSLENEAGSNANETTSANTDGELNGLQNFGT
Subjt:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT

XP_022934289.1 uncharacterized protein LOC111441484 isoform X2 [Cucurbita moschata]2.7e-27590.71Show/hide
Query:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
        MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETV+SNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
Subjt:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS

Query:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------
        LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED                                
Subjt:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------

Query:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
                    IKDGSASSKQSDGLQGP RTTKQNSSQPREAQ   AMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
Subjt:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA

Query:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
        GTREQKTSSEVS SSS KVGKSSSKDARIKTDCKASPSS STQKTQSRVAPKVKTRIGNSRLPSY+ISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
Subjt:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN

Query:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL
        SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTT+IGAGNVSAIGGQNKTKPSKLQPVTVL
Subjt:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL

Query:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT
        PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATR+MSLENEAGSNANETTSANTDGELNGLQNFGT
Subjt:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT

XP_023538747.1 SMY2 homolog 2-like [Cucurbita pepo subsp. pepo]3.9e-27490.22Show/hide
Query:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
        MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETV+SNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
Subjt:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS

Query:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------
        LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED                                
Subjt:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------

Query:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
                    IKDGSASSKQSDGLQGPGRTTK+NSSQPREAQQ KAMGKLPSNSTL+KRPSLAAT NDSTTSGAGSADGQDSV LRSTTHKLSRLPTA
Subjt:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA

Query:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
        GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSS STQKTQSRVAPKVKTRIGNSRLPSY+ISQSKHSSGISSASSKSEWSTE SSNSTHELQS 
Subjt:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN

Query:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL
        SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSI+KSNVSV GGTTKIGAGNVS IGGQNKTKPSKLQPVTVL
Subjt:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL

Query:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATRE-MSLENEAGSNANETTSANTDGELNGLQNFGT
        PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATRE MSLENEAGSNANETTSANTDGELNGLQNFGT
Subjt:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATRE-MSLENEAGSNANETTSANTDGELNGLQNFGT

TrEMBL top hitse value%identityAlignment
A0A6J1CV96 uncharacterized protein LOC111015105 isoform X26.1e-18063.78Show/hide
Query:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
        MD V TD+DRRFSRLSLIDFASEDD L+SSPSCD HD NSL   KEDEEQ+N  +LETV+S+RIE  TD  EQ+EDEPQLLPS +PER  +NGKYNLRKS
Subjt:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS

Query:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------
        LAWDSAF T AGFLDPEELTSMI  VG++EK VLPII EDV KSSDSISTL SEIMPLESIEGNLFED                                
Subjt:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------

Query:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLA----ATVNDSTTSGAGSADGQDSVGLRSTTHKLSR
                    IKD S SSKQ+DGLQGPGRT K+NSSQPR  QQL A+ KLP NS LTKRPSL     A VN+ST+SGAG AD +DSV L++TT KL+R
Subjt:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLA----ATVNDSTTSGAGSADGQDSVGLRSTTHKLSR

Query:  LPTAGTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKV-KTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTH
        + TA  REQ+TSSE S+SSS+KV KSSSKD + KTDCKASPS+   +KT SRVA +V +T  GNS + SY ISQ+KHSSGIS ASS SEWSTESSSNST 
Subjt:  LPTAGTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKV-KTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTH

Query:  ELQSNSSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTG-------------------SIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIG
        E +SNSSRASLHSISSKRIS+DSD SH+G N +VG HTQTTG                   S+KPSGLRLPSPKIG+FDGGKTS MKSN++VPGG TK G
Subjt:  ELQSNSSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTG-------------------SIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIG

Query:  AGNVSAIGGQNKTKPSKLQPVTVLPKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETT
        AGNVS  GGQNKTKPSKLQPV +LPK+TTRA   P+ N KS+K +ATKMSKTN  D+++KE   +GSNTD+H+SDVCA S        + E G + NETT
Subjt:  AGNVSAIGGQNKTKPSKLQPVTVLPKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETT

Query:  SANTDGELNGLQN
        +A TD E N   N
Subjt:  SANTDGELNGLQN

A0A6J1F259 uncharacterized protein LOC111441484 isoform X21.3e-27590.71Show/hide
Query:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
        MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETV+SNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
Subjt:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS

Query:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------
        LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED                                
Subjt:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------

Query:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
                    IKDGSASSKQSDGLQGP RTTKQNSSQPREAQ   AMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
Subjt:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA

Query:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
        GTREQKTSSEVS SSS KVGKSSSKDARIKTDCKASPSS STQKTQSRVAPKVKTRIGNSRLPSY+ISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
Subjt:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN

Query:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL
        SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTT+IGAGNVSAIGGQNKTKPSKLQPVTVL
Subjt:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL

Query:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT
        PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATR+MSLENEAGSNANETTSANTDGELNGLQNFGT
Subjt:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT

A0A6J1F781 uncharacterized protein LOC111441484 isoform X19.7e-27991.22Show/hide
Query:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
        MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETV+SNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
Subjt:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS

Query:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------
        LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED                                
Subjt:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------

Query:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
                    IKDGSASSKQSDGLQGP RTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
Subjt:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA

Query:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
        GTREQKTSSEVS SSS KVGKSSSKDARIKTDCKASPSS STQKTQSRVAPKVKTRIGNSRLPSY+ISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
Subjt:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN

Query:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL
        SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTT+IGAGNVSAIGGQNKTKPSKLQPVTVL
Subjt:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL

Query:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT
        PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATR+MSLENEAGSNANETTSANTDGELNGLQNFGT
Subjt:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT

A0A6J1KYE8 uncharacterized protein LOC111498278 isoform X22.3e-24889.19Show/hide
Query:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
        MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETV+SNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
Subjt:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS

Query:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------
        LAWDSAFLTGAGFLDPEELTSMIAPVGR+EKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED                                
Subjt:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------

Query:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
                    IKDGSASSKQSDGLQGPGRT KQNSSQPREAQ   AMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
Subjt:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA

Query:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
        GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSS STQKTQSRVAPKVKTRIGNSRLPSY+ISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
Subjt:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN

Query:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL
        SSRASLHSIS KRISVDSD SHDGSNPAVG+HTQT GSIKPSGLRLPSPKIGYFDGGKTSI+ SNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL
Subjt:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL

Query:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTD
        PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKE GCDGSNTD
Subjt:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTD

A0A6J1L1F4 uncharacterized protein LOC111498278 isoform X15.0e-25189.56Show/hide
Query:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
        MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETV+SNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS
Subjt:  MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKS

Query:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------
        LAWDSAFLTGAGFLDPEELTSMIAPVGR+EKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED                                
Subjt:  LAWDSAFLTGAGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFED--------------------------------

Query:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
                    IKDGSASSKQSDGLQGPGRT KQNSSQPREAQ+LKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA
Subjt:  ------------IKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTA

Query:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
        GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSS STQKTQSRVAPKVKTRIGNSRLPSY+ISQSKHSSGISSASSKSEWSTESSSNSTHELQSN
Subjt:  GTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSSGISSASSKSEWSTESSSNSTHELQSN

Query:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL
        SSRASLHSIS KRISVDSD SHDGSNPAVG+HTQT GSIKPSGLRLPSPKIGYFDGGKTSI+ SNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL
Subjt:  SSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQPVTVL

Query:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTD
        PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKE GCDGSNTD
Subjt:  PKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37070.1 unknown protein1.6e-1027.51Show/hide
Query:  LSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKSLAWDSAFLTGAGFL
        LSLIDF++EDD L+ S    F D  +  F+  D+E D    L     N  +E T G    E++       EP +I    K NLRKSLAWD AF T AG L
Subjt:  LSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKSLAWDSAFLTGAGFL

Query:  DPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTL--------GSEIMPLESIEGNLFEDIKDGSA-SSKQSDGLQGPGRTTKQNSSQPREAQQLKAM
        +P+EL+SM+       ++ LP + ED+ +S++S+STL        G E    ++   +  +D+    A  S  +  L  P    K   +  R+   +++ 
Subjt:  DPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTL--------GSEIMPLESIEGNLFEDIKDGSA-SSKQSDGLQGPGRTTKQNSSQPREAQQLKAM

Query:  G-----KLP----SNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHK--------LSRLPTAGTREQKTSSEV---------SISSSNKVGKS-
        G     K P     ++T   RPS       S  S    A    +   + T  K         SR+P +       S+ V         S++S N++  S 
Subjt:  G-----KLP----SNSTLTKRPSLAATVNDSTTSGAGSADGQDSVGLRSTTHK--------LSRLPTAGTREQKTSSEV---------SISSSNKVGKS-

Query:  SSKDARIKTDCKAS--PSSRST--QKTQS-RVAPKVKTRIGNSRLPSYTISQSK---------HSSGISSASSKSEWSTESSSNSTHELQSNSSRASLHS
        SS ++ +     AS  PS  S   +K QS R+A         S   S  I Q K         +   +S  SS  +WS+ES    T    +  ++ S+H 
Subjt:  SSKDARIKTDCKAS--PSSRST--QKTQS-RVAPKVKTRIGNSRLPSYTISQSK---------HSSGISSASSKSEWSTESSSNSTHELQSNSSRASLHS

Query:  ISSKRISVDSDASHDGSNPAVGSHTQ-----TTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQP------V
         +       +      +N    S  Q     +   +KP+GLR+PSPK+GYFDG + S+ ++    P G+     G VS +       P +  P       
Subjt:  ISSKRISVDSDASHDGSNPAVGSHTQ-----TTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIGGQNKTKPSKLQP------V

Query:  TVLPKSTTRAVSQPN--LNLKSHKTTATKMSKTNELDQ
          + K+  R VS+ +  +   S K T  + SK +  +Q
Subjt:  TVLPKSTTRAVSQPN--LNLKSHKTTATKMSKTNELDQ

AT3G53320.1 unknown protein1.7e-2027.65Show/hide
Query:  LSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKSLAWDSAFLTGAGFL
        L LID A EDD L+ S   +F + +      ++++  NF R      + I       E++E+  Q   S EPE++ + GKYNLRKSLAWD+ F T AG L
Subjt:  LSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKSLAWDSAFLTGAGFL

Query:  DPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFEDIK----------------------------------------DGSA
        +PEEL+SM+    +  K+ LP I ED+ +S++SIST  S+     S E  LFED++                                         G  
Subjt:  DPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFEDIK----------------------------------------DGSA

Query:  SSKQS----DGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSAD--------GQDSVGLRSTTHKLSR--LPTAGTRE
         SK S      +QGPG+ TK    QP   + L      P N     RP    + N S+   + +          G++ +G R +  + ++  LP  G   
Subjt:  SSKQS----DGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSLAATVNDSTTSGAGSAD--------GQDSVGLRSTTHKLSR--LPTAGTRE

Query:  QKTSSEVSISSSNKV-----------GKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTR--IGNSRLP---SYTISQSKHSSGISSASSKSEWSTE
         K+SS  S +S N++             SSS   +   D     +  S++ +   +A +  +R  +G  R+P   +   S+ K SS + +A S S++S+E
Subjt:  QKTSSEVSISSSNKV-----------GKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTR--IGNSRLP---SYTISQSKHSSGISSASSKSEWSTE

Query:  SSSNSTHELQSNSSRASLHS----------------ISSKRISVDSDASHDGSN--PAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPG
        SS  S     +N ++ ++                   +SK  SV    + +G+    A+      + S KPSGLR+PSPKIG+FDG +     S     G
Subjt:  SSSNSTHELQSNSSRASLHS----------------ISSKRISVDSDASHDGSN--PAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPG

Query:  GTTKIGAGNVSAIGGQNKTKPSKLQPVT
        G T+     +           SK+  V+
Subjt:  GTTKIGAGNVSAIGGQNKTKPSKLQPVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCCGTATATACTGATCACGATCGCCGTTTTAGCCGTCTCAGCCTCATCGATTTCGCTTCCGAGGACGATTTGCTAATCTCTTCTCCTTCCTGCGACTTCCATGA
CGCCAATTCTTTAGGCTTTGCAAAGGAGGATGAGGAACAGGACAATTTTGAACGATTAGAGACTGTGGAATCTAACAGAATAGAGGAAAGAACAGATGGCTTCGAACAGA
GAGAGGATGAACCTCAATTACTCCCATCTTCCGAACCGGAAAGGATCGAAAGAAATGGGAAATATAACTTGCGTAAGAGTTTAGCATGGGATAGTGCTTTCTTAACTGGT
GCAGGGTTTTTGGATCCTGAAGAGTTAACTAGCATGATTGCACCAGTGGGCAGGCATGAAAAACGTGTATTACCCATAATTCCAGAAGATGTTCTGAAATCTTCAGATTC
AATTTCTACATTGGGTAGTGAAATTATGCCATTGGAAAGCATTGAGGGCAATTTATTCGAAGATATAAAGGACGGGTCTGCGTCCAGTAAGCAAAGTGATGGGTTGCAAG
GACCTGGGAGAACCACTAAGCAGAATTCTTCACAACCACGCGAGGCACAGCAACTGAAGGCAATGGGCAAGCTTCCTTCAAACTCAACGTTGACCAAGAGGCCTTCACTT
GCCGCCACTGTGAATGATAGTACTACGAGTGGTGCAGGATCTGCAGATGGGCAGGATTCAGTTGGTCTTAGATCTACTACACACAAGCTTTCCCGACTTCCAACTGCCGG
CACGAGGGAGCAAAAGACTTCCTCTGAAGTTTCTATAAGTTCATCTAACAAGGTCGGTAAATCTTCCTCCAAAGATGCAAGGATAAAAACAGATTGTAAGGCTTCACCTT
CCTCTCGTAGCACTCAGAAAACACAATCTCGAGTTGCACCTAAGGTCAAGACTCGTATTGGGAATTCTCGTCTCCCTTCCTACACGATATCTCAAAGTAAGCATTCTTCA
GGAATATCATCTGCTAGCTCTAAAAGTGAGTGGTCGACAGAGTCGTCATCAAATTCCACCCATGAACTTCAATCTAATAGCTCAAGAGCCAGCCTTCACTCAATTTCGAG
CAAAAGAATCTCCGTAGACAGCGATGCATCTCATGATGGAAGTAACCCTGCTGTTGGATCCCATACTCAAACTACTGGCTCAATAAAGCCTTCGGGCCTTCGATTGCCAT
CACCTAAAATCGGGTACTTTGATGGGGGGAAAACTTCTATCATGAAATCTAATGTGTCTGTACCTGGTGGCACGACTAAGATTGGAGCTGGAAATGTCAGCGCAATTGGA
GGCCAAAATAAGACCAAGCCTTCGAAGCTTCAACCTGTTACAGTGTTGCCCAAGAGCACAACTCGTGCTGTTAGTCAGCCTAATTTGAATTTGAAATCTCATAAAACCAC
TGCAACTAAGATGTCCAAAACGAATGAACTCGACCAAGAAGTCAAAGAGCTTGGTTGTGATGGATCAAATACTGATATGCATAATTCAGATGTATGTGCGATTTCAATTG
CCACGAGGGAGATGAGCCTAGAAAATGAAGCAGGAAGTAATGCAAACGAAACGACATCAGCGAATACTGATGGAGAACTTAATGGATTACAAAACTTCGGGACCTGA
mRNA sequenceShow/hide mRNA sequence
TCCCCTAAACCTCCCGCCCCTCTCTCACCGCCTTCGATTTTTCAAATCATTCAATCCGAGTCGGACAGTTCCTCACCTCTTCCGCGCTTCTTCCAGATCTGTCTCATGGA
CGCCGTATATACTGATCACGATCGCCGTTTTAGCCGTCTCAGCCTCATCGATTTCGCTTCCGAGGACGATTTGCTAATCTCTTCTCCTTCCTGCGACTTCCATGACGCCA
ATTCTTTAGGCTTTGCAAAGGAGGATGAGGAACAGGACAATTTTGAACGATTAGAGACTGTGGAATCTAACAGAATAGAGGAAAGAACAGATGGCTTCGAACAGAGAGAG
GATGAACCTCAATTACTCCCATCTTCCGAACCGGAAAGGATCGAAAGAAATGGGAAATATAACTTGCGTAAGAGTTTAGCATGGGATAGTGCTTTCTTAACTGGTGCAGG
GTTTTTGGATCCTGAAGAGTTAACTAGCATGATTGCACCAGTGGGCAGGCATGAAAAACGTGTATTACCCATAATTCCAGAAGATGTTCTGAAATCTTCAGATTCAATTT
CTACATTGGGTAGTGAAATTATGCCATTGGAAAGCATTGAGGGCAATTTATTCGAAGATATAAAGGACGGGTCTGCGTCCAGTAAGCAAAGTGATGGGTTGCAAGGACCT
GGGAGAACCACTAAGCAGAATTCTTCACAACCACGCGAGGCACAGCAACTGAAGGCAATGGGCAAGCTTCCTTCAAACTCAACGTTGACCAAGAGGCCTTCACTTGCCGC
CACTGTGAATGATAGTACTACGAGTGGTGCAGGATCTGCAGATGGGCAGGATTCAGTTGGTCTTAGATCTACTACACACAAGCTTTCCCGACTTCCAACTGCCGGCACGA
GGGAGCAAAAGACTTCCTCTGAAGTTTCTATAAGTTCATCTAACAAGGTCGGTAAATCTTCCTCCAAAGATGCAAGGATAAAAACAGATTGTAAGGCTTCACCTTCCTCT
CGTAGCACTCAGAAAACACAATCTCGAGTTGCACCTAAGGTCAAGACTCGTATTGGGAATTCTCGTCTCCCTTCCTACACGATATCTCAAAGTAAGCATTCTTCAGGAAT
ATCATCTGCTAGCTCTAAAAGTGAGTGGTCGACAGAGTCGTCATCAAATTCCACCCATGAACTTCAATCTAATAGCTCAAGAGCCAGCCTTCACTCAATTTCGAGCAAAA
GAATCTCCGTAGACAGCGATGCATCTCATGATGGAAGTAACCCTGCTGTTGGATCCCATACTCAAACTACTGGCTCAATAAAGCCTTCGGGCCTTCGATTGCCATCACCT
AAAATCGGGTACTTTGATGGGGGGAAAACTTCTATCATGAAATCTAATGTGTCTGTACCTGGTGGCACGACTAAGATTGGAGCTGGAAATGTCAGCGCAATTGGAGGCCA
AAATAAGACCAAGCCTTCGAAGCTTCAACCTGTTACAGTGTTGCCCAAGAGCACAACTCGTGCTGTTAGTCAGCCTAATTTGAATTTGAAATCTCATAAAACCACTGCAA
CTAAGATGTCCAAAACGAATGAACTCGACCAAGAAGTCAAAGAGCTTGGTTGTGATGGATCAAATACTGATATGCATAATTCAGATGTATGTGCGATTTCAATTGCCACG
AGGGAGATGAGCCTAGAAAATGAAGCAGGAAGTAATGCAAACGAAACGACATCAGCGAATACTGATGGAGAACTTAATGGATTACAAAACTTCGGGACCTGA
Protein sequenceShow/hide protein sequence
MDAVYTDHDRRFSRLSLIDFASEDDLLISSPSCDFHDANSLGFAKEDEEQDNFERLETVESNRIEERTDGFEQREDEPQLLPSSEPERIERNGKYNLRKSLAWDSAFLTG
AGFLDPEELTSMIAPVGRHEKRVLPIIPEDVLKSSDSISTLGSEIMPLESIEGNLFEDIKDGSASSKQSDGLQGPGRTTKQNSSQPREAQQLKAMGKLPSNSTLTKRPSL
AATVNDSTTSGAGSADGQDSVGLRSTTHKLSRLPTAGTREQKTSSEVSISSSNKVGKSSSKDARIKTDCKASPSSRSTQKTQSRVAPKVKTRIGNSRLPSYTISQSKHSS
GISSASSKSEWSTESSSNSTHELQSNSSRASLHSISSKRISVDSDASHDGSNPAVGSHTQTTGSIKPSGLRLPSPKIGYFDGGKTSIMKSNVSVPGGTTKIGAGNVSAIG
GQNKTKPSKLQPVTVLPKSTTRAVSQPNLNLKSHKTTATKMSKTNELDQEVKELGCDGSNTDMHNSDVCAISIATREMSLENEAGSNANETTSANTDGELNGLQNFGT