; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025521 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025521
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationscaffold13:31535575..31536831
RNA-Seq ExpressionSpg025521
SyntenySpg025521
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]1.9e-20788.07Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP
        ++EDGNED GAVFPTSTQ+IP WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          MDNGDATLCDY+RPVQHLQGVAELP
Subjt:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+E RV  PSGKQVLHNKIQVE TKVEDREEAG+ N+SSLLRSRLLAPLGIPFCSAS GG  K RPVDCG DFSF D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAH+QQIQGK INGMLPNNQLHGRHSNGN EV+HEHRLQCSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

XP_022132327.1 uncharacterized protein LOC111005206 [Momordica charantia]3.9e-20587.17Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLC RVLGR+NLWLHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNED GAV+PTSTQSIPIWSNGGFP SPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGK+DGSC++ M NGDATLCDYQRPVQHLQGVAELP
Subjt:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDF-SFNDIGHLLDTESLKRRMEQIAAV
        ENN+EAR++ P+GKQVL+NKI  EGTKV DREEAG   +S LL+SRLLAPLGIPFCSASIGGARKARP D G DF SF+DIGHL DTESL+RRMEQIAAV
Subjt:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDF-SFNDIGHLLDTESLKRRMEQIAAV

Query:  QGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWP-AYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNP
         GLGSVSAD AN+LNKVLDVYLKQLIRSCV LVG  P   EPEKPL  + Q+QGK INGMLPNNQLHGRHSNG+ EVMHEHRL+CS+SLLDFKVAMELNP
Subjt:  QGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWP-AYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNP

Query:  KQLGEDWPLLMEKIRMRAFEE
        KQLGEDWPLL+EKIRMRAFEE
Subjt:  KQLGEDWPLLMEKIRMRAFEE

XP_022962598.1 uncharacterized protein LOC111463000 [Cucurbita moschata]3.0e-20587.35Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNED GAVFPTSTQ IPIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QSA KEDGSCR+ MDNG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+EARVQ PSGKQVL         +VEDREEA + N SSLLRSRLLAPLGIPFCSASIGGA K RPVDCG +FSF+D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGK INGMLPNNQLH  HSNGNGEV+HE RL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

XP_022997521.1 uncharacterized protein LOC111492414 [Cucurbita maxima]4.6e-20687.83Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNED GAVF TSTQ IPIWSN GF MSPRK RSGIRDRKLKDRPS L PN KVECIS QSA KEDGSCR+ MDNG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+EARVQ PSGKQVL  ++QVEGTKVEDREEA + N SSLLRSRLLAPLGIPFCSASIGGA K RPVDCG +FSF+D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGK INGMLPNNQLH  HSNGN EV+HE RL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]4.4e-20988.54Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNED GAVFPTSTQ IPIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QSA KEDGSCR+ +DNG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+EARVQ PSGKQVL  ++QVEGTKVEDREEA + N SSLLRSRLLAPLGIPFCSASIGGA K RPVDCG +FSF+D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGK INGMLPNNQLH  HSNGNGEV+HE RL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

TrEMBL top hitse value%identityAlignment
A0A1S3BCQ5 uncharacterized protein LOC1034882319.0e-20888.07Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP
        ++EDGNED GAVFPTSTQ+IP WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          MDNGDATLCDY+RPVQHLQGVAELP
Subjt:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+E RV  PSGKQVLHNKIQVE TKVEDREEAG+ N+SSLLRSRLLAPLGIPFCSAS GG  K RPVDCG DFSF D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAH+QQIQGK INGMLPNNQLHGRHSNGN EV+HEHRLQCSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

A0A5A7VF96 SAGA-Tad1 domain-containing protein9.0e-20888.07Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP
        ++EDGNED GAVFPTSTQ+IP WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          MDNGDATLCDY+RPVQHLQGVAELP
Subjt:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+E RV  PSGKQVLHNKIQVE TKVEDREEAG+ N+SSLLRSRLLAPLGIPFCSAS GG  K RPVDCG DFSF D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAH+QQIQGK INGMLPNNQLHGRHSNGN EV+HEHRLQCSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

A0A6J1BTJ5 uncharacterized protein LOC1110052061.9e-20587.17Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLC RVLGR+NLWLHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNED GAV+PTSTQSIPIWSNGGFP SPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGK+DGSC++ M NGDATLCDYQRPVQHLQGVAELP
Subjt:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDF-SFNDIGHLLDTESLKRRMEQIAAV
        ENN+EAR++ P+GKQVL+NKI  EGTKV DREEAG   +S LL+SRLLAPLGIPFCSASIGGARKARP D G DF SF+DIGHL DTESL+RRMEQIAAV
Subjt:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDF-SFNDIGHLLDTESLKRRMEQIAAV

Query:  QGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWP-AYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNP
         GLGSVSAD AN+LNKVLDVYLKQLIRSCV LVG  P   EPEKPL  + Q+QGK INGMLPNNQLHGRHSNG+ EVMHEHRL+CS+SLLDFKVAMELNP
Subjt:  QGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWP-AYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNP

Query:  KQLGEDWPLLMEKIRMRAFEE
        KQLGEDWPLL+EKIRMRAFEE
Subjt:  KQLGEDWPLLMEKIRMRAFEE

A0A6J1HF85 uncharacterized protein LOC1114630001.4e-20587.35Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNED GAVFPTSTQ IPIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QSA KEDGSCR+ MDNG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+EARVQ PSGKQVL         +VEDREEA + N SSLLRSRLLAPLGIPFCSASIGGA K RPVDCG +FSF+D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGK INGMLPNNQLH  HSNGNGEV+HE RL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

A0A6J1K7Q1 uncharacterized protein LOC1114924142.2e-20687.83Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNED GAVF TSTQ IPIWSN GF MSPRK RSGIRDRKLKDRPS L PN KVECIS QSA KEDGSCR+ MDNG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNED-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+EARVQ PSGKQVL  ++QVEGTKVEDREEA + N SSLLRSRLLAPLGIPFCSASIGGA K RPVDCG +FSF+D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGK INGMLPNNQLH  HSNGN EV+HE RL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein2.5e-3729.81Show/hide
Query:  RIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIEDGNE
        R++  E+K+ I +K+G  R+  YF  L +FL+ ++SK+EFDKLC + +GREN+ LHN+L++SILKNA  AK+ PP     YPK S               
Subjt:  RIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIEDGNE

Query:  DGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELPENNVEARV
                     ++ +  FP SPRK RS    RK +DRPSPLGP GK + ++                  D ++   QR                    
Subjt:  DGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELPENNVEARV

Query:  QHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSR--LLAPLGIPF---CSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQGLG
                    + +E   VED EE  ++  S  ++SR  L APLG+ F     A            C S       G L D  +L+ R+E+   ++G+ 
Subjt:  QHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSR--LLAPLGIPF---CSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQGLG

Query:  SVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQLGE
         +S D AN+LN+ L+ Y+++LI  C+ L                                              + R   ++S+LDF  AME+NP+ LGE
Subjt:  SVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQLGE

Query:  DWPLLMEKIRMRAFEE
        +WP+ +EKI  RA EE
Subjt:  DWPLLMEKIRMRAFEE

AT2G24530.1 unknown protein2.8e-9748.23Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQ  Q  RI L ELK  IVKK G +RS+RYF+YL RFLSQKL+K+EFDK C R+LGRENL LHNQLI+SIL+NA  AK+ PP   AG+   ST++     
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEDGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELPE
          +   + G + P  +Q  P+WSNG  P+SPRK RSG+++RK +DRPSPLG NGKVE + HQ   +ED    V M+NG     DYQR  +++        
Subjt:  VIEDGNEDGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELPE

Query:  NNVEARVQHPSGKQVLHNKIQVEGTKVED---REEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSD-FSFNDIGHLLDTESLKRRMEQIA
        +  +     P  K  + NK ++    + D   +EE  R+N   L  S L+APLGIPFCSAS+GG+ +  PV   ++  S  D G L D E L++RME IA
Subjt:  NNVEARVQHPSGKQVLHNKIQVEGTKVED---REEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSD-FSFNDIGHLLDTESLKRRMEQIA

Query:  AVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMEL
          QGL  VS +CA  LN +LDVYLK+LI SC DLVGA     +P K    +QQ Q K +NG+ P N L  +  NG+ ++  +H    S+S+LDF+ AMEL
Subjt:  AVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMEL

Query:  NPKQLGEDWPLLMEKIRMRAFEE
        NP+QLGEDWP L E+I +R+FEE
Subjt:  NPKQLGEDWPLLMEKIRMRAFEE

AT4G31440.1 unknown protein1.1e-7544.21Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQ  Q  RIDL ELK  IVKK+G +RS RYF+YL RFLSQKL+K+EFDK C R+LGRENL LHN+LI+SIL+NA  AK+ P +  +G+P  S    K   
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEDG-AVFPTSTQSIPIWSNGGFPMSPRKSRSG-IRDRKLKDRPSPLGPNGKV-ECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAE
          EDG E+  ++ P   ++    SNG       K R G   DR ++D+P PLG NGKV    ++   G      R   +   A LC              
Subjt:  VIEDGNEDG-AVFPTSTQSIPIWSNGGFPMSPRKSRSG-IRDRKLKDRPSPLGPNGKV-ECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAE

Query:  LPENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVD-CGSDFSFNDIGHLLDTESLKRRMEQIA
                    P+ ++ +  K QV      D E   R+    L    ++APLGIPFCSAS+GG R+  PV    +  S  D G L DTE L++RME IA
Subjt:  LPENNVEARVQHPSGKQVLHNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVD-CGSDFSFNDIGHLLDTESLKRRMEQIA

Query:  AVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMEL
          QGLG VSA+C+ VLN +LD+YLK+L++SCVDL GA      P K    +QQ + + +NG+  NN  H + SN   ++  E   Q S+SLLDF+VAMEL
Subjt:  AVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMEL

Query:  NPKQLGEDWPLLMEKIRMRAFEE
        NP QLGEDWPLL E+I +  FEE
Subjt:  NPKQLGEDWPLLMEKIRMRAFEE

AT4G33890.1 unknown protein6.0e-3930.5Show/hide
Query:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              + 
Subjt:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIE

Query:  DGNEDGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELPENNV
         GN D      ++Q  P+  +  F  S RK RS    RKL+DRPSPLGP GK   ++  +                      +  +   Q   EL     
Subjt:  DGNEDGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELPENNV

Query:  EARVQHPSGKQVLHNKIQVEGTKVEDREEAGRL-NNSSLLRSR--LLAPLGIPFCSASIGGARKARPVDCGSDFSFN-----DIGHLLDTESLKRRMEQI
                    L ++  VE   VE+ EE  ++   S  ++SR  L APLG+   S   G  RK+         SFN     + G L DT +L+ R+E+ 
Subjt:  EARVQHPSGKQVLHNKIQVEGTKVEDREEAGRL-NNSSLLRSR--LLAPLGIPFCSASIGGARKARPVDCGSDFSFN-----DIGHLLDTESLKRRMEQI

Query:  AAVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMEL
          ++GL  ++ D  ++LN  LDV++++LI  C+ L       +  + + ++   Q + ++                            +S+ DF+  MEL
Subjt:  AAVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMEL

Query:  NPKQLGEDWPLLMEKIRMRAFEE
        N + LGEDWP+ MEKI  RA ++
Subjt:  NPKQLGEDWPLLMEKIRMRAFEE

AT4G33890.2 unknown protein6.0e-3930.5Show/hide
Query:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              + 
Subjt:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIE

Query:  DGNEDGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELPENNV
         GN D      ++Q  P+  +  F  S RK RS    RKL+DRPSPLGP GK   ++  +                      +  +   Q   EL     
Subjt:  DGNEDGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELPENNV

Query:  EARVQHPSGKQVLHNKIQVEGTKVEDREEAGRL-NNSSLLRSR--LLAPLGIPFCSASIGGARKARPVDCGSDFSFN-----DIGHLLDTESLKRRMEQI
                    L ++  VE   VE+ EE  ++   S  ++SR  L APLG+   S   G  RK+         SFN     + G L DT +L+ R+E+ 
Subjt:  EARVQHPSGKQVLHNKIQVEGTKVEDREEAGRL-NNSSLLRSR--LLAPLGIPFCSASIGGARKARPVDCGSDFSFN-----DIGHLLDTESLKRRMEQI

Query:  AAVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMEL
          ++GL  ++ D  ++LN  LDV++++LI  C+ L       +  + + ++   Q + ++                            +S+ DF+  MEL
Subjt:  AAVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMEL

Query:  NPKQLGEDWPLLMEKIRMRAFEE
        N + LGEDWP+ MEKI  RA ++
Subjt:  NPKQLGEDWPLLMEKIRMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTCGGAACCGATCGGTCAAAGCGGTACTTCTTTTACTTGAATAGGTT
CTTGAGTCAAAAGCTGAGTAAGAATGAGTTCGATAAGTTATGTTGTCGTGTACTTGGGAGGGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATG
CATGTCAAGCTAAGGCTGCACCACCAATACCTGTAGCAGGTTATCCGAAGACTTCGACACAATCTGCAAAGATTTCCCCTGTTATAGAAGATGGGAATGAGGATGGAGCT
GTTTTTCCTACTTCCACTCAAAGTATTCCCATTTGGTCCAATGGAGGATTTCCAATGTCCCCAAGAAAGAGCAGGTCTGGGATACGCGATCGCAAACTCAAAGACAGGCC
GAGTCCGCTAGGACCCAATGGGAAGGTTGAATGTATCTCACATCAATCAGCAGGCAAGGAAGATGGAAGCTGCCGAGTCACGATGGATAATGGTGATGCAACTTTGTGTG
ACTATCAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACTACCTGAAAACAATGTTGAGGCTAGAGTTCAGCATCCATCAGGAAAGCAAGTCCTACACAATAAGATC
CAGGTTGAAGGAACTAAGGTTGAAGACAGGGAAGAAGCTGGACGTTTGAATAACTCAAGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGC
TAGTATCGGTGGGGCTCGCAAAGCTAGGCCTGTAGATTGTGGGAGTGATTTTAGCTTTAATGATATTGGTCATTTATTGGATACCGAGTCGTTGAAACGACGCATGGAAC
AAATTGCTGCAGTACAGGGCCTAGGCAGTGTTTCTGCAGATTGTGCTAATGTTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGATTTG
GTTGGAGCATGGCCTGCATATGAGCCTGAGAAACCTCTTGCCCATAGGCAGCAGATTCAGGGGAAAGCTATCAATGGCATGTTGCCGAATAATCAATTACATGGACGACA
TAGCAATGGAAATGGAGAAGTTATGCACGAGCACAGATTACAGTGCTCGATATCGTTGCTTGACTTCAAGGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACT
GGCCTTTGCTAATGGAGAAAATTCGTATGCGTGCATTTGAGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTCGGAACCGATCGGTCAAAGCGGTACTTCTTTTACTTGAATAGGTT
CTTGAGTCAAAAGCTGAGTAAGAATGAGTTCGATAAGTTATGTTGTCGTGTACTTGGGAGGGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATG
CATGTCAAGCTAAGGCTGCACCACCAATACCTGTAGCAGGTTATCCGAAGACTTCGACACAATCTGCAAAGATTTCCCCTGTTATAGAAGATGGGAATGAGGATGGAGCT
GTTTTTCCTACTTCCACTCAAAGTATTCCCATTTGGTCCAATGGAGGATTTCCAATGTCCCCAAGAAAGAGCAGGTCTGGGATACGCGATCGCAAACTCAAAGACAGGCC
GAGTCCGCTAGGACCCAATGGGAAGGTTGAATGTATCTCACATCAATCAGCAGGCAAGGAAGATGGAAGCTGCCGAGTCACGATGGATAATGGTGATGCAACTTTGTGTG
ACTATCAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACTACCTGAAAACAATGTTGAGGCTAGAGTTCAGCATCCATCAGGAAAGCAAGTCCTACACAATAAGATC
CAGGTTGAAGGAACTAAGGTTGAAGACAGGGAAGAAGCTGGACGTTTGAATAACTCAAGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGC
TAGTATCGGTGGGGCTCGCAAAGCTAGGCCTGTAGATTGTGGGAGTGATTTTAGCTTTAATGATATTGGTCATTTATTGGATACCGAGTCGTTGAAACGACGCATGGAAC
AAATTGCTGCAGTACAGGGCCTAGGCAGTGTTTCTGCAGATTGTGCTAATGTTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGATTTG
GTTGGAGCATGGCCTGCATATGAGCCTGAGAAACCTCTTGCCCATAGGCAGCAGATTCAGGGGAAAGCTATCAATGGCATGTTGCCGAATAATCAATTACATGGACGACA
TAGCAATGGAAATGGAGAAGTTATGCACGAGCACAGATTACAGTGCTCGATATCGTTGCTTGACTTCAAGGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACT
GGCCTTTGCTAATGGAGAAAATTCGTATGCGTGCATTTGAGGAATGA
Protein sequenceShow/hide protein sequence
MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIEDGNEDGA
VFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGSCRVTMDNGDATLCDYQRPVQHLQGVAELPENNVEARVQHPSGKQVLHNKI
QVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDL
VGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGEVMHEHRLQCSISLLDFKVAMELNPKQLGEDWPLLMEKIRMRAFEE