; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005148 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005148
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationchr6:11178783..11180039
RNA-Seq ExpressionLag0005148
SyntenyLag0005148
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029751.1 hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyrosperma]4.0e-20286.63Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL LHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNE+ GAVFPTSTQ IPIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QSA KEDG+ RI MDNG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+EARVQRP+GKQVL+        +VEDREEA + N SSLLRSRLLAPLGIPFCSASIGGA K RPVDCG +FSF+D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGK INGMLPNNQLH  HSNGNG+V+HERRL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]3.6e-20386.87Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENL LHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP
        ++EDGNE+ GAVFPTSTQ+IP WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          MDNGDATLCDY+RPVQHLQGVAELP
Subjt:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+E RV +PSGKQVL NKIQVE TKVEDREEAG+ N+SSLLRSRLLAPLGIPFCSAS GG  K RPVDCG DFSF D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAH+QQIQGK INGMLPNNQLHGRHSNGN +V+HE RLQCSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

XP_022962598.1 uncharacterized protein LOC111463000 [Cucurbita moschata]1.8e-20286.87Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL LHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNE+ GAVFPTSTQ IPIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QSA KEDG+ RI MDNG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+EARVQRPSGKQVL+        +VEDREEA + N SSLLRSRLLAPLGIPFCSASIGGA K RPVDCG +FSF+D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGK INGMLPNNQLH  HSNGNG+V+HERRL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

XP_022997521.1 uncharacterized protein LOC111492414 [Cucurbita maxima]2.8e-20387.35Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL LHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNE+ GAVF TSTQ IPIWSN GF MSPRK RSGIRDRKLKDRPS L PN KVECIS QSA KEDG+ RI MDNG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+EARVQRPSGKQVL  ++QVEGTKVEDREEA + N SSLLRSRLLAPLGIPFCSASIGGA K RPVDCG +FSF+D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGK INGMLPNNQLH  HSNGN +V+HERRL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]2.7e-20688.07Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL LHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNE+ GAVFPTSTQ IPIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QSA KEDG+ RI +DNG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+EARVQRPSGKQVL  ++QVEGTKVEDREEA + N SSLLRSRLLAPLGIPFCSASIGGA K RPVDCG +FSF+D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGK INGMLPNNQLH  HSNGNG+V+HERRL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

TrEMBL top hitse value%identityAlignment
A0A1S3BCQ5 uncharacterized protein LOC1034882311.7e-20386.87Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENL LHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP
        ++EDGNE+ GAVFPTSTQ+IP WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          MDNGDATLCDY+RPVQHLQGVAELP
Subjt:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+E RV +PSGKQVL NKIQVE TKVEDREEAG+ N+SSLLRSRLLAPLGIPFCSAS GG  K RPVDCG DFSF D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAH+QQIQGK INGMLPNNQLHGRHSNGN +V+HE RLQCSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

A0A5A7VF96 SAGA-Tad1 domain-containing protein1.7e-20386.87Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENL LHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP
        ++EDGNE+ GAVFPTSTQ+IP WSNG   +SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          MDNGDATLCDY+RPVQHLQGVAELP
Subjt:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+E RV +PSGKQVL NKIQVE TKVEDREEAG+ N+SSLLRSRLLAPLGIPFCSAS GG  K RPVDCG DFSF D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAH+QQIQGK INGMLPNNQLHGRHSNGN +V+HE RLQCSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

A0A6J1BTJ5 uncharacterized protein LOC1110052061.1e-20085.99Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLC RVLGR+NL LHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNE+ GAV+PTSTQSIPIWSNGGFP SPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGK+DG+ ++ M NGDATLCDYQRPVQHLQGVAELP
Subjt:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDF-SFNDIGHLLDTESLKRRMEQIAAV
        ENN+EAR+ RP+GKQVL NKI  EGTKV DREEAG   +S LL+SRLLAPLGIPFCSASIGGARKARP D G DF SF+DIGHL DTESL+RRMEQIAAV
Subjt:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDF-SFNDIGHLLDTESLKRRMEQIAAV

Query:  QGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWP-AYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNP
         GLGSVSAD AN+LNKVLDVYLKQLIRSCV LVG  P   EPEKPL  + Q+QGK INGMLPNNQLHGRHSNG+ +VMHE RL+CS+SLLDFKVAMELNP
Subjt:  QGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWP-AYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNP

Query:  KQLGEDWPLLMEKIRMRAFEE
        KQLGEDWPLL+EKIRMRAFEE
Subjt:  KQLGEDWPLLMEKIRMRAFEE

A0A6J1HF85 uncharacterized protein LOC1114630008.7e-20386.87Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL LHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNE+ GAVFPTSTQ IPIWSN GFP+SPRK RSGIRDRKLKDRPS L PN KVECIS QSA KEDG+ RI MDNG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+EARVQRPSGKQVL+        +VEDREEA + N SSLLRSRLLAPLGIPFCSASIGGA K RPVDCG +FSF+D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGK INGMLPNNQLH  HSNGNG+V+HERRL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

A0A6J1K7Q1 uncharacterized protein LOC1114924141.3e-20387.35Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL LHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP
        VIEDGNE+ GAVF TSTQ IPIWSN GF MSPRK RSGIRDRKLKDRPS L PN KVECIS QSA KEDG+ RI MDNG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNEE-GAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELP

Query:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ
        ENN+EARVQRPSGKQVL  ++QVEGTKVEDREEA + N SSLLRSRLLAPLGIPFCSASIGGA K RPVDCG +FSF+D+GHLLDTESL+RRMEQIAAVQ
Subjt:  ENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQ

Query:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ
        GLGSVSADCAN+LNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGK INGMLPNNQLH  HSNGN +V+HERRL CSISLLDFKVAMELNPKQ
Subjt:  GLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKIRMRAFEE
        LGEDWPLL+EKI MRAF E
Subjt:  LGEDWPLLMEKIRMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein1.1e-3729.81Show/hide
Query:  RIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIEDGNE
        R++  E+K+ I +K+G  R+  YF  L +FL+ ++SK+EFDKLC + +GREN+ LHN+L++SILKNA  AK+ PP     YPK S               
Subjt:  RIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIEDGNE

Query:  EGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELPENNVEARV
                     ++ +  FP SPRK RS    RK +DRPSPLGP GK + ++                  D ++   QR                    
Subjt:  EGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELPENNVEARV

Query:  QRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSR--LLAPLGIPF---CSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQGLG
                    + +E   VED EE  ++  S  ++SR  L APLG+ F     A            C S       G L D  +L+ R+E+   ++G+ 
Subjt:  QRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSR--LLAPLGIPF---CSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQGLG

Query:  SVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQLGE
         +S D AN+LN+ L+ Y+++LI  C+ L                                              ++R   ++S+LDF  AME+NP+ LGE
Subjt:  SVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQLGE

Query:  DWPLLMEKIRMRAFEE
        +WP+ +EKI  RA EE
Subjt:  DWPLLMEKIRMRAFEE

AT2G24530.1 unknown protein2.8e-9748.23Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQ  Q  RI L ELK  IVKK G +RS+RYF+YL RFLSQKL+K+EFDK C R+LGRENL LHNQLI+SIL+NA  AK+ PP   AG+   ST++     
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEEGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELPE
          +   + G + P  +Q  P+WSNG  P+SPRK RSG+++RK +DRPSPLG NGKVE + HQ   +ED    + M+NG     DYQR  +++        
Subjt:  VIEDGNEEGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELPE

Query:  NNVEARVQRPSGKQVLRNKIQVEGTKVED---REEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSD-FSFNDIGHLLDTESLKRRMEQIA
        +  +    RP  K  + NK ++    + D   +EE  R+N   L  S L+APLGIPFCSAS+GG+ +  PV   ++  S  D G L D E L++RME IA
Subjt:  NNVEARVQRPSGKQVLRNKIQVEGTKVED---REEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSD-FSFNDIGHLLDTESLKRRMEQIA

Query:  AVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMEL
          QGL  VS +CA  LN +LDVYLK+LI SC DLVGA     +P K    +QQ Q K +NG+ P N L  +  NG+ D+  +     S+S+LDF+ AMEL
Subjt:  AVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMEL

Query:  NPKQLGEDWPLLMEKIRMRAFEE
        NP+QLGEDWP L E+I +R+FEE
Subjt:  NPKQLGEDWPLLMEKIRMRAFEE

AT4G31440.1 unknown protein3.6e-7645.02Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQ  Q  RIDL ELK  IVKK+G +RS RYF+YL RFLSQKL+K+EFDK C R+LGRENL LHN+LI+SIL+NA  AK+ P +  +G+P  S    K   
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  VIEDGNEEG-AVFPTSTQSIPIWSNGGFPMSPRKSRSG-IRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAEL
          EDG EE  ++ P   ++    SNG       K R G   DR ++D+P PLG NGKV            G F             Y RP ++       
Subjt:  VIEDGNEEG-AVFPTSTQSIPIWSNGGFPMSPRKSRSG-IRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAEL

Query:  PENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVD-CGSDFSFNDIGHLLDTESLKRRMEQIAA
        P+   ++    P+ ++ +  K QV      D E   R+    L    ++APLGIPFCSAS+GG R+  PV    +  S  D G L DTE L++RME IA 
Subjt:  PENNVEARVQRPSGKQVLRNKIQVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVD-CGSDFSFNDIGHLLDTESLKRRMEQIAA

Query:  VQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELN
         QGLG VSA+C+ VLN +LD+YLK+L++SCVDL GA      P K    +QQ + + +NG+  NN  H + SN   D+  E   Q S+SLLDF+VAMELN
Subjt:  VQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELN

Query:  PKQLGEDWPLLMEKIRMRAFEE
        P QLGEDWPLL E+I +  FEE
Subjt:  PKQLGEDWPLLMEKIRMRAFEE

AT4G33890.1 unknown protein5.1e-3830.26Show/hide
Query:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              + 
Subjt:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIE

Query:  DGNEEGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELPENNV
         GN +      ++Q  P+  +  F  S RK RS    RKL+DRPSPLGP GK   ++  +                      +  +   Q   EL     
Subjt:  DGNEEGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELPENNV

Query:  EARVQRPSGKQVLRNKIQVEGTKVEDREEAGRL-NNSSLLRSR--LLAPLGIPFCSASIGGARKARPVDCGSDFSFN-----DIGHLLDTESLKRRMEQI
                    L ++  VE   VE+ EE  ++   S  ++SR  L APLG+   S   G  RK+         SFN     + G L DT +L+ R+E+ 
Subjt:  EARVQRPSGKQVLRNKIQVEGTKVEDREEAGRL-NNSSLLRSR--LLAPLGIPFCSASIGGARKARPVDCGSDFSFN-----DIGHLLDTESLKRRMEQI

Query:  AAVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMEL
          ++GL  ++ D  ++LN  LDV++++LI  C+ L       +  + + ++   Q + ++                            +S+ DF+  MEL
Subjt:  AAVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMEL

Query:  NPKQLGEDWPLLMEKIRMRAFEE
        N + LGEDWP+ MEKI  RA ++
Subjt:  NPKQLGEDWPLLMEKIRMRAFEE

AT4G33890.2 unknown protein5.1e-3830.26Show/hide
Query:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              + 
Subjt:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIE

Query:  DGNEEGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELPENNV
         GN +      ++Q  P+  +  F  S RK RS    RKL+DRPSPLGP GK   ++  +                      +  +   Q   EL     
Subjt:  DGNEEGAVFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELPENNV

Query:  EARVQRPSGKQVLRNKIQVEGTKVEDREEAGRL-NNSSLLRSR--LLAPLGIPFCSASIGGARKARPVDCGSDFSFN-----DIGHLLDTESLKRRMEQI
                    L ++  VE   VE+ EE  ++   S  ++SR  L APLG+   S   G  RK+         SFN     + G L DT +L+ R+E+ 
Subjt:  EARVQRPSGKQVLRNKIQVEGTKVEDREEAGRL-NNSSLLRSR--LLAPLGIPFCSASIGGARKARPVDCGSDFSFN-----DIGHLLDTESLKRRMEQI

Query:  AAVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMEL
          ++GL  ++ D  ++LN  LDV++++LI  C+ L       +  + + ++   Q + ++                            +S+ DF+  MEL
Subjt:  AAVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMEL

Query:  NPKQLGEDWPLLMEKIRMRAFEE
        N + LGEDWP+ MEKI  RA ++
Subjt:  NPKQLGEDWPLLMEKIRMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTCGGAACTGATCGGTCAAAACGGTACTTCTTTTACTTGAATAGGTT
CTTGAGTCAAAAGCTGAGTAAGAATGAGTTCGATAAGTTATGTTGTCGTGTACTTGGGAGGGAGAATCTTTTGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATG
CATGTCAAGCTAAGGCTGCACCACCAATACCTGTAGCAGGTTATCCGAAAACTTCGACACAATCTGCAAAAATTTCCCCTGTTATAGAAGATGGGAATGAGGAGGGAGCT
GTTTTTCCTACTTCCACTCAAAGTATTCCCATTTGGTCCAATGGAGGATTTCCAATGTCCCCAAGAAAGAGCAGGTCTGGGATACGCGACCGCAAACTCAAAGACAGGCC
GAGTCCGCTAGGACCCAATGGGAAGGTTGAATGTATATCACATCAATCAGCAGGCAAGGAAGATGGAAACTTTCGAATCACGATGGATAATGGTGATGCAACTCTGTGTG
ACTATCAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACTACCTGAAAACAATGTTGAGGCTAGAGTTCAGCGTCCATCAGGAAAGCAAGTCCTACGCAATAAGATC
CAGGTTGAAGGAACTAAGGTTGAAGACAGGGAAGAAGCTGGACGGTTGAATAACTCGAGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGC
TAGTATCGGTGGGGCCCGCAAAGCTAGGCCTGTAGATTGTGGGAGTGATTTTAGCTTTAATGATATTGGTCATTTATTGGATACTGAGTCGTTGAAACGACGCATGGAAC
AAATTGCTGCAGTACAGGGCCTAGGCAGTGTTTCTGCAGATTGTGCTAATGTTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGATTTG
GTTGGAGCATGGCCTGCATATGAGCCTGAGAAACCTCTTGCCCATAGGCAGCAGATTCAGGGGAAAGCTATCAATGGCATGTTGCCGAATAATCAATTACATGGACGGCA
TAGCAATGGAAATGGAGATGTTATGCACGAGCGCAGATTACAGTGCTCGATATCATTGCTTGACTTCAAGGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACT
GGCCTTTGCTAATGGAGAAAATTCGTATGCGTGCATTTGAGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTCGGAACTGATCGGTCAAAACGGTACTTCTTTTACTTGAATAGGTT
CTTGAGTCAAAAGCTGAGTAAGAATGAGTTCGATAAGTTATGTTGTCGTGTACTTGGGAGGGAGAATCTTTTGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATG
CATGTCAAGCTAAGGCTGCACCACCAATACCTGTAGCAGGTTATCCGAAAACTTCGACACAATCTGCAAAAATTTCCCCTGTTATAGAAGATGGGAATGAGGAGGGAGCT
GTTTTTCCTACTTCCACTCAAAGTATTCCCATTTGGTCCAATGGAGGATTTCCAATGTCCCCAAGAAAGAGCAGGTCTGGGATACGCGACCGCAAACTCAAAGACAGGCC
GAGTCCGCTAGGACCCAATGGGAAGGTTGAATGTATATCACATCAATCAGCAGGCAAGGAAGATGGAAACTTTCGAATCACGATGGATAATGGTGATGCAACTCTGTGTG
ACTATCAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACTACCTGAAAACAATGTTGAGGCTAGAGTTCAGCGTCCATCAGGAAAGCAAGTCCTACGCAATAAGATC
CAGGTTGAAGGAACTAAGGTTGAAGACAGGGAAGAAGCTGGACGGTTGAATAACTCGAGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGC
TAGTATCGGTGGGGCCCGCAAAGCTAGGCCTGTAGATTGTGGGAGTGATTTTAGCTTTAATGATATTGGTCATTTATTGGATACTGAGTCGTTGAAACGACGCATGGAAC
AAATTGCTGCAGTACAGGGCCTAGGCAGTGTTTCTGCAGATTGTGCTAATGTTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGATTTG
GTTGGAGCATGGCCTGCATATGAGCCTGAGAAACCTCTTGCCCATAGGCAGCAGATTCAGGGGAAAGCTATCAATGGCATGTTGCCGAATAATCAATTACATGGACGGCA
TAGCAATGGAAATGGAGATGTTATGCACGAGCGCAGATTACAGTGCTCGATATCATTGCTTGACTTCAAGGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGGAAGACT
GGCCTTTGCTAATGGAGAAAATTCGTATGCGTGCATTTGAGGAATGA
Protein sequenceShow/hide protein sequence
MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLLLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPVIEDGNEEGA
VFPTSTQSIPIWSNGGFPMSPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKEDGNFRITMDNGDATLCDYQRPVQHLQGVAELPENNVEARVQRPSGKQVLRNKI
QVEGTKVEDREEAGRLNNSSLLRSRLLAPLGIPFCSASIGGARKARPVDCGSDFSFNDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANVLNKVLDVYLKQLIRSCVDL
VGAWPAYEPEKPLAHRQQIQGKAINGMLPNNQLHGRHSNGNGDVMHERRLQCSISLLDFKVAMELNPKQLGEDWPLLMEKIRMRAFEE