; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001255 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001255
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationscaffold36:2687404..2688663
RNA-Seq ExpressionMS001255
SyntenyMS001255
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029751.1 hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-18782.42Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLC RVLGR+NLWLHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP

Query:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
        VIEDGN D GAV+PTSTQ IPIWSN GFP SPRK RSGIRDRKLKDRPS L PN KVECIS QSA K+DGSC++MM NG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV
        ENNIEAR+ RPAGKQVL         +V DREEA  S  S LLRSRLLAPLGIPFCSASIGGA K RP D GG+F SFSD+GHL DTESLRRRMEQIAAV
Subjt:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV

Query:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP
         GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP   EPEKPL    Q+QGKVINGMLPNNQLH  HSNG+ EV+HE RL CS+SLLDFKVAMELNP
Subjt:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKIRMRAFEE
        KQLGEDWPLLLEKI MRAF E
Subjt:  KQLGEDWPLLLEKIRMRAFEE

XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]8.7e-18982.42Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK C RVLGR+NLWLHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP

Query:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
        ++EDGN D GAV+PTSTQ+IP WSNG    SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          M NGDATLCDY+RPVQHLQGVAELP
Subjt:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV
        ENNIE R+ +P+GKQVL+NKI  E TKV DREEAG S HS LLRSRLLAPLGIPFCSAS GG  K RP D GGDF SF D+GHL DTESLRRRMEQIAAV
Subjt:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV

Query:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP
         GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP   EPEKPL  K Q+QGKVINGMLPNNQLHGRHSNG+ EV+HEHRL+CS+SLLDFKVAMELNP
Subjt:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKIRMRAFEE
         QLGEDWPLLLEKI MRAF E
Subjt:  KQLGEDWPLLLEKIRMRAFEE

XP_022132327.1 uncharacterized protein LOC111005206 [Momordica charantia]1.8e-23999.29Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
        MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP

Query:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
        VIEDGN DTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
Subjt:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH
        ENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLL+SRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH
Subjt:  ENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH

Query:  GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPK
        GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGT PMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPK
Subjt:  GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKIRMRAFEE
        QLGEDWPLLLEKIRMRAFEE
Subjt:  QLGEDWPLLLEKIRMRAFEE

XP_022997521.1 uncharacterized protein LOC111492414 [Cucurbita maxima]3.0e-18982.9Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLC RVLGR+NLWLHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP

Query:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
        VIEDGN D GAV+ TSTQ IPIWSN GF  SPRK RSGIRDRKLKDRPS L PN KVECIS QSA K+DGSC++MM NG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV
        ENNIEAR+ RP+GKQVL  ++  EGTKV DREEA  S  S LLRSRLLAPLGIPFCSASIGGA K RP D GG+F SFSD+GHL DTESLRRRMEQIAAV
Subjt:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV

Query:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP
         GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP+  EPEKPL    Q+QGKVINGMLPNNQLH  HSNG+REV+HE RL CS+SLLDFKVAMELNP
Subjt:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKIRMRAFEE
        KQLGEDWPLLLEKI MRAF E
Subjt:  KQLGEDWPLLLEKIRMRAFEE

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]6.0e-19082.9Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLC RVLGR+NLWLHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP

Query:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
        VIEDGN D GAV+PTSTQ IPIWSN GFP SPRK RSGIRDRKLKDRPS L PN KVECIS QSA K+DGSC++M+ NG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV
        ENNIEAR+ RP+GKQVL  ++  EGTKV DREEA  S  S LLRSRLLAPLGIPFCSASIGGA K RP D GG+F SFSD+GHL DTESLRRRMEQIAAV
Subjt:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV

Query:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP
         GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP   EPEKPL    Q+QGKVINGMLPNNQLH  HSNG+ EV+HE RL CS+SLLDFKVAMELNP
Subjt:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKIRMRAFEE
        KQLGEDWPLLLEKI MRAF E
Subjt:  KQLGEDWPLLLEKIRMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LM32 Uncharacterized protein1.0e-18782.19Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK C RVLGR+NLWLHNQLIQSILKNACQAK APP+PVAGYPKTSTQSAK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP

Query:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
        ++EDGN D GAV+PTSTQ+IP WSNG    SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          M NGDATLCDY+RPVQ+LQG+AELP
Subjt:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV
        ENNIE R+ +P+GKQ L NKI  E TKV DREEAG S HS LLRSRLLAPLGIPFCSASIGGARK RP D GGDF S SD+GHL DTESLRRRMEQIAAV
Subjt:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV

Query:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP
         GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP   EPEKPL+ K Q QGKVINGMLPNNQLHGRHSNGS EV+HEHRL+CS+SLLDFKVAMELNP
Subjt:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKIRMRAFEE
         QLGEDWPLLLEKI MR F E
Subjt:  KQLGEDWPLLLEKIRMRAFEE

A0A1S3BCQ5 uncharacterized protein LOC1034882314.2e-18982.42Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK C RVLGR+NLWLHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP

Query:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
        ++EDGN D GAV+PTSTQ+IP WSNG    SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          M NGDATLCDY+RPVQHLQGVAELP
Subjt:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV
        ENNIE R+ +P+GKQVL+NKI  E TKV DREEAG S HS LLRSRLLAPLGIPFCSAS GG  K RP D GGDF SF D+GHL DTESLRRRMEQIAAV
Subjt:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV

Query:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP
         GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP   EPEKPL  K Q+QGKVINGMLPNNQLHGRHSNG+ EV+HEHRL+CS+SLLDFKVAMELNP
Subjt:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKIRMRAFEE
         QLGEDWPLLLEKI MRAF E
Subjt:  KQLGEDWPLLLEKIRMRAFEE

A0A5A7VF96 SAGA-Tad1 domain-containing protein4.2e-18982.42Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
        MQPQQSLRIDLGELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK C RVLGR+NLWLHNQLIQSILKNACQAKAAPP+PVAGYPKTSTQSAK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP

Query:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
        ++EDGN D GAV+PTSTQ+IP WSNG    SPRK RSGIRDRKLKDRPS LGPNGKVECISH SA          M NGDATLCDY+RPVQHLQGVAELP
Subjt:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV
        ENNIE R+ +P+GKQVL+NKI  E TKV DREEAG S HS LLRSRLLAPLGIPFCSAS GG  K RP D GGDF SF D+GHL DTESLRRRMEQIAAV
Subjt:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV

Query:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP
         GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP   EPEKPL  K Q+QGKVINGMLPNNQLHGRHSNG+ EV+HEHRL+CS+SLLDFKVAMELNP
Subjt:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKIRMRAFEE
         QLGEDWPLLLEKI MRAF E
Subjt:  KQLGEDWPLLLEKIRMRAFEE

A0A6J1BTJ5 uncharacterized protein LOC1110052068.9e-24099.29Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
        MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP

Query:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
        VIEDGN DTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
Subjt:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH
        ENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLL+SRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH
Subjt:  ENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH

Query:  GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPK
        GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGT PMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPK
Subjt:  GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKIRMRAFEE
        QLGEDWPLLLEKIRMRAFEE
Subjt:  QLGEDWPLLLEKIRMRAFEE

A0A6J1K7Q1 uncharacterized protein LOC1114924141.4e-18982.9Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
        MQPQQSLRIDL ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLC RVLGR+NLWLHNQLIQSILKNACQAKAAPP+P AGYPKTSTQ+AK+SP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP

Query:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
        VIEDGN D GAV+ TSTQ IPIWSN GF  SPRK RSGIRDRKLKDRPS L PN KVECIS QSA K+DGSC++MM NG+AT CDYQRPVQHLQGV ELP
Subjt:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV
        ENNIEAR+ RP+GKQVL  ++  EGTKV DREEA  S  S LLRSRLLAPLGIPFCSASIGGA K RP D GG+F SFSD+GHL DTESLRRRMEQIAAV
Subjt:  ENNIEARI-RPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV

Query:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP
         GLGSVSAD ANILNKVLDVYLKQLIRSCV LVG WP+  EPEKPL    Q+QGKVINGMLPNNQLH  HSNG+REV+HE RL CS+SLLDFKVAMELNP
Subjt:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKIRMRAFEE
        KQLGEDWPLLLEKI MRAF E
Subjt:  KQLGEDWPLLLEKIRMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein1.1e-3730.92Show/hide
Query:  RIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNG
        R++  E+K+ I +K+G  R+  YF  L +FL+ ++SK+EFDKLC + +GR+N+ LHN+L++SILKNA  AK+ PP     YPK S               
Subjt:  RIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNG

Query:  DTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELPENNIEAR
                      ++ +  FP SPRK RS    RK +DRPSPLGP GK + ++                  D ++   QR          LP   +   
Subjt:  DTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELPENNIEAR

Query:  IRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGG-DFVSFSDIGHLSDTESLRRRMEQIAAVHGLGSVS
                      ++G +V ++     S+ S   RS L APLG+ F   S     KAR + + G +  +    G L D  +LR R+E+   + G+  +S
Subjt:  IRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGG-DFVSFSDIGHLSDTESLRRRMEQIAAVHGLGSVS

Query:  ADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDW
         DSAN+LN+ L+ Y+++LI  C+ L                                               + R   +VS+LDF  AME+NP+ LGE+W
Subjt:  ADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDW

Query:  PLLLEKIRMRAFEE
        P+ LEKI  RA EE
Subjt:  PLLLEKIRMRAFEE

AT2G24530.1 unknown protein2.7e-9547.38Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
        MQ  Q  RI L ELK  IVKK G +RS+RYF+YL RFLSQKL+K+EFDK C R+LGR+NL LHNQLI+SIL+NA  AK+ PP   AG+    +  A    
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP

Query:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
           DG   +G + P  +Q  P+WSNG  P SPRK RSG+++RK +DRPSPLG NGKVE + HQ   ++D    + M NG     DYQR  ++   VA+  
Subjt:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH
        +      +RP  K  + NK       + D +         L  S L+APLGIPFCSAS+GG+ +  P     + +S  D G L D E LR+RME IA   
Subjt:  ENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVH

Query:  GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPK
        GL  VS + A  LN +LDVYLK+LI SC  LVG      +P K    K Q Q K++NG+ P N L  +  NGS ++  +H    SVS+LDF+ AMELNP+
Subjt:  GLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKIRMRAFEE
        QLGEDWP L E+I +R+FEE
Subjt:  QLGEDWPLLLEKIRMRAFEE

AT4G31440.1 unknown protein1.4e-7542.99Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP
        MQ  Q  RIDL ELK  IVKK+G +RS RYF+YL RFLSQKL+K+EFDK C R+LGR+NL LHN+LI+SIL+NA  AK+ P +  +G+P  S    K   
Subjt:  MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSP

Query:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSG-IRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAEL
          EDG  ++ ++ P   ++    SNG       K R G   DR ++D+P PLG NGKV                            Y RP ++       
Subjt:  VIEDGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSG-IRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAEL

Query:  PENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV
        P+    A + PA ++ ++ K       +   +EA   I   L    ++APLGIPFCSAS+GG R+  P       +S  D G LSDTE LR+RME IA  
Subjt:  PENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAV

Query:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP
         GLG VSA+ + +LN +LD+YLK+L++SCV L G   M   P K   +K Q + +++NG+  NN  H + SN   ++  E     SVSLLDF+VAMELNP
Subjt:  HGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKIRMRAFEE
         QLGEDWPLL E+I +  FEE
Subjt:  KQLGEDWPLLLEKIRMRAFEE

AT4G33890.1 unknown protein1.6e-3930.99Show/hide
Query:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR N+ LHN+LI+SI+KNAC AK+ P +   G              + 
Subjt:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIE

Query:  DGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECI---SHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
         GNGD+      ++Q  P+  +  F  S RK RS    RKL+DRPSPLGP GK   +   + +S  K   + +++ +          RP   +  V    
Subjt:  DGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECI---SHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASI-GGARKARPADFGGDFVSFS-----DIGHLSDTESLRRRME
                             +EG +V        S+ S   R  L APLG+   S S+  GA +   ++      SF+     + G L DT +LR R+E
Subjt:  ENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASI-GGARKARPADFGGDFVSFS-----DIGHLSDTESLRRRME

Query:  QIAAVHGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVA
        +   + GL  ++ DS ++LN  LDV++++LI  C+ L  T        +  TD+++             +++ +++  SR + +       VS+ DF+  
Subjt:  QIAAVHGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVA

Query:  MELNPKQLGEDWPLLLEKIRMRAFEE
        MELN + LGEDWP+ +EKI  RA ++
Subjt:  MELNPKQLGEDWPLLLEKIRMRAFEE

AT4G33890.2 unknown protein1.6e-3930.99Show/hide
Query:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR N+ LHN+LI+SI+KNAC AK+ P +   G              + 
Subjt:  QQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIE

Query:  DGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECI---SHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP
         GNGD+      ++Q  P+  +  F  S RK RS    RKL+DRPSPLGP GK   +   + +S  K   + +++ +          RP   +  V    
Subjt:  DGNGDTGAVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECI---SHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELP

Query:  ENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASI-GGARKARPADFGGDFVSFS-----DIGHLSDTESLRRRME
                             +EG +V        S+ S   R  L APLG+   S S+  GA +   ++      SF+     + G L DT +LR R+E
Subjt:  ENNIEARIRPAGKQVLNNKIHDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASI-GGARKARPADFGGDFVSFS-----DIGHLSDTESLRRRME

Query:  QIAAVHGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVA
        +   + GL  ++ DS ++LN  LDV++++LI  C+ L  T        +  TD+++             +++ +++  SR + +       VS+ DF+  
Subjt:  QIAAVHGLGSVSADSANILNKVLDVYLKQLIRSCVGLVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVA

Query:  MELNPKQLGEDWPLLLEKIRMRAFEE
        MELN + LGEDWP+ +EKI  RA ++
Subjt:  MELNPKQLGEDWPLLLEKIRMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTTGGAACTGATCGGTCAAAACGGTATTTCTTTTACTTGAACAGGTT
CTTGAGTCAGAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTCGTCGGGTGCTCGGGAGAGACAATCTTTGGCTGCATAATCAGTTGATACAGTCAATTTTGAAGAATG
CGTGCCAAGCTAAGGCTGCGCCGCCGTTACCTGTAGCAGGCTATCCGAAAACTTCAACACAATCTGCAAAAGTTTCCCCTGTTATAGAAGATGGGAATGGGGACACTGGA
GCTGTTTATCCTACTTCTACTCAAAGTATTCCCATTTGGTCTAATGGAGGATTTCCAGCGTCCCCAAGAAAGAGCAGGTCTGGGATACGCGACCGCAAACTCAAGGACAG
GCCGAGTCCACTGGGGCCAAATGGGAAGGTCGAATGTATCTCACATCAATCAGCAGGCAAGAAAGATGGAAGCTGTAAAATGATGATGGTTAATGGTGATGCAACTCTAT
GTGACTATCAGAGACCAGTGCAACATCTGCAAGGAGTCGCTGAACTGCCTGAAAACAATATCGAGGCTAGAATTCGACCAGCAGGAAAGCAAGTCCTAAACAATAAGATC
CATGATGAAGGAACTAAGGTTGGAGACCGGGAAGAAGCGGGACACTCAATCCACTCGGGTTTGCTTCGAAGTCGCTTACTGGCACCCCTTGGGATTCCTTTCTGCTCAGC
TAGTATCGGCGGGGCCCGCAAAGCGAGGCCTGCGGATTTTGGGGGTGATTTCGTTAGCTTTAGCGATATTGGTCATTTATCGGATACAGAGTCATTGAGACGACGCATGG
AACAAATTGCTGCAGTACATGGCCTAGGCAGTGTTTCTGCAGACTCTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATAAGGTCTTGTGTTGGC
TTGGTTGGAACATGGCCTATGCCATGTGAGCCCGAGAAGCCTCTTACCGATAAGCTACAGGTTCAGGGGAAGGTGATCAATGGTATGTTGCCGAATAATCAATTACACGG
GCGACATAGCAATGGAAGCAGAGAAGTTATGCACGAGCACAGATTACGGTGCTCGGTATCGTTGCTTGATTTCAAAGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGG
AAGACTGGCCTTTGCTGCTGGAGAAAATTCGTATGCGTGCATTCGAGGAA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTTGGAACTGATCGGTCAAAACGGTATTTCTTTTACTTGAACAGGTT
CTTGAGTCAGAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTCGTCGGGTGCTCGGGAGAGACAATCTTTGGCTGCATAATCAGTTGATACAGTCAATTTTGAAGAATG
CGTGCCAAGCTAAGGCTGCGCCGCCGTTACCTGTAGCAGGCTATCCGAAAACTTCAACACAATCTGCAAAAGTTTCCCCTGTTATAGAAGATGGGAATGGGGACACTGGA
GCTGTTTATCCTACTTCTACTCAAAGTATTCCCATTTGGTCTAATGGAGGATTTCCAGCGTCCCCAAGAAAGAGCAGGTCTGGGATACGCGACCGCAAACTCAAGGACAG
GCCGAGTCCACTGGGGCCAAATGGGAAGGTCGAATGTATCTCACATCAATCAGCAGGCAAGAAAGATGGAAGCTGTAAAATGATGATGGTTAATGGTGATGCAACTCTAT
GTGACTATCAGAGACCAGTGCAACATCTGCAAGGAGTCGCTGAACTGCCTGAAAACAATATCGAGGCTAGAATTCGACCAGCAGGAAAGCAAGTCCTAAACAATAAGATC
CATGATGAAGGAACTAAGGTTGGAGACCGGGAAGAAGCGGGACACTCAATCCACTCGGGTTTGCTTCGAAGTCGCTTACTGGCACCCCTTGGGATTCCTTTCTGCTCAGC
TAGTATCGGCGGGGCCCGCAAAGCGAGGCCTGCGGATTTTGGGGGTGATTTCGTTAGCTTTAGCGATATTGGTCATTTATCGGATACAGAGTCATTGAGACGACGCATGG
AACAAATTGCTGCAGTACATGGCCTAGGCAGTGTTTCTGCAGACTCTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATAAGGTCTTGTGTTGGC
TTGGTTGGAACATGGCCTATGCCATGTGAGCCCGAGAAGCCTCTTACCGATAAGCTACAGGTTCAGGGGAAGGTGATCAATGGTATGTTGCCGAATAATCAATTACACGG
GCGACATAGCAATGGAAGCAGAGAAGTTATGCACGAGCACAGATTACGGTGCTCGGTATCGTTGCTTGATTTCAAAGTAGCAATGGAGCTTAACCCAAAGCAACTTGGGG
AAGACTGGCCTTTGCTGCTGGAGAAAATTCGTATGCGTGCATTCGAGGAA
Protein sequenceShow/hide protein sequence
MQPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCRRVLGRDNLWLHNQLIQSILKNACQAKAAPPLPVAGYPKTSTQSAKVSPVIEDGNGDTG
AVYPTSTQSIPIWSNGGFPASPRKSRSGIRDRKLKDRPSPLGPNGKVECISHQSAGKKDGSCKMMMVNGDATLCDYQRPVQHLQGVAELPENNIEARIRPAGKQVLNNKI
HDEGTKVGDREEAGHSIHSGLLRSRLLAPLGIPFCSASIGGARKARPADFGGDFVSFSDIGHLSDTESLRRRMEQIAAVHGLGSVSADSANILNKVLDVYLKQLIRSCVG
LVGTWPMPCEPEKPLTDKLQVQGKVINGMLPNNQLHGRHSNGSREVMHEHRLRCSVSLLDFKVAMELNPKQLGEDWPLLLEKIRMRAFEE