; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011899 (gene) of Chayote v1 genome

Gene IDSed0011899
OrganismSechium edule (Chayote v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationLG01:9756662..9759184
RNA-Seq ExpressionSed0011899
SyntenySed0011899
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029751.1 hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-18682.89Show/hide
Query:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP
        MQ QQ LRIDL  LKSQIVKKLGTDRSK YFFYLNRFLSQKLSKNEFDKLCCRVLG++NLWLHNQLI SILKNACQAKAAPPIP AGYPKTSTQ+AK SP
Subjt:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP

Query:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP
        VIEDGN DG AVFP+S Q IPIWSN GF  SPRK RS I DRK+KDRPS L PN KVECIS QSA KEDGSC+I+MDNG+A  CD++RPVQHLQGV ELP
Subjt:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP

Query:  ENNIEARVQQPAGKQVLPNKIQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGLGNV
        ENNIEARVQ+PAGKQVL  ++QVEDREE  QSN+ SLLRSRLLAPLGIPFCSASIGGA + RPVDCG +FSFSD+GHLLDTESLRRRMEQIAAVQGLG+V
Subjt:  ENNIEARVQQPAGKQVLPNKIQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGLGNV

Query:  SADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDW
        SADCANILN VLDVYLKQLIRSCV+LVG W    PEKPLAH QQIQGKVINGMLPN QLH  HSNGNGE + E RL CSISLLDFKVAMELNPKQLGEDW
Subjt:  SADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDW

Query:  PLLMEKICM
        PLL+EKI M
Subjt:  PLLMEKICM

XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]1.3e-18180.68Show/hide
Query:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP
        MQ QQ LRIDLG LKSQIVKKLG DRSK YFFYLNRFLSQKLSKNEFDK CCRVLG++NLWLHNQLI SILKNACQAKAAPPIPVAGYPKTSTQSAK SP
Subjt:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP

Query:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP
        ++EDGN DG AVFP+S Q+IP WSNG    SPRK RS I DRK+KDRPS L PNGKVECIS+ SA+         MDNGDA  CD++RPVQHLQGVAELP
Subjt:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP

Query:  ENNIEARVQQPAGKQVLPNKIQ-----VEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQ
        ENNIE RV QP+GKQVL NKIQ     VEDREE  QSN  SLLRSRLLAPLGIPFCSAS GG  + RPVDCG DFSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEARVQQPAGKQVLPNKIQ-----VEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQ

Query:  GLGNVSADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQ
        GLG+VSADCANILN VLDVYLKQLIRSCV+LVG W    PEKPLAHKQQIQGKVINGMLPN QLHGRHSNGN E + EHRLQCSISLLDFKVAMELNP Q
Subjt:  GLGNVSADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKICM
        LGEDWPLL+EKICM
Subjt:  LGEDWPLLMEKICM

XP_022962598.1 uncharacterized protein LOC111463000 [Cucurbita moschata]3.0e-18682.64Show/hide
Query:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP
        MQ QQ LRIDL  LKSQIVKKLGTDRSK YFFYLNRFLSQKLSKNEFDKLCCRVLG++NLWLHNQLI SILKNACQAKAAPPIP AGYPKTSTQ+AK SP
Subjt:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP

Query:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP
        VIEDGN DG AVFP+S Q IPIWSN GF  SPRK RS I DRK+KDRPS L PN KVECIS QSA KEDGSC+I+MDNG+A  CD++RPVQHLQGV ELP
Subjt:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP

Query:  ENNIEARVQQPAGKQVLPNKIQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGLGNV
        ENNIEARVQ+P+GKQVL  ++QVEDREE  QSN+ SLLRSRLLAPLGIPFCSASIGGA + RPVDCG +FSFSD+GHLLDTESLRRRMEQIAAVQGLG+V
Subjt:  ENNIEARVQQPAGKQVLPNKIQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGLGNV

Query:  SADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDW
        SADCANILN VLDVYLKQLIRSCV+LVG W    PEKPLAH QQIQGKVINGMLPN QLH  HSNGNGE + E RL CSISLLDFKVAMELNPKQLGEDW
Subjt:  SADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDW

Query:  PLLMEKICM
        PLL+EKI M
Subjt:  PLLMEKICM

XP_022997521.1 uncharacterized protein LOC111492414 [Cucurbita maxima]2.8e-18481.55Show/hide
Query:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP
        MQ QQ LRIDL  LKSQIVKKLGTDRSK YFFYLNRFLSQKLSKNEFDKLCCRVLG++NLWLHNQLI SILKNACQAKAAPPIP AGYPKTSTQ+AK SP
Subjt:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP

Query:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP
        VIEDGN DG AVF +S Q IPIWSN GFS SPRK RS I DRK+KDRPS L PN KVECIS QSA KEDGSC+I+MDNG+A  CD++RPVQHLQGV ELP
Subjt:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP

Query:  ENNIEARVQQPAGKQVLPNKIQ---VEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGL
        ENNIEARVQ+P+GKQVL  +++   VEDREE  QSN+ SLLRSRLLAPLGIPFCSASIGGA + RPVDCG +FSFSD+GHLLDTESLRRRMEQIAAVQGL
Subjt:  ENNIEARVQQPAGKQVLPNKIQ---VEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGL

Query:  GNVSADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLG
        G+VSADCANILN VLDVYLKQLIRSCV+LVG W    PEKPLAH QQIQGKVINGMLPN QLH  HSNGN E + E RL CSISLLDFKVAMELNPKQLG
Subjt:  GNVSADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLG

Query:  EDWPLLMEKICM
        EDWPLL+EKI M
Subjt:  EDWPLLMEKICM

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]2.6e-18581.55Show/hide
Query:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP
        MQ QQ LRIDL  LKSQIVKKLGTDRSK YFFYLNRFLSQKLSKNEFDKLCCRVLG++NLWLHNQLI SILKNACQAKAAPPIP AGYPKTSTQ+AK SP
Subjt:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP

Query:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP
        VIEDGN DG AVFP+S Q IPIWSN GF  SPRK RS I DRK+KDRPS L PN KVECIS QSA KEDGSC+I++DNG+A  CD++RPVQHLQGV ELP
Subjt:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP

Query:  ENNIEARVQQPAGKQVLPNKIQ---VEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGL
        ENNIEARVQ+P+GKQVL  +++   VEDREE  QSN+ SLLRSRLLAPLGIPFCSASIGGA + RPVDCG +FSFSD+GHLLDTESLRRRMEQIAAVQGL
Subjt:  ENNIEARVQQPAGKQVLPNKIQ---VEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGL

Query:  GNVSADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLG
        G+VSADCANILN VLDVYLKQLIRSCV+LVG W    PEKPLAH QQIQGKVINGMLPN QLH  HSNGNGE + E RL CSISLLDFKVAMELNPKQLG
Subjt:  GNVSADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLG

Query:  EDWPLLMEKICM
        EDWPLL+EKI M
Subjt:  EDWPLLMEKICM

TrEMBL top hitse value%identityAlignment
A0A1S3BCQ5 uncharacterized protein LOC1034882316.3e-18280.68Show/hide
Query:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP
        MQ QQ LRIDLG LKSQIVKKLG DRSK YFFYLNRFLSQKLSKNEFDK CCRVLG++NLWLHNQLI SILKNACQAKAAPPIPVAGYPKTSTQSAK SP
Subjt:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP

Query:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP
        ++EDGN DG AVFP+S Q+IP WSNG    SPRK RS I DRK+KDRPS L PNGKVECIS+ SA+         MDNGDA  CD++RPVQHLQGVAELP
Subjt:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP

Query:  ENNIEARVQQPAGKQVLPNKIQ-----VEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQ
        ENNIE RV QP+GKQVL NKIQ     VEDREE  QSN  SLLRSRLLAPLGIPFCSAS GG  + RPVDCG DFSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEARVQQPAGKQVLPNKIQ-----VEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQ

Query:  GLGNVSADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQ
        GLG+VSADCANILN VLDVYLKQLIRSCV+LVG W    PEKPLAHKQQIQGKVINGMLPN QLHGRHSNGN E + EHRLQCSISLLDFKVAMELNP Q
Subjt:  GLGNVSADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKICM
        LGEDWPLL+EKICM
Subjt:  LGEDWPLLMEKICM

A0A5A7VF96 SAGA-Tad1 domain-containing protein6.3e-18280.68Show/hide
Query:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP
        MQ QQ LRIDLG LKSQIVKKLG DRSK YFFYLNRFLSQKLSKNEFDK CCRVLG++NLWLHNQLI SILKNACQAKAAPPIPVAGYPKTSTQSAK SP
Subjt:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP

Query:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP
        ++EDGN DG AVFP+S Q+IP WSNG    SPRK RS I DRK+KDRPS L PNGKVECIS+ SA+         MDNGDA  CD++RPVQHLQGVAELP
Subjt:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP

Query:  ENNIEARVQQPAGKQVLPNKIQ-----VEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQ
        ENNIE RV QP+GKQVL NKIQ     VEDREE  QSN  SLLRSRLLAPLGIPFCSAS GG  + RPVDCG DFSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEARVQQPAGKQVLPNKIQ-----VEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQ

Query:  GLGNVSADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQ
        GLG+VSADCANILN VLDVYLKQLIRSCV+LVG W    PEKPLAHKQQIQGKVINGMLPN QLHGRHSNGN E + EHRLQCSISLLDFKVAMELNP Q
Subjt:  GLGNVSADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKICM
        LGEDWPLL+EKICM
Subjt:  LGEDWPLLMEKICM

A0A6J1BTJ5 uncharacterized protein LOC1110052067.8e-18079.57Show/hide
Query:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP
        MQ QQ LRIDLG LKSQIVKKLGTDRSK YFFYLNRFLSQKLSKNEFDKLC RVLG+DNLWLHNQLI SILKNACQAKAAPP+PVAGYPKTSTQSAK SP
Subjt:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP

Query:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP
        VIEDGN D  AV+P+S QSIPIWSNGGF  SPRKSRS I DRK+KDRPSPL PNGKVECIS+QSA K+DGSC+++M NGDA  CD++RPVQHLQGVAELP
Subjt:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP

Query:  ENNIEARVQQPAGKQVLPNKI-----QVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDF-SFSDIGHLLDTESLRRRMEQIAAV
        ENNIEAR+ +PAGKQVL NKI     +V DREE   S    LL+SRLLAPLGIPFCSASIGGA +ARP D G DF SFSDIGHL DTESLRRRMEQIAAV
Subjt:  ENNIEARVQQPAGKQVLPNKI-----QVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDF-SFSDIGHLLDTESLRRRMEQIAAV

Query:  QGLGNVSADCANILNNVLDVYLKQLIRSCVNLVGTW-----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNP
         GLG+VSAD ANILN VLDVYLKQLIRSCV LVGT      PEKPL  K Q+QGKVINGMLPN QLHGRHSNG+ E M EHRL+CS+SLLDFKVAMELNP
Subjt:  QGLGNVSADCANILNNVLDVYLKQLIRSCVNLVGTW-----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNP

Query:  KQLGEDWPLLMEKICMCTSKE
        KQLGEDWPLL+EKI M   +E
Subjt:  KQLGEDWPLLMEKICMCTSKE

A0A6J1HF85 uncharacterized protein LOC1114630001.5e-18682.64Show/hide
Query:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP
        MQ QQ LRIDL  LKSQIVKKLGTDRSK YFFYLNRFLSQKLSKNEFDKLCCRVLG++NLWLHNQLI SILKNACQAKAAPPIP AGYPKTSTQ+AK SP
Subjt:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP

Query:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP
        VIEDGN DG AVFP+S Q IPIWSN GF  SPRK RS I DRK+KDRPS L PN KVECIS QSA KEDGSC+I+MDNG+A  CD++RPVQHLQGV ELP
Subjt:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP

Query:  ENNIEARVQQPAGKQVLPNKIQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGLGNV
        ENNIEARVQ+P+GKQVL  ++QVEDREE  QSN+ SLLRSRLLAPLGIPFCSASIGGA + RPVDCG +FSFSD+GHLLDTESLRRRMEQIAAVQGLG+V
Subjt:  ENNIEARVQQPAGKQVLPNKIQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGLGNV

Query:  SADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDW
        SADCANILN VLDVYLKQLIRSCV+LVG W    PEKPLAH QQIQGKVINGMLPN QLH  HSNGNGE + E RL CSISLLDFKVAMELNPKQLGEDW
Subjt:  SADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDW

Query:  PLLMEKICM
        PLL+EKI M
Subjt:  PLLMEKICM

A0A6J1K7Q1 uncharacterized protein LOC1114924141.4e-18481.55Show/hide
Query:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP
        MQ QQ LRIDL  LKSQIVKKLGTDRSK YFFYLNRFLSQKLSKNEFDKLCCRVLG++NLWLHNQLI SILKNACQAKAAPPIP AGYPKTSTQ+AK SP
Subjt:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP

Query:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP
        VIEDGN DG AVF +S Q IPIWSN GFS SPRK RS I DRK+KDRPS L PN KVECIS QSA KEDGSC+I+MDNG+A  CD++RPVQHLQGV ELP
Subjt:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP

Query:  ENNIEARVQQPAGKQVLPNKIQ---VEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGL
        ENNIEARVQ+P+GKQVL  +++   VEDREE  QSN+ SLLRSRLLAPLGIPFCSASIGGA + RPVDCG +FSFSD+GHLLDTESLRRRMEQIAAVQGL
Subjt:  ENNIEARVQQPAGKQVLPNKIQ---VEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGL

Query:  GNVSADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLG
        G+VSADCANILN VLDVYLKQLIRSCV+LVG W    PEKPLAH QQIQGKVINGMLPN QLH  HSNGN E + E RL CSISLLDFKVAMELNPKQLG
Subjt:  GNVSADCANILNNVLDVYLKQLIRSCVNLVGTW----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLG

Query:  EDWPLLMEKICM
        EDWPLL+EKI M
Subjt:  EDWPLLMEKICM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein5.7e-3430Show/hide
Query:  LKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSPVIEDGNGDGAAVF
        +K+ I +K+G  R+  YF  L +FL+ ++SK+EFDKLC + +G++N+ LHN+L+ SILKNA  AK+ PP     YPK S                G  VF
Subjt:  LKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSPVIEDGNGDGAAVF

Query:  PSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELPENNIEARVQQPAG
        P                SPRK RS    RK +DRPSPL P GK                                           P++      +  + 
Subjt:  PSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELPENNIEARVQQPAG

Query:  KQVLPNK-IQVEDREEGAQSNQLSLLRSR--LLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGLGNVSADCANILNN
         Q LP + + VED EE  Q      ++SR  L APLG+ F    +    R    +  +  +    G L D  +LR R+E+   ++G+  +S D AN+LN 
Subjt:  KQVLPNK-IQVEDREEGAQSNQLSLLRSR--LLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGLGNVSADCANILNN

Query:  VLDVYLKQLIRSCVNLVGTWPEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDWPLLMEKICMCTSKE
         L+ Y+++LI  C++L                                       A  + R   ++S+LDF  AME+NP+ LGE+WP+ +EKIC   S+E
Subjt:  VLDVYLKQLIRSCVNLVGTWPEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDWPLLMEKICMCTSKE

AT2G24530.1 unknown protein2.2e-8644.42Show/hide
Query:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP
        MQ  Q  RI L  LK  IVKK G +RS+ YF+YL RFLSQKL+K+EFDK C R+LG++NL LHNQLI SIL+NA  AK+ PP   AG+   +        
Subjt:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP

Query:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP
           DG      + P+ +Q  P+WSNG    SPRK RS + +RK +DRPSPL  NGKVE + +Q   +ED    + M+NG     D++R  +++       
Subjt:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP

Query:  ENNIEARVQQPAGKQVLPNK-----IQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDD-FSFSDIGHLLDTESLRRRMEQIAAV
         +  +    +P  K  +PNK     + + D +   +  +++L  S L+APLGIPFCSAS+GG+ R  PV    +  S  D G L D E LR+RME IA  
Subjt:  ENNIEARVQQPAGKQVLPNK-----IQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDD-FSFSDIGHLLDTESLRRRMEQIAAV

Query:  QGLGNVSADCANILNNVLDVYLKQLIRSCVNLVGTW-----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNP
        QGL  VS +CA  LNN+LDVYLK+LI SC +LVG       P K    KQQ Q K++NG+ P   L  +  NG+ +   +H    S+S+LDF+ AMELNP
Subjt:  QGLGNVSADCANILNNVLDVYLKQLIRSCVNLVGTW-----PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNP

Query:  KQLGEDWPLLMEKICMCTSKE
        +QLGEDWP L E+I + + +E
Subjt:  KQLGEDWPLLMEKICMCTSKE

AT4G31440.1 unknown protein8.5e-7042.24Show/hide
Query:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP
        MQ  Q  RIDL  LK  IVKK+G +RS  YF+YL RFLSQKL+K+EFDK C R+LG++NL LHN+LI SIL+NA  AK+ P +  +G+P  S    K   
Subjt:  MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSP

Query:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP
          EDG  +  ++ P   ++    SNG  +   +       DR I+D+P PL  NGKV                         P  + RP ++       P
Subjt:  VIEDGNGDGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELP

Query:  ENNIEARVQQPAGKQVLPNKIQVE---DREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVD-CGDDFSFSDIGHLLDTESLRRRMEQIAAVQG
        +   ++    PA ++ +  K QV     R++ AQ   LS     ++APLGIPFCSAS+GG  R  PV       S  D G L DTE LR+RME IA  QG
Subjt:  ENNIEARVQQPAGKQVLPNKIQVE---DREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVD-CGDDFSFSDIGHLLDTESLRRRMEQIAAVQG

Query:  LGNVSADCANILNNVLDVYLKQLIRSCVNLVGT-----WPEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQ
        LG VSA+C+ +LNN+LD+YLK+L++SCV+L G       P K    KQQ + +++NG+  N   H + SN   +   E   Q S+SLLDF+VAMELNP Q
Subjt:  LGNVSADCANILNNVLDVYLKQLIRSCVNLVGT-----WPEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQ

Query:  LGEDWPLLMEKICMCTSKE
        LGEDWPLL E+I +   +E
Subjt:  LGEDWPLLMEKICMCTSKE

AT4G33890.1 unknown protein3.8e-3831.45Show/hide
Query:  RIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSPVIEDGNG
        R+D   +K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +G+ N+ LHN+LI SI+KNAC AK+ P I             K    +  GNG
Subjt:  RIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSPVIEDGNG

Query:  DGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECI---SYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELPENNI
        D       ++Q  P+  +  FS S RK RS    RK++DRPSPL P GK   +   + +S SK   + +++            RP   +  V E  E   
Subjt:  DGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECI---SYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELPENNI

Query:  EARVQQPAGKQVLPNKIQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVD----CGDDF---SFSDIGHLLDTESLRRRMEQIAAVQGL
           V+Q AG               G+ S Q    R  L APLG+   S S+      + V     C   F   +  + G L DT +LR R+E+   ++GL
Subjt:  EARVQQPAGKQVLPNKIQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVD----CGDDF---SFSDIGHLLDTESLRRRMEQIAAVQGL

Query:  GNVSADCANILNNVLDVYLKQLIRSCVNLVGTWPEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDWP
          ++ D  ++LN+ LDV++++LI  C++L  T        +                        N +   + R    +S+ DF+  MELN + LGEDWP
Subjt:  GNVSADCANILNNVLDVYLKQLIRSCVNLVGTWPEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDWP

Query:  LLMEKIC
        + MEKIC
Subjt:  LLMEKIC

AT4G33890.2 unknown protein3.8e-3831.45Show/hide
Query:  RIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSPVIEDGNG
        R+D   +K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +G+ N+ LHN+LI SI+KNAC AK+ P I             K    +  GNG
Subjt:  RIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSPVIEDGNG

Query:  DGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECI---SYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELPENNI
        D       ++Q  P+  +  FS S RK RS    RK++DRPSPL P GK   +   + +S SK   + +++            RP   +  V E  E   
Subjt:  DGAAVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECI---SYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELPENNI

Query:  EARVQQPAGKQVLPNKIQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVD----CGDDF---SFSDIGHLLDTESLRRRMEQIAAVQGL
           V+Q AG               G+ S Q    R  L APLG+   S S+      + V     C   F   +  + G L DT +LR R+E+   ++GL
Subjt:  EARVQQPAGKQVLPNKIQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVD----CGDDF---SFSDIGHLLDTESLRRRMEQIAAVQGL

Query:  GNVSADCANILNNVLDVYLKQLIRSCVNLVGTWPEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDWP
          ++ D  ++LN+ LDV++++LI  C++L  T        +                        N +   + R    +S+ DF+  MELN + LGEDWP
Subjt:  GNVSADCANILNNVLDVYLKQLIRSCVNLVGTWPEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDWP

Query:  LLMEKIC
        + MEKIC
Subjt:  LLMEKIC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGCTCAGCAGATCTTGAGAATTGATTTAGGTGGATTGAAATCTCAGATAGTGAAGAAGCTTGGAACCGATCGGTCAAAATGGTACTTCTTTTACTTAAATAGGTT
CTTGAGTCAAAAGCTGAGTAAGAATGAGTTTGACAAGTTATGCTGTCGTGTGCTTGGGAAGGACAATCTTTGGCTGCATAATCAATTGATACACTCAATTTTGAAGAATG
CTTGTCAAGCTAAGGCTGCACCTCCAATACCTGTAGCAGGCTATCCGAAAACTTCGACACAATCTGCAAAAAATTCCCCTGTTATAGAAGATGGAAATGGGGATGGTGCA
GCTGTTTTTCCTTCTTCAAATCAAAGTATTCCCATTTGGTCCAATGGAGGTTTTTCGACGTCTCCAAGAAAGAGCAGGTCTAGGATTCATGACCGCAAAATCAAGGACAG
GCCGAGTCCACTCGAACCGAATGGGAAGGTTGAATGTATCTCATATCAATCAGCAAGCAAGGAAGATGGCAGCTGCCAAATCGTGATGGATAATGGTGATGCAGCTCCGT
GCGACTTTCGAAGACCAGTGCAGCATTTGCAAGGAGTTGCTGAACTACCTGAAAACAATATTGAGGCCAGAGTTCAGCAACCAGCAGGTAAGCAAGTCCTACCCAACAAA
ATCCAGGTTGAAGATAGGGAAGAAGGGGCACAGTCAAATCAGTTGAGTTTACTTCGGAGTCGATTACTTGCACCGCTTGGGATTCCTTTTTGCTCAGCTAGTATCGGTGG
GGCTTGTAGAGCAAGGCCTGTGGATTGTGGGGACGATTTTAGCTTTAGTGATATCGGTCATTTGTTGGATACTGAATCGTTGAGACGACGCATGGAACAAATTGCTGCAG
TACAGGGCCTAGGCAATGTTTCTGCAGATTGTGCTAATATTTTGAATAACGTGTTGGATGTATATTTGAAGCAGCTAATCAGGTCTTGTGTTAACTTGGTTGGAACATGG
CCTGAGAAACCTCTTGCCCACAAGCAGCAGATTCAGGGGAAGGTTATAAATGGCATGTTGCCGAATATTCAATTACATGGACGACATAGCAATGGAAATGGCGAAGCTAT
GCCCGAGCACAGATTACAGTGCTCGATATCGTTGCTCGACTTCAAGGTAGCTATGGAGCTTAATCCAAAGCAACTTGGGGAAGACTGGCCTTTGCTAATGGAGAAAATTT
GTATGTGCACATCCAAGGAATGA
mRNA sequenceShow/hide mRNA sequence
AGAAAATCTAAATTATAAGAGAGAGAGAGAGAGAGACACAGAGGTTGAAGGACAGAGAAAAGAAAGCAAACGAAAATCCGAAGCTCCGAAGATTCGGACCAAGCCATTCT
CAGCTTACAATTTCTCTTCAATTCCTCAAGTTCTTCAAAGGTTAGAGCTCAATCAATCCCTATGTTCTTGTTCTTCAAATGCTCTCGACCTTAATCGCCCAGATTCAATG
TGCTTTACACAACTCTGCAATTCGTGCTCTGTTCAAGCAATTCACTGATCTCTTTCTTCGACTTCCAAATTTCGCTGTTCCAACTTTTGAGCTTCTGAATTCTGCTGAAA
TAATGATGGGTTTCCATTGGTTCGCCGACCCAATTCGCGTTTTCGTTAGTTTTTGAATTTTTATGCGTGTTTTCAAATTGGGTTTCTTTGTTCGTAATTAGGGTTTTGAT
TTGCGATTGATTTGCAATAGGTTTGATTCGGTTTTGAAATTTTGGCGGGATTTAGTGAGCCATGTAGTGGGTTTAACTTCTAGGGTTGTGGGTTTTGAGAAGTTGCATTT
TGTGATTTGTAGTTTGGTAAAGAAAGGATCAGTTGGACATAGGATTGCTGCAATTCTGTTGAAATCATTTTTGGGGTTTCTTTGAGGAACTTGCTGACTTTGACAATTAT
TATGCATTGATTGATACATGTGGAAGTGAGAATTTTGTGGGGTGTTTTTTTTTCCCTTTTTGGTGAATTTCAATAGTAATTCTGTTCCTCTTACATAGTCTGGATTTGAG
TGATTTTGTAAAGGGGGTTTGAATCTTTCTGCTGAGAAATGCAAGCTCAGCAGATCTTGAGAATTGATTTAGGTGGATTGAAATCTCAGATAGTGAAGAAGCTTGGAACC
GATCGGTCAAAATGGTACTTCTTTTACTTAAATAGGTTCTTGAGTCAAAAGCTGAGTAAGAATGAGTTTGACAAGTTATGCTGTCGTGTGCTTGGGAAGGACAATCTTTG
GCTGCATAATCAATTGATACACTCAATTTTGAAGAATGCTTGTCAAGCTAAGGCTGCACCTCCAATACCTGTAGCAGGCTATCCGAAAACTTCGACACAATCTGCAAAAA
ATTCCCCTGTTATAGAAGATGGAAATGGGGATGGTGCAGCTGTTTTTCCTTCTTCAAATCAAAGTATTCCCATTTGGTCCAATGGAGGTTTTTCGACGTCTCCAAGAAAG
AGCAGGTCTAGGATTCATGACCGCAAAATCAAGGACAGGCCGAGTCCACTCGAACCGAATGGGAAGGTTGAATGTATCTCATATCAATCAGCAAGCAAGGAAGATGGCAG
CTGCCAAATCGTGATGGATAATGGTGATGCAGCTCCGTGCGACTTTCGAAGACCAGTGCAGCATTTGCAAGGAGTTGCTGAACTACCTGAAAACAATATTGAGGCCAGAG
TTCAGCAACCAGCAGGTAAGCAAGTCCTACCCAACAAAATCCAGGTTGAAGATAGGGAAGAAGGGGCACAGTCAAATCAGTTGAGTTTACTTCGGAGTCGATTACTTGCA
CCGCTTGGGATTCCTTTTTGCTCAGCTAGTATCGGTGGGGCTTGTAGAGCAAGGCCTGTGGATTGTGGGGACGATTTTAGCTTTAGTGATATCGGTCATTTGTTGGATAC
TGAATCGTTGAGACGACGCATGGAACAAATTGCTGCAGTACAGGGCCTAGGCAATGTTTCTGCAGATTGTGCTAATATTTTGAATAACGTGTTGGATGTATATTTGAAGC
AGCTAATCAGGTCTTGTGTTAACTTGGTTGGAACATGGCCTGAGAAACCTCTTGCCCACAAGCAGCAGATTCAGGGGAAGGTTATAAATGGCATGTTGCCGAATATTCAA
TTACATGGACGACATAGCAATGGAAATGGCGAAGCTATGCCCGAGCACAGATTACAGTGCTCGATATCGTTGCTCGACTTCAAGGTAGCTATGGAGCTTAATCCAAAGCA
ACTTGGGGAAGACTGGCCTTTGCTAATGGAGAAAATTTGTATGTGCACATCCAAGGAATGAAATGACTGATATATCTGTTTATCCCATTCCAAAGTTTGATTGTCCTGTC
AATGATTCAAAGGGATCGAGAATGGATCTCAAAGACATATTGGCTATGCTCATTTGTGTGAGTAAACAAGGTTGCTCACAAAGATACCGAGCAAGTCAAACATCAAACTC
GGCATTTGTTCTCAGGAATCATTTAACTTCGACTAAACTTGAGCACCGTCAGTCGTAATGCCCTTCGGTGATTGATCTTCGTTTCACTATCTAACTGGGAGTAGCCGGGG
TTGAAGGATCTCAAGCTTTTCGTGTAAATTTAGATCACTGAAGAAATTTACAAAATTTTGCAGAGATACAGTTTGTGTAGCTTGGTGGGAGGTAATTACTGGTGCAAGAG
CTTAGAGTTGCTCTTTGTGTATTTAAGTCTGAGGATTGATTTGAATGCCGTATGAAGAACACATCAACAGACAACAATCTGCAGATAGTATCCTCTGTATTTC
Protein sequenceShow/hide protein sequence
MQAQQILRIDLGGLKSQIVKKLGTDRSKWYFFYLNRFLSQKLSKNEFDKLCCRVLGKDNLWLHNQLIHSILKNACQAKAAPPIPVAGYPKTSTQSAKNSPVIEDGNGDGA
AVFPSSNQSIPIWSNGGFSTSPRKSRSRIHDRKIKDRPSPLEPNGKVECISYQSASKEDGSCQIVMDNGDAAPCDFRRPVQHLQGVAELPENNIEARVQQPAGKQVLPNK
IQVEDREEGAQSNQLSLLRSRLLAPLGIPFCSASIGGACRARPVDCGDDFSFSDIGHLLDTESLRRRMEQIAAVQGLGNVSADCANILNNVLDVYLKQLIRSCVNLVGTW
PEKPLAHKQQIQGKVINGMLPNIQLHGRHSNGNGEAMPEHRLQCSISLLDFKVAMELNPKQLGEDWPLLMEKICMCTSKE