; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0000448 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0000448
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationchr03:26552078..26554685
RNA-Seq ExpressionPay0000448
SyntenyPay0000448
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138880.1 uncharacterized protein LOC101213741 [Cucumis sativus]2.4e-22896.31Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAK APPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPS+LGPNGKVECISHLSANMDNGDATLCDYKRPVQ+LQG+AELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS

Query:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQ L NKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSAS GG  KTRPVDCGGDFS  DVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLPNNQLHGRHSNG+EEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRAFGE
        CMR FGE
Subjt:  CMRAFGE

XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]2.2e-237100Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS

Query:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRAFGE
        CMRAFGE
Subjt:  CMRAFGE

XP_022962598.1 uncharacterized protein LOC111463000 [Cucurbita moschata]3.8e-19785.2Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQVL         +VEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFGE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFGE

XP_022997521.1 uncharacterized protein LOC111492414 [Cucurbita maxima]9.0e-19985.44Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVF TSTQ IP WSN    +SPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQVL  ++QVE TKVEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFGE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFGE

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]2.8e-20086.16Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          +DNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQVL  ++QVE TKVEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFGE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFGE

TrEMBL top hitse value%identityAlignment
A0A0A0LM32 Uncharacterized protein1.2e-22896.31Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAK APPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPS+LGPNGKVECISHLSANMDNGDATLCDYKRPVQ+LQG+AELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS

Query:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQ L NKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSAS GG  KTRPVDCGGDFS  DVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLPNNQLHGRHSNG+EEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRAFGE
        CMR FGE
Subjt:  CMRAFGE

A0A1S3BCQ5 uncharacterized protein LOC1034882311.1e-237100Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS

Query:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRAFGE
        CMRAFGE
Subjt:  CMRAFGE

A0A5A7VF96 SAGA-Tad1 domain-containing protein1.1e-237100Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS

Query:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRAFGE
        CMRAFGE
Subjt:  CMRAFGE

A0A6J1HF85 uncharacterized protein LOC1114630001.8e-19785.2Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQVL         +VEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFGE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFGE

A0A6J1K7Q1 uncharacterized protein LOC1114924144.3e-19985.44Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVF TSTQ IP WSN    +SPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQVL  ++QVE TKVEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFGE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein3.4e-3931.68Show/hide
Query:  RIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNE
        R++  E+K+ I +K+G  R+  YF  L +FL+ ++SK+EFDK C + +GREN+ LHN+L++SILKNA  AK+ PP     YPK S               
Subjt:  RIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNE

Query:  DGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHN
         G  VFP             SPRKCRS    RK +DRPS LGP GK + ++         D ++   +R                               
Subjt:  DGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHN

Query:  KIQVEATKVEDREEAGQSNHSSLLRSR--LLAPLGIPFCSASTGGPHKTRPVDCGG--DFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANILNK
         + +E   VED EE  Q   S  ++SR  L APLG+ F   S     K R     G    +    G L D  +LR R+E+   ++G+  +S D AN+LN+
Subjt:  KIQVEATKVEDREEAGQSNHSSLLRSR--LLAPLGIPFCSASTGGPHKTRPVDCGG--DFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANILNK

Query:  VLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMR
         L+ Y+++LI  C+ L                                              + R   ++S+LDF  AME+NP  LGE+WP+ LEKIC R
Subjt:  VLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMR

Query:  AFGE
        A  E
Subjt:  AFGE

AT2G24530.1 unknown protein9.2e-9347.98Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQ  Q  RI L ELK  IVKK G +RS+RYF+YL RFLSQKL+K+EFDK+C R+LGRENL LHNQLI+SIL+NA  AK+ PP   AG+    +  A    
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGV---SPRKCRSGIRDRKLKDRPSVLGPNGKVECISHL---------SANMDNGDATLCDYKRPVQHLQGVAELP
           DG E  G + P  +Q+ P WSNGV   SPRK RSG+++RK +DRPS LG NGKVE + H          S  M+NG     DY+R  ++   VA+  
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGV---SPRKCRSGIRDRKLKDRPSVLGPNGKVECISHL---------SANMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGD-FSFGDVGHLLDTESLRRRMEQIAAV
        +        +P  K  + NK ++ A  + D +   +    +L  S L+APLGIPFCSAS GG  +T PV    +  S  D G L D E LR+RME IA  
Subjt:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGD-FSFGDVGHLLDTESLRRRMEQIAAV

Query:  QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNP
        QGL  VS +CA  LN +LDVYLK+LI SC DLVGA     +P K    KQQ Q K++NG+ P N L  +  NG+ ++  +H    S+S+LDF+ AMELNP
Subjt:  QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNP

Query:  TQLGEDWPLLLEKICMRAFGE
         QLGEDWP L E+I +R+F E
Subjt:  TQLGEDWPLLLEKICMRAFGE

AT4G31440.1 unknown protein7.6e-7945.23Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQ  Q  RIDL ELK  IVKK+G +RS RYF+YL RFLSQKL+K+EFDKSC R+LGRENL LHN+LI+SIL+NA  AK+ P +  +G+P  S    K   
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
          EDG E+  ++ P   +N    SNGV  +       DR ++D+P  LG NGKV                   Y RP ++       P+   +     P+
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS

Query:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVD-CGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCAN
         ++ +  K QV A    D E    +    L    ++APLGIPFCSAS GG  +T PV       S  D G L DTE LR+RME IA  QGLG VSA+C+ 
Subjt:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVD-CGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCAN

Query:  ILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLE
        +LN +LD+YLK+L++SCVDL GA      P K    KQQ + +++NG+  NN  H + SN   ++  E   Q S+SLLDF+VAMELNP QLGEDWPLL E
Subjt:  ILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLE

Query:  KICMRAFGE
        +I +  F E
Subjt:  KICMRAFGE

AT4G33890.1 unknown protein2.0e-3931.4Show/hide
Query:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGY-----PKTSTQSAKI
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDK C + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G         S ++++I
Subjt:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGY-----PKTSTQSAKI

Query:  SPLVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQ
         PL       G + F  ST            RKCRS    RKL+DRPS LGP GK   ++  +             +  +   Q   EL           
Subjt:  SPLVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQ

Query:  PSGKQVLHNKIQVEATKVEDREEAGQ-SNHSSLLRSR--LLAPLGIPFCSASTGGPHK--TRPVDCGGDF---SFGDVGHLLDTESLRRRMEQIAAVQGL
              L ++  VE   VE+ EE  Q +  S  ++SR  L APLG+   S   G   K  +    C   F   +  + G L DT +LR R+E+   ++GL
Subjt:  PSGKQVLHNKIQVEATKVEDREEAGQ-SNHSSLLRSR--LLAPLGIPFCSASTGGPHK--TRPVDCGGDF---SFGDVGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLG
          ++ D  ++LN  LDV++++LI  C+ L       +  + +                           N +   + R    +S+ DF+  MELN   LG
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLG

Query:  EDWPLLLEKICMRA
        EDWP+ +EKIC RA
Subjt:  EDWPLLLEKICMRA

AT4G33890.2 unknown protein2.0e-3931.4Show/hide
Query:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGY-----PKTSTQSAKI
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDK C + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G         S ++++I
Subjt:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGY-----PKTSTQSAKI

Query:  SPLVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQ
         PL       G + F  ST            RKCRS    RKL+DRPS LGP GK   ++  +             +  +   Q   EL           
Subjt:  SPLVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQ

Query:  PSGKQVLHNKIQVEATKVEDREEAGQ-SNHSSLLRSR--LLAPLGIPFCSASTGGPHK--TRPVDCGGDF---SFGDVGHLLDTESLRRRMEQIAAVQGL
              L ++  VE   VE+ EE  Q +  S  ++SR  L APLG+   S   G   K  +    C   F   +  + G L DT +LR R+E+   ++GL
Subjt:  PSGKQVLHNKIQVEATKVEDREEAGQ-SNHSSLLRSR--LLAPLGIPFCSASTGGPHK--TRPVDCGGDF---SFGDVGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLG
          ++ D  ++LN  LDV++++LI  C+ L       +  + +                           N +   + R    +S+ DF+  MELN   LG
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLG

Query:  EDWPLLLEKICMRA
        EDWP+ +EKIC RA
Subjt:  EDWPLLLEKICMRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAACTCGGAGCCGATCGGTCGAAACGGTACTTCTTTTACTTGAACAGGTT
CCTGAGTCAAAAGCTGAGTAAGAATGAGTTTGATAAGTCATGTTGTCGTGTACTTGGGAGGGAGAATCTCTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATG
CTTGTCAAGCTAAGGCTGCACCACCAATACCTGTAGCAGGCTATCCGAAAACATCTACGCAATCTGCAAAAATTTCCCCTCTCGTAGAAGATGGGAATGAAGATGGTGGA
GCTGTTTTTCCTACTTCCACGCAAAATATTCCCGGTTGGTCCAATGGAGTTTCACCAAGAAAGTGCAGGTCTGGGATACGCGACCGCAAACTCAAAGACAGGCCGAGTGT
ACTGGGGCCAAATGGGAAGGTTGAATGTATCTCACATCTATCCGCAAACATGGATAATGGTGATGCAACACTATGTGACTATAAGAGACCAGTGCAGCATCTGCAAGGAG
TTGCTGAACTACCTGAAAACAATATTGAGGTTAGAGTTCCACAACCGTCAGGAAAGCAAGTCCTACATAATAAGATCCAAGTTGAAGCAACCAAGGTTGAAGACAGAGAA
GAAGCAGGACAGTCAAATCACTCGAGTTTACTTCGGAGCCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTACTGGCGGGCCCCACAAAACAAGGCCGGT
GGATTGTGGGGGTGATTTTAGCTTTGGTGATGTTGGTCATTTGTTGGATACTGAGTCGTTGAGACGACGCATGGAACAAATTGCTGCTGTACAGGGCCTAGGCAGTGTTT
CTGCAGATTGTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGACTTGGTCGGAGCGTGGCCTGCATATGAGCCTGAGAAA
CCTCTTGCTCATAAGCAGCAGATTCAGGGGAAGGTTATCAATGGCATGTTGCCAAATAATCAATTACACGGACGACATAGCAATGGAAATGAAGAAGTTGTGCATGAGCA
CAGGTTACAGTGTTCAATATCGTTGCTCGACTTCAAGGTAGCAATGGAGCTTAATCCAACACAACTAGGGGAAGACTGGCCTTTGTTACTGGAGAAAATTTGTATGCGTG
CCTTCGGGGAATGA
mRNA sequenceShow/hide mRNA sequence
TTATAAGAGAGAGAGAGAGAGAGAAATAAGAGAGAGAGAGAGAGAGAGACAGAGGTTGAAGGACAGAGAGAAAGCAAACGAAAGTTCGAAGCTCCGAAGATTCGGACCGA
GCCACTTAGCTTCAATTTTCTCTTCAATTTGTCAAGTTTGTCAAAGGTCAGAGCTGAAGCAATCGCTATGTCTTTGTTGTTCAAATGCTATTGAGCTTAGTCGGCGAGAT
TCAATGTGCTTCATATCTCTGCAATTTGTGCTCTGTTCACTCTACTCAACTTCGCTCAACTTACTGATCTCTTTATTTGGCTTCGAAATTTCGTGTTTCGAACTTCGGAG
CTTGCGAATTCTGTTCAAAGAATGATGGGTTTGTATTGGTTCTTCGACCCAATTCGCGTTTTTCGGTAGTTGTTGATTTCTTTGTTTGTTCTCGTGTTCTGAACTTGGGT
TTCGTTATTTTTCGAAGTTAGGGTTTTTATTTGCTAATGTTTTGCAATAGGTTTGATTTGGTTTTTGAAATTTTGGCGGGATTTAGTGAGCCATGTAGTGGGTTTAACTT
CTAGGGTTGTGGTTTTTGAGAAGTTGCGATTTCGAGTTGCATTTTGTGCTTTGTAGTTGGGGAAAGAAAAGGATCAGTTGGAAATATGGTTGCTGCAATTCTGATGAAAT
CCGTTCTGGGATTTCCTTGATCTGTTGTGGTTGAATTTTGGGAACTGGCTCGAATGATGCCGACCTTGAAAAATTAATCTGCATTGATTGATATGTGGGGGTGAAAAATT
TGTGTTTCTTTTTTCTTTTTCTTCGTTTGTTGGCAAGGTTTTGTGTCTGTAATTGAGTGATTTTGTAAAGGGTGTGTGAATCTGTCTGCTGAGAAATGCAACCTCAGCAG
AGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAACTCGGAGCCGATCGGTCGAAACGGTACTTCTTTTACTTGAACAGGTTCCTGAGTCAAAAGCT
GAGTAAGAATGAGTTTGATAAGTCATGTTGTCGTGTACTTGGGAGGGAGAATCTCTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATGCTTGTCAAGCTAAGG
CTGCACCACCAATACCTGTAGCAGGCTATCCGAAAACATCTACGCAATCTGCAAAAATTTCCCCTCTCGTAGAAGATGGGAATGAAGATGGTGGAGCTGTTTTTCCTACT
TCCACGCAAAATATTCCCGGTTGGTCCAATGGAGTTTCACCAAGAAAGTGCAGGTCTGGGATACGCGACCGCAAACTCAAAGACAGGCCGAGTGTACTGGGGCCAAATGG
GAAGGTTGAATGTATCTCACATCTATCCGCAAACATGGATAATGGTGATGCAACACTATGTGACTATAAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACTACCTG
AAAACAATATTGAGGTTAGAGTTCCACAACCGTCAGGAAAGCAAGTCCTACATAATAAGATCCAAGTTGAAGCAACCAAGGTTGAAGACAGAGAAGAAGCAGGACAGTCA
AATCACTCGAGTTTACTTCGGAGCCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTACTGGCGGGCCCCACAAAACAAGGCCGGTGGATTGTGGGGGTGA
TTTTAGCTTTGGTGATGTTGGTCATTTGTTGGATACTGAGTCGTTGAGACGACGCATGGAACAAATTGCTGCTGTACAGGGCCTAGGCAGTGTTTCTGCAGATTGTGCTA
ATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGACTTGGTCGGAGCGTGGCCTGCATATGAGCCTGAGAAACCTCTTGCTCATAAG
CAGCAGATTCAGGGGAAGGTTATCAATGGCATGTTGCCAAATAATCAATTACACGGACGACATAGCAATGGAAATGAAGAAGTTGTGCATGAGCACAGGTTACAGTGTTC
AATATCGTTGCTCGACTTCAAGGTAGCAATGGAGCTTAATCCAACACAACTAGGGGAAGACTGGCCTTTGTTACTGGAGAAAATTTGTATGCGTGCCTTCGGGGAATGAA
ACCACTCTGATATTTCTGTTTATCCCATTCCACAGTGTAGCTGACCACCTGTCACAGTTCAAAGGGGTTGAGAATTGATCTCAAAGACTTCTCAGCTCTGCTCATTTGTG
AGTAAACAAAGATGGCCCTAATCTTGATAAAAGCCCTGGCTTGCAAGAATACAAGCAGATCAAACATCAATATCGATGTTCATTCTTGGGAATCATTTAACTGTGACTAA
GCTCGGGCACTGTCAGCCAGAATGCCCATCAGCGATTGATCTACGTTTCATCTTTAGCCATGATTTAAGGGTCTCAAGCTTTTCGTGTAAATTTAGATCACTGAGGAATT
ATACAAAATTTTGCAGAGATACAGTTTGTAGCTTGGTGAGAGGTAACCACTGGTGCAAGAACTTAGTGTTGCTATTTGTGTATTTAAATCTGAGAATTGATTTGAATTTC
ATATGAATAACTCATCAACAGACCACAATCTGCAGATAGTATCCTCTGTATTTCATGCTTACCTTTTTTTTGTTTTTT
Protein sequenceShow/hide protein sequence
MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNEDGG
AVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNKIQVEATKVEDRE
EAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEK
PLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFGE