; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0006674 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0006674
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationchr03:23751775..23752996
RNA-Seq ExpressionIVF0006674
SyntenyIVF0006674
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138880.1 uncharacterized protein LOC101213741 [Cucumis sativus]2.10e-26791.15Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADR----------FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADR          FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA           +GYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADR----------FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPS+LGPNGKVECISHLSANMDNGDATLCDYKRPVQ+LQG+AELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS

Query:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQ L NKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSAS GG  KTRPVDCGGDFS  DVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLPNNQLHGRHSNG+EEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRAFGE
        CMR FGE
Subjt:  CMRAFGE

XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]1.11e-27894.59Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADR----------FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADR          FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA           +GYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADR----------FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS

Query:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRAFGE
        CMRAFGE
Subjt:  CMRAFGE

XP_022962598.1 uncharacterized protein LOC111463000 [Cucurbita moschata]2.38e-22680.19Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADR----------FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DR          FLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNA           +GYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADR----------FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNG---VSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNG---VSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQVL         +VEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFGE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFGE

XP_022997521.1 uncharacterized protein LOC111492414 [Cucurbita maxima]3.01e-22880.43Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADR----------FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DR          FLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNA           +GYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADR----------FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVF TSTQ IP WSN    +SPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQVL   +QVE TKVEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFGE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFGE

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]3.15e-23081.15Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADR----------FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DR          FLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNA           +GYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADR----------FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNG---VSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          +DNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNG---VSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQVL   +QVE TKVEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFGE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFGE

TrEMBL top hitse value%identityAlignment
A0A0A0LM32 Uncharacterized protein1.0e-21091.15Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGAD          RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA           +GYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPS+LGPNGKVECISHLSANMDNGDATLCDYKRPVQ+LQG+AELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS

Query:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQ L NKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSAS GG  KTRPVDCGGDFS  DVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLPNNQLHGRHSNG+EEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRAFGE
        CMR FGE
Subjt:  CMRAFGE

A0A1S3BCQ5 uncharacterized protein LOC1034882312.7e-21994.59Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGAD          RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA           +GYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS

Query:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRAFGE
        CMRAFGE
Subjt:  CMRAFGE

A0A5A7VF96 SAGA-Tad1 domain-containing protein2.7e-21994.59Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGAD          RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA           +GYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS

Query:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRAFGE
        CMRAFGE
Subjt:  CMRAFGE

A0A6J1HF85 uncharacterized protein LOC1114630002.1e-17980.19Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKN-----------ASGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG D          RFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKN           A+GYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKN-----------ASGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQVL         +VEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFGE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFGE

A0A6J1K7Q1 uncharacterized protein LOC1114924145.1e-18180.43Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKN-----------ASGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG D          RFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKN           A+GYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKN-----------ASGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVF TSTQ IP WSN    +SPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQVL  ++QVE TKVEDREEA QSN SSLLRSRLLAPLGIPFCSAS GG HKTRPVDCGG+FSF D+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFGE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein4.6e-3330.5Show/hide
Query:  RIDLGELKSQIVKKLGADR----------FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNAS-------GYPKTSTQSAKISPLVEDGNEDGGA
        R++  E+K+ I +K+G  R          FL+ ++SK+EFDK C + +GREN+ LHN+L++SILKNAS        YPK S                G  
Subjt:  RIDLGELKSQIVKKLGADR----------FLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNAS-------GYPKTSTQSAKISPLVEDGNEDGGA

Query:  VFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNKIQV
        VFP             SPRKCRS    RK +DRPS LGP GK + ++         D ++   +R                                + +
Subjt:  VFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNKIQV

Query:  EATKVEDREEAGQSNHSSLLRSR--LLAPLGIPFCSASTGGPHKTRPVDCGG--DFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANILNKVLDV
        E   VED EE  Q   S  ++SR  L APLG+ F   S     K R     G    +    G L D  +LR R+E+   ++G+  +S D AN+LN+ L+ 
Subjt:  EATKVEDREEAGQSNHSSLLRSR--LLAPLGIPFCSASTGGPHKTRPVDCGG--DFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANILNKVLDV

Query:  YLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFGE
        Y+++LI  C+ L                                              + R   ++S+LDF  AME+NP  LGE+WP+ LEKIC RA  E
Subjt:  YLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFGE

AT2G24530.1 unknown protein6.9e-8245.32Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNAS-------GYPKTSTQSAKISPLVED
        MQ  Q  RI L ELK  IVKK G +          RFLSQKL+K+EFDK+C R+LGRENL LHNQLI+SIL+NA+        +    +  A       D
Subjt:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNAS-------GYPKTSTQSAKISPLVED

Query:  GNEDGGAVFPTSTQNIPGWSNGV---SPRKCRSGIRDRKLKDRPSVLGPNGKVECISHL---------SANMDNGDATLCDYKRPVQHLQGVAELPENNI
        G E  G + P  +Q+ P WSNGV   SPRK RSG+++RK +DRPS LG NGKVE + H          S  M+NG     DY+R  ++   VA+  +   
Subjt:  GNEDGGAVFPTSTQNIPGWSNGV---SPRKCRSGIRDRKLKDRPSVLGPNGKVECISHL---------SANMDNGDATLCDYKRPVQHLQGVAELPENNI

Query:  EVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGD-FSFGDVGHLLDTESLRRRMEQIAAVQGLG
             +P  K  + NK ++ A  + D +   +    +L  S L+APLGIPFCSAS GG  +T PV    +  S  D G L D E LR+RME IA  QGL 
Subjt:  EVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVDCGGD-FSFGDVGHLLDTESLRRRMEQIAAVQGLG

Query:  SVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLG
         VS +CA  LN +LDVYLK+LI SC DLVGA     +P K    KQQ Q K++NG+ P N L  +  NG+ ++  +H    S+S+LDF+ AMELNP QLG
Subjt:  SVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLG

Query:  EDWPLLLEKICMRAFGE
        EDWP L E+I +R+F E
Subjt:  EDWPLLLEKICMRAFGE

AT4G31440.1 unknown protein2.0e-6843.03Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP
        MQ  Q  RIDL ELK  IVKK+G +          RFLSQKL+K+EFDKSC R+LGRENL LHN+LI+SIL+NA           SG+P  S    K   
Subjt:  MQPQQSLRIDLGELKSQIVKKLGAD----------RFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNA-----------SGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS
          EDG E+  ++ P   +N    SNGV  +       DR ++D+P  LG NGKV                   Y RP ++       P+   +     P+
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPS

Query:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVD-CGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCAN
         ++ +  K QV A    D E    +    L    ++APLGIPFCSAS GG  +T PV       S  D G L DTE LR+RME IA  QGLG VSA+C+ 
Subjt:  GKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASTGGPHKTRPVD-CGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCAN

Query:  ILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLE
        +LN +LD+YLK+L++SCVDL GA      P K    KQQ + +++NG+  NN  H + SN   ++  E   Q S+SLLDF+VAMELNP QLGEDWPLL E
Subjt:  ILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLE

Query:  KICMRAFGE
        +I +  F E
Subjt:  KICMRAFGE

AT4G33890.1 unknown protein2.3e-3229.95Show/hide
Query:  QQSLRIDLGELKSQIVKKLG----------ADRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNASGYPKTSTQSAKISPLVEDGNE-----DG
        Q S R+D  E+K+ I +++G            RF + K++K+EFDK C + +GR+N+ LHN+LI+SI+KNA          AK  P ++ G       +G
Subjt:  QQSLRIDLGELKSQIVKKLG----------ADRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNASGYPKTSTQSAKISPLVEDGNE-----DG

Query:  GAVFPTSTQNIPGWSN-GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNK
         +   +  Q + G S    S RKCRS    RKL+DRPS LGP GK   ++  +             +  +   Q   EL                 L ++
Subjt:  GAVFPTSTQNIPGWSN-GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNK

Query:  IQVEATKVEDREEAGQ-SNHSSLLRSR--LLAPLGIPFCSASTGGPHK--TRPVDCGGDF---SFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
          VE   VE+ EE  Q +  S  ++SR  L APLG+   S   G   K  +    C   F   +  + G L DT +LR R+E+   ++GL  ++ D  ++
Subjt:  IQVEATKVEDREEAGQ-SNHSSLLRSR--LLAPLGIPFCSASTGGPHK--TRPVDCGGDF---SFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LN  LDV++++LI  C+ L       +  + +                           N +   + R    +S+ DF+  MELN   LGEDWP+ +EKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRA
        C RA
Subjt:  CMRA

AT4G33890.2 unknown protein2.3e-3229.95Show/hide
Query:  QQSLRIDLGELKSQIVKKLG----------ADRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNASGYPKTSTQSAKISPLVEDGNE-----DG
        Q S R+D  E+K+ I +++G            RF + K++K+EFDK C + +GR+N+ LHN+LI+SI+KNA          AK  P ++ G       +G
Subjt:  QQSLRIDLGELKSQIVKKLG----------ADRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNASGYPKTSTQSAKISPLVEDGNE-----DG

Query:  GAVFPTSTQNIPGWSN-GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNK
         +   +  Q + G S    S RKCRS    RKL+DRPS LGP GK   ++  +             +  +   Q   EL                 L ++
Subjt:  GAVFPTSTQNIPGWSN-GVSPRKCRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNK

Query:  IQVEATKVEDREEAGQ-SNHSSLLRSR--LLAPLGIPFCSASTGGPHK--TRPVDCGGDF---SFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
          VE   VE+ EE  Q +  S  ++SR  L APLG+   S   G   K  +    C   F   +  + G L DT +LR R+E+   ++GL  ++ D  ++
Subjt:  IQVEATKVEDREEAGQ-SNHSSLLRSR--LLAPLGIPFCSASTGGPHK--TRPVDCGGDF---SFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LN  LDV++++LI  C+ L       +  + +                           N +   + R    +S+ DF+  MELN   LGEDWP+ +EKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRA
        C RA
Subjt:  CMRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAACTCGGAGCCGATCGGTTCCTGAGTCAAAAGCTGAGTAAGAATGAGTT
TGATAAGTCATGTTGTCGTGTACTTGGGAGGGAGAATCTCTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATGCTTCAGGCTATCCGAAAACATCTACGCAAT
CTGCAAAAATTTCCCCTCTCGTAGAAGATGGGAATGAAGATGGTGGAGCTGTTTTTCCTACTTCCACGCAAAATATTCCCGGTTGGTCCAATGGAGTTTCACCAAGAAAG
TGCAGGTCTGGGATACGCGACCGCAAACTCAAAGACAGGCCGAGTGTACTGGGGCCAAATGGGAAGGTTGAATGTATCTCACATCTATCCGCAAACATGGATAATGGTGA
TGCAACACTATGTGACTATAAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACTACCTGAAAACAATATTGAGGTTAGAGTTCCACAACCGTCAGGAAAGCAAGTCC
TACATAATAAGATCCAAGTTGAAGCAACCAAGGTTGAAGACAGAGAAGAAGCAGGACAGTCAAATCACTCGAGTTTACTTCGGAGCCGATTACTTGCACCTCTTGGGATT
CCTTTTTGCTCAGCTAGTACTGGCGGGCCCCACAAAACAAGGCCGGTGGATTGTGGGGGTGATTTTAGCTTTGGTGATGTTGGTCATTTGTTGGATACTGAGTCATTGAG
ACGACGCATGGAACAAATTGCTGCTGTACAGGGCCTAGGCAGTGTTTCTGCAGATTGTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGT
CTTGTGTTGACTTGGTCGGAGCGTGGCCTGCATATGAGCCTGAGAAACCTCTTGCTCATAAGCAGCAGATTCAGGGGAAGGTTATCAATGGCATGTTGCCAAATAATCAA
TTACACGGACGACATAGCAATGGAAATGAAGAAGTTGTGCATGAGCACAGGTTACAGTGTTCAATATCGTTGCTCGACTTCAAGGTAGCAATGGAGCTTAATCCAACACA
ACTAGGGGAAGACTGGCCTTTGTTACTGGAGAAAATTTGTATGCGTGCCTTCGGGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAACTCGGAGCCGATCGGTTCCTGAGTCAAAAGCTGAGTAAGAATGAGTT
TGATAAGTCATGTTGTCGTGTACTTGGGAGGGAGAATCTCTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATGCTTCAGGCTATCCGAAAACATCTACGCAAT
CTGCAAAAATTTCCCCTCTCGTAGAAGATGGGAATGAAGATGGTGGAGCTGTTTTTCCTACTTCCACGCAAAATATTCCCGGTTGGTCCAATGGAGTTTCACCAAGAAAG
TGCAGGTCTGGGATACGCGACCGCAAACTCAAAGACAGGCCGAGTGTACTGGGGCCAAATGGGAAGGTTGAATGTATCTCACATCTATCCGCAAACATGGATAATGGTGA
TGCAACACTATGTGACTATAAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGAACTACCTGAAAACAATATTGAGGTTAGAGTTCCACAACCGTCAGGAAAGCAAGTCC
TACATAATAAGATCCAAGTTGAAGCAACCAAGGTTGAAGACAGAGAAGAAGCAGGACAGTCAAATCACTCGAGTTTACTTCGGAGCCGATTACTTGCACCTCTTGGGATT
CCTTTTTGCTCAGCTAGTACTGGCGGGCCCCACAAAACAAGGCCGGTGGATTGTGGGGGTGATTTTAGCTTTGGTGATGTTGGTCATTTGTTGGATACTGAGTCATTGAG
ACGACGCATGGAACAAATTGCTGCTGTACAGGGCCTAGGCAGTGTTTCTGCAGATTGTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGT
CTTGTGTTGACTTGGTCGGAGCGTGGCCTGCATATGAGCCTGAGAAACCTCTTGCTCATAAGCAGCAGATTCAGGGGAAGGTTATCAATGGCATGTTGCCAAATAATCAA
TTACACGGACGACATAGCAATGGAAATGAAGAAGTTGTGCATGAGCACAGGTTACAGTGTTCAATATCGTTGCTCGACTTCAAGGTAGCAATGGAGCTTAATCCAACACA
ACTAGGGGAAGACTGGCCTTTGTTACTGGAGAAAATTTGTATGCGTGCCTTCGGGGAATGA
Protein sequenceShow/hide protein sequence
MQPQQSLRIDLGELKSQIVKKLGADRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNASGYPKTSTQSAKISPLVEDGNEDGGAVFPTSTQNIPGWSNGVSPRK
CRSGIRDRKLKDRPSVLGPNGKVECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQPSGKQVLHNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGI
PFCSASTGGPHKTRPVDCGGDFSFGDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQ
LHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFGE