; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0025500 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0025500
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationchr03:3812651..3815345
RNA-Seq ExpressionPI0025500
SyntenyPI0025500
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138880.1 uncharacterized protein LOC101213741 [Cucumis sativus]9.6e-22595.61Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAK APPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP
        LVEDGNEDGGAVFPTSTQNIPGWSNG   VSPRKCRSGIRDRKLKDRPS+LGPNGK ECISHLSANMDNGDATLCDYKRPVQ+LQG+AELPENNIEVRVP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP

Query:  QQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADC
        Q SGKQ LQNKIQVEATKVEDREEAGQS+HSSLLRSRLLAPLGIPFCSAS GGARKTRPVDCGGDFS SDVGHLLDTESLRRRMEQIAAVQGLGSVSADC
Subjt:  QQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADC

Query:  ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL
        ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLPNNQLHGRHSNG+EEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL
Subjt:  ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL

Query:  EKICMRAFEE
        EKICMR F E
Subjt:  EKICMRAFEE

XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]4.2e-22897.32Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP
        LVEDGNEDGGAVFPTSTQNIPGWSNG   VSPRKCRSGIRDRKLKDRPSVLGPNGK ECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP

Query:  QQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADC
        Q SGKQVL NKIQVEATKVEDREEAGQS+HSSLLRSRLLAPLGIPFCSASTGG  KTRPVDCGGDFSF DVGHLLDTESLRRRMEQIAAVQGLGSVSADC
Subjt:  QQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADC

Query:  ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL
        ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL
Subjt:  ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL

Query:  EKICMRAFEE
        EKICMRAF E
Subjt:  EKICMRAFEE

XP_022962598.1 uncharacterized protein LOC111463000 [Cucurbita moschata]1.5e-19885.68Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVFPTSTQ IP WSN GFPVSPRKCRSGIRDRKLKDRPS+L PN K ECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV + SGKQVLQ        +VEDREEA QS+ SSLLRSRLLAPLGIPFCSAS GGA KTRPVDCGG+FSFSD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFEE

XP_022997521.1 uncharacterized protein LOC111492414 [Cucurbita maxima]6.9e-19985.68Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVF TSTQ IP WSN GF +SPRKCRSGIRDRKLKDRPS+L PN K ECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV + SGKQVLQ  +QVE TKVEDREEA QS+ SSLLRSRLLAPLGIPFCSAS GGA KTRPVDCGG+FSFSD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFEE

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]1.5e-20186.63Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVFPTSTQ IP WSN GFPVSPRKCRSGIRDRKLKDRPS+L PN K ECIS  SA          +DNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV + SGKQVLQ  +QVE TKVEDREEA QS+ SSLLRSRLLAPLGIPFCSAS GGA KTRPVDCGG+FSFSD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LM32 Uncharacterized protein4.6e-22595.61Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAK APPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP
        LVEDGNEDGGAVFPTSTQNIPGWSNG   VSPRKCRSGIRDRKLKDRPS+LGPNGK ECISHLSANMDNGDATLCDYKRPVQ+LQG+AELPENNIEVRVP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP

Query:  QQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADC
        Q SGKQ LQNKIQVEATKVEDREEAGQS+HSSLLRSRLLAPLGIPFCSAS GGARKTRPVDCGGDFS SDVGHLLDTESLRRRMEQIAAVQGLGSVSADC
Subjt:  QQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADC

Query:  ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL
        ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLPNNQLHGRHSNG+EEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL
Subjt:  ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL

Query:  EKICMRAFEE
        EKICMR F E
Subjt:  EKICMRAFEE

A0A1S3BCQ5 uncharacterized protein LOC1034882312.0e-22897.32Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP
        LVEDGNEDGGAVFPTSTQNIPGWSNG   VSPRKCRSGIRDRKLKDRPSVLGPNGK ECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP

Query:  QQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADC
        Q SGKQVL NKIQVEATKVEDREEAGQS+HSSLLRSRLLAPLGIPFCSASTGG  KTRPVDCGGDFSF DVGHLLDTESLRRRMEQIAAVQGLGSVSADC
Subjt:  QQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADC

Query:  ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL
        ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL
Subjt:  ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL

Query:  EKICMRAFEE
        EKICMRAF E
Subjt:  EKICMRAFEE

A0A5A7VF96 SAGA-Tad1 domain-containing protein2.0e-22897.32Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP
        LVEDGNEDGGAVFPTSTQNIPGWSNG   VSPRKCRSGIRDRKLKDRPSVLGPNGK ECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVP

Query:  QQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADC
        Q SGKQVL NKIQVEATKVEDREEAGQS+HSSLLRSRLLAPLGIPFCSASTGG  KTRPVDCGGDFSF DVGHLLDTESLRRRMEQIAAVQGLGSVSADC
Subjt:  QQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADC

Query:  ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL
        ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL
Subjt:  ANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLL

Query:  EKICMRAFEE
        EKICMRAF E
Subjt:  EKICMRAFEE

A0A6J1HF85 uncharacterized protein LOC1114630007.5e-19985.68Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVFPTSTQ IP WSN GFPVSPRKCRSGIRDRKLKDRPS+L PN K ECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV + SGKQVLQ        +VEDREEA QS+ SSLLRSRLLAPLGIPFCSAS GGA KTRPVDCGG+FSFSD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFEE

A0A6J1K7Q1 uncharacterized protein LOC1114924143.3e-19985.68Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP
        ++EDGNEDGGAVF TSTQ IP WSN GF +SPRKCRSGIRDRKLKDRPS+L PN K ECIS  SA          MDNG+AT CDY+RPVQHLQGV ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSA---------NMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV + SGKQVLQ  +QVE TKVEDREEA QS+ SSLLRSRLLAPLGIPFCSAS GGA KTRPVDCGG+FSFSD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVG WP +EPEKPLAH QQIQGKVINGMLPNNQLH  HSNGN EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAF E
Subjt:  LGEDWPLLLEKICMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein1.6e-3930.62Show/hide
Query:  RIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNE
        R++  E+K+ I +K+G  R+  YF  L +FL+ ++SK+EFDK C + +GREN+ LHN+L++SILKNA  AK+ PP     YPK S     +         
Subjt:  RIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNE

Query:  DGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQQSGKQV
                            FP SPRKCRS    RK +DRPS LGP GK + ++         D ++   +R                            
Subjt:  DGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQQSGKQV

Query:  LQNKIQVEATKVEDREEAGQSHHSSLLRSR--LLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANILN
            + +E   VED EE  Q   S  ++SR  L APLG+ F   S         ++     +    G L D  +LR R+E+   ++G+  +S D AN+LN
Subjt:  LQNKIQVEATKVEDREEAGQSHHSSLLRSR--LLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANILN

Query:  KVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICM
        + L+ Y+++LI  C+ L                                              + R   ++S+LDF  AME+NP  LGE+WP+ LEKIC 
Subjt:  KVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICM

Query:  RAFEE
        RA EE
Subjt:  RAFEE

AT2G24530.1 unknown protein6.4e-9447.98Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQ  Q  RI L ELK  IVKK G +RS+RYF+YL RFLSQKL+K+EFDK+C R+LGRENL LHNQLI+SIL+NA  AK+ PP   AG+    +  A    
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHL---------SANMDNGDATLCDYKRPVQHLQGVAELP
           DG E  G + P  +Q+ P WSNG  P+SPRK RSG+++RK +DRPS LG NGK E + H          S  M+NG     DY+R  +++       
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHL---------SANMDNGDATLCDYKRPVQHLQGVAELP

Query:  ENNIEVRVPQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGD-FSFSDVGHLLDTESLRRRMEQIAAV
        E + E   P +  K  + NK ++ A  + D +   +    +L  S L+APLGIPFCSAS GG+ +T PV    +  S  D G L D E LR+RME IA  
Subjt:  ENNIEVRVPQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGD-FSFSDVGHLLDTESLRRRMEQIAAV

Query:  QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNP
        QGL  VS +CA  LN +LDVYLK+LI SC DLVGA     +P K    KQQ Q K++NG+ P N L  +  NG+ ++  +H    S+S+LDF+ AMELNP
Subjt:  QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNP

Query:  TQLGEDWPLLLEKICMRAFEE
         QLGEDWP L E+I +R+FEE
Subjt:  TQLGEDWPLLLEKICMRAFEE

AT4G31440.1 unknown protein8.4e-7845.52Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP
        MQ  Q  RIDL ELK  IVKK+G +RS RYF+YL RFLSQKL+K+EFDKSC R+LGRENL LHN+LI+SIL+NA  AK+ P +  +G+P  S    K   
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGGF-PVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRV
          EDG E+  ++ P   +N    SNG    V P  C     DR ++D+P  LG NGK   +   +            Y RP ++        E +     
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGGF-PVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRV

Query:  PQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVD-CGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSA
        P +  ++ +  K QV A    D E    +    L    ++APLGIPFCSAS GG R+T PV       S  D G L DTE LR+RME IA  QGLG VSA
Subjt:  PQQSGKQVLQNKIQVEATKVEDREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVD-CGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSA

Query:  DCANILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWP
        +C+ +LN +LD+YLK+L++SCVDL GA      P K    KQQ + +++NG+  NN  H + SN   ++  E   Q S+SLLDF+VAMELNP QLGEDWP
Subjt:  DCANILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWP

Query:  LLLEKICMRAFEE
        LL E+I +  FEE
Subjt:  LLLEKICMRAFEE

AT4G33890.1 unknown protein5.3e-4031.89Show/hide
Query:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDK C + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              V 
Subjt:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVE

Query:  DGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQQS
         GN D       ++Q  P   +  F  S RKCRS    RKL+DRPS LGP GK                                 L   N E     QS
Subjt:  DGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQQS

Query:  GKQVLQ--NKIQVEATKVEDREEAGQ-SHHSSLLRSR--LLAPLGIPFCSASTGGARK--TRPVDCGGDF---SFSDVGHLLDTESLRRRMEQIAAVQGL
          ++L   ++  VE   VE+ EE  Q +  S  ++SR  L APLG+   S   G  RK  +    C   F   +  + G L DT +LR R+E+   ++GL
Subjt:  GKQVLQ--NKIQVEATKVEDREEAGQ-SHHSSLLRSR--LLAPLGIPFCSASTGGARK--TRPVDCGGDF---SFSDVGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLG
          ++ D  ++LN  LDV++++LI  C+ L       +  + +                           N +   + R    +S+ DF+  MELN   LG
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLG

Query:  EDWPLLLEKICMRAFEE
        EDWP+ +EKIC RA ++
Subjt:  EDWPLLLEKICMRAFEE

AT4G33890.2 unknown protein5.3e-4031.89Show/hide
Query:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVE
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDK C + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              V 
Subjt:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVE

Query:  DGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQQS
         GN D       ++Q  P   +  F  S RKCRS    RKL+DRPS LGP GK                                 L   N E     QS
Subjt:  DGNEDGGAVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQQS

Query:  GKQVLQ--NKIQVEATKVEDREEAGQ-SHHSSLLRSR--LLAPLGIPFCSASTGGARK--TRPVDCGGDF---SFSDVGHLLDTESLRRRMEQIAAVQGL
          ++L   ++  VE   VE+ EE  Q +  S  ++SR  L APLG+   S   G  RK  +    C   F   +  + G L DT +LR R+E+   ++GL
Subjt:  GKQVLQ--NKIQVEATKVEDREEAGQ-SHHSSLLRSR--LLAPLGIPFCSASTGGARK--TRPVDCGGDF---SFSDVGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLG
          ++ D  ++LN  LDV++++LI  C+ L       +  + +                           N +   + R    +S+ DF+  MELN   LG
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLG

Query:  EDWPLLLEKICMRAFEE
        EDWP+ +EKIC RA ++
Subjt:  EDWPLLLEKICMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAACTTGGAGCCGATCGGTCGAAACGGTACTTCTTTTACTTGAATAGGTT
CCTGAGTCAAAAGCTGAGTAAGAATGAGTTTGATAAGTCATGTTGTCGTGTACTTGGGAGGGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATG
CTTGTCAAGCTAAGGCTGCACCACCAATACCTGTAGCAGGCTATCCGAAAACTTCGACGCAATCTGCAAAAATTTCCCCTCTCGTAGAAGATGGGAATGAAGATGGTGGA
GCTGTTTTTCCTACTTCCACGCAAAATATTCCTGGTTGGTCCAATGGAGGTTTTCCAGTTTCACCAAGAAAGTGCAGGTCTGGGATACGTGACCGCAAACTCAAAGACAG
GCCGAGTGTACTGGGGCCAAATGGGAAGGGTGAATGTATCTCACATCTATCAGCAAACATGGATAATGGTGATGCAACACTATGTGACTATAAGAGACCAGTGCAGCATC
TGCAAGGAGTTGCTGAACTACCTGAAAACAATATTGAGGTTAGAGTTCCACAACAGTCAGGAAAGCAAGTCCTACAAAATAAGATCCAGGTTGAAGCAACCAAGGTTGAA
GACAGGGAAGAAGCGGGACAGTCACATCACTCGAGTTTACTTCGGAGCCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTACCGGCGGGGCCCGCAAAAC
AAGGCCGGTGGATTGTGGGGGAGATTTTAGCTTTAGTGATGTTGGTCATTTGTTGGATACTGAGTCGTTGAGACGACGCATGGAACAAATTGCTGCTGTACAGGGCCTAG
GCAGTGTTTCTGCAGATTGTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGACTTGGTCGGAGCGTGGCCTGCTTATGAG
CCTGAGAAACCTCTTGCTCATAAGCAGCAGATTCAGGGGAAGGTTATCAATGGCATGTTGCCGAATAATCAATTACACGGACGACATAGCAATGGAAACGAAGAAGTCGT
GCACGAGCACAGGTTACAGTGTTCGATATCGTTGCTTGACTTCAAGGTTGCAATGGAGCTTAATCCAACACAACTAGGGGAAGACTGGCCTTTGTTACTGGAGAAAATTT
GTATGCGTGCCTTCGAGGAATGA
mRNA sequenceShow/hide mRNA sequence
TGAATGCAGATTATAGATAGCTACGAAGAAGCAAAACTTGAATAATAATTATAAAAAGAAAAAAAGAAAGAAAAAAAAGGCGATCGTTAAAATAAAAAATAAAACTAAAT
TATAAGAGAGAGAGAAAAATAAGAGAGAGAGAGAGAGAGAGAGAGAGAGACAGAGGTTGGAGGACAGAGAGAAAGCAAACGAAAGTTCGAAGCTCCGAAGATTCGGACCG
AGCCACTTAGCTTCAATTTCTCTTCAATTTGTCAAGTTTGTCAAAGGTCAGAGCTGAAGCAATCGCTATGTCTTTGTTGTTCAAATGCTATTGAGCTTAGTCGGCGAGAT
TCAATGTGCTTCATATCTCTGCAATTTGTGCTCTGTTCACTCTACTCAACTTCGCTCAACTTACTGATCTCTTTATTCGACTTCGAAATTTCGTGTTTCGAACTTCAGAG
CTTGCAAATTCTGTTCAAAGAATGATGGGTTTGTATTGGTTCTTCGACCCAATTCGCGTTTTTCGGTAGTTGTTGATTTCTTTGTTTGTTCCCGTGTTCTGAACTTGGGT
TTCGTTGTTTTTCCAAGTTAGGGTTTTTATTTGCGATTGTTTTGCAATAGGTTTGATTTGGTTTTTGAAATTTGGCGGGATTTAGTGAGCCATGTAGTGGGTTTAACTTC
TAGGGTTGTGGTTTTTGAGAAGTTGCGATTTCGAGTTGCATTTTGTGCTTTGTAGTTGGGGAAAGAAAAGGATCAGTTGGAAATATGATTGCTGCAATTCTGATGAAATC
CGTTCTGGGATTTCCTTGATCTGTTGGGGTCGAATTTTGGGAACTGGCTCAAATGATGCCGACCTTGAAAAATTAATCTGCATTGATTGATATGTGGGAGTGAAAATTTT
GTGTTTCTTTTTTCTTTTTCTTCGTTTGTTGGCAAAGTTTCGTGTCTGTAATTGAGTGATTTTGTAAAGGGTGTGTGAATCTGTCTGCTGAGAAATGCAACCTCAGCAGA
GCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAACTTGGAGCCGATCGGTCGAAACGGTACTTCTTTTACTTGAATAGGTTCCTGAGTCAAAAGCTG
AGTAAGAATGAGTTTGATAAGTCATGTTGTCGTGTACTTGGGAGGGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATGCTTGTCAAGCTAAGGC
TGCACCACCAATACCTGTAGCAGGCTATCCGAAAACTTCGACGCAATCTGCAAAAATTTCCCCTCTCGTAGAAGATGGGAATGAAGATGGTGGAGCTGTTTTTCCTACTT
CCACGCAAAATATTCCTGGTTGGTCCAATGGAGGTTTTCCAGTTTCACCAAGAAAGTGCAGGTCTGGGATACGTGACCGCAAACTCAAAGACAGGCCGAGTGTACTGGGG
CCAAATGGGAAGGGTGAATGTATCTCACATCTATCAGCAAACATGGATAATGGTGATGCAACACTATGTGACTATAAGAGACCAGTGCAGCATCTGCAAGGAGTTGCTGA
ACTACCTGAAAACAATATTGAGGTTAGAGTTCCACAACAGTCAGGAAAGCAAGTCCTACAAAATAAGATCCAGGTTGAAGCAACCAAGGTTGAAGACAGGGAAGAAGCGG
GACAGTCACATCACTCGAGTTTACTTCGGAGCCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTACCGGCGGGGCCCGCAAAACAAGGCCGGTGGATTGT
GGGGGAGATTTTAGCTTTAGTGATGTTGGTCATTTGTTGGATACTGAGTCGTTGAGACGACGCATGGAACAAATTGCTGCTGTACAGGGCCTAGGCAGTGTTTCTGCAGA
TTGTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGACTTGGTCGGAGCGTGGCCTGCTTATGAGCCTGAGAAACCTCTTG
CTCATAAGCAGCAGATTCAGGGGAAGGTTATCAATGGCATGTTGCCGAATAATCAATTACACGGACGACATAGCAATGGAAACGAAGAAGTCGTGCACGAGCACAGGTTA
CAGTGTTCGATATCGTTGCTTGACTTCAAGGTTGCAATGGAGCTTAATCCAACACAACTAGGGGAAGACTGGCCTTTGTTACTGGAGAAAATTTGTATGCGTGCCTTCGA
GGAATGAAACCACTCTGGTATTTCTGTTTATCCCATTCCACAGTGTAGCTGACCTGTCACAGTTCAAAGGGGTTGAGAATAGATCTCAAAGACCTCTCAGCTCTGCTCAT
TTGTGAGTAAACAAAGATTGCTCAAATCTTGATAAAAGCCCTGGCTTGCAAGAATACAAGCAGATCAAACATCAATACCGATGTTCATTCTTGGGAATCATTTAACTGTG
ACTAAGCTTGGGCACTGTCAGTCAGAATGCCCATCAGCGATTGATCTACGTTTCATCTTTAGCCATGATTTAAGGGCCTCAAGCTTTTCATGTAAATTTAGATCACTGAG
GAATTATACAAAATTTTGCAGAGATACAGTTTGTAGCTTGGTGAGAGGTAACCACTGGAGCAAGAACTTAGTGTTGCTATTTGTGTATTTAAATCTGAGAATTGATTTGA
ATTTCATATGAATAACTCATCAACAGACAACAATCTGCAGATAGTATCCTCTGTA
Protein sequenceShow/hide protein sequence
MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGYPKTSTQSAKISPLVEDGNEDGG
AVFPTSTQNIPGWSNGGFPVSPRKCRSGIRDRKLKDRPSVLGPNGKGECISHLSANMDNGDATLCDYKRPVQHLQGVAELPENNIEVRVPQQSGKQVLQNKIQVEATKVE
DREEAGQSHHSSLLRSRLLAPLGIPFCSASTGGARKTRPVDCGGDFSFSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYE
PEKPLAHKQQIQGKVINGMLPNNQLHGRHSNGNEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE