; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy2G022490 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy2G022490
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationGy14Chr2:30826384..30828975
RNA-Seq ExpressionCsGy2G022490
SyntenyCsGy2G022490
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138880.1 uncharacterized protein LOC101213741 [Cucumis sativus]1.82e-301100Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS

Query:  GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRTFGE
        CMRTFGE
Subjt:  CMRTFGE

XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]1.48e-29196.31Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAK APPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPS+LGPNGKVECISHLSANMDNGDATLCDYKRPVQ+LQG+AELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS

Query:  GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQ L NKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSAS GG  KTRPVDCGGDFS  DVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLPNNQLHGRHSNG+EEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRTFGE
        CMR FGE
Subjt:  CMRTFGE

XP_022962598.1 uncharacterized protein LOC111463000 [Cucurbita moschata]3.19e-24683.77Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAK APPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNG---VSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQNLQGIAELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQ+LQG+ ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNG---VSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQNLQGIAELP

Query:  ENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQ LQ        +VEDREEA QSN SSLLRSRLLAPLGIPFCSASIGGA KTRPVDCGG+FS SD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPL+H QQ QGKVINGMLPNNQLH  HSNG+ EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRTFGE
        LGEDWPLLLEKI MR F E
Subjt:  LGEDWPLLLEKICMRTFGE

XP_022997521.1 uncharacterized protein LOC111492414 [Cucurbita maxima]4.02e-24884.01Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAK APPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQNLQGIAELP
        ++EDGNEDGGAVF TSTQ IP WSN    +SPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQ+LQG+ ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQNLQGIAELP

Query:  ENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQ LQ  +QVE TKVEDREEA QSN SSLLRSRLLAPLGIPFCSASIGGA KTRPVDCGG+FS SD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVG WP +EPEKPL+H QQ QGKVINGMLPNNQLH  HSNG+ EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRTFGE
        LGEDWPLLLEKI MR F E
Subjt:  LGEDWPLLLEKICMRTFGE

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]4.21e-25084.73Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAK APPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNG---VSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQNLQGIAELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          +DNG+AT CDY+RPVQ+LQG+ ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNG---VSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQNLQGIAELP

Query:  ENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQ LQ  +QVE TKVEDREEA QSN SSLLRSRLLAPLGIPFCSASIGGA KTRPVDCGG+FS SD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPL+H QQ QGKVINGMLPNNQLH  HSNG+ EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRTFGE
        LGEDWPLLLEKI MR F E
Subjt:  LGEDWPLLLEKICMRTFGE

TrEMBL top hitse value%identityAlignment
A0A0A0LM32 Uncharacterized protein8.83e-302100Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS

Query:  GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRTFGE
        CMRTFGE
Subjt:  CMRTFGE

A0A1S3BCQ5 uncharacterized protein LOC1034882317.15e-29296.31Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAK APPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPS+LGPNGKVECISHLSANMDNGDATLCDYKRPVQ+LQG+AELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS

Query:  GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQ L NKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSAS GG  KTRPVDCGGDFS  DVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLPNNQLHGRHSNG+EEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRTFGE
        CMR FGE
Subjt:  CMRTFGE

A0A5A7VF96 SAGA-Tad1 domain-containing protein7.15e-29296.31Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAK APPIPVAGYPKTSTQSAKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS
        LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPS+LGPNGKVECISHLSANMDNGDATLCDYKRPVQ+LQG+AELPENNIEVRVPQPS
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS

Query:  GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
        GKQ L NKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSAS GG  KTRPVDCGGDFS  DVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI
Subjt:  GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANI

Query:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
        LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPL+HKQQ QGKVINGMLPNNQLHGRHSNG+EEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI
Subjt:  LNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKI

Query:  CMRTFGE
        CMR FGE
Subjt:  CMRTFGE

A0A6J1HF85 uncharacterized protein LOC1114630001.54e-24683.77Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAK APPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNG---VSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQNLQGIAELP
        ++EDGNEDGGAVFPTSTQ IP WSN    VSPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQ+LQG+ ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNG---VSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQNLQGIAELP

Query:  ENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQ LQ        +VEDREEA QSN SSLLRSRLLAPLGIPFCSASIGGA KTRPVDCGG+FS SD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPA+EPEKPL+H QQ QGKVINGMLPNNQLH  HSNG+ EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRTFGE
        LGEDWPLLLEKI MR F E
Subjt:  LGEDWPLLLEKICMRTFGE

A0A6J1K7Q1 uncharacterized protein LOC1114924141.95e-24884.01Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
        MQPQQSLRIDL ELKSQIVKKLG DRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAK APPIP AGYPKTSTQ+AKISP
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQNLQGIAELP
        ++EDGNEDGGAVF TSTQ IP WSN    +SPRKCRSGIRDRKLKDRPS+L PN KVECIS  SA          MDNG+AT CDY+RPVQ+LQG+ ELP
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSN---GVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSA---------NMDNGDATLCDYKRPVQNLQGIAELP

Query:  ENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQ
        ENNIE RV +PSGKQ LQ  +QVE TKVEDREEA QSN SSLLRSRLLAPLGIPFCSASIGGA KTRPVDCGG+FS SD+GHLLDTESLRRRMEQIAAVQ
Subjt:  ENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQ

Query:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQ
        GLGSVSADCANILNKVLDVYLKQLIRSCVDLVG WP +EPEKPL+H QQ QGKVINGMLPNNQLH  HSNG+ EVVHE RL CSISLLDFKVAMELNP Q
Subjt:  GLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQ

Query:  LGEDWPLLLEKICMRTFGE
        LGEDWPLLLEKI MR F E
Subjt:  LGEDWPLLLEKICMRTFGE

SwissProt top hitse value%identityAlignment
A6QR06 Transcriptional adapter 16.0e-0434.09Show/hide
Query:  DLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAK
        +L   K  + + LG D  K+Y+  L  +  QK+SK EFD    R+L ++N+  HN  + +IL   CQ  V+ P      P T   +AK
Subjt:  DLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAK

Q5BJQ7 Transcriptional adapter 16.0e-0434.09Show/hide
Query:  DLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAK
        +L   K  + + LG D  K+Y+  L  +  QK+SK EFD    R+L ++N+  HN  + +IL   CQ  V+ P      P T   +AK
Subjt:  DLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAK

Q99LM9 Transcriptional adapter 16.0e-0434.09Show/hide
Query:  DLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAK
        +L   K  + + LG D  K+Y+  L  +  QK+SK EFD    R+L ++N+  HN  + +IL   CQ  V+ P      P T   +AK
Subjt:  DLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAK

Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein1.3e-3831.44Show/hide
Query:  RIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISPLVEDGNE
        R++  E+K+ I +K+G  R+  YF  L +FL+ ++SK+EFDK C + +GREN+ LHN+L++SILKNA  AK  PP     YPK S               
Subjt:  RIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISPLVEDGNE

Query:  DGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPSGKQDLQN
         G  VFP             SPRKCRS    RK +DRPS LGP GK + ++         D ++   +R                               
Subjt:  DGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPSGKQDLQN

Query:  KIQVEATKVEDREEAGQSNHSSLLRSR--LLAPLGIPFCSASIGGARKTRPVDCGG--DFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANILNK
         + +E   VED EE  Q   S  ++SR  L APLG+ F   S     K R     G    +    G L D  +LR R+E+   ++G+  +S D AN+LN+
Subjt:  KIQVEATKVEDREEAGQSNHSSLLRSR--LLAPLGIPFCSASIGGARKTRPVDCGG--DFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANILNK

Query:  VLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMR
         L+ Y+++LI  C+ L                                              + R   ++S+LDF  AME+NP  LGE+WP+ LEKIC R
Subjt:  VLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMR

Query:  TFGE
           E
Subjt:  TFGE

AT2G24530.1 unknown protein4.1e-9348.46Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
        MQ  Q  RI L ELK  IVKK G +RS+RYF+YL RFLSQKL+K+EFDK+C R+LGRENL LHNQLI+SIL+NA  AK  PP   AG+    +  A    
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGV---SPRKCRSGIRDRKLKDRPSILGPNGKVECISHL---------SANMDNGDATLCDYKRPVQNLQGIAELP
           DG E  G + P  +Q+ P WSNGV   SPRK RSG+++RK +DRPS LG NGKVE + H          S  M+NG     DY+R      G     
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGV---SPRKCRSGIRDRKLKDRPSILGPNGKVECISHL---------SANMDNGDATLCDYKRPVQNLQGIAELP

Query:  ENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGD-FSLSDVGHLLDTESLRRRMEQIAAV
        E + E    +P  K  + NK ++ A  + D +   +    +L  S L+APLGIPFCSAS+GG+ +T PV    +  S  D G L D E LR+RME IA  
Subjt:  ENNIEVRVPQPSGKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGD-FSLSDVGHLLDTESLRRRMEQIAAV

Query:  QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNP
        QGL  VS +CA  LN +LDVYLK+LI SC DLVGA     +P K    KQQ Q K++NG+ P N L  +  NGS ++  +H    S+S+LDF+ AMELNP
Subjt:  QGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNP

Query:  TQLGEDWPLLLEKICMRTFGE
         QLGEDWP L E+I +R+F E
Subjt:  TQLGEDWPLLLEKICMRTFGE

AT4G31440.1 unknown protein3.4e-7945.48Show/hide
Query:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP
        MQ  Q  RIDL ELK  IVKK+G +RS RYF+YL RFLSQKL+K+EFDKSC R+LGRENL LHN+LI+SIL+NA  AK  P +  +G+P  S    K   
Subjt:  MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISP

Query:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS
          EDG E+  ++ P   +N    SNGV  +       DR ++D+P  LG NGKV                   Y RP          P+   +     P+
Subjt:  LVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPS

Query:  GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVD-CGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCAN
         ++ +  K QV A    D E    +    L    ++APLGIPFCSAS+GG R+T PV       S  D G L DTE LR+RME IA  QGLG VSA+C+ 
Subjt:  GKQDLQNKIQVEATKVEDREEAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVD-CGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCAN

Query:  ILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLE
        +LN +LD+YLK+L++SCVDL GA      P K    KQQ + +++NG+  NN  H + SN   ++  E   Q S+SLLDF+VAMELNP QLGEDWPLL E
Subjt:  ILNKVLDVYLKQLIRSCVDLVGAWPAY-EPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLE

Query:  KICMRTFGE
        +I +  F E
Subjt:  KICMRTFGE

AT4G33890.1 unknown protein2.6e-3931.23Show/hide
Query:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGY-----PKTSTQSAKI
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDK C + +GR+N+ LHN+LI+SI+KNAC AK  P I   G         S ++++I
Subjt:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGY-----PKTSTQSAKI

Query:  SPLVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQ
         PL       G + F  ST            RKCRS    RKL+DRPS LGP GK   ++  +             +  +   Q   EL           
Subjt:  SPLVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQ

Query:  PSGKQDLQNKIQVEATKVEDREEAGQ-SNHSSLLRSR--LLAPLGIPFCSASIGGARK--TRPVDCGGDF---SLSDVGHLLDTESLRRRMEQIAAVQGL
              L ++  VE   VE+ EE  Q +  S  ++SR  L APLG+   S   G  RK  +    C   F   +  + G L DT +LR R+E+   ++GL
Subjt:  PSGKQDLQNKIQVEATKVEDREEAGQ-SNHSSLLRSR--LLAPLGIPFCSASIGGARK--TRPVDCGGDF---SLSDVGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLG
          ++ D  ++LN  LDV++++LI  C+ L       +  + ++++   Q + ++                            +S+ DF+  MELN   LG
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLG

Query:  EDWPLLLEKICMR
        EDWP+ +EKIC R
Subjt:  EDWPLLLEKICMR

AT4G33890.2 unknown protein2.6e-3931.23Show/hide
Query:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGY-----PKTSTQSAKI
        Q S R+D  E+K+ I +++G  R++ YF  L RF + K++K+EFDK C + +GR+N+ LHN+LI+SI+KNAC AK  P I   G         S ++++I
Subjt:  QQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGY-----PKTSTQSAKI

Query:  SPLVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQ
         PL       G + F  ST            RKCRS    RKL+DRPS LGP GK   ++  +             +  +   Q   EL           
Subjt:  SPLVEDGNEDGGAVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQ

Query:  PSGKQDLQNKIQVEATKVEDREEAGQ-SNHSSLLRSR--LLAPLGIPFCSASIGGARK--TRPVDCGGDF---SLSDVGHLLDTESLRRRMEQIAAVQGL
              L ++  VE   VE+ EE  Q +  S  ++SR  L APLG+   S   G  RK  +    C   F   +  + G L DT +LR R+E+   ++GL
Subjt:  PSGKQDLQNKIQVEATKVEDREEAGQ-SNHSSLLRSR--LLAPLGIPFCSASIGGARK--TRPVDCGGDF---SLSDVGHLLDTESLRRRMEQIAAVQGL

Query:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLG
          ++ D  ++LN  LDV++++LI  C+ L       +  + ++++   Q + ++                            +S+ DF+  MELN   LG
Subjt:  GSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEKPLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLG

Query:  EDWPLLLEKICMR
        EDWP+ +EKIC R
Subjt:  EDWPLLLEKICMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCAGAGCTTGAGAATTGACTTGGGTGAATTGAAATCTCAGATAGTGAAGAAACTTGGAGCCGATCGGTCAAAACGGTACTTCTTTTACTTGAATAGGTT
CCTGAGTCAAAAGCTGAGTAAGAATGAGTTTGATAAGTCATGTTGTCGTGTACTCGGAAGGGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATG
CTTGTCAAGCTAAGGTGGCACCACCAATACCTGTTGCAGGCTATCCGAAAACATCAACGCAATCTGCAAAAATTTCCCCTCTCGTAGAAGATGGGAATGAAGATGGTGGA
GCTGTTTTTCCTACTTCAACGCAAAATATTCCCGGTTGGTCCAATGGAGTTTCACCAAGAAAGTGCAGGTCTGGGATACGTGATCGCAAGCTCAAAGACAGACCCAGTAT
ACTGGGGCCAAATGGGAAGGTTGAATGTATCTCACATCTATCAGCCAACATGGATAATGGTGATGCAACACTATGTGACTATAAGAGACCAGTGCAGAATCTGCAAGGAA
TTGCTGAACTACCTGAAAACAATATTGAGGTCAGAGTTCCACAACCGTCAGGAAAGCAAGACCTACAAAATAAGATCCAGGTTGAAGCAACCAAGGTTGAAGACAGGGAA
GAAGCAGGACAGTCAAATCACTCGAGTTTGCTTCGGAGCCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATCGGTGGGGCTCGCAAAACAAGGCCAGT
GGATTGTGGGGGAGATTTTAGCTTAAGTGATGTTGGTCATTTGTTGGATACTGAGTCGTTGAGACGACGCATGGAACAAATTGCTGCTGTACAGGGCCTGGGCAGTGTTT
CTGCAGATTGTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGACTTAGTTGGAGCGTGGCCTGCATATGAGCCTGAGAAA
CCTCTTTCTCATAAGCAGCAGTTTCAGGGGAAGGTTATTAATGGCATGTTGCCAAATAATCAATTACACGGACGACATAGCAATGGAAGCGAAGAAGTTGTGCACGAGCA
CAGGTTACAATGTTCGATATCGTTGCTTGACTTCAAGGTAGCAATGGAGCTTAATCCAACACAATTAGGGGAAGACTGGCCTTTGTTACTGGAGAAAATTTGTATGCGTA
CCTTCGGGGAATGA
mRNA sequenceShow/hide mRNA sequence
GAGAGAGAGAGAGAGGAAAATAAGAGAGAGAGAGACAGAGGTTGAAGGACAGAGAGAAAGCAAACGAAAGTTCGAAGCTCCGAAGATTCGGACCGAGCCACTTAGCTTCA
ATTTTCTCTTCAATTTGTCAAGTTTGTCAAAGGTCAGAGCTGAAGCAATCGCTATGTCTTTGTTGTTCAAATGCTATTGAGCTTAGTCGGCGAGATTCAATGTGCTTCAT
ATCTCTGCAATTTGTGCTCTGTTCACTCCACTCGACTTTGCTCAACTTACTGATCTCTTTATTTGGCTTCGAAATTTCGTGTTTCGAACTTCAGGGCTTGCGAATTCTGT
TCAAAGAATGATGGGTTTGTATTGGTTCTTCGACCCAATTCGCGTTTTTCAGTAGTTGTTGATTTCTTTGTTTGTTCCCGTGTTCTGAACTTGGGTTTCGTTGTTTTTCG
AAGTTAGGGTTTTTATTTGCTATTGTTTTGCAATAGGTTTGATTTGGTTTTTGAAATTTTGGCGGGATTTAGTGAGCCATGTAGTGGGTTTAACTTCTAGGGTTGTGGTT
TTTGAGAAGTTGCGATTTCGAGTTGCATTTTGTGCTTTGTAGTTGGGGAAAGAAAAGGATCAGTTGGAAATATGATTGCTGCAATTCTGATGAAATCCGTCCTGGGATTT
CCTTGATCTGTTGGGGTTGAATTTTGGGAACTGGTTCAAATGATGCCGACCTTGAAAAATTAACCTGCATTGATTGATATGTGGGGGTGAAATTTTCGTGTTTCTTTTTT
CTTTTTCTTCGATTGTTGGCCAAGTTTCGTGTCTGTAATTGAGTGATTCTGTAAAGGGTGTGTGAATCTGTCTGCTGAGAAATGCAACCTCAGCAGAGCTTGAGAATTGA
CTTGGGTGAATTGAAATCTCAGATAGTGAAGAAACTTGGAGCCGATCGGTCAAAACGGTACTTCTTTTACTTGAATAGGTTCCTGAGTCAAAAGCTGAGTAAGAATGAGT
TTGATAAGTCATGTTGTCGTGTACTCGGAAGGGAGAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTGAAGAATGCTTGTCAAGCTAAGGTGGCACCACCAATA
CCTGTTGCAGGCTATCCGAAAACATCAACGCAATCTGCAAAAATTTCCCCTCTCGTAGAAGATGGGAATGAAGATGGTGGAGCTGTTTTTCCTACTTCAACGCAAAATAT
TCCCGGTTGGTCCAATGGAGTTTCACCAAGAAAGTGCAGGTCTGGGATACGTGATCGCAAGCTCAAAGACAGACCCAGTATACTGGGGCCAAATGGGAAGGTTGAATGTA
TCTCACATCTATCAGCCAACATGGATAATGGTGATGCAACACTATGTGACTATAAGAGACCAGTGCAGAATCTGCAAGGAATTGCTGAACTACCTGAAAACAATATTGAG
GTCAGAGTTCCACAACCGTCAGGAAAGCAAGACCTACAAAATAAGATCCAGGTTGAAGCAACCAAGGTTGAAGACAGGGAAGAAGCAGGACAGTCAAATCACTCGAGTTT
GCTTCGGAGCCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATCGGTGGGGCTCGCAAAACAAGGCCAGTGGATTGTGGGGGAGATTTTAGCTTAAGTG
ATGTTGGTCATTTGTTGGATACTGAGTCGTTGAGACGACGCATGGAACAAATTGCTGCTGTACAGGGCCTGGGCAGTGTTTCTGCAGATTGTGCTAATATTTTGAATAAG
GTGTTGGATGTATATTTGAAGCAGTTAATTAGGTCTTGTGTTGACTTAGTTGGAGCGTGGCCTGCATATGAGCCTGAGAAACCTCTTTCTCATAAGCAGCAGTTTCAGGG
GAAGGTTATTAATGGCATGTTGCCAAATAATCAATTACACGGACGACATAGCAATGGAAGCGAAGAAGTTGTGCACGAGCACAGGTTACAATGTTCGATATCGTTGCTTG
ACTTCAAGGTAGCAATGGAGCTTAATCCAACACAATTAGGGGAAGACTGGCCTTTGTTACTGGAGAAAATTTGTATGCGTACCTTCGGGGAATGAAACCACTCTGATATT
TCTGTTTATCCCATTCCACAGTGTAGCTGACCTGTCACAGTTCAAAGGGGTTGAGAATTGAGCTCAAAGACCTCTCAGCTCTACTCATTTGTGAGTAAACAAAAATTGCC
CAAATCTTGATAGAAGCCCTGGCTTGCAAGAATTCAAGCAGATCAAACATCAATATTGATTTTCATTCTTGGGAATCATTTAACTGTGACTAAGCTTGGACACTGTCAGC
CAGAATGCCCATCAGCGATTGATCTACGTTTCATCTTTAACCATGATTTAAGGGTCTCAAGCTTTTCATGTAAATTTAGATCACTGAGGAATTATACGAAATTTTGCAGA
GATACAGTTTGTAGCTTGGTGAGAGGTAACCACTGGTGCAAGAACTTAGTGATGCTATTTGTGTATTTAAATCTGAGAATTGATTTGAATTTCATATGAATAACTCATCA
ACAGACAACAATCTGCAGATAGCTCTGTATTTCGTGCTTACCTTTTGTTTTTTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MQPQQSLRIDLGELKSQIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKSCCRVLGRENLWLHNQLIQSILKNACQAKVAPPIPVAGYPKTSTQSAKISPLVEDGNEDGG
AVFPTSTQNIPGWSNGVSPRKCRSGIRDRKLKDRPSILGPNGKVECISHLSANMDNGDATLCDYKRPVQNLQGIAELPENNIEVRVPQPSGKQDLQNKIQVEATKVEDRE
EAGQSNHSSLLRSRLLAPLGIPFCSASIGGARKTRPVDCGGDFSLSDVGHLLDTESLRRRMEQIAAVQGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAYEPEK
PLSHKQQFQGKVINGMLPNNQLHGRHSNGSEEVVHEHRLQCSISLLDFKVAMELNPTQLGEDWPLLLEKICMRTFGE