; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G011290 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G011290
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationchr08:19929935..19932339
RNA-Seq ExpressionLsi08G011290
SyntenyLsi08G011290
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029751.1 hypothetical protein SDJN02_08093, partial [Cucurbita argyrosperma subsp. argyrosperma]7.9e-19587.66Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        IVKKLG DRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AG PKTSTQ+AKISPVIEDGNEDGGAVFPTST
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL
        QGIP WSN GFP SPRKCR+GIRDRKLKDRPS+L PN KVEC S QSA KEDGSCRI++DNG+AT CDY+RPVQHLQGV E PENN EARVQ+P+GKQVL
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL

Query:  QNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDVYLK
        Q  +QVEDREEA QSNR S+LRS LLAPLGIPFCSAS GGA KTRPVD GG+FSFSD+GHLLDTESL+RRMEQIAAVQGLGSVSADCANILNKVLDVYLK
Subjt:  QNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDVYLK

Query:  QLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE
        QLIRSCVDLVGAWPAFEPEK LAH QQ+QGKVINGMLPNNQLH  HSNGNGEV HE RL CS+SLLDFKVAMELNP QLGEDWPLLLEKI MRAF E
Subjt:  QLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE

XP_008445087.1 PREDICTED: uncharacterized protein LOC103488231 [Cucumis melo]1.9e-19387.06Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAG PKTSTQSAKISP++EDGNEDGGAVFPTST
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL
        Q IP WSNG    SPRKCR+GIRDRKLKDRPSVLGPNGKVEC SH SA          +DNGDAT CDYKRPVQHLQGVAE PENN E RV QPSGKQVL
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL

Query:  QNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVL
         NKIQ     VEDREEAGQSN  S+LRS LLAPLGIPFCSASTGG  KTRPVD GGDFSF D+GHLLDTESL+RRMEQIAAVQGLGSVSADCANILNKVL
Subjt:  QNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVL

Query:  DVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAF
        DVYLKQLIRSCVDLVGAWPA+EPEK LAHKQQ+QGKVINGMLPNNQLHGRHSNGN EV HEHRLQCS+SLLDFKVAMELNPTQLGEDWPLLLEKICMRAF
Subjt:  DVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAF

Query:  EE
         E
Subjt:  EE

XP_022962598.1 uncharacterized protein LOC111463000 [Cucurbita moschata]3.6e-19587.91Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        IVKKLG DRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AG PKTSTQ+AKISPVIEDGNEDGGAVFPTST
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL
        QGIP WSN GFP SPRKCR+GIRDRKLKDRPS+L PN KVEC S QSA KEDGSCRI++DNG+AT CDY+RPVQHLQGV E PENN EARVQ+PSGKQVL
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL

Query:  QNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDVYLK
        Q  +QVEDREEA QSNR S+LRS LLAPLGIPFCSAS GGA KTRPVD GG+FSFSD+GHLLDTESL+RRMEQIAAVQGLGSVSADCANILNKVLDVYLK
Subjt:  QNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDVYLK

Query:  QLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE
        QLIRSCVDLVGAWPAFEPEK LAH QQ+QGKVINGMLPNNQLH  HSNGNGEV HE RL CS+SLLDFKVAMELNP QLGEDWPLLLEKI MRAF E
Subjt:  QLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE

XP_022997521.1 uncharacterized protein LOC111492414 [Cucurbita maxima]9.1e-19185.75Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        IVKKLG DRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AG PKTSTQ+AKISPVIEDGNEDGGAVF TST
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL
        QGIP WSN GF  SPRKCR+GIRDRKLKDRPS+L PN KVEC S QSA KEDGSCRI++DNG+AT CDY+RPVQHLQGV E PENN EARVQ+PSGKQVL
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL

Query:  QNKIQ---VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDV
        Q +++   VEDREEA QSNR S+LRS LLAPLGIPFCSAS GGA KTRPVD GG+FSFSD+GHLLDTESL+RRMEQIAAVQGLGSVSADCANILNKVLDV
Subjt:  QNKIQ---VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDV

Query:  YLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE
        YLKQLIRSCVDLVG WP FEPEK LAH QQ+QGKVINGMLPNNQLH  HSNGN EV HE RL CS+SLLDFKVAMELNP QLGEDWPLLLEKI MRAF E
Subjt:  YLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE

XP_023546134.1 uncharacterized protein LOC111805335 [Cucurbita pepo subsp. pepo]1.0e-19487Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        IVKKLG DRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AG PKTSTQ+AKISPVIEDGNEDGGAVFPTST
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL
        QGIP WSN GFP SPRKCR+GIRDRKLKDRPS+L PN KVEC S QSA KEDGSCRI++DNG+AT CDY+RPVQHLQGV E PENN EARVQ+PSGKQVL
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL

Query:  QNKIQ---VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDV
        Q +++   VEDREEA QSNR S+LRS LLAPLGIPFCSAS GGA KTRPVD GG+FSFSD+GHLLDTESL+RRMEQIAAVQGLGSVSADCANILNKVLDV
Subjt:  QNKIQ---VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDV

Query:  YLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE
        YLKQLIRSCVDLVGAWPAFEPEK LAH QQ+QGKVINGMLPNNQLH  HSNGNGEV HE RL CS+SLLDFKVAMELNP QLGEDWPLLLEKI MRAF E
Subjt:  YLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LM32 Uncharacterized protein2.2e-19085.57Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAK APPIPVAG PKTSTQSAKISP++EDGNEDGGAVFPTST
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL
        Q IP WSNG    SPRKCR+GIRDRKLKDRPS+LGPNGKVEC SH SA          +DNGDAT CDYKRPVQ+LQG+AE PENN E RV QPSGKQ L
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL

Query:  QNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVL
        QNKIQ     VEDREEAGQSN  S+LRS LLAPLGIPFCSAS GGARKTRPVD GGDFS SD+GHLLDTESL+RRMEQIAAVQGLGSVSADCANILNKVL
Subjt:  QNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVL

Query:  DVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAF
        DVYLKQLIRSCVDLVGAWPA+EPEK L+HKQQ QGKVINGMLPNNQLHGRHSNG+ EV HEHRLQCS+SLLDFKVAMELNPTQLGEDWPLLLEKICMR F
Subjt:  DVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAF

Query:  EE
         E
Subjt:  EE

A0A1S3BCQ5 uncharacterized protein LOC1034882319.4e-19487.06Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAG PKTSTQSAKISP++EDGNEDGGAVFPTST
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL
        Q IP WSNG    SPRKCR+GIRDRKLKDRPSVLGPNGKVEC SH SA          +DNGDAT CDYKRPVQHLQGVAE PENN E RV QPSGKQVL
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL

Query:  QNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVL
         NKIQ     VEDREEAGQSN  S+LRS LLAPLGIPFCSASTGG  KTRPVD GGDFSF D+GHLLDTESL+RRMEQIAAVQGLGSVSADCANILNKVL
Subjt:  QNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVL

Query:  DVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAF
        DVYLKQLIRSCVDLVGAWPA+EPEK LAHKQQ+QGKVINGMLPNNQLHGRHSNGN EV HEHRLQCS+SLLDFKVAMELNPTQLGEDWPLLLEKICMRAF
Subjt:  DVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAF

Query:  EE
         E
Subjt:  EE

A0A5A7VF96 SAGA-Tad1 domain-containing protein9.4e-19487.06Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDK CCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAG PKTSTQSAKISP++EDGNEDGGAVFPTST
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL
        Q IP WSNG    SPRKCR+GIRDRKLKDRPSVLGPNGKVEC SH SA          +DNGDAT CDYKRPVQHLQGVAE PENN E RV QPSGKQVL
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL

Query:  QNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVL
         NKIQ     VEDREEAGQSN  S+LRS LLAPLGIPFCSASTGG  KTRPVD GGDFSF D+GHLLDTESL+RRMEQIAAVQGLGSVSADCANILNKVL
Subjt:  QNKIQ-----VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVL

Query:  DVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAF
        DVYLKQLIRSCVDLVGAWPA+EPEK LAHKQQ+QGKVINGMLPNNQLHGRHSNGN EV HEHRLQCS+SLLDFKVAMELNPTQLGEDWPLLLEKICMRAF
Subjt:  DVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAF

Query:  EE
         E
Subjt:  EE

A0A6J1HF85 uncharacterized protein LOC1114630001.7e-19587.91Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        IVKKLG DRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AG PKTSTQ+AKISPVIEDGNEDGGAVFPTST
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL
        QGIP WSN GFP SPRKCR+GIRDRKLKDRPS+L PN KVEC S QSA KEDGSCRI++DNG+AT CDY+RPVQHLQGV E PENN EARVQ+PSGKQVL
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL

Query:  QNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDVYLK
        Q  +QVEDREEA QSNR S+LRS LLAPLGIPFCSAS GGA KTRPVD GG+FSFSD+GHLLDTESL+RRMEQIAAVQGLGSVSADCANILNKVLDVYLK
Subjt:  QNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDVYLK

Query:  QLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE
        QLIRSCVDLVGAWPAFEPEK LAH QQ+QGKVINGMLPNNQLH  HSNGNGEV HE RL CS+SLLDFKVAMELNP QLGEDWPLLLEKI MRAF E
Subjt:  QLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE

A0A6J1K7Q1 uncharacterized protein LOC1114924144.4e-19185.75Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        IVKKLG DRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIP AG PKTSTQ+AKISPVIEDGNEDGGAVF TST
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL
        QGIP WSN GF  SPRKCR+GIRDRKLKDRPS+L PN KVEC S QSA KEDGSCRI++DNG+AT CDY+RPVQHLQGV E PENN EARVQ+PSGKQVL
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL

Query:  QNKIQ---VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDV
        Q +++   VEDREEA QSNR S+LRS LLAPLGIPFCSAS GGA KTRPVD GG+FSFSD+GHLLDTESL+RRMEQIAAVQGLGSVSADCANILNKVLDV
Subjt:  QNKIQ---VEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDV

Query:  YLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE
        YLKQLIRSCVDLVG WP FEPEK LAH QQ+QGKVINGMLPNNQLH  HSNGN EV HE RL CS+SLLDFKVAMELNP QLGEDWPLLLEKI MRAF E
Subjt:  YLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein4.0e-3532.17Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        I +K+G  R+  YF  L +FL+ ++SK+EFDKLC + +GREN+ LHN+L++SILKNA  AK+ PP      PK S       PV                
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIPWSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL-
                 FP SPRKCR+    RK +DRPS LGP GK                                           P++ T    +  S  Q L 
Subjt:  QGIPWSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL-

Query:  QNKIQVEDREEAGQSNRLSVL--RSGLLAPLGIPFCSASTGGARKTRPVDFGG--DFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLD
           + VED EE  Q      +  RS L APLG+ F   S     K R   + G    +    G L D  +L+ R+E+   ++G+  +S D AN+LN+ L+
Subjt:  QNKIQVEDREEAGQSNRLSVL--RSGLLAPLGIPFCSASTGGARKTRPVDFGG--DFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLD

Query:  VYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFE
         Y+++LI  C+             SLA +Q+                              R   +VS+LDF  AME+NP  LGE+WP+ LEKIC RA E
Subjt:  VYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFE

Query:  E
        E
Subjt:  E

AT2G24530.1 unknown protein2.8e-8947.77Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        IVKK G +RS+RYF+YL RFLSQKL+K+EFDK C R+LGRENL LHNQLI+SIL+NA  AK+ PP   AG     +  A       DG E  G + P  +
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL
        Q  P WSNG  P SPRK R+G+++RK +DRPS LG NGKVE   HQ   +ED    + ++NG     DY+R  ++   VA++ +      V++P     +
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVL

Query:  QNK-----IQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGD-FSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKV
         NK     + + D +   +  R+++  S L+APLGIPFCSAS GG+ +T PV    +  S  D G L D E L++RME IA  QGL  VS +CA  LN +
Subjt:  QNK-----IQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGD-FSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKV

Query:  LDVYLKQLIRSCVDLVGAWPA-FEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMR
        LDVYLK+LI SC DLVGA     +P K    KQQ Q K++NG+ P N L  +  NG+ ++  +H    SVS+LDF+ AMELNP QLGEDWP L E+I +R
Subjt:  LDVYLKQLIRSCVDLVGAWPA-FEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMR

Query:  AFEE
        +FEE
Subjt:  AFEE

AT4G31440.1 unknown protein1.2e-7144.2Show/hide
Query:  IAIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPT
        + IVKK+G +RS RYF+YL RFLSQKL+K+EFDK C R+LGRENL LHN+LI+SIL+NA  AK+ P +  +G P  S    K     EDG E+  ++ P 
Subjt:  IAIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPT

Query:  STQG-IPWSNGGFPASPRKCRTG-IRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGK
          +  +  SNG       K R G   DR ++D+P  LG NGKV                         P  Y RP ++       P+    A +  P+ +
Subjt:  STQG-IPWSNGGFPASPRKCRTG-IRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGK

Query:  QVLQNKIQVE---DREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVD-FGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNK
        + +  K QV     R++  Q   LS     ++APLGIPFCSAS GG R+T PV       S  D G L DTE L++RME IA  QGLG VSA+C+ +LN 
Subjt:  QVLQNKIQVE---DREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVD-FGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNK

Query:  VLDVYLKQLIRSCVDLVGAWPA-FEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICM
        +LD+YLK+L++SCVDL GA      P K    KQQ + +++NG+  NN  H + SN   ++  E   Q SVSLLDF+VAMELNP QLGEDWPLL E+I +
Subjt:  VLDVYLKQLIRSCVDLVGAWPA-FEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICM

Query:  RAFEE
          FEE
Subjt:  RAFEE

AT4G33890.1 unknown protein1.3e-3330.37Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              +  GN D       ++
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGK---VECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGK
        Q  P   +  F  S RKCR+    RKL+DRPS LGP GK   +   + +S  K   +  ++            RP   +  V E  E      V+Q +G 
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGK---VECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGK

Query:  QVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFS-----DIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILN
                       G  +  S  R  L APLG+   S   G  RK+         SF+     + G L DT +L+ R+E+   ++GL  ++ D  ++LN
Subjt:  QVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFS-----DIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILN

Query:  KVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICM
          LDV++++LI  C+ L       +  + +                           N +   + R    VS+ DF+  MELN   LGEDWP+ +EKIC 
Subjt:  KVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICM

Query:  RAFEE
        RA ++
Subjt:  RAFEE

AT4G33890.2 unknown protein1.3e-3330.37Show/hide
Query:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST
        I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LHN+LI+SI+KNAC AK+ P I   G              +  GN D       ++
Subjt:  IVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVFPTST

Query:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGK---VECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGK
        Q  P   +  F  S RKCR+    RKL+DRPS LGP GK   +   + +S  K   +  ++            RP   +  V E  E      V+Q +G 
Subjt:  QGIP-WSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGK---VECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGK

Query:  QVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFS-----DIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILN
                       G  +  S  R  L APLG+   S   G  RK+         SF+     + G L DT +L+ R+E+   ++GL  ++ D  ++LN
Subjt:  QVLQNKIQVEDREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFS-----DIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILN

Query:  KVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICM
          LDV++++LI  C+ L       +  + +                           N +   + R    VS+ DF+  MELN   LGEDWP+ +EKIC 
Subjt:  KVLDVYLKQLIRSCVDLVGAWPAFEPEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICM

Query:  RAFEE
        RA ++
Subjt:  RAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGGGAAAAGAAAAGGATCAGTTGGAAATATGATTGCTATAGTGAAGAAGCTCGGAGCCGATCGGTCGAAACGGTACTTCTTTTACTTGAATAGGTTCTTGAGTCA
AAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTTGTCGTGTACTTGGGAGAGAAAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTAAAGAATGCTTGTCAAG
CTAAGGCTGCACCACCAATACCTGTAGCAGGCTGTCCGAAAACTTCGACGCAATCTGCAAAAATTTCCCCTGTCATAGAAGATGGGAATGAGGATGGTGGAGCTGTTTTT
CCTACTTCCACTCAAGGTATTCCTTGGTCCAATGGAGGTTTTCCGGCGTCCCCGAGAAAGTGCAGGACTGGGATACGCGACCGCAAACTCAAAGACAGGCCTAGTGTACT
AGGACCAAATGGGAAGGTTGAATGTAACTCACATCAATCAGCAGGCAAGGAAGACGGCAGCTGTAGAATCATCGTGGATAATGGTGATGCAACACCGTGTGACTATAAGA
GACCAGTGCAGCATCTGCAAGGAGTTGCTGAACAACCTGAAAACAATACTGAGGCTAGAGTTCAGCAACCATCAGGAAAGCAAGTCTTACAAAATAAGATCCAGGTTGAA
GACAGGGAAGAAGCGGGACAGTCAAATCGCTTGAGTGTACTTCGGAGTGGATTACTTGCACCTCTCGGGATTCCTTTTTGCTCAGCCAGTACAGGCGGGGCCCGCAAAAC
GAGGCCGGTGGATTTTGGGGGAGATTTTAGCTTCAGTGATATTGGTCATTTGTTGGATACTGAGTCGTTGAAACGACGCATGGAACAAATTGCTGCAGTACAGGGCCTGG
GCAGTGTTTCTGCAGATTGTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGCTAATTAGGTCTTGTGTTGACTTGGTTGGAGCATGGCCTGCATTTGAG
CCTGAGAAATCCCTTGCCCATAAGCAGCAGGTTCAGGGGAAGGTTATCAATGGCATGTTGCCGAATAATCAATTACATGGACGACATAGCAACGGAAATGGAGAAGTTGA
ACACGAGCACAGATTACAGTGCTCGGTATCATTGCTCGACTTCAAGGTAGCAATGGAGCTTAATCCAACACAACTTGGGGAAGACTGGCCTTTGCTACTGGAGAAAATTT
GTATGCGAGCATTCGAGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGGGAAAAGAAAAGGATCAGTTGGAAATATGATTGCTATAGTGAAGAAGCTCGGAGCCGATCGGTCGAAACGGTACTTCTTTTACTTGAATAGGTTCTTGAGTCA
AAAGCTGAGTAAGAATGAGTTTGATAAGTTATGTTGTCGTGTACTTGGGAGAGAAAATCTTTGGCTGCATAATCAATTGATACAGTCAATTTTAAAGAATGCTTGTCAAG
CTAAGGCTGCACCACCAATACCTGTAGCAGGCTGTCCGAAAACTTCGACGCAATCTGCAAAAATTTCCCCTGTCATAGAAGATGGGAATGAGGATGGTGGAGCTGTTTTT
CCTACTTCCACTCAAGGTATTCCTTGGTCCAATGGAGGTTTTCCGGCGTCCCCGAGAAAGTGCAGGACTGGGATACGCGACCGCAAACTCAAAGACAGGCCTAGTGTACT
AGGACCAAATGGGAAGGTTGAATGTAACTCACATCAATCAGCAGGCAAGGAAGACGGCAGCTGTAGAATCATCGTGGATAATGGTGATGCAACACCGTGTGACTATAAGA
GACCAGTGCAGCATCTGCAAGGAGTTGCTGAACAACCTGAAAACAATACTGAGGCTAGAGTTCAGCAACCATCAGGAAAGCAAGTCTTACAAAATAAGATCCAGGTTGAA
GACAGGGAAGAAGCGGGACAGTCAAATCGCTTGAGTGTACTTCGGAGTGGATTACTTGCACCTCTCGGGATTCCTTTTTGCTCAGCCAGTACAGGCGGGGCCCGCAAAAC
GAGGCCGGTGGATTTTGGGGGAGATTTTAGCTTCAGTGATATTGGTCATTTGTTGGATACTGAGTCGTTGAAACGACGCATGGAACAAATTGCTGCAGTACAGGGCCTGG
GCAGTGTTTCTGCAGATTGTGCTAATATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGCTAATTAGGTCTTGTGTTGACTTGGTTGGAGCATGGCCTGCATTTGAG
CCTGAGAAATCCCTTGCCCATAAGCAGCAGGTTCAGGGGAAGGTTATCAATGGCATGTTGCCGAATAATCAATTACATGGACGACATAGCAACGGAAATGGAGAAGTTGA
ACACGAGCACAGATTACAGTGCTCGGTATCATTGCTCGACTTCAAGGTAGCAATGGAGCTTAATCCAACACAACTTGGGGAAGACTGGCCTTTGCTACTGGAGAAAATTT
GTATGCGAGCATTCGAGGAATGAAACGGCTCTGATATTTCTGTTTATCCCATTCCACAGTTTAGCTGACCTGTCACAGTTCAGAGGGGTTGAGTATGGATCTCGAAGACC
TCTCCGCTCTGCTCATTTGTGAGTAAACAAATATTGCTCAAAACTTGATAAAAGCTCTGGCTTGCAAGATGACAAGCAGATCAAACATCAAAATCGATGTTCGTTCTTGG
GAATCATTTAACTGTGACTAAGCTTGGGCGCTGTCAGCCAGATTGCCCATCAGTGATAGATCTACGTTTCACATTGAAACTGGCTGTAGCCGTGATTTAAGGGTCTCGAG
CTTTTCGTGTAAATTTAGATCACTGAGGAAATATACAAAATTTTGCAGAGATACAGTTTGTAGCTTGGTGAGAGGTAATTACTGGTGCAAGAACTTAGTGTTGCTATTTG
TGTATTTAAATCTGAGAATTGATTTGAATTTCATATGAATAACTCATCAACAGACAACAATCTGCAGATGGTATCCTCTGTATCTTATGTTTTCTTTTTTCTTTTTTTTG
GTTAGATAGAAAGATCAGATGCTATAAAATGATATAGCATTTTTATTCTCTAAACATCTATGTTCAAAACGTTCCACATTTATAATGCAATAGTCAGCCACATTTAGATT
C
Protein sequenceShow/hide protein sequence
MVGKRKGSVGNMIAIVKKLGADRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIQSILKNACQAKAAPPIPVAGCPKTSTQSAKISPVIEDGNEDGGAVF
PTSTQGIPWSNGGFPASPRKCRTGIRDRKLKDRPSVLGPNGKVECNSHQSAGKEDGSCRIIVDNGDATPCDYKRPVQHLQGVAEQPENNTEARVQQPSGKQVLQNKIQVE
DREEAGQSNRLSVLRSGLLAPLGIPFCSASTGGARKTRPVDFGGDFSFSDIGHLLDTESLKRRMEQIAAVQGLGSVSADCANILNKVLDVYLKQLIRSCVDLVGAWPAFE
PEKSLAHKQQVQGKVINGMLPNNQLHGRHSNGNGEVEHEHRLQCSVSLLDFKVAMELNPTQLGEDWPLLLEKICMRAFEE