; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh05G003360 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh05G003360
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionNAD(P)-bd_dom domain-containing protein
Genome locationCma_Chr05:1462726..1469993
RNA-Seq ExpressionCmaCh05G003360
SyntenyCmaCh05G003360
Gene Ontology termsGO:0006352 - DNA-templated transcription, initiation (biological process)
GO:1901006 - ubiquinone-6 biosynthetic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005739 - mitochondrion (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0044877 - protein-containing complex binding (molecular function)
InterPro domainsIPR003923 - Transcription initiation factor TFIID, 23-30kDa subunit
IPR016040 - NAD(P)-binding domain
IPR036291 - NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG95024.1 TBP-associated factor II 15 [Prunus dulcis]4.0e-14465.68Show/hide
Query:  TDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAFDGVTAV------
        T   +VDEPFKV EAETVN+PPPPTEKLLVLGGNGFVGSH+C+EA++RGL+VASLSRSGRSS+ D WA+NV WH+GNLLSP+SL +A DGVT+V      
Subjt:  TDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAFDGVTAV------

Query:  ------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVG
              MYKINGTANINAIR A+++            GVKR+VY+SAADFG+            RAAETELLTKFPYGG      VILRPGFIYGTR+VG
Subjt:  ------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVG

Query:  SMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQRSN--SNELRSKR----GKTKKMNHS-------Q
        S+KLPLGVIGSPLEM+ QH +PL+QLPLVGPL TPPV+VT+VA V+VRAATDPVFPPGI+DVYGIQR N  SN  R        + +KMNH+       Q
Subjt:  SMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQRSN--SNELRSKR----GKTKKMNHS-------Q

Query:  QATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQKDKR
         + GSRHDDDAAL+EFLASLM+YTPT+  D  +      S  +  D        +  IRLVAVATQKFV++VA+DALQ CKARQA+VVKDKRDKQQKDKR
Subjt:  QATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQKDKR

Query:  LILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
        LILTM+DLS+ALREYGVNVKHQEYFADSPSTG+D  SREE
Subjt:  LILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE

KAA0064867.1 uncharacterized protein E6C27_scaffold82G001800 [Cucumis melo var. makuwa]1.2e-16972.71Show/hide
Query:  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQ-------------
        T+ A +SGRPFSTDSNK+DEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEA+NRGLTVASLSRSGRSSIRDSWAN+VIWHQ             
Subjt:  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQ-------------

Query:  ---------------GNLLSPDSLNEAFDGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGY
                       GNLLSPDSLNEAFDGVTAV            MYKINGTANINAIRVA+DK            GVKR+VYISAADFGL NYLLQGY
Subjt:  ---------------GNLLSPDSLNEAFDGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGY

Query:  YEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGII
        YEGKRAAETELLTKFPYGG      VILRPGFIYGTRNVGS+KLPLGVIGSPLEMVLQHAKPLHQLPL+GPL TPPVSVTSVARVSVRAATDPVFPPGII
Subjt:  YEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGII

Query:  DVYGIQRSNSNELRSKRGKTKKMNHSQQATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGL--FRVYSVLSRIRLVAVATQKFVAD
        D+Y                           GSRHDDDAALSEFLASLMEYTPT+  ++          +EH  G   F+   V   IRLVAVATQKFVAD
Subjt:  DVYGIQRSNSNELRSKRGKTKKMNHSQQATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGL--FRVYSVLSRIRLVAVATQKFVAD

Query:  VASDALQHCKARQAAVVKDKRDKQQKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
        VASDALQHCKARQAAVVKDKRDKQQKDKRLILTM+DLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
Subjt:  VASDALQHCKARQAAVVKDKRDKQQKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE

KAG6598532.1 Transcription initiation factor TFIID subunit 10, partial [Cucurbita argyrosperma subsp. sororia]3.5e-18080.18Show/hide
Query:  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAF
        TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEA+NRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAF
Subjt:  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAF

Query:  DGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVIL
        DGVTAV            MYKINGTANINAIRVAADK            GVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGG      VIL
Subjt:  DGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVIL

Query:  RPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPP---VSVTSVARVSVRAATDPVFPPGIIDVYGIQRSNSNELRSKRGKTKKMNH
        RPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPP   V V S+  + +    + VF         +  SNSNELRSKRGKTKKMNH
Subjt:  RPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPP---VSVTSVARVSVRAATDPVFPPGIIDVYGIQRSNSNELRSKRGKTKKMNH

Query:  SQQATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGL--FRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQ
        SQQATGSRHDDDAALSEFLASLMEYTPT+  ++          +EH  G   F+   V   IRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQ
Subjt:  SQQATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGL--FRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQ

Query:  KDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
        KDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
Subjt:  KDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE

KAG7029463.1 putative protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-21587.64Show/hide
Query:  IVRTVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLN
        +  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEA+NRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLN
Subjt:  IVRTVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLN

Query:  EAFDGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVR
        EAFDGVTAV            MYKINGTANINAIRVAADK            GVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGG      
Subjt:  EAFDGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVR

Query:  VILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQRSNSNELRSKRGKTKKMNH
        VILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQRSNSNELRSKRGKTKKMNH
Subjt:  VILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQRSNSNELRSKRGKTKKMNH

Query:  SQQATGSRHDDDAALSEFLASLMEYTPT-------------------VGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQH
        SQQATGSRHDDDAALSEFLASLMEYTPT                   VGRDMCLPGLFCSSVIEHEDGLF VYS  S IRLVAVATQKFVADVASDALQH
Subjt:  SQQATGSRHDDDAALSEFLASLMEYTPT-------------------VGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQH

Query:  CKARQAAVVKDKRDKQQKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
        CKARQAAVVKDKRDKQQKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
Subjt:  CKARQAAVVKDKRDKQQKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE

XP_022997337.1 uncharacterized protein At1g32220, chloroplastic [Cucurbita maxima]1.3e-12989.25Show/hide
Query:  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAF
        TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAF
Subjt:  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAF

Query:  DGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVIL
        DGVTAV            MYKINGTANINAIRVAADK            GVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGG      VIL
Subjt:  DGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVIL

Query:  RPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQR
        RPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQR
Subjt:  RPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQR

TrEMBL top hitse value%identityAlignment
A0A4Y1QT38 TBP-associated factor II 152.0e-14465.68Show/hide
Query:  TDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAFDGVTAV------
        T   +VDEPFKV EAETVN+PPPPTEKLLVLGGNGFVGSH+C+EA++RGL+VASLSRSGRSS+ D WA+NV WH+GNLLSP+SL +A DGVT+V      
Subjt:  TDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAFDGVTAV------

Query:  ------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVG
              MYKINGTANINAIR A+++            GVKR+VY+SAADFG+            RAAETELLTKFPYGG      VILRPGFIYGTR+VG
Subjt:  ------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVG

Query:  SMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQRSN--SNELRSKR----GKTKKMNHS-------Q
        S+KLPLGVIGSPLEM+ QH +PL+QLPLVGPL TPPV+VT+VA V+VRAATDPVFPPGI+DVYGIQR N  SN  R        + +KMNH+       Q
Subjt:  SMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQRSN--SNELRSKR----GKTKKMNHS-------Q

Query:  QATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQKDKR
         + GSRHDDDAAL+EFLASLM+YTPT+  D  +      S  +  D        +  IRLVAVATQKFV++VA+DALQ CKARQA+VVKDKRDKQQKDKR
Subjt:  QATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQKDKR

Query:  LILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
        LILTM+DLS+ALREYGVNVKHQEYFADSPSTG+D  SREE
Subjt:  LILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE

A0A5A7VCX0 NAD(P)-bd_dom domain-containing protein6.0e-17072.71Show/hide
Query:  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQ-------------
        T+ A +SGRPFSTDSNK+DEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEA+NRGLTVASLSRSGRSSIRDSWAN+VIWHQ             
Subjt:  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQ-------------

Query:  ---------------GNLLSPDSLNEAFDGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGY
                       GNLLSPDSLNEAFDGVTAV            MYKINGTANINAIRVA+DK            GVKR+VYISAADFGL NYLLQGY
Subjt:  ---------------GNLLSPDSLNEAFDGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGY

Query:  YEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGII
        YEGKRAAETELLTKFPYGG      VILRPGFIYGTRNVGS+KLPLGVIGSPLEMVLQHAKPLHQLPL+GPL TPPVSVTSVARVSVRAATDPVFPPGII
Subjt:  YEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGII

Query:  DVYGIQRSNSNELRSKRGKTKKMNHSQQATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGL--FRVYSVLSRIRLVAVATQKFVAD
        D+Y                           GSRHDDDAALSEFLASLMEYTPT+  ++          +EH  G   F+   V   IRLVAVATQKFVAD
Subjt:  DVYGIQRSNSNELRSKRGKTKKMNHSQQATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGL--FRVYSVLSRIRLVAVATQKFVAD

Query:  VASDALQHCKARQAAVVKDKRDKQQKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
        VASDALQHCKARQAAVVKDKRDKQQKDKRLILTM+DLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
Subjt:  VASDALQHCKARQAAVVKDKRDKQQKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE

A0A5H2XKG1 TBP-associated factor II 151.0e-12964.11Show/hide
Query:  KLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAFDGVTAV------------MYKINGTANINAIRVAADKG
        +LLVLGGNGFVGSH+C+EA++RGL+VASLSRSGRSS+ D WA+NV WH+GNLLSP+SL +A DGVT+V            MYKINGTANINAIR A+++ 
Subjt:  KLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAFDGVTAV------------MYKINGTANINAIRVAADKG

Query:  AVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVGSMKLPLGVIGSPLEM----VLQHAKP
                   GVKR+VY+SAADFG+            RAAETELLTKFPYGG      VILRPGFIYGTR+VGS+KLPLGVIGSPLEM    + QH +P
Subjt:  AVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVGSMKLPLGVIGSPLEM----VLQHAKP

Query:  LHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQRSN--SNELRSKR----GKTKKMNHS-------QQATGSRHDDDAALSEFLASLME
        L+QLPLVGPL TPPV+VT+VA V+VRAATDPVFPPGI+DVYGIQR N  SN  R        + +KMNH+       Q + GSRHDDDAAL+EFLASLM+
Subjt:  LHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQRSN--SNELRSKR----GKTKKMNHS-------QQATGSRHDDDAALSEFLASLME

Query:  YTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQKDKRLILTMDDLSKALREYGVNVKHQ
        YTPT+  D  +      S  +  D        +  IRLVAVATQKFV++VA+DALQ CKARQA+VVKDKRDKQQKDKRLILTM+DLS+ALREYGVNVKHQ
Subjt:  YTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQKDKRLILTMDDLSKALREYGVNVKHQ

Query:  EYFADSPSTGVDSTSREE
        EYFADSPSTG+D  SREE
Subjt:  EYFADSPSTGVDSTSREE

A0A6J1HD94 uncharacterized protein At1g32220, chloroplastic1.0e-12988.89Show/hide
Query:  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAF
        TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEA+NRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAF
Subjt:  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAF

Query:  DGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVIL
        DGVTAV            MYKINGTANINAIRVAADK            GVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGG      VIL
Subjt:  DGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVIL

Query:  RPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQR
        RPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQR
Subjt:  RPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQR

A0A6J1K4Q1 uncharacterized protein At1g32220, chloroplastic6.1e-13089.25Show/hide
Query:  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAF
        TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAF
Subjt:  TVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAF

Query:  DGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVIL
        DGVTAV            MYKINGTANINAIRVAADK            GVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGG      VIL
Subjt:  DGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVIL

Query:  RPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQR
        RPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQR
Subjt:  RPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQR

SwissProt top hitse value%identityAlignment
O04173 Transcription initiation factor TFIID subunit 101.8e-4166.21Show/hide
Query:  MNHSQQATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQ
        MNH QQ+  ++H+DDAAL+EFLASLM+YTPT+  D+ +      S  +  D        +  IRLVAVATQKFVADVASDALQHCKAR A VVKDK  KQ
Subjt:  MNHSQQATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQ

Query:  QKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
        QKDKRL+LTM+DLSKALREYGVNVKH EYFADSPSTG+D  +R+E
Subjt:  QKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE

Q05892 MIOREX complex component 21.6e-1027.21Show/hide
Query:  KLLVLGGNGFVGSHICQEAINRGLTVASLSRSGR----SSIRD-SWANNVIWHQGNLLSPDSLNEAFDGVTAVMYK------------------------
        KL+V GGNGF+G  ICQEA+  G  V S+SRSG+    + + D  W   V W   ++  PDS +E  +  T V++                         
Subjt:  KLLVLGGNGFVGSHICQEAINRGLTVASLSRSGR----SSIRD-SWANNVIWHQGNLLSPDSLNEAFDGVTAVMYK------------------------

Query:  --INGTANINAIRVAADKGAVFMWQYQSL---------------------AGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRV
          ++  A  N ++ ++      M   QS                      A  + + YIS AD G    +  GY   KR AE EL     Y        +
Subjt:  --INGTANINAIRVAADKGAVFMWQYQSL---------------------AGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRV

Query:  ILRPGFIYGT-RNVGSMKLPLGVIGSPLEMVLQHAKPL--HQLPLVGPLLTPPVSVTSVARVSVRAATDPVF
        I+RPGF++   RN      P   I + LE++    K L  ++L L+  L+ P VS   V++  ++   +P F
Subjt:  ILRPGFIYGT-RNVGSMKLPLGVIGSPLEMVLQHAKPL--HQLPLVGPLLTPPVSVTSVARVSVRAATDPVF

Q12962 Transcription initiation factor TFIID subunit 103.8e-1239.47Show/hide
Query:  LSEFLASLMEYTPTVGRDMCLPGLFCSSV-IEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQKDKRLILTMDDLSKA
        L +FL  L +YTPT+     + G + +    E  D           IRL+++A QKF++D+A+DALQHCK +  A    +   + KD++  LTM+DL+ A
Subjt:  LSEFLASLMEYTPTVGRDMCLPGLFCSSV-IEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQKDKRLILTMDDLSKA

Query:  LREYGVNVKHQEYF
        L EYG+NVK   YF
Subjt:  LREYGVNVKHQEYF

Q8K0H5 Transcription initiation factor TFIID subunit 103.8e-1239.47Show/hide
Query:  LSEFLASLMEYTPTVGRDMCLPGLFCSSV-IEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQKDKRLILTMDDLSKA
        L +FL  L +YTPT+     + G + +    E  D           IRL+++A QKF++D+A+DALQHCK +  A    +   + KD++  LTM+DL+ A
Subjt:  LSEFLASLMEYTPTVGRDMCLPGLFCSSV-IEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQKDKRLILTMDDLSKA

Query:  LREYGVNVKHQEYF
        L EYG+NVK   YF
Subjt:  LREYGVNVKHQEYF

Q9FVR6 Uncharacterized protein At1g32220, chloroplastic2.2e-3639.41Show/hide
Query:  TEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAFDGVTAV------------MYKINGTANINAIRVAAD
        +E+++VLGGNGFVGS IC+ AI+ G+ V S+SRSGR +  DSW + V W  G++    + +E   G TAV            M +ING AN+ A+  A D
Subjt:  TEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAFDGVTAV------------MYKINGTANINAIRVAAD

Query:  KGAVFMWQYQSLAGVKRYVYISAADFGLVNYLL-QGYYEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHA---
                     GV ++V I+  D+ L  ++L  GY+ GKR AE ELL+K+P  G      V+LRPGFIYG R V  +++PL ++G PL+ +   A   
Subjt:  KGAVFMWQYQSLAGVKRYVYISAADFGLVNYLL-QGYYEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHA---

Query:  -KPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVF
         +PL  LP    +L PPV+V  +A   + A  D  F
Subjt:  -KPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVF

Arabidopsis top hitse value%identityAlignment
AT1G32220.1 NAD(P)-binding Rossmann-fold superfamily protein1.6e-3739.41Show/hide
Query:  TEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAFDGVTAV------------MYKINGTANINAIRVAAD
        +E+++VLGGNGFVGS IC+ AI+ G+ V S+SRSGR +  DSW + V W  G++    + +E   G TAV            M +ING AN+ A+  A D
Subjt:  TEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAFDGVTAV------------MYKINGTANINAIRVAAD

Query:  KGAVFMWQYQSLAGVKRYVYISAADFGLVNYLL-QGYYEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHA---
                     GV ++V I+  D+ L  ++L  GY+ GKR AE ELL+K+P  G      V+LRPGFIYG R V  +++PL ++G PL+ +   A   
Subjt:  KGAVFMWQYQSLAGVKRYVYISAADFGLVNYLL-QGYYEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHA---

Query:  -KPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVF
         +PL  LP    +L PPV+V  +A   + A  D  F
Subjt:  -KPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVF

AT4G31720.1 TBP-associated factor II 151.2e-4266.21Show/hide
Query:  MNHSQQATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQ
        MNH QQ+  ++H+DDAAL+EFLASLM+YTPT+  D+ +      S  +  D        +  IRLVAVATQKFVADVASDALQHCKAR A VVKDK  KQ
Subjt:  MNHSQQATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQ

Query:  QKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
        QKDKRL+LTM+DLSKALREYGVNVKH EYFADSPSTG+D  +R+E
Subjt:  QKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE

AT4G31720.2 TBP-associated factor II 151.2e-4266.21Show/hide
Query:  MNHSQQATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQ
        MNH QQ+  ++H+DDAAL+EFLASLM+YTPT+  D+ +      S  +  D        +  IRLVAVATQKFVADVASDALQHCKAR A VVKDK  KQ
Subjt:  MNHSQQATGSRHDDDAALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQ

Query:  QKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE
        QKDKRL+LTM+DLSKALREYGVNVKH EYFADSPSTG+D  +R+E
Subjt:  QKDKRLILTMDDLSKALREYGVNVKHQEYFADSPSTGVDSTSREE

AT5G10730.1 NAD(P)-binding Rossmann-fold superfamily protein7.7e-10969.79Show/hide
Query:  VRTVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNE
        +R V+A   GR  STDSNK+DEPF VEEAETV+VPPPPTEKLLVLGGNGFVGSH+C+EA++RGL+V+SLSRSGRSS+++SWA+ V WHQGNLLS D L +
Subjt:  VRTVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNE

Query:  AFDGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRV
        A +GVT+V            MYKINGTANINAIR A++K            GVKR+VYISAADFGL NYLL+GYYEGKRAAETELLT+F YGGI      
Subjt:  AFDGVTAV------------MYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRV

Query:  ILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQRSNSNELR
        ILRPGFIYGTR+VGSMK+PLGV GSP+EMVLQ AKPL+QLPLVGPL TPPV+V SVA+V+VRAATDPVFPPGI+DV+GIQR +  + R
Subjt:  ILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQRSNSNELR

AT5G15910.1 NAD(P)-binding Rossmann-fold superfamily protein6.0e-6956.25Show/hide
Query:  KLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAFDGVTAV------------MYKINGTANINAIRVAADKG
        K+LVLGGNG+VGSHIC+EA+ +G +V+SLSRSGRSS+ DSW ++V WHQG+LLSPDSL  A +G+T+V            M +INGTANINA++ AA++ 
Subjt:  KLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQGNLLSPDSLNEAFDGVTAV------------MYKINGTANINAIRVAADKG

Query:  AVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQ-HAKPLHQ
                   GVKR+VYISAADFG++N L++GY+EGKRA E E+L KF   G       +LRPGFI+GTR VGS+KLPL +IG+PLEMVL+   K + +
Subjt:  AVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVILRPGFIYGTRNVGSMKLPLGVIGSPLEMVLQ-HAKPLHQ

Query:  LPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGI
        +P++GPLL PPV+V SVA  +V+AA DP F  G+IDVY I
Subjt:  LPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTCCTCGAGAGTTGTAAACTTTGATTATCTTGACTGTAGAATTGTTCGCACAGTGACTGCTTTTCAGAGCGGGAGACCATTTTCAACAGACTCTAACAAG
GTCGACGAACCGTTCAAGGTCGAGGAAGCTGAAACAGTTAATGTTCCCCCACCTCCAACTGAGAAGTTGCTGGTGCTGGGTGGAAATGGATTTGTGGGTTCTCAT
ATTTGTCAAGAAGCCATAAATCGTGGTCTTACAGTTGCTAGCCTTAGCAGGTCTGGTAGATCATCAATACGTGATTCTTGGGCGAACAATGTAATTTGGCATCAA
GGAAACCTTCTTTCACCTGATTCACTGAATGAAGCTTTTGACGGTGTTACGGCTGTTATGTACAAGATCAATGGGACGGCAAATATCAATGCAATCAGAGTTGCT
GCAGATAAAGGTGCAGTTTTTATGTGGCAGTACCAATCGCTTGCAGGTGTTAAGAGATATGTCTATATCTCTGCTGCTGATTTTGGCTTGGTTAATTACTTGCTA
CAGGGATATTATGAGGGAAAGCGAGCGGCTGAGACAGAACTCCTCACTAAATTTCCTTATGGAGGTATATTGCCTGTAGTACGAGTGATTTTGAGGCCGGGATTT
ATTTACGGGACCCGTAATGTCGGGAGCATGAAGTTACCTCTGGGAGTAATTGGCTCTCCTTTGGAAATGGTTCTTCAACACGCCAAACCACTACACCAGCTACCA
CTCGTTGGTCCTTTATTAACTCCTCCAGTAAGCGTGACTTCGGTGGCAAGGGTTTCTGTTCGAGCAGCAACGGACCCCGTCTTTCCTCCTGGCATCATCGACGTC
TATGGCATTCAACGGAGTAATTCGAACGAGCTTCGTTCGAAGAGAGGGAAGACGAAGAAGATGAACCATAGCCAACAAGCAACAGGGAGCAGGCACGACGATGAC
GCAGCGCTCTCCGAGTTTCTAGCATCTCTAATGGAGTATACTCCCACTGTCGGGCGTGACATGTGCTTACCTGGTTTGTTCTGTAGTAGTGTTATTGAACATGAA
GACGGCCTTTTTAGAGTTTATAGTGTGCTTTCTAGGATCAGGCTTGTCGCTGTTGCTACACAGAAGTTCGTTGCAGATGTTGCAAGTGACGCTCTCCAGCATTGT
AAGGCGAGACAGGCAGCAGTAGTGAAAGACAAAAGGGATAAACAACAAAAGGATAAGCGCTTAATATTGACCATGGATGATCTCTCGAAGGCACTTCGTGAGTAT
GGCGTGAATGTGAAACATCAAGAATATTTTGCTGATAGCCCTTCAACCGGAGTGGATTCTACTTCCAGAGAGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACTTCCTCGAGAGTTGTAAACTTTGATTATCTTGACTGTAGAATTGTTCGCACAGTGACTGCTTTTCAGAGCGGGAGACCATTTTCAACAGACTCTAACAAG
GTCGACGAACCGTTCAAGGTCGAGGAAGCTGAAACAGTTAATGTTCCCCCACCTCCAACTGAGAAGTTGCTGGTGCTGGGTGGAAATGGATTTGTGGGTTCTCAT
ATTTGTCAAGAAGCCATAAATCGTGGTCTTACAGTTGCTAGCCTTAGCAGGTCTGGTAGATCATCAATACGTGATTCTTGGGCGAACAATGTAATTTGGCATCAA
GGAAACCTTCTTTCACCTGATTCACTGAATGAAGCTTTTGACGGTGTTACGGCTGTTATGTACAAGATCAATGGGACGGCAAATATCAATGCAATCAGAGTTGCT
GCAGATAAAGGTGCAGTTTTTATGTGGCAGTACCAATCGCTTGCAGGTGTTAAGAGATATGTCTATATCTCTGCTGCTGATTTTGGCTTGGTTAATTACTTGCTA
CAGGGATATTATGAGGGAAAGCGAGCGGCTGAGACAGAACTCCTCACTAAATTTCCTTATGGAGGTATATTGCCTGTAGTACGAGTGATTTTGAGGCCGGGATTT
ATTTACGGGACCCGTAATGTCGGGAGCATGAAGTTACCTCTGGGAGTAATTGGCTCTCCTTTGGAAATGGTTCTTCAACACGCCAAACCACTACACCAGCTACCA
CTCGTTGGTCCTTTATTAACTCCTCCAGTAAGCGTGACTTCGGTGGCAAGGGTTTCTGTTCGAGCAGCAACGGACCCCGTCTTTCCTCCTGGCATCATCGACGTC
TATGGCATTCAACGGAGTAATTCGAACGAGCTTCGTTCGAAGAGAGGGAAGACGAAGAAGATGAACCATAGCCAACAAGCAACAGGGAGCAGGCACGACGATGAC
GCAGCGCTCTCCGAGTTTCTAGCATCTCTAATGGAGTATACTCCCACTGTCGGGCGTGACATGTGCTTACCTGGTTTGTTCTGTAGTAGTGTTATTGAACATGAA
GACGGCCTTTTTAGAGTTTATAGTGTGCTTTCTAGGATCAGGCTTGTCGCTGTTGCTACACAGAAGTTCGTTGCAGATGTTGCAAGTGACGCTCTCCAGCATTGT
AAGGCGAGACAGGCAGCAGTAGTGAAAGACAAAAGGGATAAACAACAAAAGGATAAGCGCTTAATATTGACCATGGATGATCTCTCGAAGGCACTTCGTGAGTAT
GGCGTGAATGTGAAACATCAAGAATATTTTGCTGATAGCCCTTCAACCGGAGTGGATTCTACTTCCAGAGAGGAATGAGTTGGAATAACAGTTAAAATGTCTCAT
CTTTGTGTTCGTGAATATACTGATGCATCATAGTTCAGGTTGGAGTACTCTTGCTGCCTGCGTCATGTTAGTATTGCTATCATTAAGTAGCCCGTCTGTGAACTG
TCATAGCGGTAGATATGTTAAGCGTTTTCTACATGTCTACTAAATGAAACTGTGCATAATGAAACCCAAAACATCATTCTTGTTCATACAAGGTTGTGTTATCAT
GATGCATTCTCCATGGATTTGCAACTTCAAATTAATCGTCCATGTCCGTTTATGTTTGTGTTTATGAGTTCGGTAGGACGACTTCATTTTTAAATAAACTCAATT
GTTTTGATTGAAATCTTTTTCATTGAAGTAAACTATT
Protein sequenceShow/hide protein sequence
MTSSRVVNFDYLDCRIVRTVTAFQSGRPFSTDSNKVDEPFKVEEAETVNVPPPPTEKLLVLGGNGFVGSHICQEAINRGLTVASLSRSGRSSIRDSWANNVIWHQ
GNLLSPDSLNEAFDGVTAVMYKINGTANINAIRVAADKGAVFMWQYQSLAGVKRYVYISAADFGLVNYLLQGYYEGKRAAETELLTKFPYGGILPVVRVILRPGF
IYGTRNVGSMKLPLGVIGSPLEMVLQHAKPLHQLPLVGPLLTPPVSVTSVARVSVRAATDPVFPPGIIDVYGIQRSNSNELRSKRGKTKKMNHSQQATGSRHDDD
AALSEFLASLMEYTPTVGRDMCLPGLFCSSVIEHEDGLFRVYSVLSRIRLVAVATQKFVADVASDALQHCKARQAAVVKDKRDKQQKDKRLILTMDDLSKALREY
GVNVKHQEYFADSPSTGVDSTSREE