; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013068 (gene) of Snake gourd v1 genome

Gene IDTan0013068
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscription factor MYB117
Genome locationLG04:9588189..9589992
RNA-Seq ExpressionTan0013068
SyntenyTan0013068
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574076.1 Transcription factor MYB105, partial [Cucurbita argyrosperma subsp. sororia]1.1e-14274.81Show/hide
Query:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG
        MGMLDD  DD DDFG+ VEESLNLDLN  AIVSSSQES EV+ENGR GFWNFPFSCES+TRSSET     DFSDGFVEN+N   N            ASG
Subjt:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG

Query:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH
        AQSRLCARGHWRPAEDTKLRELVA YGPQNWNLIA+KLEGRSGKSCRLRWFNQLDPRINRR FSEEEEERLMQAHR+YGNKWAMIARLFPGRTDNAVKNH
Subjt:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH

Query:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS
        WHVIMARKYREQSRSYRRRKLSQSVYRKME+DLSFL LPK+   DNRH T  SPF        GAVDYGFLTQMA AGGE ++++H    PFF SCS LS
Subjt:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS

Query:  TLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT
        TL N+LPDPKSRFWEGTG+GFV PRSH QY+PYNTAA A+                  SSSVTAETT   +  PPFIDFLGVGAT
Subjt:  TLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT

XP_022151206.1 transcriptional activator Myb [Momordica charantia]8.4e-14073.9Show/hide
Query:  ESLNLDLNCAAI---VSSSQE-------SCEVIENGRSGFWNFPFSCESMTRSSETDVGD-----NDFSDGFV--ENINNN--NNI--ANPTSCSNTPSA
        E+ +LDLNCA +   V  SQE       SCEV ENG  GFW FPFSCESMTRSSETDV D     +DFSDGFV   NIN +  N+I   NP+SCSN  +A
Subjt:  ESLNLDLNCAAI---VSSSQE-------SCEVIENGRSGFWNFPFSCESMTRSSETDVGD-----NDFSDGFV--ENINNN--NNI--ANPTSCSNTPSA

Query:  SGAQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVK
        SGAQSRLCARGHWRPAEDTKLRELVA YGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAF+EEEE+RLMQAHR+YGNKWAMIARLFPGRTDNAVK
Subjt:  SGAQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVK

Query:  NHWHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTT-SSPFGNFHGSSVGAVDYGFLTQMAIAG-GE-PISTNHPFFPSCSQL
        NHWHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKD  GDN HT+T SSPFGNF  +SVG +DYGFLTQM I G GE PIS NHP+F SC+QL
Subjt:  NHWHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTT-SSPFGNFHGSSVGAVDYGFLTQMAIAG-GE-PISTNHPFFPSCSQL

Query:  STLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT-------------------SSSVTAETTTKQQSPPPFIDFLGVGAT
        S   N LPDPKSR W+GTGNGF+ PRSH QY+P NTAAP +                   SSSVTAETT   +SPPPFIDFLGVGAT
Subjt:  STLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT-------------------SSSVTAETTTKQQSPPPFIDFLGVGAT

XP_022945320.1 transcription factor CSA-like isoform X2 [Cucurbita moschata]1.7e-14074.03Show/hide
Query:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG
        MGMLDD  DD DDFG+ VEESLNLDLN  AIVSSSQES EV+ENGR G WNFPFSCES+TRSSET     DFSDGFVEN+N   N            ASG
Subjt:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG

Query:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH
        AQSRLCARGHWRPAEDTKLRELVA YGPQNWNLIA+KLEGRSGKSCRLRWFNQLDPRINRR FSEEEEERLMQAHR+YGNKWAMIARLFPGRTDNAVKNH
Subjt:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH

Query:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS
        WHVIMARKYREQSRSYRRRKLSQSVYRKME+DLSFL LPK+   DNRH T  SPF        GAVDYGFLTQMA AGGE ++++H    PFF SCS LS
Subjt:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS

Query:  TLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT
        TL N+L DPKSRFWEGTG+GF+ PRSH QY+PYNT A A+                  SSSVTAETT K    PPFIDFLGVGAT
Subjt:  TLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT

XP_022968071.1 transcription factor CSA-like isoform X2 [Cucurbita maxima]1.5e-14174.55Show/hide
Query:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG
        MGMLDD  DD DDFG+ VEESL LDLN  AIVSSSQES EV+ENGR GFWNFPFSCESMTRSSET     DFSDGFVEN+N   N            ASG
Subjt:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG

Query:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH
        AQSRLCARGHWRPAEDTKLRELVA YGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRR FSEEEEERLMQAHR+YGNKWAMIARLFPGRTDNAVKNH
Subjt:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH

Query:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS
        WHVIMARKYREQSRSYRRRKL+QSVYRKME+DLSFL LP  N  DNRH T  SPF        GAVDYGFLTQMA  GGE ++++H    PFF SCS LS
Subjt:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS

Query:  TLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT
        TL N+LPDPKSRFWEGTG+GFV PRSH QY+PYNTAA A+                  SSSVTAETT   +  PPFIDFLGVGAT
Subjt:  TLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT

XP_023542831.1 transcription factor CSA-like [Cucurbita pepo subsp. pepo]3.1e-14274.55Show/hide
Query:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG
        MGMLDD  DD DDFG+ VEESLNLDLN  AIVSSSQES EV+ENGR GFWNFPFSCES+TRSSET     DFSDGFVEN+N   N            ASG
Subjt:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG

Query:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH
        AQSRLCARGHWRPAEDTKLRELVA YGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRR FSEEEEERLMQAHR+YGNKWAMIARLFPGRTDNAVKNH
Subjt:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH

Query:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS
        WHVIMARKYREQSRSYRRRKLSQSVYRKME+DLSFL LPK+   DNRH T  SPF        GAVDYGFLTQMA AGGE ++++H    PFF SCS LS
Subjt:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS

Query:  TLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT
        TL N+LPDPK RFWEGTG+GFV PRSH+QY+PYNTAA A+                  SSSVT ETT   +  PPFIDFLGVGAT
Subjt:  TLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT

TrEMBL top hitse value%identityAlignment
A0A6J1DDW8 transcriptional activator Myb4.1e-14073.9Show/hide
Query:  ESLNLDLNCAAI---VSSSQE-------SCEVIENGRSGFWNFPFSCESMTRSSETDVGD-----NDFSDGFV--ENINNN--NNI--ANPTSCSNTPSA
        E+ +LDLNCA +   V  SQE       SCEV ENG  GFW FPFSCESMTRSSETDV D     +DFSDGFV   NIN +  N+I   NP+SCSN  +A
Subjt:  ESLNLDLNCAAI---VSSSQE-------SCEVIENGRSGFWNFPFSCESMTRSSETDVGD-----NDFSDGFV--ENINNN--NNI--ANPTSCSNTPSA

Query:  SGAQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVK
        SGAQSRLCARGHWRPAEDTKLRELVA YGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAF+EEEE+RLMQAHR+YGNKWAMIARLFPGRTDNAVK
Subjt:  SGAQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVK

Query:  NHWHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTT-SSPFGNFHGSSVGAVDYGFLTQMAIAG-GE-PISTNHPFFPSCSQL
        NHWHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKD  GDN HT+T SSPFGNF  +SVG +DYGFLTQM I G GE PIS NHP+F SC+QL
Subjt:  NHWHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTT-SSPFGNFHGSSVGAVDYGFLTQMAIAG-GE-PISTNHPFFPSCSQL

Query:  STLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT-------------------SSSVTAETTTKQQSPPPFIDFLGVGAT
        S   N LPDPKSR W+GTGNGF+ PRSH QY+P NTAAP +                   SSSVTAETT   +SPPPFIDFLGVGAT
Subjt:  STLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT-------------------SSSVTAETTTKQQSPPPFIDFLGVGAT

A0A6J1G0L6 transcription factor CSA-like isoform X28.2e-14174.03Show/hide
Query:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG
        MGMLDD  DD DDFG+ VEESLNLDLN  AIVSSSQES EV+ENGR G WNFPFSCES+TRSSET     DFSDGFVEN+N   N            ASG
Subjt:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG

Query:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH
        AQSRLCARGHWRPAEDTKLRELVA YGPQNWNLIA+KLEGRSGKSCRLRWFNQLDPRINRR FSEEEEERLMQAHR+YGNKWAMIARLFPGRTDNAVKNH
Subjt:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH

Query:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS
        WHVIMARKYREQSRSYRRRKLSQSVYRKME+DLSFL LPK+   DNRH T  SPF        GAVDYGFLTQMA AGGE ++++H    PFF SCS LS
Subjt:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS

Query:  TLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT
        TL N+L DPKSRFWEGTG+GF+ PRSH QY+PYNT A A+                  SSSVTAETT K    PPFIDFLGVGAT
Subjt:  TLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT

A0A6J1G0P1 transcription factor CSA-like isoform X19.0e-14073.08Show/hide
Query:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG
        MGMLDD  DD DDFG+ VEESLNLDLN  AIVSSSQES EV+ENGR G WNFPFSCES+TRSSET     DFSDGFVEN+N   N            ASG
Subjt:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG

Query:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH
        AQSRLCARGHWRPAEDTKLRELVA YGPQNWNLIA+KLEGRSGKSCRLRWFNQLDPRINRR FSEEEEERLMQAHR+YGNKWAMIARLFPGRTDNAVKNH
Subjt:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH

Query:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS
        WHVIMARKYREQSRSYRRRKLSQSVYRKME+DLSFL LPK+   DNRH T  SPF        GAVDYGFLTQMA AGGE ++++H    PFF SCS LS
Subjt:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS

Query:  TLNNL-----LPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT
        TLN L       DPKSRFWEGTG+GF+ PRSH QY+PYNT A A+                  SSSVTAETT K    PPFIDFLGVGAT
Subjt:  TLNNL-----LPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT

A0A6J1HTU7 transcription factor CSA-like isoform X27.4e-14274.55Show/hide
Query:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG
        MGMLDD  DD DDFG+ VEESL LDLN  AIVSSSQES EV+ENGR GFWNFPFSCESMTRSSET     DFSDGFVEN+N   N            ASG
Subjt:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG

Query:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH
        AQSRLCARGHWRPAEDTKLRELVA YGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRR FSEEEEERLMQAHR+YGNKWAMIARLFPGRTDNAVKNH
Subjt:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH

Query:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS
        WHVIMARKYREQSRSYRRRKL+QSVYRKME+DLSFL LP  N  DNRH T  SPF        GAVDYGFLTQMA  GGE ++++H    PFF SCS LS
Subjt:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS

Query:  TLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT
        TL N+LPDPKSRFWEGTG+GFV PRSH QY+PYNTAA A+                  SSSVTAETT   +  PPFIDFLGVGAT
Subjt:  TLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT

A0A6J1HW56 transcription factor CSA-like isoform X16.9e-14073.33Show/hide
Query:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG
        MGMLDD  DD DDFG+ VEESL LDLN  AIVSSSQES EV+ENGR GFWNFPFSCESMTRSSET     DFSDGFVEN+N   N            ASG
Subjt:  MGMLDD--DDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASG

Query:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH
        AQSRLCARGHWRPAEDTKLRELVA YGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRR FSEEEEERLMQAHR+YGNKWAMIARLFPGRTDNAVKNH
Subjt:  AQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNH

Query:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS
        WHVIMARKYREQSRSYRRRKL+QSVYRKME+DLSFL LP  N  DNRH T  SPF        GAVDYGFLTQMA  GGE ++++H    PFF SCS LS
Subjt:  WHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH----PFFPSCSQLS

Query:  TLNNL-----LPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT
        TLN L       DPKSRFWEGTG+GFV PRSH QY+PYNTAA A+                  SSSVTAETT   +  PPFIDFLGVGAT
Subjt:  TLNNL-----LPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPAT------------------SSSVTAETTTKQQSPPPFIDFLGVGAT

SwissProt top hitse value%identityAlignment
Q5NBM8 Transcription factor CSA6.4e-5846.64Show/hide
Query:  GAQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKN
        G Q +LCARGHWRPAED KL++LVA YGPQNWNLIAEKL+GRSGKSCRLRWFNQLDPRINRRAF+EEEEERLM AHR YGNKWA+IARLFPGRTDNAVKN
Subjt:  GAQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKN

Query:  HWHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLT--------QMAIAGGEPISTNHP--FF
        HWHV+MAR++REQS ++RRRK S S                     + H   S  +  + G++         T          A A     + N P  F+
Subjt:  HWHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLT--------QMAIAGGEPISTNHP--FF

Query:  ---PSCSQLSTLNNLLPDPKS-----RFWEGTGNGFVVPRSHQQYKPYNTAAPATSSSVTAET----------------TTKQQSPPPFIDFLGVGAT
           P  S  ST     P   +      F+ G G  F            +T AP+  S+ +A                   T      PF DFLGVGAT
Subjt:  ---PSCSQLSTLNNLLPDPKS-----RFWEGTGNGFVVPRSHQQYKPYNTAAPATSSSVTAET----------------TTKQQSPPPFIDFLGVGAT

Q6R053 Transcription factor MYB562.1e-4850.48Show/hide
Query:  NCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASGAQSRLCARGHWRPAEDTKLRELVAHY
        N  ++ +S   +C+   N  S   N P +  +    SE + G+        E   N  ++            SG  +++C+RGHWRP ED KL+ELVA +
Subjt:  NCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASGAQSRLCARGHWRPAEDTKLRELVAHY

Query:  GPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKLSQSVY
        GPQNWNLI+  L GRSGKSCRLRWFNQLDPRIN+RAF+EEEE RL+ AHR YGNKWA+I+RLFPGRTDNAVKNHWHVIMAR+ RE  R  +R++   ++ 
Subjt:  GPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKLSQSVY

Query:  RKMEQDLS
        R  E  +S
Subjt:  RKMEQDLS

Q6R0C4 Transcription factor MYB523.5e-4876.36Show/hide
Query:  LCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVI
        +C+RGHWRPAED KLRELV  +GP NWN IA+KL GRSGKSCRLRWFNQLDPRINR  F+EEEEERL+ +HR++GN+W++IAR FPGRTDNAVKNHWHVI
Subjt:  LCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVI

Query:  MARKYREQSR
        MAR+ RE+S+
Subjt:  MARKYREQSR

Q9LQX5 Transcription factor MYB1175.2e-6043.09Show/hide
Query:  EVIENGRSGFWNFPFSCESM--------TRSSETDVGDNDFSDGFV---ENINNNNNIAN-------PTSCSNTPSASGAQSRLCARGHWRPAEDTKLRE
        E++    S  W+FPF+  ++          S E ++  +   D  V   E+ NNN N +N        T    + S+S ++  +  RGHWRPAED KL+E
Subjt:  EVIENGRSGFWNFPFSCESM--------TRSSETDVGDNDFSDGFV---ENINNNNNIAN-------PTSCSNTPSASGAQSRLCARGHWRPAEDTKLRE

Query:  LVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKL
        LV+ YGPQNWNLIAEKL+GRSGKSCRLRWFNQLDPRINRRAF+EEEEERLMQAHRLYGNKWAMIARLFPGRTDN+VKNHWHV+MARKYRE S +YRRRKL
Subjt:  LVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKL

Query:  SQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDY------GFLTQMAIAGGEPISTNH-----PFFPSCSQLSTLNNLLPDPKSRFW
                       N P   +  N H    +P  N+H S +    Y       F     +    PI+++H     PF   C Q    NN  P   S F 
Subjt:  SQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDY------GFLTQMAIAGGEPISTNH-----PFFPSCSQLSTLNNLLPDPKSRFW

Query:  EGTGNGFVV------------------PRSHQQYKPYNTAAPATSSSVTAETTTKQQSPPPFIDFLGVG
           GN  +V                  P + ++ +P +        +V  E   K +  P F DFLG+G
Subjt:  EGTGNGFVV------------------PRSHQQYKPYNTAAPATSSSVTAETTTKQQSPPPFIDFLGVG

Q9SEZ4 Transcription factor MYB1051.9e-5770.48Show/hide
Query:  RSSETDVGDNDFSDGFVENIN-NNNNIANPTSCSNTPSASGAQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRIN
        RS  T   + D +    +  N N  +     SC ++  AS       +RGHWRPAEDTKL+ELVA YGPQNWNLIAEKL+GRSGKSCRLRWFNQLDPRIN
Subjt:  RSSETDVGDNDFSDGFVENIN-NNNNIANPTSCSNTPSASGAQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRIN

Query:  RRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKLSQSV
        RRAF+EEEEERLMQAHRLYGNKWAMIARLFPGRTDN+VKNHWHVIMARK+REQS SYRRRK   S+
Subjt:  RRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKLSQSV

Arabidopsis top hitse value%identityAlignment
AT1G26780.1 myb domain protein 1176.3e-6161Show/hide
Query:  EVIENGRSGFWNFPFSCESM--------TRSSETDVGDNDFSDGFV---ENINNNNNIAN-------PTSCSNTPSASGAQSRLCARGHWRPAEDTKLRE
        E++    S  W+FPF+  ++          S E ++  +   D  V   E+ NNN N +N        T    + S+S ++  +  RGHWRPAED KL+E
Subjt:  EVIENGRSGFWNFPFSCESM--------TRSSETDVGDNDFSDGFV---ENINNNNNIAN-------PTSCSNTPSASGAQSRLCARGHWRPAEDTKLRE

Query:  LVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKL
        LV+ YGPQNWNLIAEKL+GRSGKSCRLRWFNQLDPRINRRAF+EEEEERLMQAHRLYGNKWAMIARLFPGRTDN+VKNHWHV+MARKYRE S +YRRRKL
Subjt:  LVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKL

AT1G26780.2 myb domain protein 1173.7e-6143.09Show/hide
Query:  EVIENGRSGFWNFPFSCESM--------TRSSETDVGDNDFSDGFV---ENINNNNNIAN-------PTSCSNTPSASGAQSRLCARGHWRPAEDTKLRE
        E++    S  W+FPF+  ++          S E ++  +   D  V   E+ NNN N +N        T    + S+S ++  +  RGHWRPAED KL+E
Subjt:  EVIENGRSGFWNFPFSCESM--------TRSSETDVGDNDFSDGFV---ENINNNNNIAN-------PTSCSNTPSASGAQSRLCARGHWRPAEDTKLRE

Query:  LVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKL
        LV+ YGPQNWNLIAEKL+GRSGKSCRLRWFNQLDPRINRRAF+EEEEERLMQAHRLYGNKWAMIARLFPGRTDN+VKNHWHV+MARKYRE S +YRRRKL
Subjt:  LVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKL

Query:  SQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDY------GFLTQMAIAGGEPISTNH-----PFFPSCSQLSTLNNLLPDPKSRFW
                       N P   +  N H    +P  N+H S +    Y       F     +    PI+++H     PF   C Q    NN  P   S F 
Subjt:  SQSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDY------GFLTQMAIAGGEPISTNH-----PFFPSCSQLSTLNNLLPDPKSRFW

Query:  EGTGNGFVV------------------PRSHQQYKPYNTAAPATSSSVTAETTTKQQSPPPFIDFLGVG
           GN  +V                  P + ++ +P +        +V  E   K +  P F DFLG+G
Subjt:  EGTGNGFVV------------------PRSHQQYKPYNTAAPATSSSVTAETTTKQQSPPPFIDFLGVG

AT1G69560.1 myb domain protein 1051.3e-5870.48Show/hide
Query:  RSSETDVGDNDFSDGFVENIN-NNNNIANPTSCSNTPSASGAQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRIN
        RS  T   + D +    +  N N  +     SC ++  AS       +RGHWRPAEDTKL+ELVA YGPQNWNLIAEKL+GRSGKSCRLRWFNQLDPRIN
Subjt:  RSSETDVGDNDFSDGFVENIN-NNNNIANPTSCSNTPSASGAQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRIN

Query:  RRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKLSQSV
        RRAF+EEEEERLMQAHRLYGNKWAMIARLFPGRTDN+VKNHWHVIMARK+REQS SYRRRK   S+
Subjt:  RRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKLSQSV

AT3G29020.2 myb domain protein 1101.7e-5040.25Show/hide
Query:  FPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASGAQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWF
        F F C   T +   +  +N  S+   E  N                   + SR+C+RGHWR +EDT+L ELV+ YGPQNWN IAE ++GR+GKSCRLRWF
Subjt:  FPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASGAQSRLCARGHWRPAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWF

Query:  NQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTT
        NQLDPRIN+RAFS+EEEERL+ AHR +GNKWAMIA+LF GRTDNA+KNHWHV+MARK R+QS SY +R    +   +   D    NL   N  D+     
Subjt:  NQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKLSQSVYRKMEQDLSFLNLPKDNNGDNRHTTT

Query:  SSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH---PFFPSCSQLSTLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPATSSSVTAETTTKQ
                      +   +L +   +   P+   H     FP+ S   TL+  + +P S                      +++    SSS T E T   
Subjt:  SSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNH---PFFPSCSQLSTLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTAAPATSSSVTAETTTKQ

Query:  Q-----SPPPFIDFLGVG
        +      PP FIDFLGVG
Subjt:  Q-----SPPPFIDFLGVG

AT5G17800.1 myb domain protein 561.5e-4950.48Show/hide
Query:  NCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASGAQSRLCARGHWRPAEDTKLRELVAHY
        N  ++ +S   +C+   N  S   N P +  +    SE + G+        E   N  ++            SG  +++C+RGHWRP ED KL+ELVA +
Subjt:  NCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASGAQSRLCARGHWRPAEDTKLRELVAHY

Query:  GPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKLSQSVY
        GPQNWNLI+  L GRSGKSCRLRWFNQLDPRIN+RAF+EEEE RL+ AHR YGNKWA+I+RLFPGRTDNAVKNHWHVIMAR+ RE  R  +R++   ++ 
Subjt:  GPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKLSQSVY

Query:  RKMEQDLS
        R  E  +S
Subjt:  RKMEQDLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGATGTTGGATGATGATGATGATGATGATTTTGGTCTCAAGGTTGAGGAAAGTTTGAATCTTGATCTCAACTGTGCAGCAATTGTTTCTTCTTCTCAAGAGAGTTG
TGAGGTCATTGAAAATGGGAGATCGGGTTTTTGGAACTTCCCATTTTCTTGTGAATCCATGACTAGAAGCTCAGAGACTGATGTTGGGGATAATGATTTCAGTGATGGGT
TTGTTGAAAACATTAATAATAATAATAATATTGCTAACCCAACTTCTTGTTCCAATACACCCAGTGCCAGTGGGGCTCAGTCTAGACTCTGTGCTAGAGGCCATTGGAGG
CCTGCTGAAGACACCAAGTTGAGAGAGCTTGTAGCCCATTATGGTCCCCAAAACTGGAACCTCATTGCAGAGAAGCTTGAGGGAAGATCTGGTAAAAGTTGCAGACTGCG
ATGGTTTAACCAGTTGGATCCGAGGATAAACAGAAGGGCATTTAGTGAGGAAGAAGAAGAGAGGCTAATGCAGGCTCATAGGTTATATGGAAACAAATGGGCGATGATTG
CAAGGCTTTTTCCTGGGAGGACTGATAATGCAGTGAAGAATCATTGGCATGTTATAATGGCAAGGAAATATAGAGAACAGTCTCGTTCATACAGAAGGAGGAAGCTGAGT
CAATCTGTTTACAGAAAAATGGAACAAGATTTGAGTTTCCTTAATCTTCCCAAAGATAATAATGGAGACAACAGACACACCACAACTTCTTCTCCTTTTGGAAACTTTCA
TGGTTCTTCTGTTGGAGCTGTTGATTATGGCTTTTTAACACAAATGGCCATTGCTGGTGGAGAACCAATCTCAACTAACCATCCTTTCTTCCCTTCATGTTCTCAGCTTT
CAACTCTTAATAATCTCCTCCCAGATCCCAAGAGCAGATTTTGGGAAGGAACAGGCAATGGGTTTGTGGTACCACGAAGCCACCAGCAGTACAAGCCATATAACACGGCG
GCGCCGGCGACATCTTCATCAGTAACGGCGGAGACAACAACAAAACAACAATCACCACCACCATTTATCGACTTTCTTGGGGTTGGAGCCACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGATGTTGGATGATGATGATGATGATGATTTTGGTCTCAAGGTTGAGGAAAGTTTGAATCTTGATCTCAACTGTGCAGCAATTGTTTCTTCTTCTCAAGAGAGTTG
TGAGGTCATTGAAAATGGGAGATCGGGTTTTTGGAACTTCCCATTTTCTTGTGAATCCATGACTAGAAGCTCAGAGACTGATGTTGGGGATAATGATTTCAGTGATGGGT
TTGTTGAAAACATTAATAATAATAATAATATTGCTAACCCAACTTCTTGTTCCAATACACCCAGTGCCAGTGGGGCTCAGTCTAGACTCTGTGCTAGAGGCCATTGGAGG
CCTGCTGAAGACACCAAGTTGAGAGAGCTTGTAGCCCATTATGGTCCCCAAAACTGGAACCTCATTGCAGAGAAGCTTGAGGGAAGATCTGGTAAAAGTTGCAGACTGCG
ATGGTTTAACCAGTTGGATCCGAGGATAAACAGAAGGGCATTTAGTGAGGAAGAAGAAGAGAGGCTAATGCAGGCTCATAGGTTATATGGAAACAAATGGGCGATGATTG
CAAGGCTTTTTCCTGGGAGGACTGATAATGCAGTGAAGAATCATTGGCATGTTATAATGGCAAGGAAATATAGAGAACAGTCTCGTTCATACAGAAGGAGGAAGCTGAGT
CAATCTGTTTACAGAAAAATGGAACAAGATTTGAGTTTCCTTAATCTTCCCAAAGATAATAATGGAGACAACAGACACACCACAACTTCTTCTCCTTTTGGAAACTTTCA
TGGTTCTTCTGTTGGAGCTGTTGATTATGGCTTTTTAACACAAATGGCCATTGCTGGTGGAGAACCAATCTCAACTAACCATCCTTTCTTCCCTTCATGTTCTCAGCTTT
CAACTCTTAATAATCTCCTCCCAGATCCCAAGAGCAGATTTTGGGAAGGAACAGGCAATGGGTTTGTGGTACCACGAAGCCACCAGCAGTACAAGCCATATAACACGGCG
GCGCCGGCGACATCTTCATCAGTAACGGCGGAGACAACAACAAAACAACAATCACCACCACCATTTATCGACTTTCTTGGGGTTGGAGCCACATGA
Protein sequenceShow/hide protein sequence
MGMLDDDDDDDFGLKVEESLNLDLNCAAIVSSSQESCEVIENGRSGFWNFPFSCESMTRSSETDVGDNDFSDGFVENINNNNNIANPTSCSNTPSASGAQSRLCARGHWR
PAEDTKLRELVAHYGPQNWNLIAEKLEGRSGKSCRLRWFNQLDPRINRRAFSEEEEERLMQAHRLYGNKWAMIARLFPGRTDNAVKNHWHVIMARKYREQSRSYRRRKLS
QSVYRKMEQDLSFLNLPKDNNGDNRHTTTSSPFGNFHGSSVGAVDYGFLTQMAIAGGEPISTNHPFFPSCSQLSTLNNLLPDPKSRFWEGTGNGFVVPRSHQQYKPYNTA
APATSSSVTAETTTKQQSPPPFIDFLGVGAT