; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022138 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022138
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionMyb-related protein 2-like
Genome locationchr7:19588519..19590425
RNA-Seq ExpressionLag0022138
SyntenyLag0022138
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR044848 - PHR1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022972812.1 myb-related protein 2-like isoform X1 [Cucurbita maxima]1.1e-10765.32Show/hide
Query:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT
        +HHQHRGKSIHSSERHLFLQGGNGP GDSGLVLSTDAKPRLKWTPDLH+RFVEAVNQLGG DKATPKTVMKIM I GLTLYHLKSHLQKYRLSKN+HGQ 
Subjt:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT

Query:  NGGSGSN-------KTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPN
        NGGSG+N       + GTVAVS DQRL E NGAARTN+I VAP +SSQPN   SL I S      + +QK  +++     +     + A  +  + V   
Subjt:  NGGSGSN-------KTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPN

Query:  SAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL---
           KAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQ LTAAFPELH QQQQ Q  P  L   Q  +PP           EG+  QQ     L P A+   
Subjt:  SAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL---

Query:  ----RRDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET----AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS
              D  HGLSM +GLVQG+KGEGYNGYSPSE QR FGS RKE VEKE      FRYRM     DLNAGEDQLS+NDHA S TCKMFDLNGFS
Subjt:  ----RRDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET----AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS

XP_022972813.1 myb-related protein 2-like isoform X2 [Cucurbita maxima]2.2e-10864.81Show/hide
Query:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT
        +HHQHRGKSIHSSERHLFLQGGNGP GDSGLVLSTDAKPRLKWTPDLH+RFVEAVNQLGG DKATPKTVMKIM I GLTLYHLKSHLQKYRLSKN+HGQ 
Subjt:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT

Query:  NGGSGSN-------KTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPN
        NGGSG+N       + GTVAVS DQRL E NGAARTN+I VAP +SSQPNN    + +S      + +QK  +++     +     + A  +  + V   
Subjt:  NGGSGSN-------KTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPN

Query:  SAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL---
           KAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQ LTAAFPELH QQQQ Q  P  L   Q  +PP           EG+  QQ     L P A+   
Subjt:  SAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL---

Query:  ----RRDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET----AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS
              D  HGLSM +GLVQG+KGEGYNGYSPSE QR FGS RKE VEKE      FRYRM     DLNAGEDQLS+NDHA S TCKMFDLNGFS
Subjt:  ----RRDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET----AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS

XP_022972814.1 myb-related protein 2-like isoform X3 [Cucurbita maxima]2.6e-10966.84Show/hide
Query:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT
        +HHQHRGKSIHSSERHLFLQGGNGP GDSGLVLSTDAKPRLKWTPDLH+RFVEAVNQLGG DKATPKTVMKIM I GLTLYHLKSHLQKYRLSKN+HGQ 
Subjt:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT

Query:  NGGSG-SNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKAQ
        NGGSG +NKTGTVAVS DQRL E NGAARTN+I VAP +SSQPN   SL I S      + +QK  +++     +     + A  +  + V      KAQ
Subjt:  NGGSG-SNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKAQ

Query:  ETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL-------RR
        ETLGRQNLGTVGLEAAKVQLSELVSKVSTQ LTAAFPELH QQQQ Q  P  L   Q  +PP           EG+  QQ     L P A+         
Subjt:  ETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL-------RR

Query:  DPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET----AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS
        D  HGLSM +GLVQG+KGEGYNGYSPSE QR FGS RKE VEKE      FRYRM     DLNAGEDQLS+NDHA S TCKMFDLNGFS
Subjt:  DPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET----AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS

XP_038903034.1 myb-related protein 2 isoform X1 [Benincasa hispida]2.2e-11166Show/hide
Query:  MYHHQHRGKSIHSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT
        MYHHQHRGKSIHSSERH+FLQGGNGPGDSGLVLSTDAKPRLKWTPDLH+RFVEAVNQLGGADKATPKTVMKIM IPGLTLYHLKSHLQKYRLSKN+HGQ 
Subjt:  MYHHQHRGKSIHSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT

Query:  NGGSGSNKT------GTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNS
        NGGSG+NKT      GTV VS DQRLAE NGAARTNNIVVAP  SSQ N   SL I S      + +QK  +++     +     + A  +  + V    
Subjt:  NGGSGSNKT------GTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNS

Query:  AGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ------------PLG
          KAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQ LTAAFPELH Q Q   +        Q  +PP           EG    Q             L 
Subjt:  AGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ------------PLG

Query:  PPALRR----DPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEK----ETAFRYRMTPSSLDLNAGEDQL-SNNDHAPSTTCKMFDLNGFS
        P A R     D  HGLSMSIGLVQG+KGEGYNGYSPSE QR FGSKRKE VEK    ET FRYRM     DLNAGEDQL SNNDH  STTCKMFDLNGFS
Subjt:  PPALRR----DPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEK----ETAFRYRMTPSSLDLNAGEDQL-SNNDHAPSTTCKMFDLNGFS

XP_038903036.1 myb-related protein 2 isoform X2 [Benincasa hispida]2.3e-11367.01Show/hide
Query:  MYHHQHRGKSIHSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT
        MYHHQHRGKSIHSSERH+FLQGGNGPGDSGLVLSTDAKPRLKWTPDLH+RFVEAVNQLGGADKATPKTVMKIM IPGLTLYHLKSHLQKYRLSKN+HGQ 
Subjt:  MYHHQHRGKSIHSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT

Query:  NGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKAQE
        NGGSG+NKTGTV VS DQRLAE NGAARTNNIVVAP  SSQ N   SL I S      + +QK  +++     +     + A  +  + V      KAQE
Subjt:  NGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKAQE

Query:  TLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ------------PLGPPALRR
        TLGRQNLGTVGLEAAKVQLSELVSKVSTQ LTAAFPELH Q Q   +        Q  +PP           EG    Q             L P A R 
Subjt:  TLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ------------PLGPPALRR

Query:  ----DPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEK----ETAFRYRMTPSSLDLNAGEDQL-SNNDHAPSTTCKMFDLNGFS
            D  HGLSMSIGLVQG+KGEGYNGYSPSE QR FGSKRKE VEK    ET FRYRM     DLNAGEDQL SNNDH  STTCKMFDLNGFS
Subjt:  ----DPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEK----ETAFRYRMTPSSLDLNAGEDQL-SNNDHAPSTTCKMFDLNGFS

TrEMBL top hitse value%identityAlignment
A0A6J1EY96 myb-related protein 2-like isoform X35.4e-10866.07Show/hide
Query:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT
        +HHQHRGKSIHSSERHLFLQGGNGP GDSGLVLSTDAKPRLKWTPDLH+RFVEAVNQLGG DKATPKTVMKIM I GLTLYHLKSHLQKYRLSKN+HGQ 
Subjt:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT

Query:  NGGSG-SNKTGTVAVSGDQRLAEPNGAARTNN-IVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKA
        NGGSG +NKTGTVAVS DQRL E NGA RTNN I VAP +SSQPN   SL I S      + +QK  +++     +     + A  +  + V      KA
Subjt:  NGGSG-SNKTGTVAVSGDQRLAEPNGAARTNN-IVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKA

Query:  QETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL-------R
        QETLGRQNLGTVGLEAAKVQLSELVSKVSTQ LTAAFPELH  QQQ Q+        Q  +PP           E +  QQ     L P A+        
Subjt:  QETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL-------R

Query:  RDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET---AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS
         D  HGLSM +GLVQG+KGEGYNGYSPSE QR FGS RKE VEKE    AFRYRM     DLNAGEDQLS+NDHA S TCKMFDLNGFS
Subjt:  RDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET---AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS

A0A6J1F419 myb-related protein 2-like isoform X24.5e-10764.05Show/hide
Query:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT
        +HHQHRGKSIHSSERHLFLQGGNGP GDSGLVLSTDAKPRLKWTPDLH+RFVEAVNQLGG DKATPKTVMKIM I GLTLYHLKSHLQKYRLSKN+HGQ 
Subjt:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT

Query:  NGGSGSN-------KTGTVAVSGDQRLAEPNGAARTNN-IVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSP
        NGGSG+N       + GTVAVS DQRL E NGA RTNN I VAP +SSQPNN    + +S      + +QK  +++     +     + A  +  + V  
Subjt:  NGGSGSN-------KTGTVAVSGDQRLAEPNGAARTNN-IVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSP

Query:  NSAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL--
            KAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQ LTAAFPELH  QQQ Q+        Q  +PP           E +  QQ     L P A+  
Subjt:  NSAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL--

Query:  -----RRDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET---AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS
               D  HGLSM +GLVQG+KGEGYNGYSPSE QR FGS RKE VEKE    AFRYRM     DLNAGEDQLS+NDHA S TCKMFDLNGFS
Subjt:  -----RRDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET---AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS

A0A6J1I5V0 myb-related protein 2-like isoform X15.4e-10865.32Show/hide
Query:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT
        +HHQHRGKSIHSSERHLFLQGGNGP GDSGLVLSTDAKPRLKWTPDLH+RFVEAVNQLGG DKATPKTVMKIM I GLTLYHLKSHLQKYRLSKN+HGQ 
Subjt:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT

Query:  NGGSGSN-------KTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPN
        NGGSG+N       + GTVAVS DQRL E NGAARTN+I VAP +SSQPN   SL I S      + +QK  +++     +     + A  +  + V   
Subjt:  NGGSGSN-------KTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPN

Query:  SAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL---
           KAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQ LTAAFPELH QQQQ Q  P  L   Q  +PP           EG+  QQ     L P A+   
Subjt:  SAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL---

Query:  ----RRDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET----AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS
              D  HGLSM +GLVQG+KGEGYNGYSPSE QR FGS RKE VEKE      FRYRM     DLNAGEDQLS+NDHA S TCKMFDLNGFS
Subjt:  ----RRDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET----AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS

A0A6J1IB69 myb-related protein 2-like isoform X21.1e-10864.81Show/hide
Query:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT
        +HHQHRGKSIHSSERHLFLQGGNGP GDSGLVLSTDAKPRLKWTPDLH+RFVEAVNQLGG DKATPKTVMKIM I GLTLYHLKSHLQKYRLSKN+HGQ 
Subjt:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT

Query:  NGGSGSN-------KTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPN
        NGGSG+N       + GTVAVS DQRL E NGAARTN+I VAP +SSQPNN    + +S      + +QK  +++     +     + A  +  + V   
Subjt:  NGGSGSN-------KTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPN

Query:  SAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL---
           KAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQ LTAAFPELH QQQQ Q  P  L   Q  +PP           EG+  QQ     L P A+   
Subjt:  SAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL---

Query:  ----RRDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET----AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS
              D  HGLSM +GLVQG+KGEGYNGYSPSE QR FGS RKE VEKE      FRYRM     DLNAGEDQLS+NDHA S TCKMFDLNGFS
Subjt:  ----RRDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET----AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS

A0A6J1ICN3 myb-related protein 2-like isoform X31.3e-10966.84Show/hide
Query:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT
        +HHQHRGKSIHSSERHLFLQGGNGP GDSGLVLSTDAKPRLKWTPDLH+RFVEAVNQLGG DKATPKTVMKIM I GLTLYHLKSHLQKYRLSKN+HGQ 
Subjt:  YHHQHRGKSIHSSERHLFLQGGNGP-GDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQT

Query:  NGGSG-SNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKAQ
        NGGSG +NKTGTVAVS DQRL E NGAARTN+I VAP +SSQPN   SL I S      + +QK  +++     +     + A  +  + V      KAQ
Subjt:  NGGSG-SNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKAQ

Query:  ETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL-------RR
        ETLGRQNLGTVGLEAAKVQLSELVSKVSTQ LTAAFPELH QQQQ Q  P  L   Q  +PP           EG+  QQ     L P A+         
Subjt:  ETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPP----------EGAPPQQ----PLGPPAL-------RR

Query:  DPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET----AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS
        D  HGLSM +GLVQG+KGEGYNGYSPSE QR FGS RKE VEKE      FRYRM     DLNAGEDQLS+NDHA S TCKMFDLNGFS
Subjt:  DPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKE-VEKET----AFRYRMTPSSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS

SwissProt top hitse value%identityAlignment
E5L8F7 Myb family transcription factor IPN21.3e-2661.05Show/hide
Query:  SIHSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQTNGGS
        +++S +R + +Q     GDSGLVL+TD KPRL+WT +LHERFV+AV QLGG DKATPKT+M++M + GLTLYHLKSHLQK+RL K  H + N  S
Subjt:  SIHSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQTNGGS

F4I274 Myb family transcription factor PHL85.5e-2536.77Show/hide
Query:  NGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQTNGGSGSNKTGTVAVSGDQRLAEP
        N      LVLSTDAKPRLKWT DLH +F+EAVNQLGG +KATPK +MK+M IPGLTLYHLKSHLQKYRL K++          NK    + S +Q +   
Subjt:  NGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQTNGGSGSNKTGTVAVSGDQRLAEP

Query:  NGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKAQETLGRQNLGTVGLEAAKVQLSELV
        N +       V   +S+       L I      T  +  +   Q+   +       L     +  +   +   KAQ+TL   +   +G++ A+ +LS L 
Subjt:  NGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKAQETLGRQNLGTVGLEAAKVQLSELV

Query:  SKVSTQSLTAAFPELHQQQQQTQ
        S V+    + +F EL Q +++ +
Subjt:  SKVSTQSLTAAFPELHQQQQQTQ

Q94A57 Protein PHR1-LIKE 21.1e-2537.79Show/hide
Query:  LQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQTNGGSGSNKTGTVAVSGDQR
        L G N PGD+ LVL+TD KPRL+WT +LHERFV+AV QLGG DKATPKT+M+ M + GLTLYHLKSHLQK+RL     G+  G   +  +   +  G+  
Subjt:  LQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQTNGGSGSNKTGTVAVSGDQR

Query:  LAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKAQETLGRQNLGTVGLEAAKVQL
         ++  G++ T+++ +A    ++            Y  T  +  +   QR   D       L     +  +   +   KA +    Q     GLEAA+ +L
Subjt:  LAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKAQETLGRQNLGTVGLEAAKVQL

Query:  SELVSKVSTQSLTAAFP
        SEL  KVS  S   + P
Subjt:  SELVSKVSTQSLTAAFP

Q9FK47 Myb-related protein 13.2e-4948.68Show/hide
Query:  YHHQHRGKSI-------HSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK
        YH+QH+GKSI        SSERH FL+ GNG GDSGL+LSTDAKPRLKWTPDLHERFVEAVNQLGG DKATPKT+MK+M IPGLTLYHLKSHLQKYRLSK
Subjt:  YHHQHRGKSI-------HSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK

Query:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMI---VSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVS
        N++GQ N         T+       + E    + + ++ + P  S      ++L +   V    +  L +Q+    R         + L           
Subjt:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMI---VSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVS

Query:  PNSAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD
             KAQETLGRQNLG  G+EA K QLSELVSKVS     ++F E      LH QQ Q    P+
Subjt:  PNSAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD

Q9SQQ9 Myb-related protein 21.1e-4950Show/hide
Query:  YHHQHRGKSIHS-------SERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK
        Y +QH+GK+I S       SERH FL+ GN PGDSGL+LSTDAKPRLKWTPDLHERF+EAVNQLGGADKATPKT+MK+M IPGLTLYHLKSHLQKYRLSK
Subjt:  YHHQHRGKSIHS-------SERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK

Query:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNS
        N++GQ N  +  NK G + +  ++         ++ N+ + P    QPN  + +           +  +   QR   +       L     +  +   + 
Subjt:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNS

Query:  AGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD
          KAQETLGRQNLG  G+EAAKVQLSELVSKVS +   ++F E      L  QQ QT   PD
Subjt:  AGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD

Arabidopsis top hitse value%identityAlignment
AT3G04030.1 Homeodomain-like superfamily protein1.6e-5151.15Show/hide
Query:  YHHQHRGKSIHS-------SERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK
        Y +QH+GK+I S       SERH FL+ GN PGDSGL+LSTDAKPRLKWTPDLHERF+EAVNQLGGADKATPKT+MK+M IPGLTLYHLKSHLQKYRLSK
Subjt:  YHHQHRGKSIHS-------SERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK

Query:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNS
        N++GQ N  +  NK G + +  ++         ++ N+ + P    QPN  + +          L +Q    +R +        A     +S  E     
Subjt:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNS

Query:  AGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD
          KAQETLGRQNLG  G+EAAKVQLSELVSKVS +   ++F E      L  QQ QT   PD
Subjt:  AGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD

AT3G04030.3 Homeodomain-like superfamily protein7.8e-5150Show/hide
Query:  YHHQHRGKSIHS-------SERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK
        Y +QH+GK+I S       SERH FL+ GN PGDSGL+LSTDAKPRLKWTPDLHERF+EAVNQLGGADKATPKT+MK+M IPGLTLYHLKSHLQKYRLSK
Subjt:  YHHQHRGKSIHS-------SERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK

Query:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNS
        N++GQ N  +  NK G + +  ++         ++ N+ + P    QPN  + +           +  +   QR   +       L     +  +   + 
Subjt:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNS

Query:  AGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD
          KAQETLGRQNLG  G+EAAKVQLSELVSKVS +   ++F E      L  QQ QT   PD
Subjt:  AGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD

AT5G18240.1 myb-related protein 12.3e-5048.68Show/hide
Query:  YHHQHRGKSI-------HSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK
        YH+QH+GKSI        SSERH FL+ GNG GDSGL+LSTDAKPRLKWTPDLHERFVEAVNQLGG DKATPKT+MK+M IPGLTLYHLKSHLQKYRLSK
Subjt:  YHHQHRGKSI-------HSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK

Query:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMI---VSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVS
        N++GQ N         T+       + E    + + ++ + P  S      ++L +   V    +  L +Q+    R         + L           
Subjt:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMI---VSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVS

Query:  PNSAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD
             KAQETLGRQNLG  G+EA K QLSELVSKVS     ++F E      LH QQ Q    P+
Subjt:  PNSAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD

AT5G18240.4 myb-related protein 12.3e-5048.68Show/hide
Query:  YHHQHRGKSI-------HSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK
        YH+QH+GKSI        SSERH FL+ GNG GDSGL+LSTDAKPRLKWTPDLHERFVEAVNQLGG DKATPKT+MK+M IPGLTLYHLKSHLQKYRLSK
Subjt:  YHHQHRGKSI-------HSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK

Query:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMI---VSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVS
        N++GQ N         T+       + E    + + ++ + P  S      ++L +   V    +  L +Q+    R         + L           
Subjt:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMI---VSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVS

Query:  PNSAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD
             KAQETLGRQNLG  G+EA K QLSELVSKVS     ++F E      LH QQ Q    P+
Subjt:  PNSAGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD

AT5G18240.5 myb-related protein 13.0e-5048.85Show/hide
Query:  YHHQHRGKSI-------HSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK
        YH+QH+GKSI        SSERH FL+ GNG GDSGL+LSTDAKPRLKWTPDLHERFVEAVNQLGG DKATPKT+MK+M IPGLTLYHLKSHLQKYRLSK
Subjt:  YHHQHRGKSI-------HSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSK

Query:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNS
        N++GQ N         T+       + E    + + ++ + P  S      ++L +        + +Q+  +++          A     +S  E     
Subjt:  NVHGQTNGGSGSNKTGTVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNS

Query:  AGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD
          KAQETLGRQNLG  G+EA K QLSELVSKVS     ++F E      LH QQ Q    P+
Subjt:  AGKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQSLTAAFPE------LHQQQQQTQRLPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACCATCATCAGCACAGAGGGAAGAGCATCCACTCCTCTGAGAGGCATTTGTTTTTACAAGGTGGGAATGGTCCTGGAGATTCAGGTCTTGTTCTTTCCACCGATGC
CAAGCCCAGGCTTAAATGGACTCCTGATTTGCATGAACGTTTTGTCGAAGCTGTCAACCAGCTTGGAGGGGCTGACAAGGCAACTCCTAAAACTGTCATGAAAATCATGA
GCATTCCTGGACTTACCTTGTACCACTTGAAAAGCCATCTCCAGAAATACAGGCTGAGCAAGAACGTGCATGGACAAACAAATGGTGGAAGTGGAAGCAACAAAACTGGC
ACTGTGGCTGTTTCTGGTGATCAGAGATTAGCGGAGCCTAATGGAGCAGCTCGCACTAACAACATAGTCGTCGCCCCACACTCCTCCTCCCAGCCCAACAATTACAATTC
CTTGATGATAGTTTCTTACTATAATTACACTCCCTTAATTCTCCAGAAGCCTTCAAATCAGCGAAACAATACAGATGCAAATCGAAGTACAACGGCACTTGCAGCTGCGA
ATAGAAGCACAAGGGAAGTATCTCCAAACAGTGCTGGAAAAGCACAAGAGACGTTAGGAAGACAGAACTTAGGCACAGTGGGACTTGAAGCCGCAAAAGTTCAGCTCTCA
GAATTGGTCTCAAAAGTGTCCACTCAATCCCTAACCGCAGCATTTCCAGAGCTCCACCAACAACAACAGCAAACGCAAAGGTTGCCTGACCTCCTGCGAGGGCTCCAAGG
ACCAAGACCACCACCAGAAGGTGCTCCTCCACAACAGCCACTTGGCCCTCCGGCCCTACGCCGGGACCCCCATCATGGCCTGTCCATGAGCATTGGGCTGGTGCAGGGGG
ACAAGGGGGAAGGCTATAACGGGTATTCGCCATCCGAGGCTCAGAGATCATTTGGGAGTAAAAGAAAGGAAGTGGAGAAAGAGACGGCGTTTAGGTATAGAATGACGCCT
TCTTCGTTGGATTTGAATGCTGGTGAGGATCAACTTAGTAATAATGATCATGCGCCTTCTACTACTTGCAAGATGTTTGATCTTAATGGCTTTAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTACCATCATCAGCACAGAGGGAAGAGCATCCACTCCTCTGAGAGGCATTTGTTTTTACAAGGTGGGAATGGTCCTGGAGATTCAGGTCTTGTTCTTTCCACCGATGC
CAAGCCCAGGCTTAAATGGACTCCTGATTTGCATGAACGTTTTGTCGAAGCTGTCAACCAGCTTGGAGGGGCTGACAAGGCAACTCCTAAAACTGTCATGAAAATCATGA
GCATTCCTGGACTTACCTTGTACCACTTGAAAAGCCATCTCCAGAAATACAGGCTGAGCAAGAACGTGCATGGACAAACAAATGGTGGAAGTGGAAGCAACAAAACTGGC
ACTGTGGCTGTTTCTGGTGATCAGAGATTAGCGGAGCCTAATGGAGCAGCTCGCACTAACAACATAGTCGTCGCCCCACACTCCTCCTCCCAGCCCAACAATTACAATTC
CTTGATGATAGTTTCTTACTATAATTACACTCCCTTAATTCTCCAGAAGCCTTCAAATCAGCGAAACAATACAGATGCAAATCGAAGTACAACGGCACTTGCAGCTGCGA
ATAGAAGCACAAGGGAAGTATCTCCAAACAGTGCTGGAAAAGCACAAGAGACGTTAGGAAGACAGAACTTAGGCACAGTGGGACTTGAAGCCGCAAAAGTTCAGCTCTCA
GAATTGGTCTCAAAAGTGTCCACTCAATCCCTAACCGCAGCATTTCCAGAGCTCCACCAACAACAACAGCAAACGCAAAGGTTGCCTGACCTCCTGCGAGGGCTCCAAGG
ACCAAGACCACCACCAGAAGGTGCTCCTCCACAACAGCCACTTGGCCCTCCGGCCCTACGCCGGGACCCCCATCATGGCCTGTCCATGAGCATTGGGCTGGTGCAGGGGG
ACAAGGGGGAAGGCTATAACGGGTATTCGCCATCCGAGGCTCAGAGATCATTTGGGAGTAAAAGAAAGGAAGTGGAGAAAGAGACGGCGTTTAGGTATAGAATGACGCCT
TCTTCGTTGGATTTGAATGCTGGTGAGGATCAACTTAGTAATAATGATCATGCGCCTTCTACTACTTGCAAGATGTTTGATCTTAATGGCTTTAGCTGA
Protein sequenceShow/hide protein sequence
MYHHQHRGKSIHSSERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFVEAVNQLGGADKATPKTVMKIMSIPGLTLYHLKSHLQKYRLSKNVHGQTNGGSGSNKTG
TVAVSGDQRLAEPNGAARTNNIVVAPHSSSQPNNYNSLMIVSYYNYTPLILQKPSNQRNNTDANRSTTALAAANRSTREVSPNSAGKAQETLGRQNLGTVGLEAAKVQLS
ELVSKVSTQSLTAAFPELHQQQQQTQRLPDLLRGLQGPRPPPEGAPPQQPLGPPALRRDPHHGLSMSIGLVQGDKGEGYNGYSPSEAQRSFGSKRKEVEKETAFRYRMTP
SSLDLNAGEDQLSNNDHAPSTTCKMFDLNGFS