; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027788 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027788
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionhomeobox-leucine zipper protein HAT9-like
Genome locationtig00153055:2592622..2604608
RNA-Seq ExpressionSgr027788
SyntenySgr027788
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591449.1 Homeobox-leucine zipper protein HAT9, partial [Cucurbita argyrosperma subsp. sororia]3.2e-4045.79Show/hide
Query:  MDGDCNTGLLLGLGRASD--NSLRSFDPEV-DVKKKL-VLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGES----ADDLLQQASSNASPVSSFSNSSG
        MD DCNTGLLLGLGR  D  NS+R   P+V  VKKKL VLKFDDILPSLTLGLS         ++V+K+ GES    A++L+ Q SS  SPVSSFSNSSG
Subjt:  MDGDCNTGLLLGLGRASD--NSLRSFDPEV-DVKKKL-VLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGES----ADDLLQQASSNASPVSSFSNSSG

Query:  FKRDRDVWVAGAGEEKEAEAEMYSERASRKL--------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTE
        FKR+RD    GAGEE   EAE++ ER S K+        PRK +                         +DLA QLNLRPRQVEVWFQNRRARTKLKQTE
Subjt:  FKRDRDVWVAGAGEEKEAEAEMYSERASRKL--------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  MDCELLKKCSTGGHSHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASL-------------------------------------LKFPFNHPSAAC
        MDCELLKKC            +E  L   +  +  +    +    A+L                                     LKFPFNHPSAAC
Subjt:  MDCELLKKCSTGGHSHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASL-------------------------------------LKFPFNHPSAAC

XP_022137752.1 homeobox-leucine zipper protein HAT9-like [Momordica charantia]1.4e-4650.34Show/hide
Query:  MDGDCN--TGLLLGLGRASDNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQT--CLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKRD
        MDGDC+  TGLLLGLGR+  N LRS  PEVDVKKKLVLKFDDILP LTLGLS  T    SVAEIIV KT   +A++LLQQ SS ASPVSSFSNSSG KRD
Subjt:  MDGDCN--TGLLLGLGRASDNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQT--CLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKRD

Query:  R-DVWVAGAGEEKEAE-AEMYSERASRKL---------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEM
        R D W+ G GEE+EAE AE+Y ER S K+         PRK +                          DLA QL+LRPRQVEVWFQNRRARTKLKQTEM
Subjt:  R-DVWVAGAGEEKEAE-AEMYSERASRKL---------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEM

Query:  DCELLKKCSTGGHSHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASL---------------------------------LKFPFNHPSAAC
        DCELLKKC            +E  L   +  +  +    +    A+L                                 LKFPFNHPSAAC
Subjt:  DCELLKKCSTGGHSHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASL---------------------------------LKFPFNHPSAAC

XP_022935802.1 homeobox-leucine zipper protein HAT9-like [Cucurbita moschata]7.2e-4058.37Show/hide
Query:  MDGDCNTGLLLGLGRASD--NSLRSFDPEV-DVKKKL-VLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGES----ADDLLQQASSNASPVSSFSNSSG
        MD DCNTGLLLGLGR  D  NS+R   P+V  VKKKL VLKFDDILPSLTLGLS         ++V K+ GES    AD+L+QQ SS  SPVSSFS+SSG
Subjt:  MDGDCNTGLLLGLGRASD--NSLRSFDPEV-DVKKKL-VLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGES----ADDLLQQASSNASPVSSFSNSSG

Query:  FKRDRDVWVAGAGEEKEAEAEMYSERASRKL--------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTE
        FKR+RD    GAGEE  AEAE++ ER S K+        PRK +                         +DLA QLNLRPRQVEVWFQNRRARTKLKQTE
Subjt:  FKRDRDVWVAGAGEEKEAEAEMYSERASRKL--------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  MDCELLKKC
        MDCELLKKC
Subjt:  MDCELLKKC

XP_022977087.1 homeobox-leucine zipper protein HAT9-like [Cucurbita maxima]3.8e-4146.76Show/hide
Query:  MDGDCNTGLLLGLGRASD--NSLRSFDPEV-DVKKKL-VLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGES-ADDLLQQASSNASPVSSFSNSSGFKR
        MD DCN GLLLGLGR  D  NS+R   P+V  VKKKL VLKFDDILPSLTLGLS         ++V+K+ GES ADDL+ Q SS  SP SSFSNSSGFKR
Subjt:  MDGDCNTGLLLGLGRASD--NSLRSFDPEV-DVKKKL-VLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGES-ADDLLQQASSNASPVSSFSNSSGFKR

Query:  DRDVWVAGAGEEKEAEAEMYSERASRKL--------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEMDC
        +RD    GAGEE  AEAE++ ER S K+        PRK +                         +DLA QLNLRPRQVEVWFQNRRARTKLKQTEMDC
Subjt:  DRDVWVAGAGEEKEAEAEMYSERASRKL--------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEMDC

Query:  ELLKKCSTGGHSHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASL------------------------------------LKFPFNHPSAAC
        ELLKKC            +E  L   +  +  +    +    A+L                                    LKFPFNHPSAAC
Subjt:  ELLKKCSTGGHSHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASL------------------------------------LKFPFNHPSAAC

XP_023536403.1 homeobox-leucine zipper protein HAT9-like [Cucurbita pepo subsp. pepo]5.5e-4058.37Show/hide
Query:  MDGDCNTGLLLGLGRASD--NSLRSFDPEV-DVKKKL-VLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGES----ADDLLQQASSNASPVSSFSNSSG
        MD DCNTGLLLGLGR  D  NS+R   P+V  VKKKL VLKFDDILPSLTLGLS         ++V K+ GES    AD+L+QQ SS  SPVSSFSNSSG
Subjt:  MDGDCNTGLLLGLGRASD--NSLRSFDPEV-DVKKKL-VLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGES----ADDLLQQASSNASPVSSFSNSSG

Query:  FKRDRDVWVAGAGEEKEAEAEMYSERASRKL--------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTE
        FKR+RD    GAGEE  AE E++ ER S K+        PRK +                         +DLA QLNLRPRQVEVWFQNRRARTKLKQTE
Subjt:  FKRDRDVWVAGAGEEKEAEAEMYSERASRKL--------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  MDCELLKKC
        MDCELLKKC
Subjt:  MDCELLKKC

TrEMBL top hitse value%identityAlignment
A0A0A0L3X3 Homeobox domain-containing protein5.6e-3042.35Show/hide
Query:  MDGDCNTGLLLGLGRASDN----SLRSFDPEVDVKK-KLVLKF-DDILPSLTLGLSAQTCLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFK
        MD DCNTGLLLGLGR S +    S+RS  P ++ KK + VLKF DDILPSLTLGLS           V  TA E           + SPVSSFSNSSGFK
Subjt:  MDGDCNTGLLLGLGRASDN----SLRSFDPEVDVKK-KLVLKF-DDILPSLTLGLSAQTCLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFK

Query:  RDRDVWVAGAGEEKEAEAEMYSERASRKLPRKT--------------------MMEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCSTGGH
        R+R        EE     E   E + RK  R T                      +DLA QLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC     
Subjt:  RDRDVWVAGAGEEKEAEAEMYSERASRKLPRKT--------------------MMEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCSTGGH

Query:  SHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASL----------------------------------LKFPFNHPSAACN
               +E  L   +  +       +    A+L                                  LKFPFNHPSAACN
Subjt:  SHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASL----------------------------------LKFPFNHPSAACN

A0A6J1C954 homeobox-leucine zipper protein HAT9-like6.5e-4750.34Show/hide
Query:  MDGDCN--TGLLLGLGRASDNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQT--CLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKRD
        MDGDC+  TGLLLGLGR+  N LRS  PEVDVKKKLVLKFDDILP LTLGLS  T    SVAEIIV KT   +A++LLQQ SS ASPVSSFSNSSG KRD
Subjt:  MDGDCN--TGLLLGLGRASDNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQT--CLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKRD

Query:  R-DVWVAGAGEEKEAE-AEMYSERASRKL---------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEM
        R D W+ G GEE+EAE AE+Y ER S K+         PRK +                          DLA QL+LRPRQVEVWFQNRRARTKLKQTEM
Subjt:  R-DVWVAGAGEEKEAE-AEMYSERASRKL---------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEM

Query:  DCELLKKCSTGGHSHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASL---------------------------------LKFPFNHPSAAC
        DCELLKKC            +E  L   +  +  +    +    A+L                                 LKFPFNHPSAAC
Subjt:  DCELLKKCSTGGHSHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASL---------------------------------LKFPFNHPSAAC

A0A6J1F6L5 homeobox-leucine zipper protein HAT9-like3.5e-4058.37Show/hide
Query:  MDGDCNTGLLLGLGRASD--NSLRSFDPEV-DVKKKL-VLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGES----ADDLLQQASSNASPVSSFSNSSG
        MD DCNTGLLLGLGR  D  NS+R   P+V  VKKKL VLKFDDILPSLTLGLS         ++V K+ GES    AD+L+QQ SS  SPVSSFS+SSG
Subjt:  MDGDCNTGLLLGLGRASD--NSLRSFDPEV-DVKKKL-VLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGES----ADDLLQQASSNASPVSSFSNSSG

Query:  FKRDRDVWVAGAGEEKEAEAEMYSERASRKL--------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTE
        FKR+RD    GAGEE  AEAE++ ER S K+        PRK +                         +DLA QLNLRPRQVEVWFQNRRARTKLKQTE
Subjt:  FKRDRDVWVAGAGEEKEAEAEMYSERASRKL--------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  MDCELLKKC
        MDCELLKKC
Subjt:  MDCELLKKC

A0A6J1IHG6 homeobox-leucine zipper protein HAT9-like1.8e-4146.76Show/hide
Query:  MDGDCNTGLLLGLGRASD--NSLRSFDPEV-DVKKKL-VLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGES-ADDLLQQASSNASPVSSFSNSSGFKR
        MD DCN GLLLGLGR  D  NS+R   P+V  VKKKL VLKFDDILPSLTLGLS         ++V+K+ GES ADDL+ Q SS  SP SSFSNSSGFKR
Subjt:  MDGDCNTGLLLGLGRASD--NSLRSFDPEV-DVKKKL-VLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGES-ADDLLQQASSNASPVSSFSNSSGFKR

Query:  DRDVWVAGAGEEKEAEAEMYSERASRKL--------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEMDC
        +RD    GAGEE  AEAE++ ER S K+        PRK +                         +DLA QLNLRPRQVEVWFQNRRARTKLKQTEMDC
Subjt:  DRDVWVAGAGEEKEAEAEMYSERASRKL--------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEMDC

Query:  ELLKKCSTGGHSHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASL------------------------------------LKFPFNHPSAAC
        ELLKKC            +E  L   +  +  +    +    A+L                                    LKFPFNHPSAAC
Subjt:  ELLKKCSTGGHSHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASL------------------------------------LKFPFNHPSAAC

A0A6P4AAE9 homeobox-leucine zipper protein HAT22-like7.0e-2545.71Show/hide
Query:  MDGDCNTGLLLGLGRASDNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQTCLSVAEIIVQKT------AGESADDLLQQASSNASPVSSFSNSSGFK
        +D  C  GL LGL   +           + K++L LK+D ++PSLTLG S +     A   ++ +      AGESA DL  QASS  S VSSFSNSS  K
Subjt:  MDGDCNTGLLLGLGRASDNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQTCLSVAEIIVQKT------AGESADDLLQQASSNASPVSSFSNSSGFK

Query:  RDRDVWVAGAGEEKEAEAEMYSERASRKL-----------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQT
        RDRD+    AGEE EAEAE+  ER S ++           PRK +                         ++LA QLNLRPRQVEVWFQNRRARTKLKQT
Subjt:  RDRDVWVAGAGEEKEAEAEMYSERASRKL-----------PRKTM------------------------MEDLAGQLNLRPRQVEVWFQNRRARTKLKQT

Query:  EMDCELLKKC
        E DCELLKKC
Subjt:  EMDCELLKKC

SwissProt top hitse value%identityAlignment
A2X674 Homeobox-leucine zipper protein HOX76.6e-1287.18Show/hide
Query:  DLAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC
        DLA +LNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+C
Subjt:  DLAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC

A2Z4C4 Homeobox-leucine zipper protein HOX153.9e-1292.11Show/hide
Query:  LAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC
        LA QLNLRPRQVEVWFQNRRARTKLKQTE+DCELLK+C
Subjt:  LAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC

P46603 Homeobox-leucine zipper protein HAT94.3e-1942.5Show/hide
Query:  DGDCNTGLLLGLGRAS-DNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKRDRDVW
        D  CNTGL+LGLG +   N+  S      +++  V K +   PSLTL LS    ++V            AD L +Q SS+ S VSSFS+    KR+RD  
Subjt:  DGDCNTGLLLGLGRAS-DNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKRDRDVW

Query:  VAGAGEEKEAEAEM-------YSER----ASRKLPRKTMMED--------------------LAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC
            GEE   E EM       Y E     ++RK  R T  +                     LA QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKC
Subjt:  VAGAGEEKEAEAEM-------YSER----ASRKLPRKTMMED--------------------LAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC

P46604 Homeobox-leucine zipper protein HAT227.3e-1940Show/hide
Query:  MDGDCNTGLLLGLGRAS-----DNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKR
        +D  CNTGL+LGLG +      +++++     VD       +F  + PSLTL LS        E    KT   + D + +Q SS+ S +SSFS S   KR
Subjt:  MDGDCNTGLLLGLGRAS-----DNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKR

Query:  DRDVWVAGAGEEKEAEAEMYSER----------------ASRKLPRKT-----MMED---------------LAGQLNLRPRQVEVWFQNRRARTKLKQT
        +R++    +G + E EAE  +ER                ++RK  R T     ++ED               LA QLNLRPRQVEVWFQNRRARTKLKQT
Subjt:  DRDVWVAGAGEEKEAEAEMYSER----------------ASRKLPRKT-----MMED---------------LAGQLNLRPRQVEVWFQNRRARTKLKQT

Query:  EMDCELLKKC
        E+DCE LKKC
Subjt:  EMDCELLKKC

Q7G737 Homeobox-leucine zipper protein HOX153.9e-1292.11Show/hide
Query:  LAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC
        LA QLNLRPRQVEVWFQNRRARTKLKQTE+DCELLK+C
Subjt:  LAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family3.0e-2042.5Show/hide
Query:  DGDCNTGLLLGLGRAS-DNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKRDRDVW
        D  CNTGL+LGLG +   N+  S      +++  V K +   PSLTL LS    ++V            AD L +Q SS+ S VSSFS+    KR+RD  
Subjt:  DGDCNTGLLLGLGRAS-DNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKRDRDVW

Query:  VAGAGEEKEAEAEM-------YSER----ASRKLPRKTMMED--------------------LAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC
            GEE   E EM       Y E     ++RK  R T  +                     LA QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKC
Subjt:  VAGAGEEKEAEAEM-------YSER----ASRKLPRKTMMED--------------------LAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC

AT2G44910.1 homeobox-leucine zipper protein 41.4e-1244.36Show/hide
Query:  DLLQQASSNASPVSSFSNSSGFKRDRDVWVAGAGEEKEAEAEMYSE---------------RASRKLPR----------KTMMED----------LAGQL
        DL ++A+  +SP S+ S+ SG K  RD+ VA  G+E EAE    S                  SRK  R          +T  E           LA QL
Subjt:  DLLQQASSNASPVSSFSNSSGFKRDRDVWVAGAGEEKEAEAEMYSE---------------RASRKLPR----------KTMMED----------LAGQL

Query:  NLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC
        NLR RQVEVWFQNRRARTKLKQTE+DCE LK+C
Subjt:  NLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC

AT3G60390.1 homeobox-leucine zipper protein 33.4e-1144.66Show/hide
Query:  GESADDLLQQASSNASPVSSFSNSSGFKRDRDVWVAGAGEEKEAEAEMYSERASRKLPRKTMMEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELL
        G   D+ +++AS +    S   + SG   D         +E+    E   +  S   P++ M   LA QLNLR RQVEVWFQNRRARTKLKQTE+DCE L
Subjt:  GESADDLLQQASSNASPVSSFSNSSGFKRDRDVWVAGAGEEKEAEAEMYSERASRKLPRKTMMEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELL

Query:  KKC
        K+C
Subjt:  KKC

AT4G37790.1 Homeobox-leucine zipper protein family5.2e-2040Show/hide
Query:  MDGDCNTGLLLGLGRAS-----DNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKR
        +D  CNTGL+LGLG +      +++++     VD       +F  + PSLTL LS        E    KT   + D + +Q SS+ S +SSFS S   KR
Subjt:  MDGDCNTGLLLGLGRAS-----DNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKR

Query:  DRDVWVAGAGEEKEAEAEMYSER----------------ASRKLPRKT-----MMED---------------LAGQLNLRPRQVEVWFQNRRARTKLKQT
        +R++    +G + E EAE  +ER                ++RK  R T     ++ED               LA QLNLRPRQVEVWFQNRRARTKLKQT
Subjt:  DRDVWVAGAGEEKEAEAEMYSER----------------ASRKLPRKT-----MMED---------------LAGQLNLRPRQVEVWFQNRRARTKLKQT

Query:  EMDCELLKKC
        E+DCE LKKC
Subjt:  EMDCELLKKC

AT5G06710.1 homeobox from Arabidopsis thaliana1.0e-1289.47Show/hide
Query:  LAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC
        LA QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+C
Subjt:  LAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGCGATTGCAATACCGGCCTTCTTCTTGGGCTCGGCCGGGCTTCGGATAATTCTCTGCGATCTTTCGATCCCGAAGTGGATGTGAAGAAGAAGCTGGTTTTGAA
GTTTGATGACATTTTGCCTTCTTTGACGCTGGGGTTATCGGCCCAGACCTGCCTCTCGGTGGCTGAGATTATTGTCCAGAAGACCGCCGGCGAATCCGCCGACGACTTGC
TGCAGCAGGCTTCTTCTAACGCCAGTCCGGTGTCGTCGTTTTCCAACTCGTCGGGATTCAAGAGGGACAGAGACGTCTGGGTCGCTGGCGCCGGTGAAGAGAAGGAGGCT
GAGGCGGAGATGTATTCAGAGAGAGCCTCTCGAAAGTTGCCGAGGAAGACGATGATGGAAGATTTGGCTGGACAGTTAAACCTGAGGCCAAGACAAGTGGAAGTATGGTT
TCAAAACAGAAGAGCCAGAACCAAGCTGAAGCAGACGGAGATGGACTGTGAATTACTGAAGAAATGCTCTACAGGCGGCCACTCTCACCGTGTGCCCTTCCTGCGAGAGA
TCCATTTGTGGCGGCGGCGGTGGCGGCGATTCGTCTCCGACCACCACCTTCTCGATTGGGCCGAAGCCTCACTTCTCAAATTCCCTTTTAACCACCCATCGGCGGCTTGT
AACCACTGTACTTCCTCAGTAGGTGGTTCTTCAATTCTCTATCTTCAGCACGGGGTCGATCTCGGGCAGTTCAGTTTCTCCGCCGCTGTTATCCTGATCCTCCTCGGACG
ATCCGATCCTTCACACTTCTCATCTGTAGATTTAGAAGAGTTTACCGGAGTTGAAGATCCGAACGGGGCCGTTGCAGAGAGAACCGAGCTGAGATTCGATCCTGCGCATG
AAATCCATGGCTTCATGTATCGGCCTTCCTATAGCCTATGGTTGTCACACCATGAATTGATCGAGTTCTGGATCCTTCGTGGTCTCTCCCGATACCATGGAAGATCGCTG
CCGGGCTTCGAATTCTTGCCGTGCCGCCACGAGCCGCTCCACCACTTGCGGCGGAGCACCCACCTTTGTTTTTTTGCCACAAATCACAGCAGAAGAAAAGGCTTCGAGGA
GGCTGGAGTATTGAGGGTGAGCGATGATTTTGGCCTTAATGGCTTCAACGTCGACGGCGTTGTCGTTGTTTCCATGCGGCGACCGACATGTCGGTTGTCCTCCTCCTCCT
CCTCTCATCAGAGGTTACCTGAAGAAGAATTGGCCGCAAGAACTGGCGAGGCGTACAAGAAATTGCCTCTGGAGCTGTGTTCTCGTTTAAGTGATTATATTCCTCCATTT
TTGGAGCAAACCCAGTTGAAAACCAGTGAAGAAAAACAACCCAGCGAGGGCATTTTGTTTCATATATATATGGGCACTGAAGATATGAATAATAAGAAAAGGGAAAATTG
GGAAGCTTTCAGAAGCAGATTTTACGGAGGCCTTGAAGCGAAGCTGCTGCATGGAAGAGGCATGCTAGAGGGTGGGGAAGAAGAAGAAGAGAGGTCGGATTTCACGTGGA
GAAGGAAAAGGGAAAACGAGAGGGTGCATGCAAGGCCAAAGCGAAAAGGCATTGCAGTTGCAGCAGGGGAGAGAAGGGCAAAGGCAAATGGGTGGGCGAGTTTGAATTTG
GGCTGGAACACAGCTGAGCCTGAGGGAGCGCCCCCACTAGGAATGGGGTGTGGGGCTGATGCTGATGCTGACACCGCCCCCCGCACTACGAGCGGGGCTGGCGAGTGTGT
TTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACGGCGATTGCAATACCGGCCTTCTTCTTGGGCTCGGCCGGGCTTCGGATAATTCTCTGCGATCTTTCGATCCCGAAGTGGATGTGAAGAAGAAGCTGGTTTTGAA
GTTTGATGACATTTTGCCTTCTTTGACGCTGGGGTTATCGGCCCAGACCTGCCTCTCGGTGGCTGAGATTATTGTCCAGAAGACCGCCGGCGAATCCGCCGACGACTTGC
TGCAGCAGGCTTCTTCTAACGCCAGTCCGGTGTCGTCGTTTTCCAACTCGTCGGGATTCAAGAGGGACAGAGACGTCTGGGTCGCTGGCGCCGGTGAAGAGAAGGAGGCT
GAGGCGGAGATGTATTCAGAGAGAGCCTCTCGAAAGTTGCCGAGGAAGACGATGATGGAAGATTTGGCTGGACAGTTAAACCTGAGGCCAAGACAAGTGGAAGTATGGTT
TCAAAACAGAAGAGCCAGAACCAAGCTGAAGCAGACGGAGATGGACTGTGAATTACTGAAGAAATGCTCTACAGGCGGCCACTCTCACCGTGTGCCCTTCCTGCGAGAGA
TCCATTTGTGGCGGCGGCGGTGGCGGCGATTCGTCTCCGACCACCACCTTCTCGATTGGGCCGAAGCCTCACTTCTCAAATTCCCTTTTAACCACCCATCGGCGGCTTGT
AACCACTGTACTTCCTCAGTAGGTGGTTCTTCAATTCTCTATCTTCAGCACGGGGTCGATCTCGGGCAGTTCAGTTTCTCCGCCGCTGTTATCCTGATCCTCCTCGGACG
ATCCGATCCTTCACACTTCTCATCTGTAGATTTAGAAGAGTTTACCGGAGTTGAAGATCCGAACGGGGCCGTTGCAGAGAGAACCGAGCTGAGATTCGATCCTGCGCATG
AAATCCATGGCTTCATGTATCGGCCTTCCTATAGCCTATGGTTGTCACACCATGAATTGATCGAGTTCTGGATCCTTCGTGGTCTCTCCCGATACCATGGAAGATCGCTG
CCGGGCTTCGAATTCTTGCCGTGCCGCCACGAGCCGCTCCACCACTTGCGGCGGAGCACCCACCTTTGTTTTTTTGCCACAAATCACAGCAGAAGAAAAGGCTTCGAGGA
GGCTGGAGTATTGAGGGTGAGCGATGATTTTGGCCTTAATGGCTTCAACGTCGACGGCGTTGTCGTTGTTTCCATGCGGCGACCGACATGTCGGTTGTCCTCCTCCTCCT
CCTCTCATCAGAGGTTACCTGAAGAAGAATTGGCCGCAAGAACTGGCGAGGCGTACAAGAAATTGCCTCTGGAGCTGTGTTCTCGTTTAAGTGATTATATTCCTCCATTT
TTGGAGCAAACCCAGTTGAAAACCAGTGAAGAAAAACAACCCAGCGAGGGCATTTTGTTTCATATATATATGGGCACTGAAGATATGAATAATAAGAAAAGGGAAAATTG
GGAAGCTTTCAGAAGCAGATTTTACGGAGGCCTTGAAGCGAAGCTGCTGCATGGAAGAGGCATGCTAGAGGGTGGGGAAGAAGAAGAAGAGAGGTCGGATTTCACGTGGA
GAAGGAAAAGGGAAAACGAGAGGGTGCATGCAAGGCCAAAGCGAAAAGGCATTGCAGTTGCAGCAGGGGAGAGAAGGGCAAAGGCAAATGGGTGGGCGAGTTTGAATTTG
GGCTGGAACACAGCTGAGCCTGAGGGAGCGCCCCCACTAGGAATGGGGTGTGGGGCTGATGCTGATGCTGACACCGCCCCCCGCACTACGAGCGGGGCTGGCGAGTGTGT
TTAG
Protein sequenceShow/hide protein sequence
MDGDCNTGLLLGLGRASDNSLRSFDPEVDVKKKLVLKFDDILPSLTLGLSAQTCLSVAEIIVQKTAGESADDLLQQASSNASPVSSFSNSSGFKRDRDVWVAGAGEEKEA
EAEMYSERASRKLPRKTMMEDLAGQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCSTGGHSHRVPFLREIHLWRRRWRRFVSDHHLLDWAEASLLKFPFNHPSAAC
NHCTSSVGGSSILYLQHGVDLGQFSFSAAVILILLGRSDPSHFSSVDLEEFTGVEDPNGAVAERTELRFDPAHEIHGFMYRPSYSLWLSHHELIEFWILRGLSRYHGRSL
PGFEFLPCRHEPLHHLRRSTHLCFFATNHSRRKGFEEAGVLRVSDDFGLNGFNVDGVVVVSMRRPTCRLSSSSSSHQRLPEEELAARTGEAYKKLPLELCSRLSDYIPPF
LEQTQLKTSEEKQPSEGILFHIYMGTEDMNNKKRENWEAFRSRFYGGLEAKLLHGRGMLEGGEEEEERSDFTWRRKRENERVHARPKRKGIAVAAGERRAKANGWASLNL
GWNTAEPEGAPPLGMGCGADADADTAPRTTSGAGECV