; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G013980 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G013980
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionSANTA domain-containing protein
Genome locationGy14Chr4:18882415..18887619
RNA-Seq ExpressionCsGy4G013980
SyntenyCsGy4G013980
Gene Ontology termsNA
InterPro domainsIPR015216 - SANT associated
IPR039110 - KNL2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032623.1 protein EMBRYO DEFECTIVE 1674-like isoform X1 [Cucumis melo var. makuwa]5.93e-15976.9Show/hide
Query:  MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGF
        M SSPEF GTT P S GASVS+FQ+TVRLLDWWL  A ++SNGKTLAVAGLTS PGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGF+NKLR TDNGF
Subjt:  MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGF

Query:  TPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVT---------------------GNGLDHGDSMAEETMQTTSATETPAPF
        TPQVFKHF+FGFPPNWETHAA CFE GAS+STA GGN S  DNL C S+SVT                      NGLDHGDSMAEETMQTTSATETP PF
Subjt:  TPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVT---------------------GNGLDHGDSMAEETMQTTSATETPAPF

Query:  TGADVQDEEVENKGKKERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKKTT
        TGA+VQDEEV+NKGKKE ESRKKV KKII  SPGS VS NTRGR  KECL+SPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKK  
Subjt:  TGADVQDEEVENKGKKERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKKTT

Query:  LTTKRKEEHRSKKAKR
           + ++    KKAKR
Subjt:  LTTKRKEEHRSKKAKR

XP_022949907.1 uncharacterized protein LOC111453163 [Cucurbita moschata]5.40e-13973.57Show/hide
Query:  MVSSPEFHGTTNPTSGG-----ASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRA
        M S+PE H  T P S       AS+SYFQ+TV LLDWWLI A NDSNGK+LAVAGLTS PGQPVRVFSSAPIVKR DVFTLETAD ICVVLKGF+NKLR 
Subjt:  MVSSPEFHGTTNPTSGG-----ASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRA

Query:  TDNGFTPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKK
        TD+GFT +VFKHFVFGFPPNWET+AANCF+  A ++TAAGGN SDTD L CRS+S   NGLD GDSMA++ MQTT+ATE P PFTG+D+Q+E+VENK +K
Subjt:  TDNGFTPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKK

Query:  ERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQD
        ERESR+KV KKI+ DSPGSGV    R R+EK C++SPEC SYGRSRSGR+LLPTMEFWRNQLPVYDSDRKLRGI+E+  D
Subjt:  ERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQD

XP_023544371.1 mis18-binding protein 1 [Cucurbita pepo subsp. pepo]1.89e-13974.64Show/hide
Query:  MVSSPEFHGTTNPTSGG-----ASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRA
        M S+PE H  T P S       ASVSYFQ+TV LLDWWLI A NDSNGK+LAVAGLTS PGQPVRVFSSAPIVKR DVFTLETADGICVVLKGF+NKLR 
Subjt:  MVSSPEFHGTTNPTSGG-----ASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRA

Query:  TDNGFTPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKK
        TD+GFT +VFKHFVFGFPPNWET+AANCF+  A ++TAAGGN SDTD+L CRS+S   NGLD GDSMA++ MQTTSA E P PFTG+D+Q+E+VENK +K
Subjt:  TDNGFTPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKK

Query:  ERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIRE
        E+ESR+KV KKI+ DSPGSGV    R R+EK+C++SPEC SYGRSRSGR+LLPTMEFWRNQLPVYDSDRKLRGI+E
Subjt:  ERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIRE

XP_031740231.1 kinetochore-associated protein KNL-2 homolog [Cucumis sativus]2.61e-208100Show/hide
Query:  MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGF
        MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGF
Subjt:  MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGF

Query:  TPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKKERESR
        TPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKKERESR
Subjt:  TPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKKERESR

Query:  KKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKKTTLTTKRKEEHRSKKAKR
        KKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKKTTLTTKRKEEHRSKKAKR
Subjt:  KKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKKTTLTTKRKEEHRSKKAKR

XP_038882906.1 kinetochore-associated protein KNL-2 homolog isoform X3 [Benincasa hispida]2.13e-14674.41Show/hide
Query:  MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGF
        M SSPEFH  T P SGGASVSYFQ+TVRLLDWWLI A ND NGKTLAVAGLTS PGQPVR+FSSAPIVKR+DVFTLETADGICV+LKGF+NKLR  DNGF
Subjt:  MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGF

Query:  TPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVT---------------------GNGLDHGDSMAEETMQTTSATETPAPF
        TPQVFKHFVFGFPPNWE HAANCFE  AS S AAGGN SDTDN  C S+S                        +G DHGDS+AE+ MQ TSAT+ PAPF
Subjt:  TPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVT---------------------GNGLDHGDSMAEETMQTTSATETPAPF

Query:  TGADVQDEEVENKGKKERESRKKVTKKIISDSPGS-GVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQD
        T AD+Q+E VENKG KE ESRKKV KKII DSPGS G+S NTRGRKEK+C++SPECRSYGRSRSGR+LLPTMEFWRNQLPVYDSDRKLR I+EE+QD
Subjt:  TGADVQDEEVENKGKKERESRKKVTKKIISDSPGS-GVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQD

TrEMBL top hitse value%identityAlignment
A0A0A0KX65 SANTA domain-containing protein1.26e-208100Show/hide
Query:  MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGF
        MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGF
Subjt:  MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGF

Query:  TPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKKERESR
        TPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKKERESR
Subjt:  TPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKKERESR

Query:  KKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKKTTLTTKRKEEHRSKKAKR
        KKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKKTTLTTKRKEEHRSKKAKR
Subjt:  KKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKKTTLTTKRKEEHRSKKAKR

A0A5D3DIU6 Protein EMBRYO DEFECTIVE 1674-like isoform X12.87e-15976.9Show/hide
Query:  MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGF
        M SSPEF GTT P S GASVS+FQ+TVRLLDWWL  A ++SNGKTLAVAGLTS PGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGF+NKLR TDNGF
Subjt:  MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGF

Query:  TPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVT---------------------GNGLDHGDSMAEETMQTTSATETPAPF
        TPQVFKHF+FGFPPNWETHAA CFE GAS+STA GGN S  DNL C S+SVT                      NGLDHGDSMAEETMQTTSATETP PF
Subjt:  TPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVT---------------------GNGLDHGDSMAEETMQTTSATETPAPF

Query:  TGADVQDEEVENKGKKERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKKTT
        TGA+VQDEEV+NKGKKE ESRKKV KKII  SPGS VS NTRGR  KECL+SPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKK  
Subjt:  TGADVQDEEVENKGKKERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKKTT

Query:  LTTKRKEEHRSKKAKR
           + ++    KKAKR
Subjt:  LTTKRKEEHRSKKAKR

A0A6J1CQY8 uncharacterized protein LOC111013765 isoform X47.15e-11662.37Show/hide
Query:  MVSSPEFHGTTNP-----TSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRA
        M S+P+ H TT P     T GGA+ SYFQ+TV L DWWLI A ND N KTLAVAGLTS+P QPVRVFSSAPIVKR+DVFTLETADGICVV+KGF+NKLR 
Subjt:  MVSSPEFHGTTNP-----TSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRA

Query:  TDNGFTPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNG----------LDHGDSMAEETMQTTSATETPAPF------
        TDNGFT +VFKHFVFGFPPNWET+A N FE  A +S +A GN SDTDNL C+S+    NG          LDHGD+MA++ MQ  SAT+T  P       
Subjt:  TDNGFTPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNG----------LDHGDSMAEETMQTTSATETPAPF------

Query:  ---TGADVQDEEVENKGKKERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIRE
           TGA++Q EE+ENK K E++  K+  +KI+  SPGSG+ +  R R+EK C+ SPE  SYGRSRSGR+LLP MEFWRNQLPVYD+DR++RGI+E
Subjt:  ---TGADVQDEEVENKGKKERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIRE

A0A6J1GDE0 uncharacterized protein LOC1114531632.61e-13973.57Show/hide
Query:  MVSSPEFHGTTNPTSGG-----ASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRA
        M S+PE H  T P S       AS+SYFQ+TV LLDWWLI A NDSNGK+LAVAGLTS PGQPVRVFSSAPIVKR DVFTLETAD ICVVLKGF+NKLR 
Subjt:  MVSSPEFHGTTNPTSGG-----ASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRA

Query:  TDNGFTPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKK
        TD+GFT +VFKHFVFGFPPNWET+AANCF+  A ++TAAGGN SDTD L CRS+S   NGLD GDSMA++ MQTT+ATE P PFTG+D+Q+E+VENK +K
Subjt:  TDNGFTPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKK

Query:  ERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQD
        ERESR+KV KKI+ DSPGSGV    R R+EK C++SPEC SYGRSRSGR+LLPTMEFWRNQLPVYDSDRKLRGI+E+  D
Subjt:  ERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQD

A0A6J1IT09 uncharacterized protein LOC1114781483.65e-13772.5Show/hide
Query:  MVSSPEFHGTTNPTSGG-----ASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRA
        M S+PE H  T P S       A++SYFQ+TV LLDWWLI A NDSN K+LAVAGLTS PGQPVRVFSSAPIVKR DVFTLETADGICVVLKGF+NKLR 
Subjt:  MVSSPEFHGTTNPTSGG-----ASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRA

Query:  TDNGFTPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKK
        TDNGFT +VFKHFVFGFPPNWET+AANCF+  A ++TAA GN SDTD+L CRS+S   NGLD GDSMA++ MQTTSA E P PFTG+D+Q+E+VENK +K
Subjt:  TDNGFTPQVFKHFVFGFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKK

Query:  ERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQD
        E+ESR+K  KKI+ DSPGSGV    R R+EK+C++SPEC SYGRSRSGR+LLPTMEFWRNQLPVYDSDRKLRGI+E+  D
Subjt:  ERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQD

SwissProt top hitse value%identityAlignment
F4KCE9 Kinetochore-associated protein KNL-2 homolog4.9e-2450.93Show/hide
Query:  SVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGL-TSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGFTPQVFKHFVFGFPPNWE
        S S FQ+TV L DWWLI    +  GK   VAG   S   + +RVF+S+PI K  DVFTL  +DGI + L+GFLNK R   NGF P++ + F+FGFPP WE
Subjt:  SVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGL-TSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGFTPQVFKHFVFGFPPNWE

Query:  THAANCFE
            +CFE
Subjt:  THAANCFE

F4KCE9 Kinetochore-associated protein KNL-2 homolog1.1e-0428.79Show/hide
Query:  AEETMQTTSATETPAPFTGADVQDEEVENKGKKERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDS
        +E+T+Q+ S    P     ++ ++ E       E  S +K+ +KI  D     V+   + +++K    S +     RSRSGRVL+ ++EFWRNQ+PVYD 
Subjt:  AEETMQTTSATETPAPFTGADVQDEEVENKGKKERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDS

Query:  DRKLRGIREEQQDKKTTLTTKRKEEHRSKKAK
        DR L  +++  +        K  +  + +  K
Subjt:  DRKLRGIREEQQDKKTTLTTKRKEEHRSKKAK

Q8RWD7 Protein EMBRYO DEFECTIVE 16741.6e-1441.05Show/hide
Query:  RTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGFTPQVFKHFVFGFPPNWETH
        ++V L DWWL        GK L + G  S     VR+FSS  I KRH+  TLE  DGI + + GF+N+ R  +NG + +V   F  GFP +WE +
Subjt:  RTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGFTPQVFKHFVFGFPPNWETH

Arabidopsis top hitse value%identityAlignment
AT1G58210.1 kinase interacting family protein1.1e-1541.05Show/hide
Query:  RTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGFTPQVFKHFVFGFPPNWETH
        ++V L DWWL        GK L + G  S     VR+FSS  I KRH+  TLE  DGI + + GF+N+ R  +NG + +V   F  GFP +WE +
Subjt:  RTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGFTPQVFKHFVFGFPPNWETH

AT5G02520.1 CONTAINS InterPro DOMAIN/s: SANT associated (InterPro:IPR015216)3.5e-2550.93Show/hide
Query:  SVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGL-TSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGFTPQVFKHFVFGFPPNWE
        S S FQ+TV L DWWLI    +  GK   VAG   S   + +RVF+S+PI K  DVFTL  +DGI + L+GFLNK R   NGF P++ + F+FGFPP WE
Subjt:  SVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGL-TSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGFTPQVFKHFVFGFPPNWE

Query:  THAANCFE
            +CFE
Subjt:  THAANCFE

AT5G02520.1 CONTAINS InterPro DOMAIN/s: SANT associated (InterPro:IPR015216)8.1e-0628.79Show/hide
Query:  AEETMQTTSATETPAPFTGADVQDEEVENKGKKERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDS
        +E+T+Q+ S    P     ++ ++ E       E  S +K+ +KI  D     V+   + +++K    S +     RSRSGRVL+ ++EFWRNQ+PVYD 
Subjt:  AEETMQTTSATETPAPFTGADVQDEEVENKGKKERESRKKVTKKIISDSPGSGVSKNTRGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDS

Query:  DRKLRGIREEQQDKKTTLTTKRKEEHRSKKAK
        DR L  +++  +        K  +  + +  K
Subjt:  DRKLRGIREEQQDKKTTLTTKRKEEHRSKKAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTTCTCCAGAATTCCATGGAACAACAAACCCCACCAGCGGCGGCGCTTCAGTTTCTTACTTCCAGAGAACAGTTCGTTTGCTCGATTGGTGGTTAATCACAGC
TGGAAATGACTCCAATGGGAAAACCCTAGCCGTCGCCGGCTTAACATCTAAACCGGGACAACCTGTTCGAGTATTTTCTTCTGCTCCCATCGTTAAGAGACACGATGTTT
TCACTCTGGAGACTGCTGATGGAATCTGTGTTGTTCTCAAGGGTTTCTTAAACAAACTGCGTGCCACTGATAATGGGTTCACGCCTCAGGTTTTCAAGCATTTTGTGTTT
GGGTTTCCTCCCAACTGGGAAACTCATGCAGCTAATTGCTTTGAGGCAGGAGCTTCTAATAGTACTGCTGCTGGGGGAAATGCTTCTGATACAGATAATCTGTCCTGTAG
ATCGAGAAGTGTTACTGGTAATGGATTGGATCATGGAGATTCTATGGCCGAAGAAACAATGCAGACTACAAGTGCAACTGAAACTCCAGCACCATTCACTGGTGCTGATG
TTCAAGATGAAGAAGTTGAAAACAAAGGAAAAAAAGAGAGAGAATCTCGGAAAAAAGTCACGAAGAAAATTATTTCCGATTCACCTGGAAGTGGTGTTAGTAAGAATACT
AGAGGAAGGAAGGAGAAGGAGTGTCTAGTATCTCCAGAATGTAGGAGTTATGGGCGATCCCGATCAGGGCGAGTACTTCTGCCAACAATGGAGTTTTGGCGCAACCAATT
ACCTGTTTACGACTCGGATCGAAAGTTAAGAGGAATACGTGAAGAGCAGCAAGACAAAAAAACCACACTCACAACCAAGAGGAAGGAAGAGCATAGGTCGAAGAAAGCGA
AAAGATAG
mRNA sequenceShow/hide mRNA sequence
TTTCCATCTTCTCCTCATCTAAGAATTTCAAAATTTTCCGAAGAAAAATCCTAAGGTAAACCAAATTCGCTTTCGAAATCTCTACAGAATTTATCAACCGATCATGGTTT
CTTCTCCAGAATTCCATGGAACAACAAACCCCACCAGCGGCGGCGCTTCAGTTTCTTACTTCCAGAGAACAGTTCGTTTGCTCGATTGGTGGTTAATCACAGCTGGAAAT
GACTCCAATGGGAAAACCCTAGCCGTCGCCGGCTTAACATCTAAACCGGGACAACCTGTTCGAGTATTTTCTTCTGCTCCCATCGTTAAGAGACACGATGTTTTCACTCT
GGAGACTGCTGATGGAATCTGTGTTGTTCTCAAGGGTTTCTTAAACAAACTGCGTGCCACTGATAATGGGTTCACGCCTCAGGTTTTCAAGCATTTTGTGTTTGGGTTTC
CTCCCAACTGGGAAACTCATGCAGCTAATTGCTTTGAGGCAGGAGCTTCTAATAGTACTGCTGCTGGGGGAAATGCTTCTGATACAGATAATCTGTCCTGTAGATCGAGA
AGTGTTACTGGTAATGGATTGGATCATGGAGATTCTATGGCCGAAGAAACAATGCAGACTACAAGTGCAACTGAAACTCCAGCACCATTCACTGGTGCTGATGTTCAAGA
TGAAGAAGTTGAAAACAAAGGAAAAAAAGAGAGAGAATCTCGGAAAAAAGTCACGAAGAAAATTATTTCCGATTCACCTGGAAGTGGTGTTAGTAAGAATACTAGAGGAA
GGAAGGAGAAGGAGTGTCTAGTATCTCCAGAATGTAGGAGTTATGGGCGATCCCGATCAGGGCGAGTACTTCTGCCAACAATGGAGTTTTGGCGCAACCAATTACCTGTT
TACGACTCGGATCGAAAGTTAAGAGGAATACGTGAAGAGCAGCAAGACAAAAAAACCACACTCACAACCAAGAGGAAGGAAGAGCATAGGTCGAAGAAAGCGAAAAGATA
GTTAAAGTGATAATGGTATGGGAATTAGGATGTTATTGTAGATATGGGTTTCTACTTTTTTTTCCCCTACTGTGAAGTACAAACCAAACGTATATCTATGTTTATATATT
TTGCTCGCTATGTTCTGCCCCCTTCTAGTTTCTTCTGACCCTGATATTCTTAGTGAATCATGAATCAATATCATGACCTTTTTAAATTAGTTATCGAGTCGATCTTTTCC
TTCACTAAATGACCCACTGAGATGTTTACACGCCGGTATTTTGTTTCATACGACTCTTTGACGTTTTACTAATGAGCCGAGTTCGAGTTAATTCCAAAATTAAAACATTT
TGATAATTCAGACGATCCTGTAAAAGAATGTTCTTTGCTTTATCAAAAATACTCAAAAGGTAAATATTAAAAACATTCTTCAAAAGTTGAGTCCTTTCAAATCAAGACGA
TTTCCCAATACAAAATAACTGTTTCAATCTATTATTTAAAAACGCTTCCACTGTTCTAAGTTTGAATGTCTAGCAAAGAATATGAATACTTTTTCAAACATCACCAAAAT
GAAATAACAAAACTACTAACATTTTGCTCTTACTGACAAATAAAAGTCCTATGAATTCCTCGATCTCAGCCAGTTGTTTCAAGGAATTAGTCCAGATGAGAGTGTTACCT
AATCTAACTATTCATGTAAGCTTTTATACGACAAGTACTAATCAGTACATGAACAAGTTTCTACCATTCCAAATTAAAGAAGGGAAAAGCTTTCAATCCAACGATGATAC
GAAAAAGAATTCATGTTCATATATAAGTAAACAACATCACAAAAAATCCATCCAAGTTTTGATACATACTGAGAAGTACCAGTAAAACTTGATTCAAATGGCTGGTAAAC
TAACTGCAGCATCTGTTATTTTGCACAAATGAAAAACATCTCACATTTTCCAACGCCAGTAATACCCCCATTATAGGATTTACCTGCAGCAGGCACAGCATTAACTTGTC
AAGCACATTCCTATGTTCTAAATCAACAATCTATCTGTCCGCTGTAGAAATATAAGTATGCTCGAATGCAAAAACAATATATGTGATTCTACAATAACATCTAGAGAAAA
AGAAGCTTAATCACTACAAGATTATGGTAAAAAAGCTTAATTCTCACTAGAAATGACTAAACTTATGTGATGGATTTTTAGCCTTCACTGAATCACCACCAAATCAAGAT
TTATACTAAAAAGAATGGCCATTAGTGAGAAATAAGCAGCATAAAAACAACTTGTGACTGCCAGACGCAAGTAAATGTAGAAATCCCAAATCAGAAATCAGAAACCCAGT
ATCAAAATCTACAATTATCTTCAAATGATTTGTGGATTTTTCAGACTAAAACCACAAATCCATAGACAAACCGAAACCATATGAGATACTAGATCTAATTGTTATGCAAA
TCGAAGAATATGTGCGGAGATTAGGGTTCCGAGAAAATACCGATTGTGCCTCTGCACGAAGGGAGAATTCTTCACCTCTTCCTCTGAAAGTTAATCCACGAGAGAATCAG
TAACATACAGAAGAGAAGGATTC
Protein sequenceShow/hide protein sequence
MVSSPEFHGTTNPTSGGASVSYFQRTVRLLDWWLITAGNDSNGKTLAVAGLTSKPGQPVRVFSSAPIVKRHDVFTLETADGICVVLKGFLNKLRATDNGFTPQVFKHFVF
GFPPNWETHAANCFEAGASNSTAAGGNASDTDNLSCRSRSVTGNGLDHGDSMAEETMQTTSATETPAPFTGADVQDEEVENKGKKERESRKKVTKKIISDSPGSGVSKNT
RGRKEKECLVSPECRSYGRSRSGRVLLPTMEFWRNQLPVYDSDRKLRGIREEQQDKKTTLTTKRKEEHRSKKAKR