; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015905 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015905
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSANTA domain-containing protein
Genome locationscaffold943_2:788944..791564
RNA-Seq ExpressionMS015905
SyntenyMS015905
Gene Ontology termsNA
InterPro domainsIPR015216 - SANT associated


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143981.1 uncharacterized protein LOC111013765 isoform X1 [Momordica charantia]4.7e-14685.99Show/hide
Query:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
        MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
Subjt:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT

Query:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK-----------------------------------------KLDHG
        TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK                                         +LDHG
Subjt:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK-----------------------------------------KLDHG

Query:  DTMAQDVMQNASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLP
        DTMAQDVMQNASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGI  RQRQRQREEKICIKSPESLSYGRSRSGRLLLP
Subjt:  DTMAQDVMQNASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLP

Query:  AMEFWRNQLPVYDA
        AMEFWRNQLPVYDA
Subjt:  AMEFWRNQLPVYDA

XP_022143982.1 uncharacterized protein LOC111013765 isoform X2 [Momordica charantia]4.3e-14788.52Show/hide
Query:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
        MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
Subjt:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT

Query:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK--------------------------------KLDHGDTMAQDVMQ
        TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK                                +LDHGDTMAQDVMQ
Subjt:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK--------------------------------KLDHGDTMAQDVMQ

Query:  NASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQL
        NASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGI  RQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQL
Subjt:  NASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQL

Query:  PVYDA
        PVYDA
Subjt:  PVYDA

XP_022143983.1 uncharacterized protein LOC111013765 isoform X3 [Momordica charantia]1.7e-14892.15Show/hide
Query:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
        MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
Subjt:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT

Query:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK--------------------KLDHGDTMAQDVMQNASATKTAVPLK
        TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK                    +LDHGDTMAQDVMQNASATKTAVPLK
Subjt:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK--------------------KLDHGDTMAQDVMQNASATKTAVPLK

Query:  NLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA
        NLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGI  RQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA
Subjt:  NLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA

XP_022143984.1 uncharacterized protein LOC111013765 isoform X4 [Momordica charantia]1.6e-14995.07Show/hide
Query:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
        MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
Subjt:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT

Query:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK-----------KLDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKT
        TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK           +LDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKT
Subjt:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK-----------KLDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKT

Query:  GANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA
        GANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGI  RQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA
Subjt:  GANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA

XP_022978049.1 uncharacterized protein LOC111478148 [Cucurbita maxima]3.5e-10170.55Show/hide
Query:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
        MASTP+ H+ TTP  D K    AA SYFQKTVCL DWWLIRAEND N K+LAVAGLTS P QPVRVFSSAPIVKR+DVFTLETADGICVV+KGFINKLRT
Subjt:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT

Query:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSKK--LDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKTGANIQGEEI
        TDNGFT EVFKHFVFGFPPNWETYA N F+GEAFD+ +A GNISDTD+L C+SK   LD GD+MAQD+MQ  SA +  VP          TG++IQ E++
Subjt:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSKK--LDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKTGANIQGEEI

Query:  ENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA
        ENKR+ E++  ++AK+KI+F SPGSG+    R RQREEK C+ SPE LSYGRSRSGRLLLP MEFWRNQLPVYD+
Subjt:  ENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA

TrEMBL top hitse value%identityAlignment
A0A6J1CQY8 uncharacterized protein LOC111013765 isoform X47.6e-15095.07Show/hide
Query:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
        MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
Subjt:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT

Query:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK-----------KLDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKT
        TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK           +LDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKT
Subjt:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK-----------KLDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKT

Query:  GANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA
        GANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGI  RQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA
Subjt:  GANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA

A0A6J1CRX9 uncharacterized protein LOC111013765 isoform X12.3e-14685.99Show/hide
Query:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
        MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
Subjt:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT

Query:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK-----------------------------------------KLDHG
        TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK                                         +LDHG
Subjt:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK-----------------------------------------KLDHG

Query:  DTMAQDVMQNASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLP
        DTMAQDVMQNASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGI  RQRQRQREEKICIKSPESLSYGRSRSGRLLLP
Subjt:  DTMAQDVMQNASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLP

Query:  AMEFWRNQLPVYDA
        AMEFWRNQLPVYDA
Subjt:  AMEFWRNQLPVYDA

A0A6J1CS24 uncharacterized protein LOC111013765 isoform X38.4e-14992.15Show/hide
Query:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
        MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
Subjt:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT

Query:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK--------------------KLDHGDTMAQDVMQNASATKTAVPLK
        TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK                    +LDHGDTMAQDVMQNASATKTAVPLK
Subjt:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK--------------------KLDHGDTMAQDVMQNASATKTAVPLK

Query:  NLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA
        NLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGI  RQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA
Subjt:  NLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA

A0A6J1CSE0 uncharacterized protein LOC111013765 isoform X22.1e-14788.52Show/hide
Query:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
        MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
Subjt:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT

Query:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK--------------------------------KLDHGDTMAQDVMQ
        TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK                                +LDHGDTMAQDVMQ
Subjt:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSK--------------------------------KLDHGDTMAQDVMQ

Query:  NASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQL
        NASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGI  RQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQL
Subjt:  NASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQL

Query:  PVYDA
        PVYDA
Subjt:  PVYDA

A0A6J1IT09 uncharacterized protein LOC1114781481.7e-10170.55Show/hide
Query:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT
        MASTP+ H+ TTP  D K    AA SYFQKTVCL DWWLIRAEND N K+LAVAGLTS P QPVRVFSSAPIVKR+DVFTLETADGICVV+KGFINKLRT
Subjt:  MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRT

Query:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSKK--LDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKTGANIQGEEI
        TDNGFT EVFKHFVFGFPPNWETYA N F+GEAFD+ +A GNISDTD+L C+SK   LD GD+MAQD+MQ  SA +  VP          TG++IQ E++
Subjt:  TDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSKK--LDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKTGANIQGEEI

Query:  ENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA
        ENKR+ E++  ++AK+KI+F SPGSG+    R RQREEK C+ SPE LSYGRSRSGRLLLP MEFWRNQLPVYD+
Subjt:  ENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDA

SwissProt top hitse value%identityAlignment
F4KCE9 Kinetochore-associated protein KNL-2 homolog1.9e-2545.83Show/hide
Query:  DPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPE-QPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRTTDNGFTDEVFKHFV
        +P      + S FQKTV L DWWLI+   +F  K   VAG     E + +RVF+S+PI K  DVFTL  +DGI + ++GF+NK R   NGF  E+ + F+
Subjt:  DPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPE-QPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRTTDNGFTDEVFKHFV

Query:  FGFPPNWETYAENYFEGEAF
        FGFPP WE    + FEG++F
Subjt:  FGFPPNWETYAENYFEGEAF

F4KCE9 Kinetochore-associated protein KNL-2 homolog4.9e-0541.27Show/hide
Query:  KQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYD
        ++ KRKI F      +     ++ +++K    S +SL   RSRSGR+L+ ++EFWRNQ+PVYD
Subjt:  KQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYD

Q8RWD7 Protein EMBRYO DEFECTIVE 16742.2e-1329.39Show/hide
Query:  STPQSHQTT--TPNGDPKTRGGAAASY---------FQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVI
        ++P ++  T   PN  P T G     +           K+V L DWWL +   D     L + G  S     VR+FSS  I KR++  TLE  DGI + I
Subjt:  STPQSHQTT--TPNGDPKTRGGAAASY---------FQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVI

Query:  KGFINKLRTTDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSKKLDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKTGA
         GFIN+ R  +NG + EV   F  GFP +WE Y E     E  +      +IS  D  +             QD+       K  + L ++VGS      
Subjt:  KGFINKLRTTDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSKKLDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKTGA

Query:  NIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREE
            +  E  R G+ D           +S   G++ R   R+REE
Subjt:  NIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREE

Arabidopsis top hitse value%identityAlignment
AT1G58210.1 kinase interacting family protein1.6e-1429.39Show/hide
Query:  STPQSHQTT--TPNGDPKTRGGAAASY---------FQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVI
        ++P ++  T   PN  P T G     +           K+V L DWWL +   D     L + G  S     VR+FSS  I KR++  TLE  DGI + I
Subjt:  STPQSHQTT--TPNGDPKTRGGAAASY---------FQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVI

Query:  KGFINKLRTTDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSKKLDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKTGA
         GFIN+ R  +NG + EV   F  GFP +WE Y E     E  +      +IS  D  +             QD+       K  + L ++VGS      
Subjt:  KGFINKLRTTDNGFTDEVFKHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSKKLDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKTGA

Query:  NIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREE
            +  E  R G+ D           +S   G++ R   R+REE
Subjt:  NIQGEEIENKRKGEQDCCKQAKRKILFISPGSGIRQRQRQRQREE

AT5G02520.1 CONTAINS InterPro DOMAIN/s: SANT associated (InterPro:IPR015216)1.4e-2645.83Show/hide
Query:  DPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPE-QPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRTTDNGFTDEVFKHFV
        +P      + S FQKTV L DWWLI+   +F  K   VAG     E + +RVF+S+PI K  DVFTL  +DGI + ++GF+NK R   NGF  E+ + F+
Subjt:  DPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPE-QPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRTTDNGFTDEVFKHFV

Query:  FGFPPNWETYAENYFEGEAF
        FGFPP WE    + FEG++F
Subjt:  FGFPPNWETYAENYFEGEAF

AT5G02520.1 CONTAINS InterPro DOMAIN/s: SANT associated (InterPro:IPR015216)3.5e-0641.27Show/hide
Query:  KQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYD
        ++ KRKI F      +     ++ +++K    S +SL   RSRSGR+L+ ++EFWRNQ+PVYD
Subjt:  KQAKRKILFISPGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTACTCCACAATCTCATCAAACTACAACCCCTAACGGCGACCCCAAAACCCGCGGCGGCGCCGCAGCTTCTTACTTCCAGAAAACAGTCTGTTTGCACGATTG
GTGGTTAATTAGAGCTGAAAATGACTTCAATCGAAAAACGCTAGCCGTCGCTGGGCTCACTTCCAGACCGGAACAACCTGTTCGAGTATTTTCTTCTGCGCCGATTGTTA
AGAGGTACGATGTTTTCACTCTCGAGACTGCGGATGGAATCTGTGTTGTTATTAAGGGTTTCATAAACAAACTCCGTACTACTGATAATGGGTTCACAGATGAGGTTTTT
AAGCATTTTGTGTTTGGGTTTCCTCCTAACTGGGAAACTTATGCGGAGAATTACTTTGAGGGAGAAGCTTTTGATAGTGCTTCTGCTGCGGGAAATATTTCTGATACAGA
CAATTTGCTATGTAAATCGAAAAAATTGGATCATGGAGATACTATGGCCCAAGATGTGATGCAGAATGCAAGTGCAACTAAAACTGCTGTGCCTCTCAAAAATCTGGTTG
GTTCACCCCATAAAACTGGTGCTAATATTCAAGGTGAAGAAATTGAAAACAAAAGAAAAGGAGAGCAAGATTGTTGCAAGCAAGCCAAGAGGAAAATTCTCTTCATTTCA
CCTGGAAGTGGTATTAGGCAAAGGCAAAGGCAAAGGCAAAGGGAGGAGAAAATCTGCATTAAATCTCCAGAAAGTTTGAGTTATGGGCGATCTCGATCAGGGAGATTACT
TCTGCCGGCGATGGAATTTTGGCGCAACCAATTACCTGTTTACGATGCGGTTAGTGCTGTGGTTTTCATTTTTATCTGTTTTGTT
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTACTCCACAATCTCATCAAACTACAACCCCTAACGGCGACCCCAAAACCCGCGGCGGCGCCGCAGCTTCTTACTTCCAGAAAACAGTCTGTTTGCACGATTG
GTGGTTAATTAGAGCTGAAAATGACTTCAATCGAAAAACGCTAGCCGTCGCTGGGCTCACTTCCAGACCGGAACAACCTGTTCGAGTATTTTCTTCTGCGCCGATTGTTA
AGAGGTACGATGTTTTCACTCTCGAGACTGCGGATGGAATCTGTGTTGTTATTAAGGGTTTCATAAACAAACTCCGTACTACTGATAATGGGTTCACAGATGAGGTTTTT
AAGCATTTTGTGTTTGGGTTTCCTCCTAACTGGGAAACTTATGCGGAGAATTACTTTGAGGGAGAAGCTTTTGATAGTGCTTCTGCTGCGGGAAATATTTCTGATACAGA
CAATTTGCTATGTAAATCGAAAAAATTGGATCATGGAGATACTATGGCCCAAGATGTGATGCAGAATGCAAGTGCAACTAAAACTGCTGTGCCTCTCAAAAATCTGGTTG
GTTCACCCCATAAAACTGGTGCTAATATTCAAGGTGAAGAAATTGAAAACAAAAGAAAAGGAGAGCAAGATTGTTGCAAGCAAGCCAAGAGGAAAATTCTCTTCATTTCA
CCTGGAAGTGGTATTAGGCAAAGGCAAAGGCAAAGGCAAAGGGAGGAGAAAATCTGCATTAAATCTCCAGAAAGTTTGAGTTATGGGCGATCTCGATCAGGGAGATTACT
TCTGCCGGCGATGGAATTTTGGCGCAACCAATTACCTGTTTACGATGCGGTTAGTGCTGTGGTTTTCATTTTTATCTGTTTTGTT
Protein sequenceShow/hide protein sequence
MASTPQSHQTTTPNGDPKTRGGAAASYFQKTVCLHDWWLIRAENDFNRKTLAVAGLTSRPEQPVRVFSSAPIVKRYDVFTLETADGICVVIKGFINKLRTTDNGFTDEVF
KHFVFGFPPNWETYAENYFEGEAFDSASAAGNISDTDNLLCKSKKLDHGDTMAQDVMQNASATKTAVPLKNLVGSPHKTGANIQGEEIENKRKGEQDCCKQAKRKILFIS
PGSGIRQRQRQRQREEKICIKSPESLSYGRSRSGRLLLPAMEFWRNQLPVYDAVSAVVFIFICFV