; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0458 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0458
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein FRA10AC1
Genome locationMC09:4072651..4078802
RNA-Seq ExpressionMC09g0458
SyntenyMC09g0458
Gene Ontology termsGO:0016311 - dephosphorylation (biological process)
GO:0016791 - phosphatase activity (molecular function)
InterPro domainsIPR019129 - Folate-sensitive fragile site protein Fra10Ac1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152761.1 protein FRA10AC1 [Momordica charantia]1.83e-16493.1Show/hide
Query:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
        MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
Subjt:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI

Query:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ
        ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYE  NFSYSEAGENKQALVKLVTCER               CSKKLHYKRKKEKEKQ
Subjt:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ

Query:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP
        ERMEQELSKRKR PSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP
Subjt:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP

XP_022960519.1 protein FRA10AC1 isoform X1 [Cucurbita moschata]5.05e-14886.21Show/hide
Query:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
        MASFGSLKNAIFEREE+KQQYQAHVRGLNAYDRHKKF+HDYVHFYGKNKSREE FPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
Subjt:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI

Query:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ
        ADMT YKSGKIGLRWR EKEV+SGKGQFICGNKHCDEKDGLASYE  NFSY EAGENKQALVKLVTCER               CSKKLHYKR KEKEK 
Subjt:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ

Query:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP
        ERMEQE+SKRKR PSD SS DSED GSRTRRRKGKKASTS  D K DDKEDFDE+LEGMFP
Subjt:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP

XP_023004639.1 protein FRA10AC1 isoform X2 [Cucurbita maxima]5.05e-14886.21Show/hide
Query:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
        MASFGSLKNAIFEREE+KQQYQAHVRGLNAYDRHKKF+HDYVHFYGKNKSREE FPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
Subjt:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI

Query:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ
        ADMT YKSGKIGLRWR EKEV+SGKGQFICGNKHCDEKDGLASYE  NFSY EAGENKQALVKLVTCER               CSKKLHYKR KEKEK 
Subjt:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ

Query:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP
        ERMEQE+SKRKR PSD SS DSED GSRTRRRKGKKASTS  D K DDKEDFDE+LEGMFP
Subjt:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP

XP_023515219.1 protein FRA10AC1 isoform X2 [Cucurbita pepo subsp. pepo]1.19e-14685.44Show/hide
Query:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
        MASFGSLKNAIFEREE+KQQYQAHVRGLNAYDRHKK +HDYVHFYGKNKSREE FPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
Subjt:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI

Query:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ
        ADMT YKSGKIGLRWR EKEV+SGKGQFICGNKHCDEKDGLASYE  NFSY EAGENKQALVKLVTCER               CSKKLHYKR K+KEK 
Subjt:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ

Query:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP
        ERMEQE+SKRKR PSD SS DSED GSRTRRRKGKKASTS  D K DDKEDFDE+LEGMFP
Subjt:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP

XP_031745033.1 protein FRA10AC1 isoform X1 [Cucumis sativus]6.44e-14684.67Show/hide
Query:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
        MASFGSLKNAIFEREE+KQQYQAHVRGLNAYDRHKKF+HDYVHFYGKNK+ EEK PIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
Subjt:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI

Query:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ
        ADMT YKSGKIGLRWR EKEV+SGKGQFICGNKHCDEK GLASYE  NFSY EAGENKQALVKLVTC RS+   ++P      RCSKKLHYKR+KEKEK 
Subjt:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ

Query:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP
        ER EQE+SKRKR PSD SS D+EDEGSRTRR KGKKASTS  DHK D KE+FDE+LEGMFP
Subjt:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP

TrEMBL top hitse value%identityAlignment
A0A1S3CFL8 protein FRA10AC11.45e-14284.29Show/hide
Query:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
        MASFGSLK AIFEREE+KQQYQAHVRGLNAYDRHKKF+HDYVHFYGKNK+ EEK PIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
Subjt:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI

Query:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ
        ADMT YKSGKIGLRWR EKEVISGKGQFICGNKHCDEK+GLASYE  NFSY EAGENKQALVKLVTCER               CSKKLHYKRKKEKEK 
Subjt:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ

Query:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP
        ER EQE+SKRKR PSD SS DSEDEGSRTRR KG KASTS  DHK D+KE+FDE+LEGMFP
Subjt:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP

A0A6J1DIS1 protein FRA10AC18.84e-16593.1Show/hide
Query:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
        MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
Subjt:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI

Query:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ
        ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYE  NFSYSEAGENKQALVKLVTCER               CSKKLHYKRKKEKEKQ
Subjt:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ

Query:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP
        ERMEQELSKRKR PSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP
Subjt:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP

A0A6J1HB92 protein FRA10AC1 isoform X12.45e-14886.21Show/hide
Query:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
        MASFGSLKNAIFEREE+KQQYQAHVRGLNAYDRHKKF+HDYVHFYGKNKSREE FPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
Subjt:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI

Query:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ
        ADMT YKSGKIGLRWR EKEV+SGKGQFICGNKHCDEKDGLASYE  NFSY EAGENKQALVKLVTCER               CSKKLHYKR KEKEK 
Subjt:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ

Query:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP
        ERMEQE+SKRKR PSD SS DSED GSRTRRRKGKKASTS  D K DDKEDFDE+LEGMFP
Subjt:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP

A0A6J1KWV6 protein FRA10AC1 isoform X22.45e-14886.21Show/hide
Query:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
        MASFGSLKNAIFEREE+KQQYQAHVRGLNAYDRHKKF+HDYVHFYGKNKSREE FPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
Subjt:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI

Query:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ
        ADMT YKSGKIGLRWR EKEV+SGKGQFICGNKHCDEKDGLASYE  NFSY EAGENKQALVKLVTCER               CSKKLHYKR KEKEK 
Subjt:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ

Query:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP
        ERMEQE+SKRKR PSD SS DSED GSRTRRRKGKKASTS  D K DDKEDFDE+LEGMFP
Subjt:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP

A0A6J1L051 protein FRA10AC1 isoform X11.88e-13286.75Show/hide
Query:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
        MASFGSLKNAIFEREE+KQQYQAHVRGLNAYDRHKKF+HDYVHFYGKNKSREE FPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
Subjt:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI

Query:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ
        ADMT YKSGKIGLRWR EKEV+SGKGQFICGNKHCDEKDGLASYE  NFSY EAGENKQALVKLVTCER               CSKKLHYKR KEKEK 
Subjt:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ

Query:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKG
        ERMEQE+SKRKR PSD SS DSED GSRTRRRKG
Subjt:  ERMEQELSKRKRWPSDGSSSDSEDEGSRTRRRKG

SwissProt top hitse value%identityAlignment
Q5FVF1 Protein FRA10AC1 homolog4.2e-4340.15Show/hide
Query:  IFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKF--PIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCIADMTHYKS
        + +REE + + + H+  ++AY RH KF++DY+ +YG  +   ++     KTD D +RE +RF+ +EED+ D +WE++L K+YYDKLFKEYCIAD++ YK 
Subjt:  IFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKF--PIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCIADMTHYKS

Query:  GKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQERMEQELS
         K G RWR EKEVISGKGQF CGNK CDEK+GL S+E  NF Y+E GE + ALVKL  C+   ++L       ++   K++   +K+ K K E  ++   
Subjt:  GKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQERMEQELS

Query:  KRKRWPSDGSSSDSEDEGSRTRRRK--------GKKASTSHVD------HKVDDK---EDFDEFLEGMF
        K  R  S   +S  +DEG  + +R         G++ S S  +       + D+K   E+FD++ + +F
Subjt:  KRKRWPSDGSSSDSEDEGSRTRRRK--------GKKASTSHVD------HKVDDK---EDFDEFLEGMF

Q70Z53 Protein FRA10AC11.4e-4341.18Show/hide
Query:  IFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKF--PIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCIADMTHYKS
        + +REE + + + H+  ++AY RH KF++DY+ +YG  K   ++     KTD D +RE +RF+ +EED+MD +WE++L K+YYDKLFKEYCIAD++ YK 
Subjt:  IFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKF--PIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCIADMTHYKS

Query:  GKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQERME----
         K G RWR EKEVISGKGQF CGNK+CD+K+GL S+E  NF Y E GE + ALVKL  C+    +L       ++   K++  K++K+K K++  E    
Subjt:  GKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQERME----

Query:  -------QELSKRKRWPSDGS---------SSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMF
               +E SK+K      S         +SD E+  S +   KG    T   D K   +E+FDE+ + +F
Subjt:  -------QELSKRKRWPSDGS---------SSDSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMF

Q8BP78 Protein FRA10AC1 homolog1.2e-4240.15Show/hide
Query:  IFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKF--PIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCIADMTHYKS
        + +REE + + + H+  ++AY RH KF++DY+ +YG  +   ++     KTD D +RE +RF+ +EED+ D +WE++L K+YYDKLFKEYCIAD++ YK 
Subjt:  IFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKF--PIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCIADMTHYKS

Query:  GKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQERMEQELS
         K G RWR EKEVISGKGQF CGNK C+EK+GL S+E  NF Y+E GE + ALVKL  C+   ++L       ++   K++   +KK K   E  E    
Subjt:  GKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQERMEQELS

Query:  KRKRWPS--------DGSSSDSEDEGSRTRRRKGK---------KASTSHVDHKVDDKEDFDEFLEGMF
        K +  PS        +G SS  + E SR R  + +         K      D K   +E+FD++ + +F
Subjt:  KRKRWPS--------DGSSSDSEDEGSRTRRRKGK---------KASTSHVDHKVDDKEDFDEFLEGMF

Arabidopsis top hitse value%identityAlignment
AT4G15030.1 CONTAINS InterPro DOMAIN/s: Folate-sensitive fragile site protein Fra10Ac1 (InterPro:IPR019129); Has 8455 Blast hits to 5700 proteins in 376 species: Archae - 6; Bacteria - 264; Metazoa - 3820; Fungi - 744; Plants - 645; Viruses - 76; Other Eukaryotes - 2900 (source: NCBI BLink).4.2e-8357.04Show/hide
Query:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI
        M S  ++++ I++REE+KQQYQAH+RGLNAY+RHKKF+ DYV FYGK+K  E K P+KTDQDTLREGYRFIRSEEDD+D SWEQ+LVKRYYDKLFKEYCI
Subjt:  MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCI

Query:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ
        ADM+ YK+GK+GLRWRTEKEV++GKGQF+CG+KHCDEK+GLASYE  NFSY EAGE+KQALVKLV CE               RC++KL+YK++KE E+ 
Subjt:  ADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQ

Query:  ERMEQELSKRKRWPS-------------DGSSSDSEDEGSRTR---------------RRKGK--KASTSHVDHKVDDKEDFDEFLEGMFP
        E  E++  KRKR  S             +G  S+S ++  + R               RRKGK  K+     D +  D E+FDE++EGMFP
Subjt:  ERMEQELSKRKRWPS-------------DGSSSDSEDEGSRTR---------------RRKGK--KASTSHVDHKVDDKEDFDEFLEGMFP

AT4G15030.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Folate-sensitive fragile site protein Fra10Ac1 (InterPro:IPR019129); Has 8498 Blast hits to 5699 proteins in 376 species: Archae - 6; Bacteria - 264; Metazoa - 3838; Fungi - 743; Plants - 650; Viruses - 76; Other Eukaryotes - 2921 (source: NCBI BLink).6.8e-8155.93Show/hide
Query:  MASFGSLKNAIFEREEKK----QQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFK
        M S  ++++ I++REE+K    +QYQAH+RGLNAY+RHKKF+ DYV FYGK+K  E K P+KTDQDTLREGYRFIRSEEDD+D SWEQ+LVKRYYDKLFK
Subjt:  MASFGSLKNAIFEREEKK----QQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFK

Query:  EYCIADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKE
        EYCIADM+ YK+GK+GLRWRTEKEV++GKGQF+CG+KHCDEK+GLASYE  NFSY EAGE+KQALVKLV CE               RC++KL+YK++KE
Subjt:  EYCIADMTHYKSGKIGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKE

Query:  KEKQERMEQELSKRKRWPS-------------DGSSSDSEDEGSRTR---------------RRKGK--KASTSHVDHKVDDKEDFDEFLEGMFP
         E+ E  E++  KRKR  S             +G  S+S ++  + R               RRKGK  K+     D +  D E+FDE++EGMFP
Subjt:  KEKQERMEQELSKRKRWPS-------------DGSSSDSEDEGSRTR---------------RRKGK--KASTSHVDHKVDDKEDFDEFLEGMFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCTTCGGCTCTCTGAAGAATGCCATCTTCGAAAGAGAAGAGAAAAAACAACAGTACCAGGCTCATGTGCGAGGGCTCAATGCTTACGATCGCCACAAGAAGTT
TATACATGATTACGTTCATTTCTACGGGAAAAATAAATCTAGGGAAGAAAAATTTCCTATTAAGACCGATCAAGACACCTTGAGGGAGGGCTACCGGTTCATTCGTTCTG
AAGAAGATGACATGGATACATCATGGGAGCAAAAGCTGGTGAAGCGATATTATGATAAGCTTTTCAAAGAATACTGCATAGCTGATATGACACATTACAAAAGTGGGAAG
ATTGGTCTCAGGTGGCGTACTGAGAAGGAAGTAATATCTGGGAAAGGGCAGTTCATATGCGGTAACAAGCATTGTGATGAAAAGGATGGTTTAGCAAGCTATGAGGCAAG
TAACTTTTCTTATTCTGAAGCAGGAGAGAACAAGCAGGCCCTTGTAAAGCTGGTAACTTGTGAGAGATCTATTTACCAACTTTTTCTCCCATCTTTGACAATTTGGAATA
GATGCTCAAAGAAGCTTCACTACAAGAGAAAGAAAGAAAAGGAGAAACAGGAAAGGATGGAGCAAGAATTGTCCAAAAGGAAGAGGTGGCCAAGTGATGGCAGTTCTTCA
GACAGTGAGGATGAGGGGAGCAGAACGAGAAGAAGAAAAGGAAAGAAGGCTTCAACATCACATGTGGATCATAAAGTTGATGACAAAGAAGACTTTGATGAATTTCTGGA
GGGAATGTTTCCATAA
mRNA sequenceShow/hide mRNA sequence
CCGGGGACGGACCTTTATGAGTCACAAACAACATCACAGCGAATCGAATACCATTGCTTGGCGCGGCCATGGCGTCCTTCGGCTCTCTGAAGAATGCCATCTTCGAAAGA
GAAGAGAAAAAACAACAGTACCAGGCTCATGTGCGAGGGCTCAATGCTTACGATCGCCACAAGAAGTTTATACATGATTACGTTCATTTCTACGGGAAAAATAAATCTAG
GGAAGAAAAATTTCCTATTAAGACCGATCAAGACACCTTGAGGGAGGGCTACCGGTTCATTCGTTCTGAAGAAGATGACATGGATACATCATGGGAGCAAAAGCTGGTGA
AGCGATATTATGATAAGCTTTTCAAAGAATACTGCATAGCTGATATGACACATTACAAAAGTGGGAAGATTGGTCTCAGGTGGCGTACTGAGAAGGAAGTAATATCTGGG
AAAGGGCAGTTCATATGCGGTAACAAGCATTGTGATGAAAAGGATGGTTTAGCAAGCTATGAGGCAAGTAACTTTTCTTATTCTGAAGCAGGAGAGAACAAGCAGGCCCT
TGTAAAGCTGGTAACTTGTGAGAGATCTATTTACCAACTTTTTCTCCCATCTTTGACAATTTGGAATAGATGCTCAAAGAAGCTTCACTACAAGAGAAAGAAAGAAAAGG
AGAAACAGGAAAGGATGGAGCAAGAATTGTCCAAAAGGAAGAGGTGGCCAAGTGATGGCAGTTCTTCAGACAGTGAGGATGAGGGGAGCAGAACGAGAAGAAGAAAAGGA
AAGAAGGCTTCAACATCACATGTGGATCATAAAGTTGATGACAAAGAAGACTTTGATGAATTTCTGGAGGGAATGTTTCCATAATGTTGATGCCATTGGGAGACGACAAA
ATGCCTTTGTTTGTACGTATGCACTTTTTTTCTGGCAATTATTAATATCTTAACATAAAACATCATTTGGTGGTTTCCTTAATTGTTTGATTGTAGCCTTGGCATTATCT
ATATAGTCTCTTCTGTTCTATGAGCTGTGGAGCACTGGTTTTTGCTCATACCTTCCCCTTGGAGTGAAATAGGAAAATCGAAAAACCAACCATCTTGACTGTAACCGAAC
GAACGTTGGTTGGTCGATTTTGAATCATCGAGAACCCGAAAGTCTCCCATCTGTTTAGTGTACATGCACTTCCCTCCCTTCTGACCATCGTCTCTCCTCACCACCACCAA
CTGCCCCTTCCGACCTTTTGAACAATGTCAATGTCCAACGAATTTCGACTGATTGTGATTGCATTATATGAAAAGGCATAGTCAAGTTACCTGGCTATTAGGTTGGGGTG
TTAGAACATCAAATCCGGAGTTGCCATTTTAATTGGCGGGTACGAAACTTGTGTCTATCCTTTTGTTTTTAATATAGCGTTACAAACTTCATTGTATAGAACTATATAAT
AAAAAAATAATTGATGTGCGTGATCATTTATAGTTCTTTCTAACTTTTCATCTAGGTAGATATCCCGTTCTTTAAATGTTAAAAATATTACACAGTTGATGCAAATGTAA
ACGCACATTGTGGAAAAGATATAATCTGACCA
Protein sequenceShow/hide protein sequence
MASFGSLKNAIFEREEKKQQYQAHVRGLNAYDRHKKFIHDYVHFYGKNKSREEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLVKRYYDKLFKEYCIADMTHYKSGK
IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEASNFSYSEAGENKQALVKLVTCERSIYQLFLPSLTIWNRCSKKLHYKRKKEKEKQERMEQELSKRKRWPSDGSSS
DSEDEGSRTRRRKGKKASTSHVDHKVDDKEDFDEFLEGMFP