; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10023126 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10023126
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein FRA10AC1
Genome locationChr05:31426662..31433433
RNA-Seq ExpressionHG10023126
SyntenyHG10023126
Gene Ontology termsGO:0016311 - dephosphorylation (biological process)
GO:0016791 - phosphatase activity (molecular function)
InterPro domainsIPR019129 - Folate-sensitive fragile site protein Fra10Ac1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135932.1 protein FRA10AC1 isoform X2 [Cucumis sativus]2.0e-10076.01Show/hide
Query:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------
        MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEK PIKTDQDTLREGYRFIRSEEDDMDTSWEQKL              
Subjt:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------

Query:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK
                  IGLRWR EKEV+SGKGQFICGNKHCDEK GLASYE                             VNFSYFEAGENKQALVKLVTC RCSK
Subjt:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK

Query:  KLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP
        KLHYKR+KEKEK ERKEQEMSKRKR SDD+SD+EDEGSRTRRKGKKASTSFGDHKAD KE+FDEYLEGMFP
Subjt:  KLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP

XP_008461295.1 PREDICTED: protein FRA10AC1 [Cucumis melo]6.8e-10176.75Show/hide
Query:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------
        MASFGSLK AIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEK PIKTDQDTLREGYRFIRSEEDDMDTSWEQKL              
Subjt:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------

Query:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK
                  IGLRWR EKEVISGKGQFICGNKHCDEK+GLASYE                             VNFSYFEAGENKQALVKLVTCERCSK
Subjt:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK

Query:  KLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP
        KLHYKRKKEKEK ERKEQEMSKRKR SDD+SDSEDEGSRTRRKG KASTSFGDHKAD+KE+FDEYLEGMFP
Subjt:  KLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP

XP_031745033.1 protein FRA10AC1 isoform X1 [Cucumis sativus]3.2e-9873.57Show/hide
Query:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------
        MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEK PIKTDQDTLREGYRFIRSEEDDMDTSWEQKL              
Subjt:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------

Query:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTC-----
                  IGLRWR EKEV+SGKGQFICGNKHCDEK GLASYE                             VNFSYFEAGENKQALVKLVTC     
Subjt:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTC-----

Query:  ----ERCSKKLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP
             RCSKKLHYKR+KEKEK ERKEQEMSKRKR SDD+SD+EDEGSRTRRKGKKASTSFGDHKAD KE+FDEYLEGMFP
Subjt:  ----ERCSKKLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP

XP_031745034.1 protein FRA10AC1 isoform X3 [Cucumis sativus]7.1e-9876.81Show/hide
Query:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLIG-------LRWRT
        MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEK PIKTDQDTLREGYRFIRSEEDDMDTSWEQKL+          WR 
Subjt:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLIG-------LRWRT

Query:  EKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTC---------ERCSKKLHYKRKK
        EKEV+SGKGQFICGNKHCDEK GLASYE                             VNFSYFEAGENKQALVKLVTC          RCSKKLHYKR+K
Subjt:  EKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTC---------ERCSKKLHYKRKK

Query:  EKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP
        EKEK ERKEQEMSKRKR SDD+SD+EDEGSRTRRKGKKASTSFGDHKAD KE+FDEYLEGMFP
Subjt:  EKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP

XP_038898608.1 protein FRA10AC1 isoform X1 [Benincasa hispida]1.7e-10478.6Show/hide
Query:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------
        MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTG EKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL              
Subjt:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------

Query:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK
                  IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYE                             VNFSYFEAGENKQALVKLVTCERC K
Subjt:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK

Query:  KLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP
        KLHYKRKKEKEKQERKEQ+MSKRKRSSDDNSDSEDEGSRTRRKGKKASTS+GDHKA DKEDFDEYLEGMFP
Subjt:  KLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP

TrEMBL top hitse value%identityAlignment
A0A1S3CFL8 protein FRA10AC13.3e-10176.75Show/hide
Query:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------
        MASFGSLK AIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEK PIKTDQDTLREGYRFIRSEEDDMDTSWEQKL              
Subjt:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------

Query:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK
                  IGLRWR EKEVISGKGQFICGNKHCDEK+GLASYE                             VNFSYFEAGENKQALVKLVTCERCSK
Subjt:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK

Query:  KLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP
        KLHYKRKKEKEK ERKEQEMSKRKR SDD+SDSEDEGSRTRRKG KASTSFGDHKAD+KE+FDEYLEGMFP
Subjt:  KLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP

A0A5A7UTX2 Protein FRA10AC18.2e-9275.1Show/hide
Query:  QQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL------------------------IGLRWRTE
        +QYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEK PIKTDQDTLREGYRFIRSEEDDMDTSWEQKL                        IGLRWR E
Subjt:  QQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL------------------------IGLRWRTE

Query:  KEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSKKLHYKRKKEKEKQERKEQ
        KEVISGKGQFICGNKHCDEK+GLASYE                             VNFSYFEAGENKQALVKLVTCERCSKKLHYKRKKEKEK ERKEQ
Subjt:  KEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSKKLHYKRKKEKEKQERKEQ

Query:  EMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP
        EMSKRKR SDD+SDSEDEGSRTRRKG KASTSFGDHKAD+KE+FDEYLEGMFP
Subjt:  EMSKRKRSSDDNSDSEDEGSRTRRKGKKASTSFGDHKADDKEDFDEYLEGMFP

A0A6J1DIS1 protein FRA10AC13.2e-9674.73Show/hide
Query:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------
        MASFGSLKNAIFEREE+KQQYQAHVRGLNAYDRHKKF+HDYVHFYGKNK+ EEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL              
Subjt:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------

Query:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK
                  IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYE                             VNFSY EAGENKQALVKLVTCERCSK
Subjt:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK

Query:  KLHYKRKKEKEKQERKEQEMSKRKRSSD-DNSDSEDEGSRT-RRKGKKASTSFGDHKADDKEDFDEYLEGMFP
        KLHYKRKKEKEKQER EQE+SKRKR SD  +SDSEDEGSRT RRKGKKASTS  DHK DDKEDFDE+LEGMFP
Subjt:  KLHYKRKKEKEKQERKEQEMSKRKRSSD-DNSDSEDEGSRT-RRKGKKASTSFGDHKADDKEDFDEYLEGMFP

A0A6J1HB92 protein FRA10AC1 isoform X11.4e-9675Show/hide
Query:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------
        MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNK+ EE FPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL              
Subjt:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------

Query:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK
                  IGLRWR EKEV+SGKGQFICGNKHCDEKDGLASYE                             VNFSYFEAGENKQALVKLVTCERCSK
Subjt:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK

Query:  KLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRT-RRKGKKASTSFGDHKADDKEDFDEYLEGMFP
        KLHYKR KEKEK ER EQEMSKRKR SDD+SDSED GSRT RRKGKKASTS  D KADDKEDFDEYLEGMFP
Subjt:  KLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRT-RRKGKKASTSFGDHKADDKEDFDEYLEGMFP

A0A6J1KWV6 protein FRA10AC1 isoform X24.2e-9674.63Show/hide
Query:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------
        MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNK+ EE FPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL              
Subjt:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKL--------------

Query:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK
                  IGLRWR EKEV+SGKGQFICGNKHCDEKDGLASYE                             VNFSYFEAGENKQALVKLVTCERCSK
Subjt:  ----------IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK

Query:  KLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRT-RRKGKKASTSFGDHKADDKEDFDEYLEGMFP
        KLHYKR KEKEK ER EQEMSKRKR SDD+SDSED GSRT RRKGKKASTS  D K DDKEDFDEYLEGMFP
Subjt:  KLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRT-RRKGKKASTSFGDHKADDKEDFDEYLEGMFP

SwissProt top hitse value%identityAlignment
Q5FVF1 Protein FRA10AC1 homolog5.2e-2733.79Show/hide
Query:  IFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKF--PIKTDQDTLREGYRFIRSEEDDMDTSWEQKL----------------------
        + +REE + + + H+  ++AY RH KF++DY+ +YG  +   ++     KTD D +RE +RF+ +EED+ D +WE++L                      
Subjt:  IFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKF--PIKTDQDTLREGYRFIRSEEDDMDTSWEQKL----------------------

Query:  --IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSKKL--HYKR
           G RWR EKEVISGKGQF CGNK CDEK+GL S+E                             VNF Y E GE + ALVKL  C+ CS KL  H++R
Subjt:  --IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSKKL--HYKR

Query:  KKEKEKQERKEQE--------MSKRKRSSDDNSDSEDEG---------SRTRRKGKKASTSFGD------HKADDK---EDFDEYLEGMF
        K+ K  ++R + +         + R  SS++ S  +DEG         SR R  G++ S S  +       + D+K   E+FD+Y + +F
Subjt:  KKEKEKQERKEQE--------MSKRKRSSDDNSDSEDEG---------SRTRRKGKKASTSFGD------HKADDK---EDFDEYLEGMF

Q70Z53 Protein FRA10AC11.0e-2733.1Show/hide
Query:  IFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKF--PIKTDQDTLREGYRFIRSEEDDMDTSWEQKL----------------------
        + +REE + + + H+  ++AY RH KF++DY+ +YG  K   ++     KTD D +RE +RF+ +EED+MD +WE++L                      
Subjt:  IFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKF--PIKTDQDTLREGYRFIRSEEDDMDTSWEQKL----------------------

Query:  --IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSKKLHY----
           G RWR EKEVISGKGQF CGNK+CD+K+GL S+E                             VNF Y E GE + ALVKL  C+ CS KL++    
Subjt:  --IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSKKLHY----

Query:  ---KRKKEKEKQERKEQEMSKRK---RSSDDNSDSEDEGSRTRRKGKKASTSFGDH---------------KADDK---EDFDEYLEGMF
           K KK K+K ++  +E S +K    S+++ S  +D+G  + +K + +     D                + D+K   E+FDEY + +F
Subjt:  ---KRKKEKEKQERKEQEMSKRK---RSSDDNSDSEDEGSRTRRKGKKASTSFGDH---------------KADDK---EDFDEYLEGMF

Q8BP78 Protein FRA10AC1 homolog3.7e-2533.79Show/hide
Query:  IFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKF--PIKTDQDTLREGYRFIRSEEDDMDTSWEQKL----------------------
        + +REE + + + H+  ++AY RH KF++DY+ +YG  +   ++     KTD D +RE +RF+ +EED+ D +WE++L                      
Subjt:  IFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKF--PIKTDQDTLREGYRFIRSEEDDMDTSWEQKL----------------------

Query:  --IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSKKL--HYKR
           G RWR EKEVISGKGQF CGNK C+EK+GL S+E                             VNF Y E GE + ALVKL  C+ CS KL  H++R
Subjt:  --IGLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSKKL--HYKR

Query:  KK----EKEKQERKEQEMSKRKRS----SDDNSDSEDEG---------SRTRRKGKKASTSFGD------HKADDK---EDFDEYLEGMF
        K+    +K+ +   E + S RK+S    S++ S  +DEG         SR R   ++ S S  +       + D+K   E+FD+Y + +F
Subjt:  KK----EKEKQERKEQEMSKRKRS----SDDNSDSEDEG---------SRTRRKGKKASTSFGD------HKADDK---EDFDEYLEGMF

Arabidopsis top hitse value%identityAlignment
AT4G15030.1 CONTAINS InterPro DOMAIN/s: Folate-sensitive fragile site protein Fra10Ac1 (InterPro:IPR019129); Has 8455 Blast hits to 5700 proteins in 376 species: Archae - 6; Bacteria - 264; Metazoa - 3820; Fungi - 744; Plants - 645; Viruses - 76; Other Eukaryotes - 2900 (source: NCBI BLink).8.1e-6850Show/hide
Query:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLI-------------
        M S  ++++ I++REERKQQYQAH+RGLNAY+RHKKF+ DYV FYGK+K  E K P+KTDQDTLREGYRFIRSEEDD+D SWEQ+L+             
Subjt:  MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLI-------------

Query:  -----------GLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK
                   GLRWRTEKEV++GKGQF+CG+KHCDEK+GLASYE                             VNFSY EAGE+KQALVKLV CERC++
Subjt:  -----------GLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSK

Query:  KLHYKRKKEKEKQERKEQEMSKRKRS---SDDNSDSED---EGSRT-------------------------RRKGK--KASTSFGDHKADDKEDFDEYLE
        KL+YK++KE E+ E KE++  KRKRS   S+D++D E+   EG R+                         RRKGK  K+     D +  D E+FDEY+E
Subjt:  KLHYKRKKEKEKQERKEQEMSKRKRS---SDDNSDSED---EGSRT-------------------------RRKGK--KASTSFGDHKADDKEDFDEYLE

Query:  GMFP
        GMFP
Subjt:  GMFP

AT4G15030.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Folate-sensitive fragile site protein Fra10Ac1 (InterPro:IPR019129); Has 8498 Blast hits to 5699 proteins in 376 species: Archae - 6; Bacteria - 264; Metazoa - 3838; Fungi - 743; Plants - 650; Viruses - 76; Other Eukaryotes - 2921 (source: NCBI BLink).1.3e-6549.03Show/hide
Query:  MASFGSLKNAIFEREERK----QQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLI---------
        M S  ++++ I++REERK    +QYQAH+RGLNAY+RHKKF+ DYV FYGK+K  E K P+KTDQDTLREGYRFIRSEEDD+D SWEQ+L+         
Subjt:  MASFGSLKNAIFEREERK----QQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLI---------

Query:  ---------------GLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCE
                       GLRWRTEKEV++GKGQF+CG+KHCDEK+GLASYE                             VNFSY EAGE+KQALVKLV CE
Subjt:  ---------------GLRWRTEKEVISGKGQFICGNKHCDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCE

Query:  RCSKKLHYKRKKEKEKQERKEQEMSKRKRS---SDDNSDSED---EGSRT-------------------------RRKGK--KASTSFGDHKADDKEDFD
        RC++KL+YK++KE E+ E KE++  KRKRS   S+D++D E+   EG R+                         RRKGK  K+     D +  D E+FD
Subjt:  RCSKKLHYKRKKEKEKQERKEQEMSKRKRS---SDDNSDSED---EGSRT-------------------------RRKGK--KASTSFGDHKADDKEDFD

Query:  EYLEGMFP
        EY+EGMFP
Subjt:  EYLEGMFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCGTTCGGCTCTCTGAAGAATGCCATCTTCGAAAGAGAAGAGAGAAAACAACAGTACCAGGCTCATGTGCGAGGGCTCAATGCTTATGATCGCCACAAGAAGTT
TATGCATGATTATGTTCATTTCTACGGGAAAAATAAAACTGGGGAAGAAAAATTTCCTATCAAGACTGATCAAGACACCCTGAGGGAAGGCTACCGGTTCATTCGTTCGG
AAGAAGATGACATGGATACATCATGGGAGCAGAAGCTGATTGGTCTCAGGTGGCGTACAGAGAAGGAAGTAATATCTGGGAAAGGGCAGTTCATATGTGGAAACAAGCAT
TGTGATGAGAAGGATGGTTTAGCAAGCTATGAGGCTTATTTGAGGGCAATCAGAGCAGAGGAGAAATATAACATGGATATATTAAGTTTTTGGATTGGACCCATTGTGCA
GTGGATTCTGGTGAACTTTTCTTATTTTGAAGCAGGAGAAAACAAACAAGCCCTTGTAAAGCTGGTAACTTGTGAGAGATGCTCAAAGAAACTTCACTACAAGAGAAAGA
AAGAAAAGGAGAAACAGGAAAGGAAGGAGCAAGAAATGTCCAAAAGGAAGAGGTCAAGTGATGACAATTCAGATAGTGAGGATGAAGGGAGCAGAACGAGGAGAAAAGGA
AAGAAGGCCTCAACATCATTTGGGGATCATAAAGCTGATGACAAAGAAGACTTTGATGAATATTTGGAGGGAATGTTTCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCGTTCGGCTCTCTGAAGAATGCCATCTTCGAAAGAGAAGAGAGAAAACAACAGTACCAGGCTCATGTGCGAGGGCTCAATGCTTATGATCGCCACAAGAAGTT
TATGCATGATTATGTTCATTTCTACGGGAAAAATAAAACTGGGGAAGAAAAATTTCCTATCAAGACTGATCAAGACACCCTGAGGGAAGGCTACCGGTTCATTCGTTCGG
AAGAAGATGACATGGATACATCATGGGAGCAGAAGCTGATTGGTCTCAGGTGGCGTACAGAGAAGGAAGTAATATCTGGGAAAGGGCAGTTCATATGTGGAAACAAGCAT
TGTGATGAGAAGGATGGTTTAGCAAGCTATGAGGCTTATTTGAGGGCAATCAGAGCAGAGGAGAAATATAACATGGATATATTAAGTTTTTGGATTGGACCCATTGTGCA
GTGGATTCTGGTGAACTTTTCTTATTTTGAAGCAGGAGAAAACAAACAAGCCCTTGTAAAGCTGGTAACTTGTGAGAGATGCTCAAAGAAACTTCACTACAAGAGAAAGA
AAGAAAAGGAGAAACAGGAAAGGAAGGAGCAAGAAATGTCCAAAAGGAAGAGGTCAAGTGATGACAATTCAGATAGTGAGGATGAAGGGAGCAGAACGAGGAGAAAAGGA
AAGAAGGCCTCAACATCATTTGGGGATCATAAAGCTGATGACAAAGAAGACTTTGATGAATATTTGGAGGGAATGTTTCCATAA
Protein sequenceShow/hide protein sequence
MASFGSLKNAIFEREERKQQYQAHVRGLNAYDRHKKFMHDYVHFYGKNKTGEEKFPIKTDQDTLREGYRFIRSEEDDMDTSWEQKLIGLRWRTEKEVISGKGQFICGNKH
CDEKDGLASYEAYLRAIRAEEKYNMDILSFWIGPIVQWILVNFSYFEAGENKQALVKLVTCERCSKKLHYKRKKEKEKQERKEQEMSKRKRSSDDNSDSEDEGSRTRRKG
KKASTSFGDHKADDKEDFDEYLEGMFP