; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021472 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021472
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionNucleic acid-binding proteins superfamily isoform 1
Genome locationchr7:8108561..8118359
RNA-Seq ExpressionLag0021472
SyntenyLag0021472
Gene Ontology termsNA
InterPro domainsIPR035200 - Cell division control protein 24, OB domain 2
IPR035201 - Cell division control protein 24, OB domain 1
IPR035203 - Cell division control protein 24, OB domain 3


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049545.1 Nucleic acid-binding proteins superfamily isoform 1 [Cucumis melo var. makuwa]3.6e-24873.31Show/hide
Query:  MSSRGRHFN---AGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCL
        MSS  +HFN   AG  SAMELDD R+LQEE DDDPFLKFVDYARSVLAFED+ED+DPNVNGTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQ  
Subjt:  MSSRGRHFN---AGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCL

Query:  ISQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTAT
                                                                         AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKT T
Subjt:  ISQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTAT

Query:  IDSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAAS
        IDSIYEKNFLS+SSVLEAVI++EFILP                       TN  +L +  F    +       RFYDLVDGILKKGRQIF+TGCYLRAAS
Subjt:  IDSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAAS

Query:  GGSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILA
        GGSG+PRLLPTEYL+ILLDEEEDDDV+LLGAQFCSD+FSSVSLD+VN+GTTYSLYARIESIGP+EIHEK NGL+MIQIILVDNDGFKLKFLLWGEQV+LA
Subjt:  GGSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILA

Query:  NLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSF
        NLLSVGS+LALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASR +S SYPTQGP++SQVSLPCDS G IDFGNYP+RSF
Subjt:  NLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSF

Query:  VIDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALL
        VIDLQDKMTGISLYG + +I NERN TEA FSM IED TGEILAKL F RSWSLGRV VGHTV+ISGLTCT NKNRLEALWIENHVGASFVNLSCLPALL
Subjt:  VIDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALL

Query:  TSSCLHKLSRLSDLTCNAHGTK
        TSSCLHKLSRLSDLT N HGTK
Subjt:  TSSCLHKLSRLSDLTCNAHGTK

KAG7029015.1 hypothetical protein SDJN02_10198 [Cucurbita argyrosperma subsp. argyrosperma]1.8e-25575.85Show/hide
Query:  SSRGRHF---NAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLI
        SSRGRHF    AGG+SAMEL+DRRRLQEE+DDDPFLKF+DYARSVLAFEDEED+DPNV GTET TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQ   
Subjt:  SSRGRHF---NAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLI

Query:  SQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATI
                                                                        AW EQHR+GAPKKIPECINQLKKKNRRKKLPKT TI
Subjt:  SQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATI

Query:  DSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASG
        DSIYEKNFLSLSSVLEAVI+EEFILP                       TN  +L +  F    +       RFYDLV GILKKGRQIFLTGCYLRAASG
Subjt:  DSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASG

Query:  GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILAN
        GSGHPRLLPTEYLI LLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGP+EIHEKTNGLQMIQI L+DNDGFKLKFLLWGEQVILAN
Subjt:  GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILAN

Query:  LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV
        LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNI+QASR L TSYPTQ PR+SQVSLPCDS GTIDFGNYP+RSFV
Subjt:  LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV

Query:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLT
        +DLQDKMTGISLYGII +IVNERN TEAVFSM IED TG+I AKLHF RSWSLGRVGVGHTVYISGLTCT+ KN LEALWIENHVGASFVNLSCLPALLT
Subjt:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLT

Query:  SSCLHKLSRLSDLTCNAHGTK
        SSCLHK+SRLSDLTCN+HGTK
Subjt:  SSCLHKLSRLSDLTCNAHGTK

XP_022938337.1 uncharacterized protein LOC111444466 [Cucurbita moschata]5.7e-25475.52Show/hide
Query:  SSRGRHF---NAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLI
        SSRGRHF    AGG+SAMEL+DRRRLQEE+DDDPFLKF+DYARSVLAFEDEED+DPNV GTET TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQ   
Subjt:  SSRGRHF---NAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLI

Query:  SQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATI
                                                                        AW EQHR+GAPKKIPECINQLKKKNRRKKLPKT TI
Subjt:  SQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATI

Query:  DSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASG
        DSIYEKNFLSLSSVLEAVI+EEFILP                       TN  +L +  F    +       RFYDLV GILKKGRQIFLTGCYLRAASG
Subjt:  DSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASG

Query:  GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILAN
        GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGT YSLYARIESIGP+EIHEKTNGLQMIQI L+DNDGFKLKFLLWGEQVILAN
Subjt:  GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILAN

Query:  LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV
        LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNI+QASR L TSYPTQ PR+SQVSLPCDS GTIDFGNYP+RSFV
Subjt:  LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV

Query:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLT
        +DLQDKMTGISLYGII +IVNERN TEAVFSM IED TG+I AKLHF +SWSLGRVGVGHTVYISGLTCT+ KN LEALWIENHVGASFVNLSCLPALLT
Subjt:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLT

Query:  SSCLHKLSRLSDLTCNAHGTK
        SSCLHK+SRLSDLT N+HGTK
Subjt:  SSCLHKLSRLSDLTCNAHGTK

XP_022972298.1 uncharacterized protein LOC111470879 isoform X1 [Cucurbita maxima]1.8e-25275.04Show/hide
Query:  SSRGRHF---NAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLI
        SSR R+F    AGG SAMEL+DRRRLQEE+DDDPFLKF+DYARSVLAFEDEED+DPNV GT+T TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQ   
Subjt:  SSRGRHF---NAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLI

Query:  SQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATI
                                                                        AW EQHR+GAPKKIPECINQLKKKNRRKKLPKT TI
Subjt:  SQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATI

Query:  DSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASG
        DSIYEKNFLSLSSVLEAVI+EEFILP                       TN  +L +  F    +       RFYDLV GILKKGRQIFLTGCYLRAASG
Subjt:  DSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASG

Query:  GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILAN
        GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAV+KGTTYSLYARIESIGP EIHEKTNGLQMIQI+L+DNDGFKLKFLLWGEQVILAN
Subjt:  GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILAN

Query:  LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV
        LLSVGSLLALDRPYIATVNENGIG+SDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASR L TSYPTQ PR+SQVSLPCDS GTIDFGNYP+RSFV
Subjt:  LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV

Query:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLT
        +DLQDKMTGISLYGI+ +IVNERN TEAVFSM IED TG+I AKLHF RSWSLGRVGVGHTVYISGLTCT+ KN LEALWIENHVGASFVNLSCLPALLT
Subjt:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLT

Query:  SSCLHKLSRLSDLTCNAHGTK
        SSCLHK+SRLSDLT N+HGTK
Subjt:  SSCLHKLSRLSDLTCNAHGTK

XP_023538883.1 uncharacterized protein LOC111799677 [Cucurbita pepo subsp. pepo]2.3e-25575.85Show/hide
Query:  SSRGRHFN---AGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLI
        SSRGRHFN   AGG+SAMEL+DRRRLQEE+DDDPFLKF+DYARSVLAFEDEED+DPNV GTET TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQ   
Subjt:  SSRGRHFN---AGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLI

Query:  SQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATI
                                                                        AW EQHR+GAPKKIPECINQLKKKNRRKKLPKT TI
Subjt:  SQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATI

Query:  DSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASG
        DSIYEKNFLSLSSVLEAVI+EEFILP                       TN  +L +  F    +       RFYDLV GILKKGRQIFLTGCYLRAASG
Subjt:  DSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASG

Query:  GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILAN
        GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGP+EIHEKTNGLQMIQI L+DNDGFKLKFLLWGEQVILAN
Subjt:  GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILAN

Query:  LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV
        LLSVGSLLALDRPYIATVNENG+GTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNI+QASR L TSYPTQ PR+SQVSLPCDS GTIDFGNYP+RSFV
Subjt:  LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV

Query:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLT
        +DLQDKMTGISLYGII +IVNERN TEAVFSM IED TG+I AKLHF RSWSLGRVGVGHTVYISGLTCT+ KN LEALWIENHVGASFVNLSCLPALLT
Subjt:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLT

Query:  SSCLHKLSRLSDLTCNAHGTK
        SSCLHK+SRLSDLT N+HGTK
Subjt:  SSCLHKLSRLSDLTCNAHGTK

TrEMBL top hitse value%identityAlignment
A0A1S3AX73 uncharacterized protein LOC103483891 isoform X25.1e-24873.31Show/hide
Query:  MSSRGRHFN---AGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCL
        MSS  +HFN   AG  SAMELDD R+LQEE DDDPFLKFVDYARSVLAFED+ED+DPNVNGTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQ  
Subjt:  MSSRGRHFN---AGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCL

Query:  ISQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTAT
                                                                         AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKT T
Subjt:  ISQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTAT

Query:  IDSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAAS
        IDSIYEKNFLSLSSVLEAVI++EFILP                       TN  +L +  F    +       RFYDLVDGILKKGRQIF+TGCYLRAAS
Subjt:  IDSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAAS

Query:  GGSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILA
        GGSG+PRLLPTEYL+ILLDEEEDDDV+LLGAQFCSD+FSSVSLD+VN+GTTYSLYARIESIGP+EIHEK NGL+MIQIILVDNDGFKLKFLLWGEQV+LA
Subjt:  GGSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILA

Query:  NLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSF
         LLSVGS+LALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASR +S SYPTQGP++SQVSLPCDS G IDFGNYP+RSF
Subjt:  NLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSF

Query:  VIDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALL
        VIDLQDKMTGISLYG + +I NERN TEA FSM IED TGEILAKL F RSWSLGRV VGHTV+ISGLTCT NKNRLEALWIENHVGASFVNLSCLPALL
Subjt:  VIDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALL

Query:  TSSCLHKLSRLSDLTCNAHGTK
        TSSCLHKLSRLSDLT N HGTK
Subjt:  TSSCLHKLSRLSDLTCNAHGTK

A0A1S4DSK5 uncharacterized protein LOC103483891 isoform X13.6e-24672.73Show/hide
Query:  MSSRGRHFN---AGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCL
        MSS  +HFN   AG  SAMELDD R+LQEE DDDPFLKFVDYARSVLAFED+ED+DPNVNGTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQ  
Subjt:  MSSRGRHFN---AGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCL

Query:  ISQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTAT
                                                                         AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKT T
Subjt:  ISQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTAT

Query:  IDSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAAS
        IDSIYEKNFLSLSSVLEAVI++EFILP                       TN  +L +  F    +       RFYDLVDGILKKGRQIF+TGCYLRAAS
Subjt:  IDSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAAS

Query:  GGSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILA
        GGSG+PRLLPTEYL+ILLDEEEDDDV+LLGAQFCSD+FSSVSLD+VN+GTTYSLYARIESIGP+EIHEK NGL+MIQIILVDNDGFKLKFLLWGEQV+LA
Subjt:  GGSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILA

Query:  NLL-----SVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNY
         LL     SVGS+LALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASR +S SYPTQGP++SQVSLPCDS G IDFGNY
Subjt:  NLL-----SVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNY

Query:  PYRSFVIDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSC
        P+RSFVIDLQDKMTGISLYG + +I NERN TEA FSM IED TGEILAKL F RSWSLGRV VGHTV+ISGLTCT NKNRLEALWIENHVGASFVNLSC
Subjt:  PYRSFVIDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSC

Query:  LPALLTSSCLHKLSRLSDLTCNAHGTK
        LPALLTSSCLHKLSRLSDLT N HGTK
Subjt:  LPALLTSSCLHKLSRLSDLTCNAHGTK

A0A5A7U7H0 Nucleic acid-binding proteins superfamily isoform 11.7e-24873.31Show/hide
Query:  MSSRGRHFN---AGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCL
        MSS  +HFN   AG  SAMELDD R+LQEE DDDPFLKFVDYARSVLAFED+ED+DPNVNGTET+TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQ  
Subjt:  MSSRGRHFN---AGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCL

Query:  ISQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTAT
                                                                         AWYEQHRVGAPKKIPECINQLKKKNRRKKLPKT T
Subjt:  ISQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTAT

Query:  IDSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAAS
        IDSIYEKNFLS+SSVLEAVI++EFILP                       TN  +L +  F    +       RFYDLVDGILKKGRQIF+TGCYLRAAS
Subjt:  IDSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAAS

Query:  GGSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILA
        GGSG+PRLLPTEYL+ILLDEEEDDDV+LLGAQFCSD+FSSVSLD+VN+GTTYSLYARIESIGP+EIHEK NGL+MIQIILVDNDGFKLKFLLWGEQV+LA
Subjt:  GGSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILA

Query:  NLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSF
        NLLSVGS+LALDRPY+ATVNENG+GTS+ELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASR +S SYPTQGP++SQVSLPCDS G IDFGNYP+RSF
Subjt:  NLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSF

Query:  VIDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALL
        VIDLQDKMTGISLYG + +I NERN TEA FSM IED TGEILAKL F RSWSLGRV VGHTV+ISGLTCT NKNRLEALWIENHVGASFVNLSCLPALL
Subjt:  VIDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALL

Query:  TSSCLHKLSRLSDLTCNAHGTK
        TSSCLHKLSRLSDLT N HGTK
Subjt:  TSSCLHKLSRLSDLTCNAHGTK

A0A6J1FDS0 uncharacterized protein LOC1114444662.8e-25475.52Show/hide
Query:  SSRGRHF---NAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLI
        SSRGRHF    AGG+SAMEL+DRRRLQEE+DDDPFLKF+DYARSVLAFEDEED+DPNV GTET TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQ   
Subjt:  SSRGRHF---NAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLI

Query:  SQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATI
                                                                        AW EQHR+GAPKKIPECINQLKKKNRRKKLPKT TI
Subjt:  SQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATI

Query:  DSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASG
        DSIYEKNFLSLSSVLEAVI+EEFILP                       TN  +L +  F    +       RFYDLV GILKKGRQIFLTGCYLRAASG
Subjt:  DSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASG

Query:  GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILAN
        GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGT YSLYARIESIGP+EIHEKTNGLQMIQI L+DNDGFKLKFLLWGEQVILAN
Subjt:  GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILAN

Query:  LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV
        LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNI+QASR L TSYPTQ PR+SQVSLPCDS GTIDFGNYP+RSFV
Subjt:  LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV

Query:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLT
        +DLQDKMTGISLYGII +IVNERN TEAVFSM IED TG+I AKLHF +SWSLGRVGVGHTVYISGLTCT+ KN LEALWIENHVGASFVNLSCLPALLT
Subjt:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLT

Query:  SSCLHKLSRLSDLTCNAHGTK
        SSCLHK+SRLSDLT N+HGTK
Subjt:  SSCLHKLSRLSDLTCNAHGTK

A0A6J1IB36 uncharacterized protein LOC111470879 isoform X18.9e-25375.04Show/hide
Query:  SSRGRHF---NAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLI
        SSR R+F    AGG SAMEL+DRRRLQEE+DDDPFLKF+DYARSVLAFEDEED+DPNV GT+T TPGWSWIASRVLRTCIAYSSSVTPAILLSELSQ   
Subjt:  SSRGRHF---NAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLI

Query:  SQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATI
                                                                        AW EQHR+GAPKKIPECINQLKKKNRRKKLPKT TI
Subjt:  SQEAVSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATI

Query:  DSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASG
        DSIYEKNFLSLSSVLEAVI+EEFILP                       TN  +L +  F    +       RFYDLV GILKKGRQIFLTGCYLRAASG
Subjt:  DSIYEKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASG

Query:  GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILAN
        GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAV+KGTTYSLYARIESIGP EIHEKTNGLQMIQI+L+DNDGFKLKFLLWGEQVILAN
Subjt:  GSGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILAN

Query:  LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV
        LLSVGSLLALDRPYIATVNENGIG+SDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASR L TSYPTQ PR+SQVSLPCDS GTIDFGNYP+RSFV
Subjt:  LLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV

Query:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLT
        +DLQDKMTGISLYGI+ +IVNERN TEAVFSM IED TG+I AKLHF RSWSLGRVGVGHTVYISGLTCT+ KN LEALWIENHVGASFVNLSCLPALLT
Subjt:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLT

Query:  SSCLHKLSRLSDLTCNAHGTK
        SSCLHK+SRLSDLT N+HGTK
Subjt:  SSCLHKLSRLSDLTCNAHGTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G17030.1 Nucleic acid-binding proteins superfamily1.2e-14547.32Show/hide
Query:  NAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEED------YDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLISQEA
        +  G+S +E+ D     +E+ +DPFL F+DYAR+V++ ED+ED        P    TE + PGW W+ASR+L+TC AYSS VT AILLS+LSQ       
Subjt:  NAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEED------YDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLISQEA

Query:  VSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATIDSIY
                                                                    AW+EQ++ G  KK PE I+QLKK +RR++L  T TIDSIY
Subjt:  VSTVHHAIIQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATIDSIY

Query:  EKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVD---GILKKGRQIFLTGCYLRAASGG
        EKNFLS++SVLEAVII   +LP                       TN  +L +  F    +       R+Y+LV+   GIL+KGR++ +TGCYLR A  G
Subjt:  EKNFLSLSSVLEAVIIEEFILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVD---GILKKGRQIFLTGCYLRAASGG

Query:  SGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILANL
         G PRLLPTEYL++LLDE++DDD IL+ AQFCSD+FSSVSLDA N G +YSLYARIESIGP+E     +  +  QI LVD DG +LKF+LWGEQVI+ANL
Subjt:  SGHPRLLPTEYLIILLDEEEDDDVILLGAQFCSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILANL

Query:  LSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCV-LTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV
        LSVGS+L ++RPYI+++ E+ +  + E CLEYGSAT LYLVP    EE+VCV L+Q+  Q S++L +        +SQV+LP D+ G++DF NYP+R+ +
Subjt:  LSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLVPCIQHEEQVCV-LTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFV

Query:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNK-NRLEALWIENHVGASFVNLSCLPALL
         D++DK TGISLYG++++I  + NAT  VFS+ IED TG I AKLHF   WSLGR+G+GH VY+SGL+C + K N +E LW E    A+FVNLSCLPA L
Subjt:  IDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWSLGRVGVGHTVYISGLTCTMNK-NRLEALWIENHVGASFVNLSCLPALL

Query:  TSSCLHKLSRLSDLT
        TSSC+H +S LS ++
Subjt:  TSSCLHKLSRLSDLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTCGGGGCCGACATTTCAACGCCGGCGGAAGCTCAGCCATGGAGTTAGATGATCGCCGACGGCTGCAGGAAGAAGATGATGATGATCCGTTTCTTAAATTTGT
CGATTACGCGAGGTCTGTGCTAGCATTCGAAGACGAAGAAGACTACGATCCCAATGTCAATGGAACGGAGACCAATACGCCGGGTTGGAGTTGGATCGCCTCTCGGGTCC
TCAGAACTTGTATCGCCTACTCCAGTTCTGTTACGCCTGCGATCTTGCTATCTGAGCTCTCGCAGTGTCTCATCAGCCAAGAGGCTGTTTCAACTGTCCATCATGCAATT
ATCCAATTGAGAAATGATCGGTTGCGTCCCAAGCTTCATATTCTGGATTGGGAACAAGAATTCCACAGGGGTCAGAGGATTTCACCTGATTTAGATGGACAAGACAAAGC
ACAGTTGTTGACAGATATTGTACAAATTCATGATTTTGTTTTGAAGGCCTGGTATGAGCAGCACAGAGTTGGGGCTCCCAAGAAAATACCTGAATGTATTAATCAGTTGA
AGAAGAAGAATAGGAGAAAGAAGCTCCCAAAAACAGCTACTATCGACTCCATATATGAGAAGAATTTCCTATCTTTAAGTAGTGTATTGGAAGCTGTTATTATTGAGGAG
TTTATTCTTCCAGATTTTATCCCTAAGCATCAACTTCCATCTCTGTATTTTCATCTTGATTTGAGGTTGTGTTACAACTCTACAAACTGTACAGTACTTGCAATCTTCAT
GTTTCATGGAAGAACAAGTAAAACAAGAATCTTTGATGGCAGATTCTATGACTTAGTGGATGGGATTTTGAAGAAAGGGAGGCAAATATTTTTAACTGGATGCTATCTTC
GAGCTGCCAGTGGCGGATCTGGTCATCCACGACTTCTGCCAACTGAATACCTTATCATATTGTTGGATGAGGAAGAAGACGATGATGTAATACTTCTAGGGGCTCAATTT
TGTTCTGACTCCTTTTCTTCTGTTTCTCTTGATGCCGTCAATAAAGGGACTACATATTCATTATATGCAAGGATTGAATCTATTGGTCCAATGGAAATTCATGAGAAGAC
TAACGGCTTACAGATGATACAAATCATTCTTGTTGATAATGATGGTTTCAAGTTAAAGTTTCTCTTATGGGGTGAACAGGTGATACTAGCCAATCTTTTAAGTGTTGGTA
GCTTGCTTGCACTTGATAGACCATATATTGCAACTGTAAACGAGAATGGCATTGGAACAAGTGATGAACTTTGTCTTGAATATGGTAGTGCAACACAGCTGTATTTGGTG
CCTTGCATTCAGCATGAGGAGCAAGTATGTGTTTTAACACAGAATATAAACCAAGCTTCAAGGATGCTTAGTACATCGTATCCTACTCAGGGTCCCCGAATTTCTCAAGT
TTCCTTGCCCTGTGATTCACAGGGGACAATTGATTTTGGTAATTATCCTTATCGGTCTTTTGTGATCGACCTTCAAGACAAGATGACTGGCATTAGCTTATATGGTATCA
TTTCAGAAATAGTTAATGAAAGAAATGCCACAGAAGCTGTTTTCTCTATGATAATTGAAGATAAAACTGGAGAAATTTTGGCAAAGTTACACTTCGCGAGATCTTGGTCG
CTGGGAAGGGTAGGCGTTGGACATACAGTATATATAAGTGGCCTGACATGCACCATGAACAAGAATCGCTTGGAGGCTTTATGGATTGAGAATCATGTTGGAGCTTCTTT
TGTCAACCTTAGCTGCTTGCCAGCGTTGTTAACTTCATCTTGTCTTCATAAACTTTCACGACTTTCAGATCTTACCTGCAATGCTCATGGTACAAAGGACCACAAGGGAG
CTCTAATCGTATATATCCATGAAACCCTAGCTCTCTCAACAAGCTACCAAGCTGAAAGGATGTCTGGTAGCCATTCATCTATATCCTTTGGAGTACAAGTGAAGCAAAAC
TCCATCCCTTTGTTGCTTAGGACCAGCGAATTAGGAATTCCTCATTCTATAACAGAAATTATGTCAAATGGGCTATTTTCAAAACAATATTACACAAATGTCCAAATCTC
TTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCTCGGGGCCGACATTTCAACGCCGGCGGAAGCTCAGCCATGGAGTTAGATGATCGCCGACGGCTGCAGGAAGAAGATGATGATGATCCGTTTCTTAAATTTGT
CGATTACGCGAGGTCTGTGCTAGCATTCGAAGACGAAGAAGACTACGATCCCAATGTCAATGGAACGGAGACCAATACGCCGGGTTGGAGTTGGATCGCCTCTCGGGTCC
TCAGAACTTGTATCGCCTACTCCAGTTCTGTTACGCCTGCGATCTTGCTATCTGAGCTCTCGCAGTGTCTCATCAGCCAAGAGGCTGTTTCAACTGTCCATCATGCAATT
ATCCAATTGAGAAATGATCGGTTGCGTCCCAAGCTTCATATTCTGGATTGGGAACAAGAATTCCACAGGGGTCAGAGGATTTCACCTGATTTAGATGGACAAGACAAAGC
ACAGTTGTTGACAGATATTGTACAAATTCATGATTTTGTTTTGAAGGCCTGGTATGAGCAGCACAGAGTTGGGGCTCCCAAGAAAATACCTGAATGTATTAATCAGTTGA
AGAAGAAGAATAGGAGAAAGAAGCTCCCAAAAACAGCTACTATCGACTCCATATATGAGAAGAATTTCCTATCTTTAAGTAGTGTATTGGAAGCTGTTATTATTGAGGAG
TTTATTCTTCCAGATTTTATCCCTAAGCATCAACTTCCATCTCTGTATTTTCATCTTGATTTGAGGTTGTGTTACAACTCTACAAACTGTACAGTACTTGCAATCTTCAT
GTTTCATGGAAGAACAAGTAAAACAAGAATCTTTGATGGCAGATTCTATGACTTAGTGGATGGGATTTTGAAGAAAGGGAGGCAAATATTTTTAACTGGATGCTATCTTC
GAGCTGCCAGTGGCGGATCTGGTCATCCACGACTTCTGCCAACTGAATACCTTATCATATTGTTGGATGAGGAAGAAGACGATGATGTAATACTTCTAGGGGCTCAATTT
TGTTCTGACTCCTTTTCTTCTGTTTCTCTTGATGCCGTCAATAAAGGGACTACATATTCATTATATGCAAGGATTGAATCTATTGGTCCAATGGAAATTCATGAGAAGAC
TAACGGCTTACAGATGATACAAATCATTCTTGTTGATAATGATGGTTTCAAGTTAAAGTTTCTCTTATGGGGTGAACAGGTGATACTAGCCAATCTTTTAAGTGTTGGTA
GCTTGCTTGCACTTGATAGACCATATATTGCAACTGTAAACGAGAATGGCATTGGAACAAGTGATGAACTTTGTCTTGAATATGGTAGTGCAACACAGCTGTATTTGGTG
CCTTGCATTCAGCATGAGGAGCAAGTATGTGTTTTAACACAGAATATAAACCAAGCTTCAAGGATGCTTAGTACATCGTATCCTACTCAGGGTCCCCGAATTTCTCAAGT
TTCCTTGCCCTGTGATTCACAGGGGACAATTGATTTTGGTAATTATCCTTATCGGTCTTTTGTGATCGACCTTCAAGACAAGATGACTGGCATTAGCTTATATGGTATCA
TTTCAGAAATAGTTAATGAAAGAAATGCCACAGAAGCTGTTTTCTCTATGATAATTGAAGATAAAACTGGAGAAATTTTGGCAAAGTTACACTTCGCGAGATCTTGGTCG
CTGGGAAGGGTAGGCGTTGGACATACAGTATATATAAGTGGCCTGACATGCACCATGAACAAGAATCGCTTGGAGGCTTTATGGATTGAGAATCATGTTGGAGCTTCTTT
TGTCAACCTTAGCTGCTTGCCAGCGTTGTTAACTTCATCTTGTCTTCATAAACTTTCACGACTTTCAGATCTTACCTGCAATGCTCATGGTACAAAGGACCACAAGGGAG
CTCTAATCGTATATATCCATGAAACCCTAGCTCTCTCAACAAGCTACCAAGCTGAAAGGATGTCTGGTAGCCATTCATCTATATCCTTTGGAGTACAAGTGAAGCAAAAC
TCCATCCCTTTGTTGCTTAGGACCAGCGAATTAGGAATTCCTCATTCTATAACAGAAATTATGTCAAATGGGCTATTTTCAAAACAATATTACACAAATGTCCAAATCTC
TTAA
Protein sequenceShow/hide protein sequence
MSSRGRHFNAGGSSAMELDDRRRLQEEDDDDPFLKFVDYARSVLAFEDEEDYDPNVNGTETNTPGWSWIASRVLRTCIAYSSSVTPAILLSELSQCLISQEAVSTVHHAI
IQLRNDRLRPKLHILDWEQEFHRGQRISPDLDGQDKAQLLTDIVQIHDFVLKAWYEQHRVGAPKKIPECINQLKKKNRRKKLPKTATIDSIYEKNFLSLSSVLEAVIIEE
FILPDFIPKHQLPSLYFHLDLRLCYNSTNCTVLAIFMFHGRTSKTRIFDGRFYDLVDGILKKGRQIFLTGCYLRAASGGSGHPRLLPTEYLIILLDEEEDDDVILLGAQF
CSDSFSSVSLDAVNKGTTYSLYARIESIGPMEIHEKTNGLQMIQIILVDNDGFKLKFLLWGEQVILANLLSVGSLLALDRPYIATVNENGIGTSDELCLEYGSATQLYLV
PCIQHEEQVCVLTQNINQASRMLSTSYPTQGPRISQVSLPCDSQGTIDFGNYPYRSFVIDLQDKMTGISLYGIISEIVNERNATEAVFSMIIEDKTGEILAKLHFARSWS
LGRVGVGHTVYISGLTCTMNKNRLEALWIENHVGASFVNLSCLPALLTSSCLHKLSRLSDLTCNAHGTKDHKGALIVYIHETLALSTSYQAERMSGSHSSISFGVQVKQN
SIPLLLRTSELGIPHSITEIMSNGLFSKQYYTNVQIS