; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004521 (gene) of Snake gourd v1 genome

Gene IDTan0004521
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNUFIP1 domain-containing protein
Genome locationLG11:8218820..8223491
RNA-Seq ExpressionTan0004521
SyntenyTan0004521
Gene Ontology termsGO:0000492 - box C/D snoRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR019496 - Nuclear fragile X mental retardation-interacting protein 1, conserved domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578837.1 hypothetical protein SDJN03_23285, partial [Cucurbita argyrosperma subsp. sororia]1.0e-25981.16Show/hide
Query:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ
        PP +SSQQLPNSSLA S NGLQNPIQIQPQNQAPFCNPNAH+NNLHGN VPNM PPMFQPG MMNLQNPLMAL NNPLGASPF PGHMGFAN+A NF AQ
Subjt:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ

Query:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNM-SQVP-HNSYGGPN-QAVPMAF--------QPFGV
        GQFN++P++NQMNMNSCLPLAQFFGQNMPNLVQQL QNMGL+NGQFCLPFQNMNQHVIPGQMLNM SQVP H SYG PN QAVPM F        QPFGV
Subjt:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNM-SQVP-HNSYGGPN-QAVPMAF--------QPFGV

Query:  NQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQT
        NQ M PVNQNPQN +PQAMGGAGSNQL GSA PLQGNSTMPFNS TQPQQARNLQSPA VGSQGN+SI+DGGNG NSFSNNLAHRNFTRNS +GFQK+Q 
Subjt:  NQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQT

Query:  HHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL
        HHMKNEKKKFG PGG KGKGFHNERRN FG A+ST+  K QKRSLSLVY++QEI QWREARRKN+PSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL
Subjt:  HHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL

Query:  AKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKR
        AKQAELGVEVAEIP EYLSYSEK D+ KR GDLS +GEEAEGAS GKEK+RNRFNKR RPEKKNR RKK K +KH SN +P  KREPTL QKLL+ADVKR
Subjt:  AKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKR

Query:  DKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEE
        DKS +LQALRFMVMNSFF EWP+KPL FP V+VKE+G EI VVDEKSLST S NLQ T + MVEN  +  I +D E+DDD++DN+EKF+GD I +LEEEE
Subjt:  DKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEE

Query:  GEIID
        GEIID
Subjt:  GEIID

XP_022939720.1 uncharacterized protein LOC111445522 [Cucurbita moschata]4.2e-26181.49Show/hide
Query:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ
        PP +SSQQLPNSSLA S NGLQNPIQIQPQNQAPFCNPNAH+NNLHGN VPNM PPMFQPG MMNLQNPLMAL NNPLGASPF PGHMGFANSA NF  Q
Subjt:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ

Query:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNM-SQVP-HNSYGGPN-QAVPMAF--------QPFGV
        GQFN++P++NQMNMNSCLPLAQFFGQNMPNLVQQL QNMGL+NGQFCLPFQNMNQHVIPGQMLNM SQVP H SYG PN QAVPM F        QPFGV
Subjt:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNM-SQVP-HNSYGGPN-QAVPMAF--------QPFGV

Query:  NQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQT
        NQ M PVNQNPQN +PQAMGGAGSNQL GSA PLQGNSTMPFNS TQPQQARNLQSPAFVGSQGN+SI+DGGNG NSFSNNLAHRNFTRNS +GFQK+Q 
Subjt:  NQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQT

Query:  HHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL
        HHMKNEKKKFG PGG KGKGFHNERRN FG A+ST+  K QKRSLSLVY++QEI QWREARRKN+PSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL
Subjt:  HHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL

Query:  AKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKR
        AKQAELGVEVAEIP EYLSYSEK D+ KR GDLS +GEEAEGAS GKEK+RNRFNKR RPEKKNR RKK K +KH SN +P  KREPTL QKLL+ADVKR
Subjt:  AKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKR

Query:  DKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEE
        DKS +LQALRFMVMNSFF EWP+KPL FP V+VKE+G EI VVDEKSLST S NLQ T + MVEN  +  I +D E+DDD++DNNEKF+GD I +LEEEE
Subjt:  DKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEE

Query:  GEIID
        GEIID
Subjt:  GEIID

XP_022992649.1 uncharacterized protein LOC111488934 [Cucurbita maxima]1.6e-25781.16Show/hide
Query:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ
        PP +SSQQLPNSSLA S NGLQNPIQIQ QNQA FCNPNAH+NNLHGN VPNM PPMFQPG MMNLQNPLMAL NNPLGASPF PGHMGFANSA NF AQ
Subjt:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ

Query:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNM-SQVP-HNSYGGPN-QAVPMAF--------QPFGV
        GQFN+VP++NQMNMNSCLP AQFFGQNMPNLVQQL QNMGLSNGQFCLPFQNMNQHVIPGQMLNM SQVP H SYG PN QAVPM F        QPFGV
Subjt:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNM-SQVP-HNSYGGPN-QAVPMAF--------QPFGV

Query:  NQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQT
        NQTM PVNQNPQN  PQAMGGAGSNQL GSA PLQ NSTMPFNS TQPQQ RNLQSPAFVGSQGN+SI+DGGNG NSFSNNLAHRNFTRNS +GFQK+Q 
Subjt:  NQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQT

Query:  HHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL
        HHMKNEKKKFG PGG KGKGFHNERRN FG  +ST+  K QKRSLSLVY++QEI QWREARRKN+PSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL
Subjt:  HHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL

Query:  AKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKR
        AKQAELGVEVAEIP EYLSYSEK D+ KR GDLS +GEEAEGAS GKEK+RNRFNKR RPEKKNR RKKGK +KH SN +P  KRE TL QKLL+ADVKR
Subjt:  AKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKR

Query:  DKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEE
        DKS +LQALRFMVMNSFF EWP+KPL FP V+VKE+G EI VVDEKSLST S NLQ T N MVEN     I +D E+DDD++DN+EKF+GD I +LEEEE
Subjt:  DKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEE

Query:  GEIID
        GEIID
Subjt:  GEIID

XP_023550423.1 uncharacterized protein LOC111808573 [Cucurbita pepo subsp. pepo]3.8e-26281.82Show/hide
Query:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ
        PP +SSQQLPNSSLA S NGLQNPIQIQPQNQAPFCNPNAH+NNLHGN VPNM PPMFQPG MMNLQNPLMAL NNPLGASPF PGHMGFANSA NF AQ
Subjt:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ

Query:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNM-SQVP-HNSYGGPN-QAVPMAF--------QPFGV
        GQFN+VP++NQMNMNSCLPLAQFFGQNMPNLVQQL QNMGLSNGQFCLPFQNMNQHVIPGQMLNM SQVP H SYG PN QAVPM F        QPFGV
Subjt:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNM-SQVP-HNSYGGPN-QAVPMAF--------QPFGV

Query:  NQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQT
        NQ M PVNQNPQN +PQAMGGAGSNQL GSA PLQGNSTMPFNS TQPQQARNLQSPAFVGSQGN+SI+DGGNG NSFSNNLAHRNFTRNS +GFQK+Q 
Subjt:  NQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQT

Query:  HHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL
        HHMKNEKKKFG PGG KGKGFHNERRN FG A+ST+  K QKRSLSLVY++QEI QWREARRKN+PSSTNIQKKLTEKQ+DCT+VDKEAQLLRQELKEIL
Subjt:  HHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL

Query:  AKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKR
        AKQAELGVEVAEIP EYLSYSEK D+ KR GDLS +GEEAEGAS GKEK+RNRFNKR RPEKKNR RKK K +KH SN +P  KREPTL QKLL+ADVKR
Subjt:  AKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKR

Query:  DKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEE
        DKS +LQALRFMVMNSFF EWP+KPL FP V+VKE+G EI VVDEKSLST S NLQ T + MVEN  +H I +D E+DDD++DN+EKFKGD I +LEEEE
Subjt:  DKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEE

Query:  GEIID
        GEIID
Subjt:  GEIID

XP_038885674.1 uncharacterized protein LOC120075982 [Benincasa hispida]3.5e-26081.46Show/hide
Query:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ
        PP  SSQQLPN+SLA S NG         QNQAPFCNPN H+NNLHGN VPNM PPMFQPG MMNLQNPLM L NNPL ASPF PGH+GFANSA N+ AQ
Subjt:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ

Query:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNMSQVP-HNSYGGPN-QAVPMAF--------QPFGVN
        GQFN+VP++NQMNMN+CLPLAQFFGQNMPNLVQQL QNMGL+NGQFCLPFQNMNQHVIPGQM+NMSQVP H SYGGPN QA+PM F        QPFGVN
Subjt:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNMSQVP-HNSYGGPN-QAVPMAF--------QPFGVN

Query:  QTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQTH
        Q M PVNQNPQN  PQAMGGAGSNQL  SA PLQGNSTM  NSSTQPQQARNLQSPAFVGSQGN+SI+DGGNG NSFSNNLAHRNFTRNSK+GFQKNQ H
Subjt:  QTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQTH

Query:  HMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEILA
        HMKNEKKKFGFPGGQKGKGFHNERRN FG A+STDQ K QKRSLSLVY++QEIRQWREARRKNYPSSTN+QKKLTEKQTDCTLVDKEAQLLR+ELKEILA
Subjt:  HMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEILA

Query:  KQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKRD
        KQAELGVEVAEIPPEYLSYSEK +NRK R D S +GEE +GAS GKEKSRNRFNKRGRPEKKNR RKKGKSEKH SN     KREPTLLQKLL+ADV+R+
Subjt:  KQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKRD

Query:  KSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEEG
        KSQLLQALRFMVMNSFFKEWP+KPLKFP VMVKEN  EIN+VDE SLS  + NLQ T N +VENN +H+ID+D ENDDDD DNNEKFKGD IQVL EEEG
Subjt:  KSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEEG

Query:  EIID
        EIID
Subjt:  EIID

TrEMBL top hitse value%identityAlignment
A0A1S3C3B2 uncharacterized protein LOC103496534 isoform X26.3e-24778.36Show/hide
Query:  MLRPPPPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAG
        M+RPP P  SSQQ+PNSSLA S NG         QNQAPFCNPN H NNL GN VP M PPMFQPG MMNLQNPLM L NNPLGASPF PGHMGFANSA 
Subjt:  MLRPPPPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAG

Query:  NFQAQGQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNMSQVP-HNSYGGPN-QAVPMAF--------Q
        NF AQGQFN++P++NQMNMNSCLPLAQFFGQNMPNLVQQL QNMGL+NGQFCLPFQNMNQHVIPGQM+NMSQVP H SYGGPN QAVPM F        Q
Subjt:  NFQAQGQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNMSQVP-HNSYGGPN-QAVPMAF--------Q

Query:  PFGVNQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQ
        PFGVNQ M PVNQNPQN +PQAMGG+GSNQ   S  PLQGNSTMP NSSTQPQQARNLQSPAF G+QGN+SI+DGGNG NS SNN AHRNF RNSK+GFQ
Subjt:  PFGVNQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQ

Query:  KNQTHHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQEL
        KNQTHHMKNEKK+FGFPGGQK KGFHNERRN F   +STDQ K QKRSLSLVY++QEIRQWREARRKNYPSSTNIQKKL EKQT+CTLV++EAQLLRQEL
Subjt:  KNQTHHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQEL

Query:  KEILAKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKA
        KEILAKQAELGVEVAEIPPEYLSYSEK DNRK+RG  S +GEEA+GAS  KEKS+NR NKRGR +KKNR RKKGK EKH SN  P  KREPTLLQKLLKA
Subjt:  KEILAKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKA

Query:  DVKRDKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGT-KNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQV
        DV++DKSQLLQALRFMVMNSFFKEWP+KPLKFP V VKEN GE NVVDE  LST + NLQ T  N +VENN  HDI++D END +D+DN+EK KGD  QV
Subjt:  DVKRDKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGT-KNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQV

Query:  LEEEEGEIID
        L EEEGEIID
Subjt:  LEEEEGEIID

A0A5A7SKM0 Putative basic-leucine zipper transcription factor F isoform X16.3e-24778.36Show/hide
Query:  MLRPPPPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAG
        M+RPP P  SSQQ+PNSSLA S NG         QNQAPFCNPN H NNL GN VP M PPMFQPG MMNLQNPLM L NNPLGASPF PGHMGFANSA 
Subjt:  MLRPPPPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAG

Query:  NFQAQGQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNMSQVP-HNSYGGPN-QAVPMAF--------Q
        NF AQGQFN++P++NQMNMNSCLPLAQFFGQNMPNLVQQL QNMGL+NGQFCLPFQNMNQHVIPGQM+NMSQVP H SYGGPN QAVPM F        Q
Subjt:  NFQAQGQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNMSQVP-HNSYGGPN-QAVPMAF--------Q

Query:  PFGVNQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQ
        PFGVNQ M PVNQNPQN +PQAMGG+GSNQ   S  PLQGNSTMP NSSTQPQQARNLQSPAF G+QGN+SI+DGGNG NS SNN AHRNF RNSK+GFQ
Subjt:  PFGVNQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQ

Query:  KNQTHHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQEL
        KNQTHHMKNEKK+FGFPGGQK KGFHNERRN F   +STDQ K QKRSLSLVY++QEIRQWREARRKNYPSSTNIQKKL EKQT+CTLV++EAQLLRQEL
Subjt:  KNQTHHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQEL

Query:  KEILAKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKA
        KEILAKQAELGVEVAEIPPEYLSYSEK DNRK+RG  S +GEEA+GAS  KEKS+NR NKRGR +KKNR RKKGK EKH SN  P  KREPTLLQKLLKA
Subjt:  KEILAKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKA

Query:  DVKRDKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGT-KNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQV
        DV++DKSQLLQALRFMVMNSFFKEWP+KPLKFP V VKEN GE NVVDE  LST + NLQ T  N +VENN  HDI++D END +D+DN+EK KGD  QV
Subjt:  DVKRDKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGT-KNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQV

Query:  LEEEEGEIID
        L EEEGEIID
Subjt:  LEEEEGEIID

A0A6J1BY94 uncharacterized protein LOC111006547 isoform X11.7e-24776.27Show/hide
Query:  MLRPPPPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFA----
        MLR PP  SSSQQLPN SLA S NGLQNP+QIQPQNQ  FCNPN HM+N+HGN VPNM PPMFQPG MMN  NPLMALHNNPL A+ F PGHMGFA    
Subjt:  MLRPPPPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFA----

Query:  ----NSAGNFQAQGQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNMSQV-----PHNSYGGPNQAVPM
            N A NFQAQGQFN+VP +NQMNMN CLPLAQ FGQNMPNLVQQLNQNMG SNGQFCLP+QNMNQHVIPGQMLNMSQV     PH SYGGPNQAVPM
Subjt:  ----NSAGNFQAQGQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNMSQV-----PHNSYGGPNQAVPM

Query:  --------AFQPFGVNQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHR
                  QPFGVNQ M  +NQNPQN  P AMGGAG  Q  GSA PLQGNSTMPFNSS QPQQARNLQSPA VGSQGN+SIN GGNGPNSFS NL  +
Subjt:  --------AFQPFGVNQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHR

Query:  NFTRNSKQGFQKNQTHHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLV
        NFTRNSK+GFQKNQ HHMKNEKKKFGFPGGQKGKGFHN+RRN FG A STDQ K QKRSLS VY+EQEI+QWREARRKNYPSSTNI KKLTEKQ DCTLV
Subjt:  NFTRNSKQGFQKNQTHHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLV

Query:  DKEAQLLRQELKEILAKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLSVGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHF------SNTT
        DKEA LLRQELKEILAKQAELGVEVAEIPPEYLSYSEKRDN+KRRGDL   EEAEG   GKEKSRNR NKRGRP+KK+R RKKGKSE+H       +N  
Subjt:  DKEAQLLRQELKEILAKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLSVGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHF------SNTT

Query:  PPNKREPTLLQKLLKADVKRDKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDE-KSLSTRSSNLQGTKNLMVE--NNDNHDIDNDGEN
        P  KREPTLLQKLLK DVKR+KSQLLQALRFMVMNSFFKEWP+KPLKFP V++KENGGEINVVDE  SLS+ +  LQ T N +VE   NDN +  ND ++
Subjt:  PPNKREPTLLQKLLKADVKRDKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDE-KSLSTRSSNLQGTKNLMVE--NNDNHDIDNDGEN

Query:  DDDDNDNNEKFKGDVIQVLEEEEGEIID
        D+  ND+ EKFKG  + V EEEEGEIID
Subjt:  DDDDNDNNEKFKGDVIQVLEEEEGEIID

A0A6J1FNJ1 uncharacterized protein LOC1114455222.0e-26181.49Show/hide
Query:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ
        PP +SSQQLPNSSLA S NGLQNPIQIQPQNQAPFCNPNAH+NNLHGN VPNM PPMFQPG MMNLQNPLMAL NNPLGASPF PGHMGFANSA NF  Q
Subjt:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ

Query:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNM-SQVP-HNSYGGPN-QAVPMAF--------QPFGV
        GQFN++P++NQMNMNSCLPLAQFFGQNMPNLVQQL QNMGL+NGQFCLPFQNMNQHVIPGQMLNM SQVP H SYG PN QAVPM F        QPFGV
Subjt:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNM-SQVP-HNSYGGPN-QAVPMAF--------QPFGV

Query:  NQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQT
        NQ M PVNQNPQN +PQAMGGAGSNQL GSA PLQGNSTMPFNS TQPQQARNLQSPAFVGSQGN+SI+DGGNG NSFSNNLAHRNFTRNS +GFQK+Q 
Subjt:  NQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQT

Query:  HHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL
        HHMKNEKKKFG PGG KGKGFHNERRN FG A+ST+  K QKRSLSLVY++QEI QWREARRKN+PSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL
Subjt:  HHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL

Query:  AKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKR
        AKQAELGVEVAEIP EYLSYSEK D+ KR GDLS +GEEAEGAS GKEK+RNRFNKR RPEKKNR RKK K +KH SN +P  KREPTL QKLL+ADVKR
Subjt:  AKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKR

Query:  DKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEE
        DKS +LQALRFMVMNSFF EWP+KPL FP V+VKE+G EI VVDEKSLST S NLQ T + MVEN  +  I +D E+DDD++DNNEKF+GD I +LEEEE
Subjt:  DKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEE

Query:  GEIID
        GEIID
Subjt:  GEIID

A0A6J1JY42 uncharacterized protein LOC1114889347.9e-25881.16Show/hide
Query:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ
        PP +SSQQLPNSSLA S NGLQNPIQIQ QNQA FCNPNAH+NNLHGN VPNM PPMFQPG MMNLQNPLMAL NNPLGASPF PGHMGFANSA NF AQ
Subjt:  PPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQ

Query:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNM-SQVP-HNSYGGPN-QAVPMAF--------QPFGV
        GQFN+VP++NQMNMNSCLP AQFFGQNMPNLVQQL QNMGLSNGQFCLPFQNMNQHVIPGQMLNM SQVP H SYG PN QAVPM F        QPFGV
Subjt:  GQFNVVPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNM-SQVP-HNSYGGPN-QAVPMAF--------QPFGV

Query:  NQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQT
        NQTM PVNQNPQN  PQAMGGAGSNQL GSA PLQ NSTMPFNS TQPQQ RNLQSPAFVGSQGN+SI+DGGNG NSFSNNLAHRNFTRNS +GFQK+Q 
Subjt:  NQTMLPVNQNPQNLMPQAMGGAGSNQLSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQT

Query:  HHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL
        HHMKNEKKKFG PGG KGKGFHNERRN FG  +ST+  K QKRSLSLVY++QEI QWREARRKN+PSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL
Subjt:  HHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEIL

Query:  AKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKR
        AKQAELGVEVAEIP EYLSYSEK D+ KR GDLS +GEEAEGAS GKEK+RNRFNKR RPEKKNR RKKGK +KH SN +P  KRE TL QKLL+ADVKR
Subjt:  AKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLS-VGEEAEGASAGKEKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKR

Query:  DKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEE
        DKS +LQALRFMVMNSFF EWP+KPL FP V+VKE+G EI VVDEKSLST S NLQ T N MVEN     I +D E+DDD++DN+EKF+GD I +LEEEE
Subjt:  DKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEE

Query:  GEIID
        GEIID
Subjt:  GEIID

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G18440.1 CONTAINS InterPro DOMAIN/s: Nuclear fragile X mental retardation-interacting protein 1, conserved region (InterPro:IPR019496); Has 1333 Blast hits to 1211 proteins in 205 species: Archae - 0; Bacteria - 137; Metazoa - 339; Fungi - 162; Plants - 70; Viruses - 6; Other Eukaryotes - 619 (source: NCBI BLink).6.6e-4735.59Show/hide
Query:  VQQLNQNMGLSNGQ------FCLPFQNMNQHVIPGQMLNMSQVPHNSYGGPNQAVPMAF---QPFGVNQTMLP--VNQNPQNLMPQAMGGAGSNQLSGSA
        +QQ  Q  G SN Q      +  P       ++  QM+N   + HN    PN  +   F    P  + Q  +P  +NQ   NL+        ++ L G +
Subjt:  VQQLNQNMGLSNGQ------FCLPFQNMNQHVIPGQMLNMSQVPHNSYGGPNQAVPMAF---QPFGVNQTMLP--VNQNPQNLMPQAMGGAGSNQLSGSA

Query:  LP-----------------LQGNSTMPFNSSTQPQQARNLQSPAF--VGSQGNTSINDGGNGP--NSFSNNL-AHRNFTRNSKQGFQKNQTHHMKNEKKK
        LP                 L   +++P+     P Q      P F     QG +  N  G+GP  N F N    H+NF +   QGFQ+ Q H   N K+K
Subjt:  LP-----------------LQGNSTMPFNSSTQPQQARNLQSPAF--VGSQGNTSINDGGNGP--NSFSNNL-AHRNFTRNSKQGFQKNQTHHMKNEKKK

Query:  FGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEILAKQAELGVE
         GF    +GKG +N+ +     + + + AK +KRS +L+Y+ +E++QWREARRKNYP+   ++KK+ +K    +++D+EA++ RQ+L+E+LAKQAELGVE
Subjt:  FGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEILAKQAELGVE

Query:  VAEIPPEYLSYSEKRDNRKRRGDLSVGEEAEGASAGKEKSRNRFNKRGR-PEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKRDKSQLLQAL
        VAE+P  YLS ++++ N    GD       +G      +++ R +++ +   KK RL  K KS +  S TT    R+PTLL+KLL AD+KRDKSQLLQ  
Subjt:  VAEIPPEYLSYSEKRDNRKRRGDLSVGEEAEGASAGKEKSRNRFNKRGR-PEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKRDKSQLLQAL

Query:  RFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDD
        RFMVMNS  KE+P +PLK PL+ VKE G E + +++ S+      L          +D  D+D D  + D+D
Subjt:  RFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDD

AT5G18440.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nuclear fragile X mental retardation-interacting protein 1, conserved region (InterPro:IPR019496); Has 1333 Blast hits to 1211 proteins in 205 species: Archae - 0; Bacteria - 137; Metazoa - 339; Fungi - 162; Plants - 70; Viruses - 6; Other Eukaryotes - 619 (source: NCBI BLink).6.6e-4735.59Show/hide
Query:  VQQLNQNMGLSNGQ------FCLPFQNMNQHVIPGQMLNMSQVPHNSYGGPNQAVPMAF---QPFGVNQTMLP--VNQNPQNLMPQAMGGAGSNQLSGSA
        +QQ  Q  G SN Q      +  P       ++  QM+N   + HN    PN  +   F    P  + Q  +P  +NQ   NL+        ++ L G +
Subjt:  VQQLNQNMGLSNGQ------FCLPFQNMNQHVIPGQMLNMSQVPHNSYGGPNQAVPMAF---QPFGVNQTMLP--VNQNPQNLMPQAMGGAGSNQLSGSA

Query:  LP-----------------LQGNSTMPFNSSTQPQQARNLQSPAF--VGSQGNTSINDGGNGP--NSFSNNL-AHRNFTRNSKQGFQKNQTHHMKNEKKK
        LP                 L   +++P+     P Q      P F     QG +  N  G+GP  N F N    H+NF +   QGFQ+ Q H   N K+K
Subjt:  LP-----------------LQGNSTMPFNSSTQPQQARNLQSPAF--VGSQGNTSINDGGNGP--NSFSNNL-AHRNFTRNSKQGFQKNQTHHMKNEKKK

Query:  FGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEILAKQAELGVE
         GF    +GKG +N+ +     + + + AK +KRS +L+Y+ +E++QWREARRKNYP+   ++KK+ +K    +++D+EA++ RQ+L+E+LAKQAELGVE
Subjt:  FGFPGGQKGKGFHNERRNTFGSASSTDQAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEILAKQAELGVE

Query:  VAEIPPEYLSYSEKRDNRKRRGDLSVGEEAEGASAGKEKSRNRFNKRGR-PEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKRDKSQLLQAL
        VAE+P  YLS ++++ N    GD       +G      +++ R +++ +   KK RL  K KS +  S TT    R+PTLL+KLL AD+KRDKSQLLQ  
Subjt:  VAEIPPEYLSYSEKRDNRKRRGDLSVGEEAEGASAGKEKSRNRFNKRGR-PEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKRDKSQLLQAL

Query:  RFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDD
        RFMVMNS  KE+P +PLK PL+ VKE G E + +++ S+      L          +D  D+D D  + D+D
Subjt:  RFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQGTKNLMVENNDNHDIDNDGENDDDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCGTCCTCCTCCTCCTGATTCTTCTTCACAACAGTTACCCAATTCATCTCTCGCCATCTCCGTCAATGGGCTTCAAAATCCCATTCAAATTCAGCCCCAAAACCA
GGCCCCTTTCTGCAATCCCAACGCCCACATGAACAATCTCCATGGAAACGCTGTCCCCAACATGGCGCCCCCCATGTTTCAGCCGGGTGCGATGATGAATTTGCAGAACC
CTCTCATGGCGTTGCATAATAATCCTCTTGGTGCTTCCCCGTTTGTTCCGGGCCATATGGGTTTTGCAAATTCTGCTGGTAATTTTCAGGCTCAGGGGCAGTTCAATGTG
GTGCCGAGTTTGAATCAGATGAATATGAACTCGTGTTTGCCTCTAGCGCAGTTTTTTGGGCAGAACATGCCGAATTTGGTTCAGCAATTGAATCAGAATATGGGTTTGTC
TAATGGGCAGTTTTGCTTGCCGTTTCAAAATATGAATCAGCATGTGATTCCTGGACAGATGCTGAATATGTCGCAAGTTCCTCATAATTCATATGGTGGTCCAAATCAAG
CTGTTCCAATGGCTTTTCAGCCTTTTGGTGTCAATCAGACAATGCTTCCTGTCAACCAGAATCCCCAAAACCTCATGCCACAAGCAATGGGTGGTGCTGGATCAAATCAA
TTGTCGGGTTCGGCTCTACCATTGCAGGGGAATTCAACCATGCCGTTTAACTCTTCGACTCAACCACAACAAGCTAGGAACCTGCAGTCACCTGCTTTCGTTGGGTCACA
GGGGAATACTTCAATAAATGATGGTGGAAATGGACCAAATTCATTCTCGAATAATTTAGCTCACAGGAACTTCACAAGAAACTCAAAGCAAGGATTTCAGAAGAATCAAA
CTCATCATATGAAAAATGAGAAGAAAAAGTTTGGGTTTCCTGGCGGACAGAAAGGAAAAGGTTTTCACAATGAGAGGAGGAACACATTTGGTAGCGCCAGCTCCACGGAT
CAAGCGAAATACCAGAAGAGATCTCTCTCTCTGGTCTATTCGGAGCAAGAAATCCGGCAATGGCGTGAAGCACGCCGGAAGAATTACCCATCATCAACCAACATACAGAA
GAAACTTACTGAAAAGCAAACTGACTGCACATTGGTCGATAAGGAGGCTCAGCTTTTGCGACAAGAACTGAAAGAGATTTTAGCAAAGCAGGCTGAATTAGGAGTCGAAG
TAGCAGAAATCCCACCCGAGTATCTCTCATATTCAGAGAAACGCGACAATCGAAAACGACGTGGAGATCTATCAGTAGGAGAGGAAGCCGAAGGAGCCTCAGCAGGGAAA
GAAAAATCTCGAAACAGGTTCAACAAGAGGGGGAGACCCGAGAAGAAGAATCGTTTGAGAAAGAAGGGCAAATCTGAGAAGCATTTTTCGAACACGACGCCACCAAACAA
GAGAGAGCCAACGTTACTGCAGAAGCTCTTGAAGGCAGATGTGAAGAGAGACAAAAGCCAGTTGTTACAAGCTTTGAGATTCATGGTGATGAATTCTTTCTTCAAAGAAT
GGCCCAGTAAACCCTTGAAGTTTCCTTTAGTCATGGTGAAGGAGAATGGTGGGGAGATCAATGTGGTTGATGAGAAATCTCTGTCTACTAGGAGTTCCAATCTCCAAGGG
ACCAAAAATTTAATGGTTGAGAACAATGATAATCATGACATTGACAACGATGGCGAAAATGATGACGATGACAACGACAACAACGAGAAGTTCAAAGGAGATGTAATACA
GGTACTCGAAGAGGAAGAAGGAGAAATTATTGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCGTCCTCCTCCTCCTGATTCTTCTTCACAACAGTTACCCAATTCATCTCTCGCCATCTCCGTCAATGGGCTTCAAAATCCCATTCAAATTCAGCCCCAAAACCA
GGCCCCTTTCTGCAATCCCAACGCCCACATGAACAATCTCCATGGAAACGCTGTCCCCAACATGGCGCCCCCCATGTTTCAGCCGGGTGCGATGATGAATTTGCAGAACC
CTCTCATGGCGTTGCATAATAATCCTCTTGGTGCTTCCCCGTTTGTTCCGGGCCATATGGGTTTTGCAAATTCTGCTGGTAATTTTCAGGCTCAGGGGCAGTTCAATGTG
GTGCCGAGTTTGAATCAGATGAATATGAACTCGTGTTTGCCTCTAGCGCAGTTTTTTGGGCAGAACATGCCGAATTTGGTTCAGCAATTGAATCAGAATATGGGTTTGTC
TAATGGGCAGTTTTGCTTGCCGTTTCAAAATATGAATCAGCATGTGATTCCTGGACAGATGCTGAATATGTCGCAAGTTCCTCATAATTCATATGGTGGTCCAAATCAAG
CTGTTCCAATGGCTTTTCAGCCTTTTGGTGTCAATCAGACAATGCTTCCTGTCAACCAGAATCCCCAAAACCTCATGCCACAAGCAATGGGTGGTGCTGGATCAAATCAA
TTGTCGGGTTCGGCTCTACCATTGCAGGGGAATTCAACCATGCCGTTTAACTCTTCGACTCAACCACAACAAGCTAGGAACCTGCAGTCACCTGCTTTCGTTGGGTCACA
GGGGAATACTTCAATAAATGATGGTGGAAATGGACCAAATTCATTCTCGAATAATTTAGCTCACAGGAACTTCACAAGAAACTCAAAGCAAGGATTTCAGAAGAATCAAA
CTCATCATATGAAAAATGAGAAGAAAAAGTTTGGGTTTCCTGGCGGACAGAAAGGAAAAGGTTTTCACAATGAGAGGAGGAACACATTTGGTAGCGCCAGCTCCACGGAT
CAAGCGAAATACCAGAAGAGATCTCTCTCTCTGGTCTATTCGGAGCAAGAAATCCGGCAATGGCGTGAAGCACGCCGGAAGAATTACCCATCATCAACCAACATACAGAA
GAAACTTACTGAAAAGCAAACTGACTGCACATTGGTCGATAAGGAGGCTCAGCTTTTGCGACAAGAACTGAAAGAGATTTTAGCAAAGCAGGCTGAATTAGGAGTCGAAG
TAGCAGAAATCCCACCCGAGTATCTCTCATATTCAGAGAAACGCGACAATCGAAAACGACGTGGAGATCTATCAGTAGGAGAGGAAGCCGAAGGAGCCTCAGCAGGGAAA
GAAAAATCTCGAAACAGGTTCAACAAGAGGGGGAGACCCGAGAAGAAGAATCGTTTGAGAAAGAAGGGCAAATCTGAGAAGCATTTTTCGAACACGACGCCACCAAACAA
GAGAGAGCCAACGTTACTGCAGAAGCTCTTGAAGGCAGATGTGAAGAGAGACAAAAGCCAGTTGTTACAAGCTTTGAGATTCATGGTGATGAATTCTTTCTTCAAAGAAT
GGCCCAGTAAACCCTTGAAGTTTCCTTTAGTCATGGTGAAGGAGAATGGTGGGGAGATCAATGTGGTTGATGAGAAATCTCTGTCTACTAGGAGTTCCAATCTCCAAGGG
ACCAAAAATTTAATGGTTGAGAACAATGATAATCATGACATTGACAACGATGGCGAAAATGATGACGATGACAACGACAACAACGAGAAGTTCAAAGGAGATGTAATACA
GGTACTCGAAGAGGAAGAAGGAGAAATTATTGATTAA
Protein sequenceShow/hide protein sequence
MLRPPPPDSSSQQLPNSSLAISVNGLQNPIQIQPQNQAPFCNPNAHMNNLHGNAVPNMAPPMFQPGAMMNLQNPLMALHNNPLGASPFVPGHMGFANSAGNFQAQGQFNV
VPSLNQMNMNSCLPLAQFFGQNMPNLVQQLNQNMGLSNGQFCLPFQNMNQHVIPGQMLNMSQVPHNSYGGPNQAVPMAFQPFGVNQTMLPVNQNPQNLMPQAMGGAGSNQ
LSGSALPLQGNSTMPFNSSTQPQQARNLQSPAFVGSQGNTSINDGGNGPNSFSNNLAHRNFTRNSKQGFQKNQTHHMKNEKKKFGFPGGQKGKGFHNERRNTFGSASSTD
QAKYQKRSLSLVYSEQEIRQWREARRKNYPSSTNIQKKLTEKQTDCTLVDKEAQLLRQELKEILAKQAELGVEVAEIPPEYLSYSEKRDNRKRRGDLSVGEEAEGASAGK
EKSRNRFNKRGRPEKKNRLRKKGKSEKHFSNTTPPNKREPTLLQKLLKADVKRDKSQLLQALRFMVMNSFFKEWPSKPLKFPLVMVKENGGEINVVDEKSLSTRSSNLQG
TKNLMVENNDNHDIDNDGENDDDDNDNNEKFKGDVIQVLEEEEGEIID