; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016946 (gene) of Snake gourd v1 genome

Gene IDTan0016946
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionnucleolar complex protein 4 homolog
Genome locationLG02:5009018..5018568
RNA-Seq ExpressionTan0016946
SyntenyTan0016946
Gene Ontology termsGO:0006364 - rRNA processing (biological process)
GO:0009793 - embryo development ending in seed dormancy (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0005730 - nucleolus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0030692 - Noc4p-Nop14p complex (cellular component)
GO:0032040 - small-subunit processome (cellular component)
InterPro domainsIPR005612 - CCAAT-binding factor
IPR027193 - Nucleolar complex protein 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011815.1 Nucleolar complex protein 4-like B, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-29087.21Show/hide
Query:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY
        MAS+ S NQ S+KK+KKN +NH LSDLKTLGLQLLSSRAHINNLPLLLT+VSPS PP YVLE+LLSLQSFFITVLPSLPSSSKP A    D QDDAELIY
Subjt:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY

Query:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE
        RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGN+GKFHSAVYHRFLQSI +SSTPVNTLIALLV KYF+H+DVRYFTYISI+KL +TFE
Subjt:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE

Query:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISFL
        AEY+SGDRSVRI+ +DG HSREGVEFIHIVHSI+SSIPPLENSNQSDYTMW+ESGD K LSDNQEAKQLKMR  ++EVL+ASKIVR+MK KFTKAWISFL
Subjt:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISFL

Query:  RLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL
        RLPLP+DVYKEVLVILDQEVIPYLSNPIILCDFLTKSY++GGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL
Subjt:  RLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL

Query:  PAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSALR
        PAYLAAAFAKKLSRLSLVVPPSG+L+I+ALIHNLLRRHPSINCLVHRENV ESKNDDS  + VA+G D SEV+AD  NMK GID FNYEETDPIKSSALR
Subjt:  PAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSALR

Query:  SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHL
        SSLWEID LRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFV+GSYATILGQELKKK+KRVPLAFYQAIPT+LFS+SDF GWSF+ E+SEKN D  +HL
Subjt:  SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHL

Query:  STKRQRVESS
          KRQRVESS
Subjt:  STKRQRVESS

XP_022952163.1 nucleolar complex protein 4 homolog [Cucurbita moschata]6.2e-28986.72Show/hide
Query:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY
        MAS+ S NQ S+KK+KKN +NH LSDLKTLGLQLLSSRAHINNLPLLLT++SPS PP YVLE+LLSLQSFFITVLPSLPSSSKP A    D QDDAELIY
Subjt:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY

Query:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE
        RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGN+GKFHSA+YHRFLQSI +SSTPVNTLIALLV KYF+H+DVRYFTYISI+KL +TFE
Subjt:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE

Query:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISFL
        AEY+ GDRSVRI+ +DG HSR+GVEFIHIVHSI+SSIPPLENSNQSDYTMW+ESGD K LSDNQEAKQLKMR  ++EVL+ASKIVR+MK KFTKAWISFL
Subjt:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISFL

Query:  RLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL
        RLPLP+DVYKEVLVILDQEVIPYLSNPIILCDFLTKSY++GGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL
Subjt:  RLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL

Query:  PAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSALR
        PAYLAAAFAKKLSRLSLVVPPSG+L+I+ALIHNLLRRHPSINCLVHRENV ESKNDDS  + VA+G D SEV+AD  NMK GID FNYEETDPIKSSALR
Subjt:  PAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSALR

Query:  SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHL
        SSLWEID LRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFV+GSYATILGQELKKK+KRVPLAFYQAIPT+LFS SDF GWSF+ E+SEKN D S+HL
Subjt:  SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHL

Query:  STKRQRVESS
          KRQRVESS
Subjt:  STKRQRVESS

XP_022969033.1 nucleolar complex protein 4 homolog B [Cucurbita maxima]6.7e-29187.38Show/hide
Query:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY
        MAS+ S NQN KKK KKN +NH LSDLKTLGLQLLSS+AHINNLPLLLT+VSPS PP YVLE+LLSLQSFFITVLPSLPSSSKP A    D QDDAELIY
Subjt:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY

Query:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE
        RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGN+GKFHSAVYHRFLQSI +SS PVNTLIALLV KYF+H+DVRYFTYISI+KL +TFE
Subjt:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE

Query:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISFL
        AEY+SGDRSVRIN +DG HSREGVEFIHIVHSI+SSIPPLENSNQSDYTMW+ESGD K LSDNQEAKQLKMR  ++EVLSAS+IVR+MK KFTKAWISFL
Subjt:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISFL

Query:  RLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL
        RLPLP+DVYKEVLVILDQEVIPYLSNPIILCDFLTKSY++GGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL
Subjt:  RLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL

Query:  PAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSALR
        PAYLAAAFAKKLSRLSLVVPPSG+L+I+ALIHNLLRRHPSINCLVHRENV ESKNDDS  + VA+G D SEV+AD  NMK GID FNYEETDPIKSSALR
Subjt:  PAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSALR

Query:  SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHL
        SSLWEID LRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFV+GSYATILGQELKKK+KRVPLAFYQAIPT+LFS+SDF GWSF+ E+SEKN D S+HL
Subjt:  SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHL

Query:  STKRQRVESS
          KRQRVESS
Subjt:  STKRQRVESS

XP_023554434.1 nucleolar complex protein 4 homolog B [Cucurbita pepo subsp. pepo]6.0e-29287.21Show/hide
Query:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY
        MAS+ S NQ+ KKK+KKN +NH LSDLKTLGLQLLSSRAHINNLPLLLT+VSPS PP YVLE+LLSLQSFFITVLPSLPSSSKP A    D QDDAELIY
Subjt:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY

Query:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE
        RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSI +SSTPVNTLIALLV KYF+H+DVRYFTYISI+KL +TFE
Subjt:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE

Query:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISFL
        AEY+SGDRSVRIN +DG HSREGVEFIHIVHSI+SSIPPLENSNQSDYTMW+ESGD K LSDNQEAKQLKMR  ++EVL+ASKIVR+MK KFTKAWISFL
Subjt:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISFL

Query:  RLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL
        RLPLP+DVYKEVLVILDQEVIPYLSNPIILCDFLTKSY++GGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL
Subjt:  RLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL

Query:  PAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSALR
        PAYLAAAFAKKLSRLSLV+PPSGSL+I+ALIHNLLRRHPSINCLVHRENV ESKNDDS  + VA+G D SEV+AD  NMK GID FNYEETDPIKSSALR
Subjt:  PAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSALR

Query:  SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHL
        SSLWEID LRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFV+GSY TILGQELKKK+KRVPLAFYQAIPT+LFS+ DF GWSF+ E+SEKN + S+HL
Subjt:  SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHL

Query:  STKRQRVESS
          KRQRVESS
Subjt:  STKRQRVESS

XP_038887732.1 protein NUCLEOLAR COMPLEX ASSOCIATED 4 isoform X1 [Benincasa hispida]1.8e-28887.07Show/hide
Query:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY
        MASI S N N KKK+K    +H LSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPP YVLE+LLSLQSFFIT LPSLPSSS   A A DD Q DAE IY
Subjt:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY

Query:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE
        RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSI +S TPV+TLIALLV KYF+H+DVRYFTYISI +LAKTF+
Subjt:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE

Query:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESG-DGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISF
        AEY+SGDR+VRIN +DGGHSREGVEFIHIVHSILSSIPPLENSN+SDYTMW+ESG D   LSDNQEAKQLKM+  ++EVLSASKIVRRMKLKF+KAWISF
Subjt:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESG-DGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISF

Query:  LRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL
        L+LPLP+DVYKEVLVILDQEVIPYLS PIILCDFL KSY+IGGV+SVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL
Subjt:  LRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL

Query:  LPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSAL
        LPAYLAAAFAKKLSRLSLVVPPSG+LVI+ALIHNLLRRHPSINCLVHRENV ESKNDDSK E V +GAD SEVDAD  NMK GIDHFNYEETDPIKSSAL
Subjt:  LPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSAL

Query:  RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNH
        RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFV+GSY+TILGQELKKKLKRVPLAFYQ  PTTLFS+SDFAGWSF+ E+SEKN D S+H
Subjt:  RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNH

Query:  LSTKRQRVESS
        LS KRQR+ SS
Subjt:  LSTKRQRVESS

TrEMBL top hitse value%identityAlignment
A0A0A0K685 CBF domain-containing protein1.2e-27784.5Show/hide
Query:  MASILSKNQNSKKKRK-----KNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDD
        MASI S N N K+K+K     KN   H+LSDLKTLGLQLLSSRAHINNLPLLLT+VSPSSPP YVLE+LLSLQSFFIT LPSLPSSSKP   A DD Q D
Subjt:  MASILSKNQNSKKKRK-----KNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDD

Query:  AELIYRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKL
        AE IYRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSI +SSTPV+TLIALLV KYF ++DVRYFTYISI +L
Subjt:  AELIYRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKL

Query:  AKTFEAEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKA
        AKTF+AEY+SGD         GGHS+EGVEFIHIVHSI+SSIPPLENSNQSDYTMW+ESGD K LSD+QEAKQLKM+  ++EVL++SKIVRRMKLKF+KA
Subjt:  AKTFEAEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKA

Query:  WISFLRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCL
        WISFL+LPLP+DVYKEVLVILDQEVIPYLS PIIL DFLTKSY+IGGV+SVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCL
Subjt:  WISFLRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCL

Query:  KSPLLPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYEETDPIKSS
        KSPLLPAYLAAAFAKKLSRLSLVVPPSG+LVI+ALIHNLLRRHPSINCLVHRENV ESKND+S  E  A+G D    D   MK GIDHFNYEE DPIKSS
Subjt:  KSPLLPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYEETDPIKSS

Query:  ALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSS
        ALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFV+GSY+TILGQELKKKLKRVPLAFYQA PTTLFS+SDFAGWSFD E+SEKN DSS
Subjt:  ALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSS

Query:  NHLSTKRQRVESS
        +HLS KRQRV SS
Subjt:  NHLSTKRQRVESS

A0A5D3DJ82 Nucleolar complex protein 4-like protein4.1e-27884.97Show/hide
Query:  MASILSKNQNSKKKR-KKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELI
        MASI S + N KKK+  KN + H+LSDLKTLGLQLLSSRAHINNLPLLLT+VSPSSPP YVLE+LLSLQSFFIT LPSLPSSSKP   A DD Q DAE I
Subjt:  MASILSKNQNSKKKR-KKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELI

Query:  YRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTF
        YRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSI  SSTPV+TLIALLV KYF ++DVRYFTYISI +LAK F
Subjt:  YRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTF

Query:  EAEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISF
        +AEY+SGD         GGHS+EGVEFIHIVHSI+SSIPPLENSNQSDYTMW+ESGD K LSD+QEAKQLKM+  ++EVL+ASKIVRRMKLKF+KAWISF
Subjt:  EAEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISF

Query:  LRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL
        L+LPLP+DVYKEVLVILDQEVIPYLS PIIL DFLTKSY+IGGV+SVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL
Subjt:  LRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL

Query:  LPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSAL
        LPAYLAAAFAKKLSRLSLVVPPSG+LVI+ALIHNLLRRHPSINCLVHRENVGESKNDDS  E  A+G D SEVDAD   MK GIDHFNYEETDPIKSSAL
Subjt:  LPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSAL

Query:  RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNT-DSSN
        RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFV+GSY+TILGQELKKKLKRVPLAFYQA PTTLFS+SDF GWSFD E+S+KN  D S+
Subjt:  RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNT-DSSN

Query:  HLSTKRQRVESS
        HLS KRQ + SS
Subjt:  HLSTKRQRVESS

A0A6J1C6T4 nucleolar complex protein 4 homolog1.8e-28685.27Show/hide
Query:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY
        MAS+LSK            E HTLS+LKTLGLQLLSSRAHINNLPLLLT+VSP+SPPHYVLE+LLSLQSFF+TVLPSLPSSSKP   A+DDP DDAELIY
Subjt:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY

Query:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE
        +TWLRSKFDE VKSLIDVAVSS+CDDTLKEIVLDAIMEFVKVGNKGKFHSAVYH+FLQ+I +S+TPVNTLIALLV KYF H+DVRYFTYISI+KLAK FE
Subjt:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE

Query:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQ-SDYTMWIESGDGKGLSDNQEAKQLKM--REKEVLSASKIVRRMKLKFTKAWISF
        AEY+SGD +VR+ND+DGGHS EGVE IHIVHSI+SSIPPLENSNQ SDYTMW+ESGD K + DNQE KQLKM   +KEVLSASKIV+RMK+KFT+AWISF
Subjt:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQ-SDYTMWIESGDGKGLSDNQEAKQLKM--REKEVLSASKIVRRMKLKFTKAWISF

Query:  LRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL
        LRLPLP+DVYKEVLVILDQEVIPYLSNPIILCDFLTKSY+IGGVVSVMALSSL+LLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL
Subjt:  LRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL

Query:  LPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSAL
        LPAYLAAAFAKKLSRLSLVVPPSG+LVI+ALIHNLLRRHPSINCLVHREN+ ESK D+S DE VARG D S VDAD  N K GIDHFNYEETDPIKSSAL
Subjt:  LPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSAL

Query:  RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNH
        RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFV+GSYATILGQELKKK+KRVPLAFYQAIPTTLFS+SDF GWSFD ++SE N D S+H
Subjt:  RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNH

Query:  LSTKRQRVESS
        LS KRQR+ESS
Subjt:  LSTKRQRVESS

A0A6J1GJR4 nucleolar complex protein 4 homolog3.0e-28986.72Show/hide
Query:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY
        MAS+ S NQ S+KK+KKN +NH LSDLKTLGLQLLSSRAHINNLPLLLT++SPS PP YVLE+LLSLQSFFITVLPSLPSSSKP A    D QDDAELIY
Subjt:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY

Query:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE
        RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGN+GKFHSA+YHRFLQSI +SSTPVNTLIALLV KYF+H+DVRYFTYISI+KL +TFE
Subjt:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE

Query:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISFL
        AEY+ GDRSVRI+ +DG HSR+GVEFIHIVHSI+SSIPPLENSNQSDYTMW+ESGD K LSDNQEAKQLKMR  ++EVL+ASKIVR+MK KFTKAWISFL
Subjt:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISFL

Query:  RLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL
        RLPLP+DVYKEVLVILDQEVIPYLSNPIILCDFLTKSY++GGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL
Subjt:  RLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL

Query:  PAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSALR
        PAYLAAAFAKKLSRLSLVVPPSG+L+I+ALIHNLLRRHPSINCLVHRENV ESKNDDS  + VA+G D SEV+AD  NMK GID FNYEETDPIKSSALR
Subjt:  PAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSALR

Query:  SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHL
        SSLWEID LRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFV+GSYATILGQELKKK+KRVPLAFYQAIPT+LFS SDF GWSF+ E+SEKN D S+HL
Subjt:  SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHL

Query:  STKRQRVESS
          KRQRVESS
Subjt:  STKRQRVESS

A0A6J1HZT8 nucleolar complex protein 4 homolog B3.2e-29187.38Show/hide
Query:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY
        MAS+ S NQN KKK KKN +NH LSDLKTLGLQLLSS+AHINNLPLLLT+VSPS PP YVLE+LLSLQSFFITVLPSLPSSSKP A    D QDDAELIY
Subjt:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIY

Query:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE
        RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGN+GKFHSAVYHRFLQSI +SS PVNTLIALLV KYF+H+DVRYFTYISI+KL +TFE
Subjt:  RTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFE

Query:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISFL
        AEY+SGDRSVRIN +DG HSREGVEFIHIVHSI+SSIPPLENSNQSDYTMW+ESGD K LSDNQEAKQLKMR  ++EVLSAS+IVR+MK KFTKAWISFL
Subjt:  AEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMR--EKEVLSASKIVRRMKLKFTKAWISFL

Query:  RLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL
        RLPLP+DVYKEVLVILDQEVIPYLSNPIILCDFLTKSY++GGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL
Subjt:  RLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLL

Query:  PAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSALR
        PAYLAAAFAKKLSRLSLVVPPSG+L+I+ALIHNLLRRHPSINCLVHRENV ESKNDDS  + VA+G D SEV+AD  NMK GID FNYEETDPIKSSALR
Subjt:  PAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDAD--NMKLGIDHFNYEETDPIKSSALR

Query:  SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHL
        SSLWEID LRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFV+GSYATILGQELKKK+KRVPLAFYQAIPT+LFS+SDF GWSF+ E+SEKN D S+HL
Subjt:  SSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHL

Query:  STKRQRVESS
          KRQRVESS
Subjt:  STKRQRVESS

SwissProt top hitse value%identityAlignment
F4IMH3 Protein NUCLEOLAR COMPLEX ASSOCIATED 41.1e-17957.51Show/hide
Query:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSL-PSSSKPTATASDDPQDDAELI
        MASILSK Q       K NE +TL +LK+LG  LL+SR+HINNLPLLLT+VSP SPP +V+ESLLSLQSFF  +L  L P+SS P++T ++DP    E++
Subjt:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSL-PSSSKPTATASDDPQDDAELI

Query:  YRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTF
        ++ WLRSKFDE VK L+DV VS + +D+L+ IVL  +MEFVK+ N G+FHS++YHR L +I +S   +   + +L +KYF ++DVRYFTYIS++K  KT 
Subjt:  YRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTF

Query:  EAEYISGDRSVRINDNDGGHSREGVEF-IHIVHSILSSIPPLE-NSNQSDYTMWIESGDGKGLSDNQEAKQLKMREKEVLSASKIVRRMKLKFTKAWISF
        EA  +S DR+V  N+     S+E +E  +  ++ +LS IPP E  + +S + MW  S +        + K+ +  +  +LS + I +RMKLKFTKAWISF
Subjt:  EAEYISGDRSVRINDNDGGHSREGVEF-IHIVHSILSSIPPLE-NSNQSDYTMWIESGDGKGLSDNQEAKQLKMREKEVLSASKIVRRMKLKFTKAWISF

Query:  LRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL
        LRLPLP+DVYKEVL  +   VIP+LSNP +LCDFLTKSY+IGGVVSVMALSSLF+LMT++GLEYP FYEKLYALLVPS+F+AKHRAKF QLLD+CLKS +
Subjt:  LRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL

Query:  LPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHR--ENVGESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYEETDPIKSSAL
        LPAYLAA+F KKLSRLSL +PP+GSLVI ALI+NLLRR+P+IN LV    EN  E+  +  +        +         KLGID+FN +E+DP KS AL
Subjt:  LPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHR--ENVGESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYEETDPIKSSAL

Query:  RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSF
        +SSLWEID+LRHHYCPPVSR + SLE +LT+RSKTTE+ ++DF SGSYATI G E+++++K+VPLAFY+ +PT+LF+DSDF GW+F
Subjt:  RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSF

Q5ZJC7 Nucleolar complex protein 4 homolog7.2e-5431.94Show/hide
Query:  AELIYRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVK-------VGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNK---YFSHVDVR
        AE  Y+ W+R ++++ V+SL ++         +KE  L  +M+FV+       V  + K   A     L+ + N   P++   +LL+++   Y  + DVR
Subjt:  AELIYRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVK-------VGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNK---YFSHVDVR

Query:  YFTYISIDKLAKTFEAEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMREKEVLSASKIVRR
        YF    + +           G    +I +         + F     ++ S I P+   N+                 +     +K   +E    SK+ + 
Subjt:  YFTYISIDKLAKTFEAEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMREKEVLSASKIVRR

Query:  MKLKFTKAWISFLRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKF
         K  F + W++FL+  LP  +YK+VLVIL   ++PY++ P ++ DFLT +Y +GG +S++AL+ LF+L+ ++ LEYP+FY+KLY+LL PSI+  K+RA+F
Subjt:  MKLKFTKAWISFLRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKF

Query:  FQLLDSCLKSPLLPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYE
        F L D  L S  LPAYL AAF K+LSRL+L  PP   L+++  I NL RRHP+   L+HR N  +               D+SE          D +  E
Subjt:  FQLLDSCLKSPLLPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYE

Query:  ETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQ
        + +P +S AL SSLWE+ SL++HY P V++    L   L+      E D+   +  S + +  +E+KK    VPL F Q
Subjt:  ETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQ

Q6NRQ2 Nucleolar complex protein 4 homolog A3.9e-5229.53Show/hide
Query:  VLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIYRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVY---HRF
        + E +L  +  +I  LP+   +   T +A D         Y+ W+R +++     ++D+   S   +  +E+ L  +M+F+++  K    ++ +   +RF
Subjt:  VLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIYRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVY---HRF

Query:  LQSITN-------SSTPVNTLIALLVNKYFSHVDVRYFTY-ISIDKLAKTFEAEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIP-PLENSNQSD
         + +               TL+     +Y  + DVRY+T  ++ D +++  +   +      + N                V  +LSSI  P+E S   +
Subjt:  LQSITN-------SSTPVNTLIALLVNKYFSHVDVRYFTY-ISIDKLAKTFEAEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSIP-PLENSNQSD

Query:  YTMWIESGDGKGLSDNQEAKQLKMREKEVLSASKIVRRMKLKFTKAWISFLRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMAL
        + +           +N++ K  K+++             K  F + W+ FL+  L V +YK+VL+IL + ++P++S P ++ DFLT +Y++GG +S++AL
Subjt:  YTMWIESGDGKGLSDNQEAKQLKMREKEVLSASKIVRRMKLKFTKAWISFLRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMAL

Query:  SSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHREN
        + LF+L+ ++ LEYP+FY+KLY+LL PSIF  K+RA+FF L +  L S  LP YL AAFAK+L+RL+L  PP   L+I+  I NL+RRHP+   L+HR +
Subjt:  SSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHREN

Query:  VGESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATIL
         G+                          L  D +  EE DP KS AL SSLWE++ L+ HY   V R    +   L+ +    E D+   +  S   + 
Subjt:  VGESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATIL

Query:  GQEL-KKKLKRVPLAFYQAIPTTLFSDSDFAGWSF
         +E+ KKK K VPL  Y+ +   L   SD     F
Subjt:  GQEL-KKKLKRVPLAFYQAIPTTLFSDSDFAGWSF

Q6NU91 Nucleolar complex protein 4 homolog B1.5e-5631.71Show/hide
Query:  VLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIYRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVY---HRF
        + E LL  +  +I  LP+   S   T +A D         Y+ W+R++++  V  L+D+   S    +++E+VL  +M+F+++  K    ++ +   +RF
Subjt:  VLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIYRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVY---HRF

Query:  ----LQSITNSSTPVNTLIALLVNK---YFSHVDVRYFTYISIDKLAKTFEAEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSI-PPLENSNQSDY
            L+ + ++         LL+ +   Y  + DVRY+T         T   E +S     RI   +         F   V  +LSSI  P+E S   ++
Subjt:  ----LQSITNSSTPVNTLIALLVNK---YFSHVDVRYFTYISIDKLAKTFEAEYISGDRSVRINDNDGGHSREGVEFIHIVHSILSSI-PPLENSNQSDY

Query:  TMWIESGDGKGLSDNQEAKQLKMREKEVLSASKIVRRMKLKFTKAWISFLRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALS
         +           +++E K  K++E+            K  F + W+SFL+  L V +YK+VL+IL + ++P++S P ++ DFLT +Y++GG +S++AL+
Subjt:  TMWIESGDGKGLSDNQEAKQLKMREKEVLSASKIVRRMKLKFTKAWISFLRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALS

Query:  SLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENV
         LF+L+ ++ LEYP+FY+KLY+LL PS+F  K+RA+FF L +  L S  LP YL AAFAK+L+RL+L  PP   L+I+  I NL+RRHP+   L+HR + 
Subjt:  SLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENV

Query:  GESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILG
        G+                          L  D +  EE DP KS AL S LWE++ L+ HY   V R    +   L+ +    E DV   +  S   +  
Subjt:  GESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILG

Query:  QELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSF
        +E+KKK K VPL  Y+ +   L   SD     F
Subjt:  QELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSF

Q8BHY2 Nucleolar complex protein 4 homolog8.2e-5040.14Show/hide
Query:  VRRMKLKFTKAWISFLRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHR
        ++  K  F + W+ FL+  LP+ +YK+VLV +   ++P+L+ P ++ DFLT + ++GG +S++AL+ LF+L+ K+ LEYP+FY+KLY LL PSIF  K+R
Subjt:  VRRMKLKFTKAWISFLRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHR

Query:  AKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDADNMKLGIDHF
        A+FF L D  L S  LPAYL AAFAK+L+RL+L  PP   L+++ LI NLLRRHP+   +VHR    E                          L  D +
Subjt:  AKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHRENVGESKNDDSKDEWVARGADVSEVDADNMKLGIDHF

Query:  NYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKL-KRVPLAFYQA
        +  E DP +S AL S LWE+ +L+ HY P VS+    +   L+V     E+ +   +  +   I  Q+LKKK+ + VPL F  A
Subjt:  NYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKL-KRVPLAFYQA

Arabidopsis top hitse value%identityAlignment
AT2G17250.1 CCAAT-binding factor7.8e-18157.51Show/hide
Query:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSL-PSSSKPTATASDDPQDDAELI
        MASILSK Q       K NE +TL +LK+LG  LL+SR+HINNLPLLLT+VSP SPP +V+ESLLSLQSFF  +L  L P+SS P++T ++DP    E++
Subjt:  MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSL-PSSSKPTATASDDPQDDAELI

Query:  YRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTF
        ++ WLRSKFDE VK L+DV VS + +D+L+ IVL  +MEFVK+ N G+FHS++YHR L +I +S   +   + +L +KYF ++DVRYFTYIS++K  KT 
Subjt:  YRTWLRSKFDELVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTF

Query:  EAEYISGDRSVRINDNDGGHSREGVEF-IHIVHSILSSIPPLE-NSNQSDYTMWIESGDGKGLSDNQEAKQLKMREKEVLSASKIVRRMKLKFTKAWISF
        EA  +S DR+V  N+     S+E +E  +  ++ +LS IPP E  + +S + MW  S +        + K+ +  +  +LS + I +RMKLKFTKAWISF
Subjt:  EAEYISGDRSVRINDNDGGHSREGVEF-IHIVHSILSSIPPLE-NSNQSDYTMWIESGDGKGLSDNQEAKQLKMREKEVLSASKIVRRMKLKFTKAWISF

Query:  LRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL
        LRLPLP+DVYKEVL  +   VIP+LSNP +LCDFLTKSY+IGGVVSVMALSSLF+LMT++GLEYP FYEKLYALLVPS+F+AKHRAKF QLLD+CLKS +
Subjt:  LRLPLPVDVYKEVLVILDQEVIPYLSNPIILCDFLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPL

Query:  LPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHR--ENVGESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYEETDPIKSSAL
        LPAYLAA+F KKLSRLSL +PP+GSLVI ALI+NLLRR+P+IN LV    EN  E+  +  +        +         KLGID+FN +E+DP KS AL
Subjt:  LPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSINCLVHR--ENVGESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYEETDPIKSSAL

Query:  RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSF
        +SSLWEID+LRHHYCPPVSR + SLE +LT+RSKTTE+ ++DF SGSYATI G E+++++K+VPLAFY+ +PT+LF+DSDF GW+F
Subjt:  RSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQELKKKLKRVPLAFYQAIPTTLFSDSDFAGWSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCATTCTCTCAAAGAATCAGAACTCGAAGAAGAAGAGGAAGAAGAACAATGAGAATCACACGCTTTCAGACCTCAAAACCCTAGGCCTCCAACTTCTCTCCTC
TCGAGCTCACATCAACAATCTCCCTCTTCTTCTTACCTACGTTTCTCCCTCTTCTCCTCCTCACTATGTCCTCGAATCCCTCCTCTCCCTCCAGTCCTTCTTCATCACTG
TCCTCCCCTCCCTCCCTTCCTCCTCCAAGCCCACCGCCACTGCCTCCGACGACCCCCAGGACGACGCCGAGTTGATTTACCGGACCTGGCTCCGTTCCAAGTTTGATGAA
CTCGTCAAGTCTCTCATTGATGTAGCGGTTTCTTCTGAATGCGATGACACTCTCAAGGAGATTGTGTTGGATGCGATTATGGAGTTTGTTAAAGTTGGTAACAAGGGGAA
ATTTCACTCTGCTGTATATCACAGATTTTTGCAGAGTATCACTAATTCTTCTACGCCAGTTAATACTCTGATAGCCTTGCTTGTAAACAAGTACTTCAGTCACGTTGACG
TCCGTTATTTTACGTATATTAGCATCGACAAACTTGCCAAGACTTTTGAGGCTGAGTACATTTCAGGTGATAGAAGTGTGAGGATTAACGACAATGATGGTGGTCATTCA
AGAGAAGGAGTGGAGTTCATTCACATTGTGCACTCTATCTTATCCTCCATTCCCCCTTTGGAAAACTCAAATCAATCTGACTACACTATGTGGATTGAATCAGGCGATGG
CAAAGGGCTCTCTGACAATCAAGAAGCAAAGCAGCTTAAAATGAGGGAAAAAGAGGTCTTATCAGCATCGAAGATTGTTAGAAGAATGAAACTAAAGTTTACAAAAGCAT
GGATTTCGTTTCTCAGGTTACCACTTCCAGTTGATGTGTACAAGGAGGTTCTTGTAATTCTTGATCAGGAAGTCATTCCTTATCTTTCTAATCCAATCATTTTATGTGAC
TTCTTAACAAAATCCTATAATATTGGCGGTGTTGTCAGTGTTATGGCTCTCAGCAGCCTCTTCCTTCTTATGACAAAATATGGTTTAGAGTATCCAAACTTCTATGAAAA
ACTTTATGCCCTATTGGTTCCTTCAATATTCATGGCAAAACATCGGGCCAAGTTTTTTCAGCTTCTTGATTCTTGCTTGAAGTCACCACTTCTTCCAGCATACTTGGCTG
CTGCTTTTGCTAAGAAATTAAGTAGGCTGTCACTTGTTGTTCCTCCATCAGGATCACTTGTCATTGTAGCTCTTATTCACAATCTCTTACGAAGGCATCCCTCAATCAAC
TGTTTGGTTCACCGGGAAAATGTTGGCGAGAGTAAGAACGATGATTCAAAAGATGAATGGGTTGCTAGAGGCGCAGATGTTTCTGAAGTTGATGCTGACAACATGAAGCT
AGGCATTGACCATTTTAACTACGAGGAAACTGATCCAATTAAATCTAGTGCCTTGAGAAGTTCACTTTGGGAAATTGACAGTCTTCGACACCATTATTGTCCTCCCGTTT
CTAGGTTAGTTTTGTCGCTTGAGAATGATCTGACTGTGAGATCAAAAACAACTGAAATTGATGTTAAAGATTTTGTTTCTGGTTCATACGCCACGATACTCGGGCAAGAG
TTGAAAAAGAAATTGAAGCGAGTCCCTTTGGCATTCTACCAAGCAATCCCCACCACCTTGTTCTCGGACTCCGATTTCGCTGGTTGGAGTTTCGATCGCGAAAATAGTGA
GAAGAATACTGATAGTAGCAATCATCTTTCGACAAAAAGACAGCGTGTAGAAAGCTCATAA
mRNA sequenceShow/hide mRNA sequence
AAACGGGTCCACTGTTCCAGTTCAAAAAACCTAACCAGAGTTTTATGCTCTTGCAGTTTTGTTCCCGCCGAGAGCTTCATACCAGCTTCAATGGCGTCCATTCTCTCAAA
GAATCAGAACTCGAAGAAGAAGAGGAAGAAGAACAATGAGAATCACACGCTTTCAGACCTCAAAACCCTAGGCCTCCAACTTCTCTCCTCTCGAGCTCACATCAACAATC
TCCCTCTTCTTCTTACCTACGTTTCTCCCTCTTCTCCTCCTCACTATGTCCTCGAATCCCTCCTCTCCCTCCAGTCCTTCTTCATCACTGTCCTCCCCTCCCTCCCTTCC
TCCTCCAAGCCCACCGCCACTGCCTCCGACGACCCCCAGGACGACGCCGAGTTGATTTACCGGACCTGGCTCCGTTCCAAGTTTGATGAACTCGTCAAGTCTCTCATTGA
TGTAGCGGTTTCTTCTGAATGCGATGACACTCTCAAGGAGATTGTGTTGGATGCGATTATGGAGTTTGTTAAAGTTGGTAACAAGGGGAAATTTCACTCTGCTGTATATC
ACAGATTTTTGCAGAGTATCACTAATTCTTCTACGCCAGTTAATACTCTGATAGCCTTGCTTGTAAACAAGTACTTCAGTCACGTTGACGTCCGTTATTTTACGTATATT
AGCATCGACAAACTTGCCAAGACTTTTGAGGCTGAGTACATTTCAGGTGATAGAAGTGTGAGGATTAACGACAATGATGGTGGTCATTCAAGAGAAGGAGTGGAGTTCAT
TCACATTGTGCACTCTATCTTATCCTCCATTCCCCCTTTGGAAAACTCAAATCAATCTGACTACACTATGTGGATTGAATCAGGCGATGGCAAAGGGCTCTCTGACAATC
AAGAAGCAAAGCAGCTTAAAATGAGGGAAAAAGAGGTCTTATCAGCATCGAAGATTGTTAGAAGAATGAAACTAAAGTTTACAAAAGCATGGATTTCGTTTCTCAGGTTA
CCACTTCCAGTTGATGTGTACAAGGAGGTTCTTGTAATTCTTGATCAGGAAGTCATTCCTTATCTTTCTAATCCAATCATTTTATGTGACTTCTTAACAAAATCCTATAA
TATTGGCGGTGTTGTCAGTGTTATGGCTCTCAGCAGCCTCTTCCTTCTTATGACAAAATATGGTTTAGAGTATCCAAACTTCTATGAAAAACTTTATGCCCTATTGGTTC
CTTCAATATTCATGGCAAAACATCGGGCCAAGTTTTTTCAGCTTCTTGATTCTTGCTTGAAGTCACCACTTCTTCCAGCATACTTGGCTGCTGCTTTTGCTAAGAAATTA
AGTAGGCTGTCACTTGTTGTTCCTCCATCAGGATCACTTGTCATTGTAGCTCTTATTCACAATCTCTTACGAAGGCATCCCTCAATCAACTGTTTGGTTCACCGGGAAAA
TGTTGGCGAGAGTAAGAACGATGATTCAAAAGATGAATGGGTTGCTAGAGGCGCAGATGTTTCTGAAGTTGATGCTGACAACATGAAGCTAGGCATTGACCATTTTAACT
ACGAGGAAACTGATCCAATTAAATCTAGTGCCTTGAGAAGTTCACTTTGGGAAATTGACAGTCTTCGACACCATTATTGTCCTCCCGTTTCTAGGTTAGTTTTGTCGCTT
GAGAATGATCTGACTGTGAGATCAAAAACAACTGAAATTGATGTTAAAGATTTTGTTTCTGGTTCATACGCCACGATACTCGGGCAAGAGTTGAAAAAGAAATTGAAGCG
AGTCCCTTTGGCATTCTACCAAGCAATCCCCACCACCTTGTTCTCGGACTCCGATTTCGCTGGTTGGAGTTTCGATCGCGAAAATAGTGAGAAGAATACTGATAGTAGCA
ATCATCTTTCGACAAAAAGACAGCGTGTAGAAAGCTCATAATCCAGGTTTCTTTACCTCCTTCAAATTCGCCACCTTCCAATGGGAACATTGCGACGCCATTTCTTTCTT
CTAATGAACTGCATTTCAGGAAGCATCGAAATATGGTAGAACCACCAATTGTTTATCTTGGATTACCCATCTTTGTCACTGAAGATTATATTACCTCCTTGAAAGTAGTT
CATTCATTACTCAGAGGGGAAGGTGAATTTTTGCATCCTTTCCATGTAACAGCCCTAATTGAAAATCAGTTTGCACAATGGAGAGGTCATGTTTACTCTTTTTTTTTTTT
ACCCCACCAGTTTTGCTGTAATTTATTCAGGTAACTCATTGGCGTCTCTAATGCAAACTTAAAGATCTGCAATCTTATTGTGATTCAGGTGTGGTAGCCAGTGGCAGGCA
TTTTTGTAGTTTCAAATGCTATTTTATTGTTCAACTATATATGAATTTGAAGCTCTGTACTGTAAAGAAATTTAATGAATTCAGTTTCAGTCGGTCA
Protein sequenceShow/hide protein sequence
MASILSKNQNSKKKRKKNNENHTLSDLKTLGLQLLSSRAHINNLPLLLTYVSPSSPPHYVLESLLSLQSFFITVLPSLPSSSKPTATASDDPQDDAELIYRTWLRSKFDE
LVKSLIDVAVSSECDDTLKEIVLDAIMEFVKVGNKGKFHSAVYHRFLQSITNSSTPVNTLIALLVNKYFSHVDVRYFTYISIDKLAKTFEAEYISGDRSVRINDNDGGHS
REGVEFIHIVHSILSSIPPLENSNQSDYTMWIESGDGKGLSDNQEAKQLKMREKEVLSASKIVRRMKLKFTKAWISFLRLPLPVDVYKEVLVILDQEVIPYLSNPIILCD
FLTKSYNIGGVVSVMALSSLFLLMTKYGLEYPNFYEKLYALLVPSIFMAKHRAKFFQLLDSCLKSPLLPAYLAAAFAKKLSRLSLVVPPSGSLVIVALIHNLLRRHPSIN
CLVHRENVGESKNDDSKDEWVARGADVSEVDADNMKLGIDHFNYEETDPIKSSALRSSLWEIDSLRHHYCPPVSRLVLSLENDLTVRSKTTEIDVKDFVSGSYATILGQE
LKKKLKRVPLAFYQAIPTTLFSDSDFAGWSFDRENSEKNTDSSNHLSTKRQRVESS