; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G010420 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G010420
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionRibosome biogenesis protein NOP53
Genome locationchr04:12206603..12244354
RNA-Seq ExpressionLsi04G010420
SyntenyLsi04G010420
Gene Ontology termsGO:0000027 - ribosomal large subunit assembly (biological process)
GO:0006364 - rRNA processing (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0005730 - nucleolus (cellular component)
GO:0008097 - 5S rRNA binding (molecular function)
InterPro domainsIPR011687 - Ribosome biogenesis protein Nop53/GLTSCR2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1214105.1 hypothetical protein CJ030_MR5G017368 [Morella rubra]1.8e-14259.84Show/hide
Query:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREK---------VLYCDSILTKNPFVQAKSKPSIIPAVEVEPPGCSFNPSH
        I T EIEDFFEKSTKDALSGG LS+ P+DSLFVVDKSRD+S+KRKIEK REK             D     N   + K KPS+I AVEVEPPGCSFNP  
Subjt:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREK---------VLYCDSILTKNPFVQAKSKPSIIPAVEVEPPGCSFNPSH

Query:  ESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMDQDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVK
        ESHQDVLA AVA+EMQKVY+NELGP PVPLTVPGEV+ EE+M F++ADN +DD+   + + ++ED  L++R  K +RVTRV  NKRARHKE++RKE+E +
Subjt:  ESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMDQDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVK

Query:  KLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSR
        K+E +SKEIDSLPDIIQEIAK+DEE++ R +RR +AKQERLKSCPPRLG+HKFEPAPVQVLLSEEI+GS RKLKGCCTL +DRYKSLEKRG+I PTAK+R
Subjt:  KLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSR

Query:  RLQAQLPYKVKLFFFPIGSTGNRIRFVLLLNLRDVYGEFDEIIAMEFEHLCRREPVRTLQFRNFSAYDERGDNLLQQSVSLLQVKDPLFKRMGASRLARF
            +               G R   + L ++       D+I+ +                         GDNL+QQSVSLL VKDPLFKRMGASRLARF
Subjt:  RLQAQLPYKVKLFFFPIGSTGNRIRFVLLLNLRDVYGEFDEIIAMEFEHLCRREPVRTLQFRNFSAYDERGDNLLQQSVSLLQVKDPLFKRMGASRLARF

Query:  SIDDERRMKIVEIGGAQELLNMLGAAKDDRT--------------HEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        +IDDERRMKIVE+GGAQ+LLNML  AKDDRT               EAV ALH+AGA+ VI+STPDS+ D ++ ++KS L+KRF DLR+DV S
Subjt:  SIDDERRMKIVEIGGAQELLNMLGAAKDDRT--------------HEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

KAF9676987.1 hypothetical protein SADUNF_Sadunf08G0060500 [Salix dunnii]2.6e-14152.98Show/hide
Query:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFV-------------------QAK------------
        I T +I++FFEKSTKDAL+GGSL+ V +DSLF VDKS+DLSVKRKIEK REKVL CDS+L KNPFV                   +AK            
Subjt:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFV-------------------QAK------------

Query:  ----------------------SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADN-NTDDET
                              +KPS+IPAVEVEPPGCSFNPS E+HQD LA+AVA EMQKVY+NELGP PVPLTV G+VI EEDM FLDADN N DD+T
Subjt:  ----------------------SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADN-NTDDET

Query:  NLDEMDQDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEP
        N + +++DED+  ++R ++ +RVTRVE NKRAR KE+ +KEAEVKK + +SK IDSLPDIIQEIAKEDEE+  R IRR +AKQERLK+ PPRLG+HKFEP
Subjt:  NLDEMDQDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEP

Query:  APVQVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPT--AKSRRLQAQLPYKVKLFFFPIGSTGNRIRFVLLLNLRDVYGEFDE--------IIAM
        AP+QV LSEE+TGS RK+KGCCTLV+DR+KSLEKRG++ PT   K++RL A            + ++    R   L  L D YGE +         +   
Subjt:  APVQVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPT--AKSRRLQAQLPYKVKLFFFPIGSTGNRIRFVLLLNLRDVYGEFDE--------IIAM

Query:  EFEHLCRREP---------------------------------------VRTLQFRNFSAYDERGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDER
        E E+LCR  P                                       V T  F  +  +   GDN+L+QSVSLL VKDP FKR GASRLAR+++DDER
Subjt:  EFEHLCRREP---------------------------------------VRTLQFRNFSAYDERGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDER

Query:  RMKIVEIGGAQELLNMLGAAKDDRTH--------------EAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        RMKIVEIGGAQELL ML AAKDDRT               EAVGALH AGAI VIKS PDS+E+ ++ +FK  L+KRF DL+Y+ SS
Subjt:  RMKIVEIGGAQELLNMLGAAKDDRTH--------------EAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

RVW42452.1 Ribosome biogenesis protein NOP53 [Vitis vinifera]7.1e-14755.61Show/hide
Query:  TAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQAK---------------------------------
        T +IEDFFEKSTKDALSGGSL+AVPSDSLF VDKS DLSVKRKIEK REKVL  DS+L +N FVQ                                   
Subjt:  TAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQAK---------------------------------

Query:  --------------------SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLD
                            SKPS+IPAVEVE PGCSFNPS ESHQD LA AVA EMQKVY+NELGP PVPLTV GE + EEDM F++AD+ +DD+ N  
Subjt:  --------------------SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLD

Query:  EMDQDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPV
        E  ++ED   EKR  K++RVTRVE N+RAR K+ +R EAE K++E +SKEID LPDIIQEIAKEDEE+  R  RR +AKQ+RLKS PPRLGKHKFEPAPV
Subjt:  EMDQDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPV

Query:  QVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRRLQAQLPYKVKLFFFPIGSTGNRIRFVLLLNLRDVYGEFDEIIAMEFEHLCRREP--V
        QVLLSEEITGS RKLKGC TL RDRYKSL+KRG++ PTAK  R+   + +K         S+ + I FV       +       IA +  +LCRR+P  V
Subjt:  QVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRRLQAQLPYKVKLFFFPIGSTGNRIRFVLLLNLRDVYGEFDEIIAMEFEHLCRREP--V

Query:  RTLQFRNFSAYDER---------------------------------GDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNML
        R+  F +++  DER                                 GDNLLQQSVSLL VKDPLFKRMGASRLARF+IDDERRMKIVE+GGAQEL+NML
Subjt:  RTLQFRNFSAYDER---------------------------------GDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNML

Query:  GAAKDDRT--------------HEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        GAAKDDRT               EAVGALH AGAI V+KSTPDS E  ++ ++K +L+KRF DLRYD+ S
Subjt:  GAAKDDRT--------------HEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

XP_004146478.1 ribosome biogenesis protein NOP53 [Cucumis sativus]1.1e-13981.47Show/hide
Query:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQA--------------------------------
        I TAEIEDFFEKSTKDALSGGSLSA+PSDSLFVVDKS+DLSVKRKIEKKR+KVLYCDS+LTKNPFVQA                                
Subjt:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQA--------------------------------

Query:  ----------------KSKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMD
                        KSKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYR EL PAPVPLTVPGEVISEEDMLFLDAD NTDDETNLDEMD
Subjt:  ----------------KSKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMD

Query:  QDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVL
        QDEDNELEKRPLKMRRVTRVE NKRARHKEKVRKEAE KK+EG+SKEIDSLPDIIQEIAKEDEER+NRRIRRTIAKQE+LKSCPPRLGKHKFEPAPVQVL
Subjt:  QDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVL

Query:  LSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR
        LSEEITGS RKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR
Subjt:  LSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR

XP_038878460.1 ribosome biogenesis protein NOP53 [Benincasa hispida]3.4e-14182.94Show/hide
Query:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQA--------------------------------
        I TAEIEDFFEKSTKDALSGGSLSA+PSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQA                                
Subjt:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQA--------------------------------

Query:  ----------------KSKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMD
                        KSKPSIIPAVEVE PGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVI EEDMLFLD DNNTDDETNLDEMD
Subjt:  ----------------KSKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMD

Query:  QDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVL
        QDED+ELEKRPLKMRRVTRVE NKRARHKEKVRKEAE KKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERL+SCPPRLGKHKFEPAPVQVL
Subjt:  QDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVL

Query:  LSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR
        LSEEITGS RKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR
Subjt:  LSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR

TrEMBL top hitse value%identityAlignment
A0A0A0KP89 Ribosome biogenesis protein NOP535.3e-14081.47Show/hide
Query:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQA--------------------------------
        I TAEIEDFFEKSTKDALSGGSLSA+PSDSLFVVDKS+DLSVKRKIEKKR+KVLYCDS+LTKNPFVQA                                
Subjt:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQA--------------------------------

Query:  ----------------KSKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMD
                        KSKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYR EL PAPVPLTVPGEVISEEDMLFLDAD NTDDETNLDEMD
Subjt:  ----------------KSKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMD

Query:  QDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVL
        QDEDNELEKRPLKMRRVTRVE NKRARHKEKVRKEAE KK+EG+SKEIDSLPDIIQEIAKEDEER+NRRIRRTIAKQE+LKSCPPRLGKHKFEPAPVQVL
Subjt:  QDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVL

Query:  LSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR
        LSEEITGS RKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR
Subjt:  LSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR

A0A1S3C5J4 Ribosome biogenesis protein NOP536.9e-14082.35Show/hide
Query:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQAK-------------------------------
        I TAEIEDFFEKSTKDALSGGSLSA+PSDSLFVVDKSRDLSVKRKIEKKR++VLYCDSILTKNPFVQA                                
Subjt:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQAK-------------------------------

Query:  -----------------SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMD
                         SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDAD NTD ETNLDEMD
Subjt:  -----------------SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMD

Query:  QDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVL
        QDEDNELEKRPLKMRRVTRVE NKRAR KEKVRKEAE KKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVL
Subjt:  QDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVL

Query:  LSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR
        LSEEITGS RKLKGCCTLVRDRYKSLEKRGIIAPTAK+RR
Subjt:  LSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR

A0A438E461 Ribosome biogenesis protein NOP533.4e-14755.61Show/hide
Query:  TAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQAK---------------------------------
        T +IEDFFEKSTKDALSGGSL+AVPSDSLF VDKS DLSVKRKIEK REKVL  DS+L +N FVQ                                   
Subjt:  TAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQAK---------------------------------

Query:  --------------------SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLD
                            SKPS+IPAVEVE PGCSFNPS ESHQD LA AVA EMQKVY+NELGP PVPLTV GE + EEDM F++AD+ +DD+ N  
Subjt:  --------------------SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLD

Query:  EMDQDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPV
        E  ++ED   EKR  K++RVTRVE N+RAR K+ +R EAE K++E +SKEID LPDIIQEIAKEDEE+  R  RR +AKQ+RLKS PPRLGKHKFEPAPV
Subjt:  EMDQDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPV

Query:  QVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRRLQAQLPYKVKLFFFPIGSTGNRIRFVLLLNLRDVYGEFDEIIAMEFEHLCRREP--V
        QVLLSEEITGS RKLKGC TL RDRYKSL+KRG++ PTAK  R+   + +K         S+ + I FV       +       IA +  +LCRR+P  V
Subjt:  QVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRRLQAQLPYKVKLFFFPIGSTGNRIRFVLLLNLRDVYGEFDEIIAMEFEHLCRREP--V

Query:  RTLQFRNFSAYDER---------------------------------GDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNML
        R+  F +++  DER                                 GDNLLQQSVSLL VKDPLFKRMGASRLARF+IDDERRMKIVE+GGAQEL+NML
Subjt:  RTLQFRNFSAYDER---------------------------------GDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNML

Query:  GAAKDDRT--------------HEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        GAAKDDRT               EAVGALH AGAI V+KSTPDS E  ++ ++K +L+KRF DLRYD+ S
Subjt:  GAAKDDRT--------------HEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

A0A5D3DQ80 Ribosome biogenesis protein NOP536.9e-14082.35Show/hide
Query:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQAK-------------------------------
        I TAEIEDFFEKSTKDALSGGSLSA+PSDSLFVVDKSRDLSVKRKIEKKR++VLYCDSILTKNPFVQA                                
Subjt:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQAK-------------------------------

Query:  -----------------SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMD
                         SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDAD NTD ETNLDEMD
Subjt:  -----------------SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMD

Query:  QDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVL
        QDEDNELEKRPLKMRRVTRVE NKRAR KEKVRKEAE KKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVL
Subjt:  QDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVL

Query:  LSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR
        LSEEITGS RKLKGCCTLVRDRYKSLEKRGIIAPTAK+RR
Subjt:  LSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRR

A0A6A1VN69 Ribosome biogenesis protein NOP538.7e-14359.84Show/hide
Query:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREK---------VLYCDSILTKNPFVQAKSKPSIIPAVEVEPPGCSFNPSH
        I T EIEDFFEKSTKDALSGG LS+ P+DSLFVVDKSRD+S+KRKIEK REK             D     N   + K KPS+I AVEVEPPGCSFNP  
Subjt:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREK---------VLYCDSILTKNPFVQAKSKPSIIPAVEVEPPGCSFNPSH

Query:  ESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMDQDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVK
        ESHQDVLA AVA+EMQKVY+NELGP PVPLTVPGEV+ EE+M F++ADN +DD+   + + ++ED  L++R  K +RVTRV  NKRARHKE++RKE+E +
Subjt:  ESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMDQDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVK

Query:  KLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSR
        K+E +SKEIDSLPDIIQEIAK+DEE++ R +RR +AKQERLKSCPPRLG+HKFEPAPVQVLLSEEI+GS RKLKGCCTL +DRYKSLEKRG+I PTAK+R
Subjt:  KLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSR

Query:  RLQAQLPYKVKLFFFPIGSTGNRIRFVLLLNLRDVYGEFDEIIAMEFEHLCRREPVRTLQFRNFSAYDERGDNLLQQSVSLLQVKDPLFKRMGASRLARF
            +               G R   + L ++       D+I+ +                         GDNL+QQSVSLL VKDPLFKRMGASRLARF
Subjt:  RLQAQLPYKVKLFFFPIGSTGNRIRFVLLLNLRDVYGEFDEIIAMEFEHLCRREPVRTLQFRNFSAYDERGDNLLQQSVSLLQVKDPLFKRMGASRLARF

Query:  SIDDERRMKIVEIGGAQELLNMLGAAKDDRT--------------HEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        +IDDERRMKIVE+GGAQ+LLNML  AKDDRT               EAV ALH+AGA+ VI+STPDS+ D ++ ++KS L+KRF DLR+DV S
Subjt:  SIDDERRMKIVEIGGAQELLNMLGAAKDDRT--------------HEAVGALHKAGAILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

SwissProt top hitse value%identityAlignment
O22892 Ribosome biogenesis protein NOP532.1e-7745.86Show/hide
Query:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQ---------AKSK--------------------
        I + +IEDFFEK+T+DALSGG+LSA PS+ LF VDKS DL VKRKIEK RE+VL  DSIL KNPFVQ          KSK                    
Subjt:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQ---------AKSK--------------------

Query:  -------------------------PSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDE-
                                 PSII AVE+E PGCS+NP+ ESHQD+LA+AVAQEMQKVY+ ELGPAPVPLT+ G+ +SE++  FLD DN ++ E 
Subjt:  -------------------------PSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDE-

Query:  ----------------------------TNL--------------------------DEMDQD-------------EDNELEKRPL---------KMRRV
                                    TNL                          D++ +D             EDN+  K  +         K +RV
Subjt:  ----------------------------TNL--------------------------DEMDQD-------------EDNELEKRPL---------KMRRV

Query:  TRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCT
        TRVE NKR R K   +KE + K  E I  EIDSLP+I++EIAKEDE+++N+ +RR IAKQE LK  PPRLGK+KFE  PVQVLL+EE+TGS RKLK CCT
Subjt:  TRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCT

Query:  LVRDRYKSLEKRGIIAPTAKSRR
        L RDR+KSLEKRGI+ P+ + RR
Subjt:  LVRDRYKSLEKRGIIAPTAKSRR

Q8BK35 Ribosome biogenesis protein NOP534.0e-1229.96Show/hide
Query:  KPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDM-------LFLDADNNTDDETNLDEMDQDEDNELEKRP
        KPS +PAVEV P G S+NP+ E HQ +L +A   E+Q+    E     + L    +  ++E +       L  ++D   + E       +  D   E  P
Subjt:  KPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDM-------LFLDADNNTDDETNLDEMDQDEDNELEKRP

Query:  L-------KMRRVTRVEFNK-RARHKEKVRKEA---------EVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEP
                +M + T  +  + +A  K +V++ A         E+ +L GI  +      + + +A+    RE RRIRR +A+ ++    P RLG+ K++ 
Subjt:  L-------KMRRVTRVEFNK-RARHKEKVRKEA---------EVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEP

Query:  APVQVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRRLQAQLPYKVKL
          + V LS E++GS R LK    ++RDR+KS +KR +I P     R + +  YKVKL
Subjt:  APVQVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRRLQAQLPYKVKL

Q9NEU5 Ribosome biogenesis protein NOP534.9e-1028.93Show/hide
Query:  KRKIEKKREKVLYCDSILTKNPFVQAKSKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLD-----
        K K+E +     +   +  K P    KS  S++PAV++   G S+NP    +Q+ +A+ +A E QK+  +E       +    E ++ E   FL+     
Subjt:  KRKIEKKREKVLYCDSILTKNPFVQAKSKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLD-----

Query:  -------ADNNTDDETNLDEMDQDEDNELEKRPLKMR--RVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDI-IQEIAKE-DEERENRRIRRTI
                D+  ++E    E       E E +  ++   R+T+ +  K+A+  +K+ KE E ++LE  +KE DS      +++ KE DEE + R     +
Subjt:  -------ADNNTDDETNLDEMDQDEDNELEKRPLKMR--RVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDI-IQEIAKE-DEERENRRIRRTI

Query:  AKQERL---KSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGI--IAPTAKSRRLQAQLPYKV
         K+E+L    +   +LGK KF  A    LL EE+TG+ R+LK    ++ DR KSL++R +  I    + RR++ +L  KV
Subjt:  AKQERL---KSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGI--IAPTAKSRRLQAQLPYKV

Q9NZM5 Ribosome biogenesis protein NOP535.3e-1230.99Show/hide
Query:  SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDM---LFLDADNNTDDETNLDEMDQDEDNELEKRPLKM
        +KPS  PAVEV P G S+NPS E HQ +L+ A   E+Q+    E     + L    +  ++E     L       +D E    + +  E  + E  P   
Subjt:  SKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDM---LFLDADNNTDDETNLDEMDQDEDNELEKRPLKM

Query:  RRVTRVEFNKRARHKEKV-----RKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSF
        R  T  +  ++ R +EK       ++A ++      +E+  L  I  ++A    E   RR RR  A++E     P RLG+ K++   + V LS E+T S 
Subjt:  RRVTRVEFNKRARHKEKV-----RKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSF

Query:  RKLKGCCTLVRDRYKSLEKRGIIAPTAKSRRLQAQLPYKVKL
        R LK    ++RDR+KS ++R +I P     R + +  YKVKL
Subjt:  RKLKGCCTLVRDRYKSLEKRGIIAPTAKSRRLQAQLPYKVKL

Q9W3C2 Ribosome biogenesis protein NOP531.9e-0927.5Show/hide
Query:  EVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEE-DMLFLDADNNTDDETNLDEMDQDEDNELEKR-------------PL
        E+  PG S+NP+ E HQ ++ Q V +E + + + E     V  ++  +V  EE D   L+  +   DE    E+D+D     +K+             P+
Subjt:  EVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEE-DMLFLDADNNTDDETNLDEMDQDEDNELEKR-------------PL

Query:  KMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKL
        + ++ ++       + KE  R+    +KL+  + ++  +  I  E+  E+E+  + + RR   + E+ K  P RLG+HKFE   + V L E+I G+ R +
Subjt:  KMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKL

Query:  KGCCTLVRDRYKSLEKRGIIAPTAKSRRLQAQLPYKVKLF
        K   +L++DR+ +L++  ++ PT K   L ++   KVK F
Subjt:  KGCCTLVRDRYKSLEKRGIIAPTAKSRRLQAQLPYKVKLF

Arabidopsis top hitse value%identityAlignment
AT2G40430.1 CONTAINS InterPro DOMAIN/s: P60-like (InterPro:IPR011687), Tumour suppressor protein Gltscr2 (InterPro:IPR011211); Has 709 Blast hits to 643 proteins in 201 species: Archae - 0; Bacteria - 32; Metazoa - 224; Fungi - 154; Plants - 45; Viruses - 0; Other Eukaryotes - 254 (source: NCBI BLink).1.5e-7845.86Show/hide
Query:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQ---------AKSK--------------------
        I + +IEDFFEK+T+DALSGG+LSA PS+ LF VDKS DL VKRKIEK RE+VL  DSIL KNPFVQ          KSK                    
Subjt:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQ---------AKSK--------------------

Query:  -------------------------PSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDE-
                                 PSII AVE+E PGCS+NP+ ESHQD+LA+AVAQEMQKVY+ ELGPAPVPLT+ G+ +SE++  FLD DN ++ E 
Subjt:  -------------------------PSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDE-

Query:  ----------------------------TNL--------------------------DEMDQD-------------EDNELEKRPL---------KMRRV
                                    TNL                          D++ +D             EDN+  K  +         K +RV
Subjt:  ----------------------------TNL--------------------------DEMDQD-------------EDNELEKRPL---------KMRRV

Query:  TRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCT
        TRVE NKR R K   +KE + K  E I  EIDSLP+I++EIAKEDE+++N+ +RR IAKQE LK  PPRLGK+KFE  PVQVLL+EE+TGS RKLK CCT
Subjt:  TRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCT

Query:  LVRDRYKSLEKRGIIAPTAKSRR
        L RDR+KSLEKRGI+ P+ + RR
Subjt:  LVRDRYKSLEKRGIIAPTAKSRR

AT2G40430.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: P60-like (InterPro:IPR011687), Tumour suppressor protein Gltscr2 (InterPro:IPR011211); Has 706 Blast hits to 639 proteins in 197 species: Archae - 0; Bacteria - 32; Metazoa - 228; Fungi - 147; Plants - 47; Viruses - 0; Other Eukaryotes - 252 (source: NCBI BLink).5.1e-7945.99Show/hide
Query:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQ---------AKSK--------------------
        I + +IEDFFEK+T+DALSGG+LSA PS+ LF VDKS DL VKRKIEK RE+VL  DSIL KNPFVQ          KSK                    
Subjt:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQ---------AKSK--------------------

Query:  -------------------------PSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDE-
                                 PSII AVE+E PGCS+NP+ ESHQD+LA+AVAQEMQKVY+ ELGPAPVPLT+ G+ +SE++  FLD DN ++ E 
Subjt:  -------------------------PSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDE-

Query:  ----------------------------TNL--------------------------DEMDQD-------------EDNELEKRPL---------KMRRV
                                    TNL                          D++ +D             EDN+  K  +         K +RV
Subjt:  ----------------------------TNL--------------------------DEMDQD-------------EDNELEKRPL---------KMRRV

Query:  TRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCT
        TRVE NKR R K   +KE + K  E I  EIDSLP+I++EIAKEDE+++N+ +RR IAKQE LK  PPRLGK+KFE  PVQVLL+EE+TGS RKLK CCT
Subjt:  TRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCT

Query:  LVRDRYKSLEKRGIIAPTAKSRRL
        L RDR+KSLEKRGI+ P+ + RRL
Subjt:  LVRDRYKSLEKRGIIAPTAKSRRL

AT2G40430.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Tumour suppressor protein Gltscr2 (InterPro:IPR011211), P60-like (InterPro:IPR011687).1.5e-7845.5Show/hide
Query:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQ---------AKSK--------------------
        I + +IEDFFEK+T+DALSGG+LSA PS+ LF VDKS DL VKRKIEK RE+VL  DSIL KNPFVQ          KSK                    
Subjt:  IITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQ---------AKSK--------------------

Query:  -------------------------PSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDE-
                                 PSII AVE+E PGCS+NP+ ESHQD+LA+AVAQEMQKVY+ ELGPAPVPLT+ G+ +SE++  FLD DN ++ E 
Subjt:  -------------------------PSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKVYRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDE-

Query:  ----------------------------TNLDEMD--------------------------------------QDEDNELEKRPL---------KMRRVT
                                    TNL +++                                      + EDN+  K  +         K +RVT
Subjt:  ----------------------------TNLDEMD--------------------------------------QDEDNELEKRPL---------KMRRVT

Query:  RVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCTL
        RVE NKR R K   +KE + K  E I  EIDSLP+I++EIAKEDE+++N+ +RR IAKQE LK  PPRLGK+KFE  PVQVLL+EE+TGS RKLK CCTL
Subjt:  RVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEERENRRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCTL

Query:  VRDRYKSLEKRGIIAPTAKSRR
         RDR+KSLEKRGI+ P+ + RR
Subjt:  VRDRYKSLEKRGIIAPTAKSRR

AT3G56210.1 ARM repeat superfamily protein1.2e-3050.74Show/hide
Query:  TLQFRNFSAYDERGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTH----EAVGALHKA----------GA
        T  +  +  +   GDNL+ QS+SLL VKDPLFKRMGASRL+RF+IDDERRMK+VE+GGAQELL+MLG+AKDD+T     +A+ AL K+          GA
Subjt:  TLQFRNFSAYDERGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTH----EAVGALHKA----------GA

Query:  ILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        + ++KSTP+S ED  ++ +KS+++++  +    VSS
Subjt:  ILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS

AT3G56210.2 ARM repeat superfamily protein1.2e-3050.74Show/hide
Query:  TLQFRNFSAYDERGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTH----EAVGALHKA----------GA
        T  +  +  +   GDNL+ QS+SLL VKDPLFKRMGASRL+RF+IDDERRMK+VE+GGAQELL+MLG+AKDD+T     +A+ AL K+          GA
Subjt:  TLQFRNFSAYDERGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTH----EAVGALHKA----------GA

Query:  ILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS
        + ++KSTP+S ED  ++ +KS+++++  +    VSS
Subjt:  ILVIKSTPDSAEDMKVNEFKSDLMKRFSDLRYDVSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAATCACTGCTGAAATTGAGGATTTCTTCGAGAAGTCCACAAAGGATGCTCTCTCCGGCGGCTCTCTTTCCGCCGTTCCCAGCGATTCTCTTTTCGTCGTCGATAA
GTCTCGAGATCTTTCAGTGAAGCGGAAAATAGAAAAGAAGCGGGAAAAAGTCCTTTACTGTGACAGCATATTAACGAAGAACCCATTTGTCCAAGCTAAATCAAAGCCAT
CTATTATTCCAGCAGTAGAAGTTGAGCCCCCGGGATGTTCGTTCAATCCATCGCATGAGAGTCATCAGGATGTACTTGCCCAAGCTGTTGCACAAGAAATGCAGAAAGTA
TATCGTAATGAACTGGGGCCTGCGCCAGTTCCTTTAACAGTCCCAGGAGAAGTTATTAGTGAAGAAGATATGTTGTTTTTGGATGCTGATAATAACACTGATGATGAAAC
CAATTTGGACGAAATGGATCAGGATGAAGATAATGAATTAGAAAAAAGGCCTTTGAAGATGAGAAGGGTGACTCGAGTTGAGTTCAATAAGAGAGCTAGGCATAAAGAAA
AGGTCAGAAAGGAAGCCGAAGTAAAGAAGTTGGAAGGAATTTCTAAAGAGATTGACAGCTTGCCGGATATCATTCAAGAAATAGCCAAAGAGGACGAGGAGAGAGAAAAT
AGACGTATCCGACGAACAATAGCCAAACAGGAGCGATTAAAGTCATGCCCACCTCGCTTGGGAAAGCACAAGTTTGAGCCCGCTCCAGTTCAAGTTCTCCTATCCGAGGA
AATAACTGGATCGTTTCGTAAGCTGAAGGGCTGTTGCACCCTTGTGAGGGACAGGTATAAGAGCTTAGAGAAAAGAGGAATAATTGCCCCTACTGCCAAGAGCAGAAGGC
TTCAAGCACAACTTCCTTACAAAGTTAAACTCTTCTTTTTTCCGATTGGTTCAACAGGAAATAGGATCAGATTTGTTCTATTGTTGAACTTGAGAGACGTGTATGGCGAA
TTCGACGAGATTATCGCAATGGAGTTTGAACATCTGTGCAGAAGGGAACCTGTGCGGACCCTGCAATTTCGCAATTTTTCAGCTTACGACGAAAGAGGGGATAACTTGTT
GCAGCAATCTGTGTCGCTCTTGCAAGTCAAGGATCCATTGTTTAAGAGGATGGGAGCGTCTAGATTGGCTCGCTTTTCAATTGATGATGAAAGAAGGATGAAAATAGTGG
AGATAGGTGGCGCTCAAGAGCTCTTAAACATGCTCGGCGCTGCCAAAGATGACCGGACACATGAAGCTGTTGGTGCCTTGCATAAAGCAGGGGCAATATTGGTTATTAAA
TCTACTCCAGATTCAGCTGAAGATATGAAAGTGAATGAGTTCAAGTCGGACCTAATGAAGAGATTTAGTGATCTTAGATATGATGTTTCATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATAATCACTGCTGAAATTGAGGATTTCTTCGAGAAGTCCACAAAGGATGCTCTCTCCGGCGGCTCTCTTTCCGCCGTTCCCAGCGATTCTCTTTTCGTCGTCGATAA
GTCTCGAGATCTTTCAGTGAAGCGGAAAATAGAAAAGAAGCGGGAAAAAGTCCTTTACTGTGACAGCATATTAACGAAGAACCCATTTGTCCAAGCTAAATCAAAGCCAT
CTATTATTCCAGCAGTAGAAGTTGAGCCCCCGGGATGTTCGTTCAATCCATCGCATGAGAGTCATCAGGATGTACTTGCCCAAGCTGTTGCACAAGAAATGCAGAAAGTA
TATCGTAATGAACTGGGGCCTGCGCCAGTTCCTTTAACAGTCCCAGGAGAAGTTATTAGTGAAGAAGATATGTTGTTTTTGGATGCTGATAATAACACTGATGATGAAAC
CAATTTGGACGAAATGGATCAGGATGAAGATAATGAATTAGAAAAAAGGCCTTTGAAGATGAGAAGGGTGACTCGAGTTGAGTTCAATAAGAGAGCTAGGCATAAAGAAA
AGGTCAGAAAGGAAGCCGAAGTAAAGAAGTTGGAAGGAATTTCTAAAGAGATTGACAGCTTGCCGGATATCATTCAAGAAATAGCCAAAGAGGACGAGGAGAGAGAAAAT
AGACGTATCCGACGAACAATAGCCAAACAGGAGCGATTAAAGTCATGCCCACCTCGCTTGGGAAAGCACAAGTTTGAGCCCGCTCCAGTTCAAGTTCTCCTATCCGAGGA
AATAACTGGATCGTTTCGTAAGCTGAAGGGCTGTTGCACCCTTGTGAGGGACAGGTATAAGAGCTTAGAGAAAAGAGGAATAATTGCCCCTACTGCCAAGAGCAGAAGGC
TTCAAGCACAACTTCCTTACAAAGTTAAACTCTTCTTTTTTCCGATTGGTTCAACAGGAAATAGGATCAGATTTGTTCTATTGTTGAACTTGAGAGACGTGTATGGCGAA
TTCGACGAGATTATCGCAATGGAGTTTGAACATCTGTGCAGAAGGGAACCTGTGCGGACCCTGCAATTTCGCAATTTTTCAGCTTACGACGAAAGAGGGGATAACTTGTT
GCAGCAATCTGTGTCGCTCTTGCAAGTCAAGGATCCATTGTTTAAGAGGATGGGAGCGTCTAGATTGGCTCGCTTTTCAATTGATGATGAAAGAAGGATGAAAATAGTGG
AGATAGGTGGCGCTCAAGAGCTCTTAAACATGCTCGGCGCTGCCAAAGATGACCGGACACATGAAGCTGTTGGTGCCTTGCATAAAGCAGGGGCAATATTGGTTATTAAA
TCTACTCCAGATTCAGCTGAAGATATGAAAGTGAATGAGTTCAAGTCGGACCTAATGAAGAGATTTAGTGATCTTAGATATGATGTTTCATCTTGACAAGAGGGCGGAGG
CAGCTTCGGCGTGAGAGATAGAATGAGGGAGATCGGATCAAGCGAGGAACCGTTAGAGTGACATGTGGAGGGGAGGGTCTGGGGAAGGTTTTTGAAAGAGAAACAACCTC
CCCTTTGACTAATTTCGATCTTGCTTTTGCTCATACTTATTCTTCTCGTTCTCACGTCCTCTTCATTTTTCTCTTGCTTTCGTTTCTTGTCTGGCTTGCACACTGGTCTC
ATGATTCTTAAAAAAAAAGTCGATCTTGCTTTTGCTATCATGCTTA
Protein sequenceShow/hide protein sequence
MIITAEIEDFFEKSTKDALSGGSLSAVPSDSLFVVDKSRDLSVKRKIEKKREKVLYCDSILTKNPFVQAKSKPSIIPAVEVEPPGCSFNPSHESHQDVLAQAVAQEMQKV
YRNELGPAPVPLTVPGEVISEEDMLFLDADNNTDDETNLDEMDQDEDNELEKRPLKMRRVTRVEFNKRARHKEKVRKEAEVKKLEGISKEIDSLPDIIQEIAKEDEEREN
RRIRRTIAKQERLKSCPPRLGKHKFEPAPVQVLLSEEITGSFRKLKGCCTLVRDRYKSLEKRGIIAPTAKSRRLQAQLPYKVKLFFFPIGSTGNRIRFVLLLNLRDVYGE
FDEIIAMEFEHLCRREPVRTLQFRNFSAYDERGDNLLQQSVSLLQVKDPLFKRMGASRLARFSIDDERRMKIVEIGGAQELLNMLGAAKDDRTHEAVGALHKAGAILVIK
STPDSAEDMKVNEFKSDLMKRFSDLRYDVSS