; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0504 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0504
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionribosomal RNA small subunit methyltransferase H
Genome locationMC11:3920915..3928332
RNA-Seq ExpressionMC11g0504
SyntenyMC11g0504
Gene Ontology termsGO:0070475 - rRNA base methylation (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0071424 - rRNA (cytosine-N4-)-methyltransferase activity (molecular function)
InterPro domainsIPR002903 - Ribosomal RNA small subunit methyltransferase H
IPR023397 - S-adenosyl-L-methionine-dependent methyltransferase, MraW, recognition domain superfamily
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608197.1 hypothetical protein SDJN03_01539, partial [Cucurbita argyrosperma subsp. sororia]1.16e-24181.84Show/hide
Query:  MAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFDENAIQF
        M KL LFS  PSQS R R LIS   +  P   RCCS ISTATD  NK  K KQKK++N+KASTA       SSSSLKLALVKEKRRTRSTKEFDE A+QF
Subjt:  MAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFDENAIQF

Query:  GDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEAD
        GDTAAHIPVMLAEVLDVFSASSGR L SFVDCTVGA GHSSAIIQAHPELKFYMGMDVDPIA+DKA+DRIS LF EDSDLKAYTVLKNFK+ KSLL ++D
Subjt:  GDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEAD

Query:  EKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLI
        EKPLDPGVDGILMDLGMSSMQVDDP RGFSVLCDGPLDMRMDPQAS++AEDILNSWPEIEVGR+LRVYGEESNWYSLQNKI+KARSQGGLH+T++LVDLI
Subjt:  EKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLI

Query:  RKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEME---KDEDIR---EDIVENEE
        RKSTP FKGGRQGWIKTATRVFQALRIAVNDELKVLE +LYS F+CLAPGGRLAVISFHSLEDRVVKQTFL+IINNP+ E   ++E IR   ++  E +E
Subjt:  RKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEME---KDEDIR---EDIVENEE

Query:  EEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN
        EEWIKQ VKGS G ILTKRPITPSEEEERLNRR RSAKLRVIQKNN
Subjt:  EEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN

KAG7037555.1 rsmH, partial [Cucurbita argyrosperma subsp. argyrosperma]4.35e-24382.1Show/hide
Query:  SMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFDENAIQ
        SM KL LFS  PSQS R R LIS   +  P   RCCS ISTATD  NK  K KQKK++N+KASTA       SSSSLKLALVKEKRRTRSTKEFDE A+Q
Subjt:  SMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFDENAIQ

Query:  FGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEA
        FGDTAAHIPVMLAEVLDVFSASSGR L SFVDCTVGA GHSSAIIQAHPELKFYMGMDVDPIA+DKA+DRIS LF EDSDLKAYTVLKNFK+ KSLL ++
Subjt:  FGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEA

Query:  DEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDL
        DEKPLDPGVDGILMDLGMSSMQVDDP RGFSVLCDGPLDMRMDPQAS++AEDILNSWPEIEVGR+LRVYGEESNWYSLQNKI+KARSQGGLH+T++LVDL
Subjt:  DEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDL

Query:  IRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEME---KDEDIR---EDIVENE
        IRKSTP FKGGRQGWIKTATRVFQALRIAVNDELKVLED+LYS F+CLAPGGRLAVISFHSLEDRVVKQTFL+IINNP+ E   ++E IR   ++  E +
Subjt:  IRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEME---KDEDIR---EDIVENE

Query:  EEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN
        EEEWIKQ VKGS G ILTKRPITPSEEEERLNRR RSAKLRVIQKNN
Subjt:  EEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN

XP_022156174.1 uncharacterized protein LOC111023099 [Momordica charantia]2.36e-308100Show/hide
Query:  MALKSMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTASSSSLKLALVKEKRRTRSTKEFDENAIQFGD
        MALKSMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTASSSSLKLALVKEKRRTRSTKEFDENAIQFGD
Subjt:  MALKSMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTASSSSLKLALVKEKRRTRSTKEFDENAIQFGD

Query:  TAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEK
        TAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEK
Subjt:  TAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEK

Query:  PLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRK
        PLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRK
Subjt:  PLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRK

Query:  STPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTV
        STPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTV
Subjt:  STPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTV

Query:  KGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN
        KGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN
Subjt:  KGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN

XP_022940206.1 uncharacterized protein LOC111445901 [Cucurbita moschata]3.06e-24382.1Show/hide
Query:  SMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFDENAIQ
        SM KL LFS  PSQS R R LIS   +  P   RCCS ISTATD  NK  K KQKK++N+KASTA       SSSSLKLALVKEKRRTRSTKEFDE A+Q
Subjt:  SMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFDENAIQ

Query:  FGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEA
        FGDTAAHIPVMLAEVLDVFSASSGR L SFVDCTVGA GHSSAIIQAHPELKFYMGMDVDPIA+DKA+DRIS LF EDSDLKAYTVLKNFK+ KSLL ++
Subjt:  FGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEA

Query:  DEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDL
        DEKPLDPGVDGILMDLGMSSMQVDDP RGFSVLCDGPLDMRMDPQAS++AEDILNSWPEIEVGR+LRVYGEESNWYSLQNKI+KARSQGGLH+T++LVDL
Subjt:  DEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDL

Query:  IRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEME---KDEDIR---EDIVENE
        IRKSTP FKGGRQGWIKTATRVFQALRIAVNDELKVLED+LYS F+CLAPGGRLAVISFHSLEDRVVKQTFL+IINNP+ E   ++E IR   ++  E +
Subjt:  IRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEME---KDEDIR---EDIVENE

Query:  EEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN
        EEEWIKQ VKGS G ILTKRPITPSEEEERLNRR RSAKLRVIQKNN
Subjt:  EEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN

XP_023524199.1 uncharacterized protein LOC111788179 [Cucurbita pepo subsp. pepo]5.31e-24481.86Show/hide
Query:  MALKS-MAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFD
        MAL+S M KL LFS  PSQS R R LIS   +  P   RCCS ISTATD  NK  K KQKK++N+KASTA       SSSSLKLALVKEKRRTRSTKEFD
Subjt:  MALKS-MAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFD

Query:  ENAIQFGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKS
        E A+QFGDTAAHIPVMLAEVLDVFSASSGR L SFVDCTVGA GHSSAIIQAHPELKFYMGMDVDPIA+DKA+DRIS LF EDSDLKAYTVLKNFK+ KS
Subjt:  ENAIQFGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKS

Query:  LLTEADEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTT
        LL ++DEKPL+PGVDGILMDLGMSSMQVDDP RGFSVLCDGPLDMRMDPQAS++AEDILNSWPEIEVGR+LRVYGEESNWYSLQNKI+KARSQGGLH+T+
Subjt:  LLTEADEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTT

Query:  QLVDLIRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEME---KDEDIR---ED
        +LVDLIRKSTP FKGGRQGWIKTATRVFQALRIAVNDELKVLED+LYS F+CLAPGGRLAVISFHSLEDRVVKQTFL+IINNP+ E   ++E IR   ++
Subjt:  QLVDLIRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEME---KDEDIR---ED

Query:  IVENEEEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN
          E +EEEWIKQTVKGS G ILTKRPITPSEEEERLNRR RSAKLRVIQKNN
Subjt:  IVENEEEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN

TrEMBL top hitse value%identityAlignment
A0A0A0LDT0 Uncharacterized protein4.95e-23480.05Show/hide
Query:  SMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFDENAIQ
        S+AK  LFS   SQSL F+      T+ PPR  R CS ISTA+ V+NKA KEKQKKS+NLKAST+       +S S KLALVKEKRRTRSTKEFDENAI 
Subjt:  SMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFDENAIQ

Query:  FGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEA
         GDTAAHIPVMLAEVLDVF++SSGR L SFVDCTVGA GHSSAIIQAHPEL FYMGMDVDPIA+DKA+DRIS  FSEDSDLKAY VLKNFK+ KSLL ++
Subjt:  FGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEA

Query:  DEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDL
        DEKPLDPGVDG++MDLGMSSMQVDDP RGF VL DGPLDMRMDPQAS++AEDILN+WPE EVGRILRVYGEESNWYSLQNKI+KARSQGGLHSTTQL+DL
Subjt:  DEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDL

Query:  IRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDI-VENEEEEWI
        IRKST  FKGGRQGWIKTATRVFQALRIAVNDEL VL++SLY+CFDCLAPGGRLAVISFHSLEDRVVKQTFL+IIN P+ E DED  E   +  E EEWI
Subjt:  IRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDI-VENEEEEWI

Query:  KQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKN
        KQTVKG  GT+LTKRPITPSEEEERLNRRSRSAKLRVIQKN
Subjt:  KQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKN

A0A5A7T697 Ribosomal RNA small subunit methyltransferase H1.18e-23380.45Show/hide
Query:  SMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKAST-------ASSSSLKLALVKEKRRTRSTKEFDENAIQ
        S+AK  LFS   SQSL F  L    T+ PPR  R CS ISTA+ V+NK  KEK KKS+NLKAST       ++S S KLALVKEKRRTRSTKEFDENAI 
Subjt:  SMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKAST-------ASSSSLKLALVKEKRRTRSTKEFDENAIQ

Query:  FGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEA
         GDTAAHIPVMLAEVLDVFS+SSGR L SFVDCTVGA GHSSAIIQAHPEL FYMGMDVDPIA+DKA+DRIS  FSEDSDLKAY VLKNFK+ KSLL ++
Subjt:  FGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEA

Query:  DEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDL
        DEKPLDPGVDGILMDLGMSSMQVDDP RGF VL DGPLDMRMDPQAS++AEDILN+WPEIEVGRILR YGEESNWYSLQNKI+KARSQGGLHSTTQL+DL
Subjt:  DEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDL

Query:  IRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIK
        IRKSTP FKGGRQGWIKTATRVFQALRIAVNDEL VL++SLY+CFDCLAPGGRLAVISFHSLEDRVVKQTFL+IIN P+ E DE  +   + +E+EEWIK
Subjt:  IRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIK

Query:  QTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKN
        QTVKGS G +LTKRPITPSEEEERLNRRSRSAKLRVIQKN
Subjt:  QTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKN

A0A6J1DSK2 uncharacterized protein LOC1110230991.14e-308100Show/hide
Query:  MALKSMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTASSSSLKLALVKEKRRTRSTKEFDENAIQFGD
        MALKSMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTASSSSLKLALVKEKRRTRSTKEFDENAIQFGD
Subjt:  MALKSMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTASSSSLKLALVKEKRRTRSTKEFDENAIQFGD

Query:  TAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEK
        TAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEK
Subjt:  TAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEK

Query:  PLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRK
        PLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRK
Subjt:  PLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRK

Query:  STPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTV
        STPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTV
Subjt:  STPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTV

Query:  KGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN
        KGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN
Subjt:  KGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN

A0A6J1FNN0 uncharacterized protein LOC1114459011.48e-24382.1Show/hide
Query:  SMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFDENAIQ
        SM KL LFS  PSQS R R LIS   +  P   RCCS ISTATD  NK  K KQKK++N+KASTA       SSSSLKLALVKEKRRTRSTKEFDE A+Q
Subjt:  SMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFDENAIQ

Query:  FGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEA
        FGDTAAHIPVMLAEVLDVFSASSGR L SFVDCTVGA GHSSAIIQAHPELKFYMGMDVDPIA+DKA+DRIS LF EDSDLKAYTVLKNFK+ KSLL ++
Subjt:  FGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEA

Query:  DEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDL
        DEKPLDPGVDGILMDLGMSSMQVDDP RGFSVLCDGPLDMRMDPQAS++AEDILNSWPEIEVGR+LRVYGEESNWYSLQNKI+KARSQGGLH+T++LVDL
Subjt:  DEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDL

Query:  IRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEME---KDEDIR---EDIVENE
        IRKSTP FKGGRQGWIKTATRVFQALRIAVNDELKVLED+LYS F+CLAPGGRLAVISFHSLEDRVVKQTFL+IINNP+ E   ++E IR   ++  E +
Subjt:  IRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEME---KDEDIR---EDIVENE

Query:  EEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN
        EEEWIKQ VKGS G ILTKRPITPSEEEERLNRR RSAKLRVIQKNN
Subjt:  EEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN

A0A6J1IWR1 uncharacterized protein LOC1114812934.18e-23680.54Show/hide
Query:  SMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFDENAIQ
        SM KL LFS  PSQS R R LIS   +  P   RCCS ISTATD + K  K KQKK++N+KASTA       SSSSLKLALVKEKRRTRSTKEFDE A+Q
Subjt:  SMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTA-------SSSSLKLALVKEKRRTRSTKEFDENAIQ

Query:  FGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEA
        FGDTAAHIPVMLAEVLDVFSASSGR L SFVDCTVGA GHSSAIIQAH ELKFYMGMDVDPIA+DKA+DRIS LF EDSDLK YTVLKNFK+ K LL ++
Subjt:  FGDTAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEA

Query:  DEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDL
        DEKPL+PGVDGILMDLGMSSMQVDDP RGFSVL DGPLDMRMDPQAS++AEDILNSWPE+EVGR+LRVYGEESNWYSLQNKI+KARSQGGLH+T++LVDL
Subjt:  DEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDL

Query:  IRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEME---KDEDIR---EDIVENE
        IRKSTP  KGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYS F+CLAPGGRLAVISFHSLEDRVVKQTFL+IIN P+ E   ++E IR   ++  E +
Subjt:  IRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEME---KDEDIR---EDIVENE

Query:  EEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN
        EEEWIKQTVKGS G ILTKRPITPSEEEERLNRR RSAKLRVIQKNN
Subjt:  EEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN

SwissProt top hitse value%identityAlignment
A8FCX3 Ribosomal RNA small subunit methyltransferase H7.0e-5137.54Show/hide
Query:  HIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLD
        H  V+L E +D  +        ++VDCT+G AGHSS ++    E    +G D D  A+D A ++++      S      +  NF++ K  L E       
Subjt:  HIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLD

Query:  PGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKSTP
          VDG++ DLG+SS Q+D P RGFS   D PLDMRMD  A++ A+ ++N WP  ++ RI   YGEE     +  KI +AR +  + +T +LVD+I++  P
Subjt:  PGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKSTP

Query:  PFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTVKGS
             R+     A RVFQA+RIAVNDELKV E++L    + L P GR++VI+FHSLEDR+ K TF  + + PE+     +  + +E + +          
Subjt:  PFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTVKGS

Query:  VGTILTKRPITPSEEEERLNRRSRSAKLRVIQK
           ++T++PI  SE+E   N R+RSAKLR+ +K
Subjt:  VGTILTKRPITPSEEEERLNRRSRSAKLRVIQK

C6D563 Ribosomal RNA small subunit methyltransferase H1.4e-5439.76Show/hide
Query:  HIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLD
        HI V+L E +D  +   G     +VDCT+G AGHS  I+         +  D D  A+D A  R++         +   V  NF++ + +L   D   +D
Subjt:  HIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLD

Query:  --PGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKS
          P VDGIL DLG+SS Q+D+  RGFS   D PLDMRMD   ++ A DI+NSW E E+ RIL VYGEE    S+  KI++AR    + +T +L +L++  
Subjt:  --PGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKS

Query:  TPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTVK
         P     R+     A R FQALRIAVNDEL   ED+L     C+ PGGR++VI+FHSLEDR+ KQ F + +       D                 + V 
Subjt:  TPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTVK

Query:  GSVG--TILTKRPITPSEEEERLNRRSRSAKLRVIQK
        G  G   ++ ++PI P+E+E  +N RSRSAKLRV +K
Subjt:  GSVG--TILTKRPITPSEEEERLNRRSRSAKLRVIQK

Q65JY8 Ribosomal RNA small subunit methyltransferase H1.7e-5238.14Show/hide
Query:  HIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLD
        H  V+L E +D  +        ++VDCT+G AGHS  ++    E    +  D D  A+  A+++++     D + +   +  NF++ K  L E       
Subjt:  HIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLD

Query:  PGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKSTP
          VDG+L DLG+SS Q+D P RGFS   D PLDMRMD  A + A++++N WP  ++ +I   YGEE     +  KI KAR Q  + +T QLVD+I+++ P
Subjt:  PGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKSTP

Query:  PFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTVKGS
             R+     A R+FQA+RIAVNDEL+V E++L    + L PGGR++VI+FHSLEDR+ K TF ++ + PE+         ++  E E  +K      
Subjt:  PFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTVKGS

Query:  VGTILTKRPITPSEEEERLNRRSRSAKLRVIQK
           ++ ++PIT SEEE   N R+RSAKLR+ +K
Subjt:  VGTILTKRPITPSEEEERLNRRSRSAKLRVIQK

Q6MEG4 Ribosomal RNA small subunit methyltransferase H3.0e-5439.88Show/hide
Query:  HIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLD
        H  V+L EV++ F      +L  F+D T+GA GH+ AI++ HPE++ Y+G+D DP A++ A  R+     E    K      NF      L E       
Subjt:  HIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLD

Query:  PGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKSTP
          +DG+L+DLG+SSMQ+D P RGFS   DGPLDMRM+P+  + A DI+N+W E ++G+I R YGEE  W      I++AR    + +TT L +L++   P
Subjt:  PGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKSTP

Query:  PFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTV---
         F    +  I   T +FQALRI VN EL VLE  +   FD L PGGR+AVISFHSLEDR+VK                ++R    +  E   +   +   
Subjt:  PFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTV---

Query:  KGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQK
        K  V  ++ ++PI P E+E + N RSRSAK R+ +K
Subjt:  KGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQK

Q8GE08 Ribosomal RNA small subunit methyltransferase H (Fragment)3.7e-5242.94Show/hide
Query:  HIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLD
        HIPV+L EVL+   A   R    ++D TVG  GHS+AI++        +G+D DP A+  A  ++   F++   L    V  NF+   S++ E  +K   
Subjt:  HIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLD

Query:  PGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKSTP
          +DGIL+D+G+SS Q+D+  RGF+   + PLDMRM+P  +V A  ILN +PE E+ RIL  YGEE     +   I+  R++  L STT LVD+IR + P
Subjt:  PGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKSTP

Query:  PFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTVKGS
             RQ     A R FQALRIAVNDEL  L+++L +  D LAP GRLAVISFHSLEDR+VK  F       E  K      D+        +    K +
Subjt:  PFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTVKGS

Query:  VGTILTKRPITPSEEEERLNRRSRSAKLRVIQK
           I+T++PIT SEEE ++N RS+SAKLRV QK
Subjt:  VGTILTKRPITPSEEEERLNRRSRSAKLRVIQK

Arabidopsis top hitse value%identityAlignment
AT5G10910.1 mraW methylase family protein1.8e-12662Show/hide
Query:  KANKEKQKKSRNLKASTASSSSLKLALVKEKRRTRSTKEFDENAIQFGD--TAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELK
        K  KEK+K+ + ++   A++ ++   + KEKRRTRS++ ++   +  GD   ++H+PVML EVLD+FS+    RL SFVDCT+GAAGHSS+IIQ+H ELK
Subjt:  KANKEKQKKSRNLKASTASSSSLKLALVKEKRRTRSTKEFDENAIQFGD--TAAHIPVMLAEVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELK

Query:  FYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAED
         ++GMDVDP+A       I +L      LKA  VLKNFK+ KS++ +   + LD GVDGILMDLGMSSMQV++P RGFSVL +GPLDMRMDPQA++ AED
Subjt:  FYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLDPGVDGILMDLGMSSMQVDDPGRGFSVLCDGPLDMRMDPQASVRAED

Query:  ILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGG
        I+NSWPE E+GR+LR YGEESNWY LQN+I+KAR  GGLHST +LVDLIR ++P  +GGRQGWIKTATRVFQ LRIAVNDELK L++SLYS FD LAPGG
Subjt:  ILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKSTPPFKGGRQGWIKTATRVFQALRIAVNDELKVLEDSLYSCFDCLAPGG

Query:  RLAVISFHSLEDRVVKQTFLNIIN-------------NPEMEKDEDIREDIVENEEEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQK
        RLAVISFHSLEDRVVKQTFL+I+               PE + +E + +++   E+EEWIKQTV  S G ILTKRPITPSEEEERLNRR+RSAKLRVIQK
Subjt:  RLAVISFHSLEDRVVKQTFLNIIN-------------NPEMEKDEDIREDIVENEEEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGAAATCAATGGCGAAGCTCCCGTTGTTTTCTCCTCCTCCGTCACAATCTCTCCGTTTTCGGTCTTTGATCTCCGCCGGTACTTCTGCGCCGCCGAGATTTCT
ACGGTGCTGTTCTTCCATATCTACCGCAACTGATGTTAGGAACAAGGCGAATAAGGAGAAGCAGAAGAAATCCAGAAATCTTAAGGCGAGTACTGCTTCGTCTTCGTCTT
TGAAATTAGCTTTGGTGAAGGAGAAAAGGAGGACTCGTTCCACTAAGGAGTTTGATGAAAACGCCATCCAGTTCGGGGATACGGCAGCTCACATTCCGGTGATGCTTGCG
GAGGTATTGGACGTGTTCTCTGCTTCTTCCGGAAGGCGGCTACATTCCTTTGTAGATTGCACCGTCGGTGCCGCTGGACACTCCTCTGCTATAATTCAAGCGCATCCAGA
GTTGAAATTTTACATGGGAATGGATGTCGATCCAATTGCAGTTGATAAGGCTGAAGATCGGATAAGTGCTCTCTTTAGTGAGGATTCTGATCTGAAAGCATATACAGTTC
TGAAGAACTTTAAATTTACGAAATCGCTACTGACTGAGGCAGATGAGAAACCTTTAGATCCTGGAGTTGATGGGATTTTGATGGATTTGGGAATGTCATCCATGCAGGTG
GACGATCCTGGAAGGGGCTTTAGTGTCCTTTGTGATGGACCTCTTGATATGCGAATGGATCCTCAGGCAAGTGTGAGAGCGGAGGACATATTGAACAGTTGGCCAGAAAT
AGAAGTGGGGCGTATCTTGCGGGTTTATGGAGAGGAGAGCAATTGGTATTCACTCCAGAATAAAATAATGAAAGCTCGATCACAGGGTGGATTGCATTCCACCACTCAAT
TAGTGGATCTCATACGCAAATCTACTCCTCCCTTCAAAGGAGGAAGGCAAGGTTGGATAAAGACAGCAACCAGGGTATTCCAAGCTCTGAGAATTGCTGTTAATGACGAA
TTGAAGGTATTAGAAGACTCCCTATATAGTTGTTTCGACTGTCTCGCCCCGGGAGGTCGACTTGCAGTCATCTCCTTCCACAGTCTGGAGGATAGGGTTGTGAAACAAAC
ATTTCTCAACATCATTAATAACCCGGAAATGGAAAAAGATGAAGATATAAGAGAAGATATCGTTGAGAATGAAGAAGAAGAATGGATTAAGCAAACTGTAAAAGGTTCAG
TGGGAACAATCCTAACTAAAAGACCGATAACTCCATCTGAAGAGGAAGAGAGGTTGAATCGACGTAGCAGGAGCGCTAAGCTAAGGGTTATTCAAAAGAATAATTAA
mRNA sequenceShow/hide mRNA sequence
ACAAAATCTATTTCTACGTTTGACCACTATTATTATTGTTTTCTAAAATATTTATATTTTTATATTTTCCACTTCTAATTATCAAATATTCAATGGTCTAAGCACAACAT
AGATCCTCACAGTAACGGCCTACGGAATAATTGAAACAAAACAAAAGTGAAATTCCAAACAGAAAATAATATGTGACAAACAATGATGAAAAATTTCTTAAAATATAAGT
AAGAATCGTAATGAGAATTACTGAACCCAAAAACTTTTACATGACGAAACGTTGCCGTTTCGGTTCCCCCGAGTCTTCTCCTTTCTTCTGCACATTCTGGCGGCAACCGA
TCTCAAACACCTGAAAATCAATGGCGATCGCGGGAATGGATCTCTCTCTCTCTCTTTTCGGAATCAGATCCACCGTCTGTGGTTGCGGCTGCCGTGCGCTCCTCGGAACC
ACTGATCCCAATTGGCGTCCGTCGGGCCTCTTGTTCTGTAATTTGCATCATTTTTTAGTTGATTAGGCCGAGAAATGATCATTGATATTGATGATTGATGATAAGAAATG
TTAATGGCGACAATCTTACTCTGGAAGCAGCGGATCAGGAAACGACTCGACGCTCGAGACGAGTCTGTACGTAAACAGACGGTGGAGGCTTCGCCTATGCTCTCGAGCTT
CTGCAACTCATCCTGCATCACGATTTATCATCGTCTTCAATTTCTGAGGAAAATAATTGGTGCAAACATGAACTGCGTTACTAGATATTAGATGAGGAACGCCAATCACC
TCGATGATACTGATTTCATTCTGGAGTCTGGTTATGGCCGCTGTGATTCTGTGTCTTCCAAACAAATTATTGTTGTTGTTGTTGCATCGAACTGCAGATTTAGGTTTGGT
TTCAGAATTGGCCTCTTGTTCTCTCTTCGCCGCTTCTTCTCCTCCACCTCCCGGCGGCGACGGCGACGGAAACGGCTGCGATTGAGGTTGGTCGTCCATTTGCCACTACA
GAAGGAAGAGACGAGATTATGATGAAAAAGAATAAAGTGGGCAGTCAAAGTTTCAAACTCTGCGGAAAAGGGGAAGCAAAAAAGGCCCAAAACAGCAATTTCTGGACTGC
AGACTACAGTTGTAAGGTAAAAGAGGATAAGCCGGTCCAATCGGGAAATTAGGGCTCGGGAACGGAAGTGTCACAACACAACTTCCCTCCATTTTCAGAGTTTATGGAAG
ACGAACACTCTCCATTTGACGAACTGCAAGATTTCCGCATTTTGCACCTCCACAAATTTCGTTTTCTGATCGGAACGGCACTGAAATGGCTTTGAAATCAATGGCGAAGC
TCCCGTTGTTTTCTCCTCCTCCGTCACAATCTCTCCGTTTTCGGTCTTTGATCTCCGCCGGTACTTCTGCGCCGCCGAGATTTCTACGGTGCTGTTCTTCCATATCTACC
GCAACTGATGTTAGGAACAAGGCGAATAAGGAGAAGCAGAAGAAATCCAGAAATCTTAAGGCGAGTACTGCTTCGTCTTCGTCTTTGAAATTAGCTTTGGTGAAGGAGAA
AAGGAGGACTCGTTCCACTAAGGAGTTTGATGAAAACGCCATCCAGTTCGGGGATACGGCAGCTCACATTCCGGTGATGCTTGCGGAGGTATTGGACGTGTTCTCTGCTT
CTTCCGGAAGGCGGCTACATTCCTTTGTAGATTGCACCGTCGGTGCCGCTGGACACTCCTCTGCTATAATTCAAGCGCATCCAGAGTTGAAATTTTACATGGGAATGGAT
GTCGATCCAATTGCAGTTGATAAGGCTGAAGATCGGATAAGTGCTCTCTTTAGTGAGGATTCTGATCTGAAAGCATATACAGTTCTGAAGAACTTTAAATTTACGAAATC
GCTACTGACTGAGGCAGATGAGAAACCTTTAGATCCTGGAGTTGATGGGATTTTGATGGATTTGGGAATGTCATCCATGCAGGTGGACGATCCTGGAAGGGGCTTTAGTG
TCCTTTGTGATGGACCTCTTGATATGCGAATGGATCCTCAGGCAAGTGTGAGAGCGGAGGACATATTGAACAGTTGGCCAGAAATAGAAGTGGGGCGTATCTTGCGGGTT
TATGGAGAGGAGAGCAATTGGTATTCACTCCAGAATAAAATAATGAAAGCTCGATCACAGGGTGGATTGCATTCCACCACTCAATTAGTGGATCTCATACGCAAATCTAC
TCCTCCCTTCAAAGGAGGAAGGCAAGGTTGGATAAAGACAGCAACCAGGGTATTCCAAGCTCTGAGAATTGCTGTTAATGACGAATTGAAGGTATTAGAAGACTCCCTAT
ATAGTTGTTTCGACTGTCTCGCCCCGGGAGGTCGACTTGCAGTCATCTCCTTCCACAGTCTGGAGGATAGGGTTGTGAAACAAACATTTCTCAACATCATTAATAACCCG
GAAATGGAAAAAGATGAAGATATAAGAGAAGATATCGTTGAGAATGAAGAAGAAGAATGGATTAAGCAAACTGTAAAAGGTTCAGTGGGAACAATCCTAACTAAAAGACC
GATAACTCCATCTGAAGAGGAAGAGAGGTTGAATCGACGTAGCAGGAGCGCTAAGCTAAGGGTTATTCAAAAGAATAATTAACGAGTGAGCATTTCTCTCTATATCTTTA
TATGAAATCATTTGCATTGCATTTTATTACTATTCTGATTGCTTTTATAATAGCCTTTCTACCTCCATAAATTCGGAAGCTCCTTAGCAATGCCTAAAAGACCCCATGCT
TTTTTGGTTGGACCAACTATTTTTGACAAGTCTTTATTCATTGTTCAAGTAAGAAGGGAAACTACAGAGATTATTTACAAAATGTCATTTTCTCATTGAATGGGAAACTT
TTAAAACAGCAGAGAAGAGAGTAGAAAATGGAGGTGCATTGCATGCACCAGCAACCATGTCAAATATTAATTCTACATAACAATACTTGAGATATAGAGAGACTAAAATA
CTACAATTTCCGCCGTTCTTATAGGCCGACCCAATATGAACGGCCGTTACCGTTATGAAGCGGAGACTGTGACGGAGATGACACACACCACTCGTACCAAACCTGCACGT
GCACCCACACATTTAATTTATTATATGATTAAACAAATAGGAAAAAAAAAAAAGAATCACCTTATTAGGACTACAACATCGCCAGAAATGTACTTCTAGAGGAGAGCCAG
GTGGCACGCATATCGGACTCCTTAATGGGAAAAATATAGCGAACCTGAATAACAACAAATATTAAGTAATTAGAAGATATTATGGAATGAACAACAGTTCAAAAACAAGT
GAGAGATCATACCAACTGAACATGTTTGGTGTGGCTGTTGATGGTTCAATGCCC
Protein sequenceShow/hide protein sequence
MALKSMAKLPLFSPPPSQSLRFRSLISAGTSAPPRFLRCCSSISTATDVRNKANKEKQKKSRNLKASTASSSSLKLALVKEKRRTRSTKEFDENAIQFGDTAAHIPVMLA
EVLDVFSASSGRRLHSFVDCTVGAAGHSSAIIQAHPELKFYMGMDVDPIAVDKAEDRISALFSEDSDLKAYTVLKNFKFTKSLLTEADEKPLDPGVDGILMDLGMSSMQV
DDPGRGFSVLCDGPLDMRMDPQASVRAEDILNSWPEIEVGRILRVYGEESNWYSLQNKIMKARSQGGLHSTTQLVDLIRKSTPPFKGGRQGWIKTATRVFQALRIAVNDE
LKVLEDSLYSCFDCLAPGGRLAVISFHSLEDRVVKQTFLNIINNPEMEKDEDIREDIVENEEEEWIKQTVKGSVGTILTKRPITPSEEEERLNRRSRSAKLRVIQKNN