; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G010070 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G010070
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionN-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X1
Genome locationchr06:20138048..20161814
RNA-Seq ExpressionLsi06G010070
SyntenyLsi06G010070
Gene Ontology termsGO:0017196 - N-terminal peptidyl-methionine acetylation (biological process)
GO:0048268 - clathrin coat assembly (biological process)
GO:0012505 - endomembrane system (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0031417 - NatC complex (cellular component)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0030276 - clathrin binding (molecular function)
InterPro domainsIPR007244 - -alpha-acetyltransferase 35, NatC auxiliary subunit
IPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain
IPR014712 - ANTH domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5559472.1 hypothetical protein RHGRI_009117 [Rhododendron griersonianum]0.0e+0064.13Show/hide
Query:  IPSGEHTVWADVSPLLEAACGD----------------------------------------------------LQDGELIHGDNFNLFSAMSALEIMDP
        IPSGE TVWAD+S LL++AC                                                      L+DGELIHG+NFNLF+AMSALEIMDP
Subjt:  IPSGEHTVWADVSPLLEAACGD----------------------------------------------------LQDGELIHGDNFNLFSAMSALEIMDP

Query:  KMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDA
        KMDSG++ +YYSVDEAI++GAAPIPLS DR VDVQC IDIMDHLL+CEATWHKGHSLAQTVFSCIYLL+PDRTS HALLHSYC VIRATC  V++VVSDA
Subjt:  KMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDA

Query:  RTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLRAY------------VEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDL
        RT+EEEDLF M YGLPL GDGD+KCLSMLHAVEETI RQLRA             +EPLQ+  DLEE +C+ LLCR+RFRKHFYH L  M+RPQGRGL+L
Subjt:  RTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLRAY------------VEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDL

Query:  ARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVF
        ARKHI SCL ELD I  SS FL +N+ G  ++ LED TTASG QP+GFD+SLNCRLSAPTPPRAIK+LSW KA++YF KLLHDLD +CSYSLDP LEGV 
Subjt:  ARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVF

Query:  RFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQME
        RFVV+FQK QPDLVAR+ LQ LL+ DGKLYG+DP++ VI+KA  LPE  K+HE QKN+ + QLGQLVI+LLK+LCTN+AWQRRKLGKILQDWRV YMQ+E
Subjt:  RFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQME

Query:  MAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKG
        +AFKK+F +I S    E+   KI +HILVWVEEQTYWI+SRFLVLGFELELYSPSEYCMVYWY+ VV IKL EK HL+ L S+ET          GKRK 
Subjt:  MAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKG

Query:  KKKGAS-KDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYE---------------------------------NLVTYNCFK
        KKK  S KD+ +D++IPP V  LQC I LAE L +MLAALRNEH +  S  PFN+EYE                                 N V +NCFK
Subjt:  KKKGAS-KDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYE---------------------------------NLVTYNCFK

Query:  DAQKIAKELKSSFSNDREKLVELRRIEQVAEHNSVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKL
        DA +IAKEL+SSFS+D E++ ELRRIEQVAEHN VALNLI +L  LE     + P    R       AAQ+    N   +  F C               
Subjt:  DAQKIAKELKSSFSNDREKLVELRRIEQVAEHNSVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKL

Query:  SNKIQEIKESWMTDHHLSQHVARSEDRMQEPNENALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVK
                    +D H                                   F +  G RMS GGTQNS RKALGA+KDTTTVSLAKVNSDYKELDIAIVK
Subjt:  SNKIQEIKESWMTDHHLSQHVARSEDRMQEPNENALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVK

Query:  STNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYS
        +TNHVERPAKEK+I+AIFAA+SATRPRADVAYCIHALARRLSKT NWAVALKTL+VIHRALREVDPTFHEEL NYGR R+HMLNL+HFKDDSS +AWDYS
Subjt:  STNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYS

Query:  AWVRSYALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEM
        AWVR+YALFLEER+EC+RVLKYDVE DR RTKDLDT  LLEQLPALQ+LL RVLGCQP+GAAV NFVIQLALS+VASES+ IY AISDGT+NLVDKFFEM
Subjt:  AWVRSYALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEM

Query:  RRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTV
           DAL+ALDIYR+AGQQAERLSEFYE+CK LDI RGE FIKIEQPPASF+QAMEEY REAP+ ST RK+QV   K A+PK +LAIE  K  A  +E   
Subjt:  RRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTV

Query:  APPPAPSP---PPPEPVKVEA-VVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGG
          PP P P   P PEPVKVEA V    PDLLGLNDP P A S LDEKN++ALAIVPV DQ T++     NG   TGWELALVTAPSSN S + TS+L GG
Subjt:  APPPAPSP---PPPEPVKVEA-VVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGG

Query:  LDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQ-----SNPFGNPH
        LD LTLDSLYDDAIRR+NQN SYNPWE  P+ G +M Q  HDPFFAS +VAAPHSVQMAAMANQQQA+M QQQQQQQMMMM    QQ     SNPF N +
Subjt:  LDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQ-----SNPFGNPH

Query:  GTNAYHYGPGMPVHASNPY
        G N + YGPGMPV A NP+
Subjt:  GTNAYHYGPGMPVHASNPY

KAG5559473.1 hypothetical protein RHGRI_009117 [Rhododendron griersonianum]0.0e+0067.78Show/hide
Query:  DLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRT
        DL+DGELIHG+NFNLF+AMSALEIMDPKMDSG++ +YYSVDEAI++GAAPIPLS DR VDVQC IDIMDHLL+CEATWHKGHSLAQTVFSCIYLL+PDRT
Subjt:  DLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRT

Query:  SSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLRAY------------VEPLQSTFDLEEHYCRVL
        S HALLHSYC VIRATC  V++VVSDART+EEEDLF M YGLPL GDGD+KCLSMLHAVEETI RQLRA             +EPLQ+  DLEE +C+ L
Subjt:  SSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLRAY------------VEPLQSTFDLEEHYCRVL

Query:  LCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKA
        LCR+RFRKHFYH L  M+RPQGRGL+LARKHI SCL ELD I  SS FL +N+ G  ++ LED TTASG QP+GFD+SLNCRLSAPTPPRAIK+LSW KA
Subjt:  LCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKA

Query:  LDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKV
        ++YF KLLHDLD +CSYSLDP LEGV RFVV+FQK QPDLVAR+ LQ LL+ DGKLYG+DP++ VI+KA  LPE  K+HE QKN+ + QLGQLVI+LLK+
Subjt:  LDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKV

Query:  LCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVE
        LCTN+AWQRRKLGKILQDWRV YMQ+E+AFKK+F +I S    E+   KI +HILVWVEEQTYWI+SRFLVLGFELELYSPSEYCMVYWY+ VV IKL E
Subjt:  LCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVE

Query:  KIHLRALMSNETGECANLAAMLGKRKGKKKGAS-KDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYENLVTYNCFKDAQKIA
        K HL+ L S+ET          GKRK KKK  S KD+ +D++IPP V  LQC I LAE L +MLAALRNEH +  S  PFN+EYEN V +NCFKDA +IA
Subjt:  KIHLRALMSNETGECANLAAMLGKRKGKKKGAS-KDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYENLVTYNCFKDAQKIA

Query:  KELKSSFSNDREKLVELRRIEQVAEHNSVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQE
        KEL+SSFS+D E++ ELRRIEQVAEHN VALNLI +L  LE     + P    R       AAQ+    N   +  F C                     
Subjt:  KELKSSFSNDREKLVELRRIEQVAEHNSVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQE

Query:  IKESWMTDHHLSQHVARSEDRMQEPNENALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVE
              +D H                                   F +  G RMS GGTQNS RKALGA+KDTTTVSLAKVNSDYKELDIAIVK+TNHVE
Subjt:  IKESWMTDHHLSQHVARSEDRMQEPNENALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVE

Query:  RPAKEKHIR---------AIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAW
        RPAKEK+I+         AIFAA+SATRPRADVAYCIHALARRLSKT NWAVALKTL+VIHRALREVDPTFHEEL NYGR R+HMLNL+HFKDDSS +AW
Subjt:  RPAKEKHIR---------AIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAW

Query:  DYSAWVRSYALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKF
        DYSAWVR+YALFLEER+EC+RVLKYDVE DR RTKDLDT  LLEQLPALQ+LL RVLGCQP+GAAV NFVIQLALS+VASES+ IY AISDGT+NLVDKF
Subjt:  DYSAWVRSYALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKF

Query:  FEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVE
        FEM   DAL+ALDIYR+AGQQAERLSEFYE+CK LDI RGE FIKIEQPPASF+QAMEEY REAP+ ST RK+QV   K A+PK +LAIE  K  A  +E
Subjt:  FEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVE

Query:  QTVAPPPAPSP---PPPEPVKVEA-VVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKL
             PP P P   P PEPVKVEA V    PDLLGLNDP P A S LDEKN++ALAIVPV DQ T++     NG   TGWELALVTAPSSN S + TS+L
Subjt:  QTVAPPPAPSP---PPPEPVKVEA-VVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKL

Query:  AGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQ-----SNPFG
         GGLD LTLDSLYDDAIRR+NQN SYNPWE  P+ G +M Q  HDPFFAS +VAAPHSVQMAAMANQQQA+M QQQQQQQMMMM    QQ     SNPF 
Subjt:  AGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQ-----SNPFG

Query:  NPHGTNAYHYGPGMPVHASNPY
        N +G N + YGPGMPV A NP+
Subjt:  NPHGTNAYHYGPGMPVHASNPY

KAG5559474.1 hypothetical protein RHGRI_009117 [Rhododendron griersonianum]0.0e+0068.24Show/hide
Query:  DLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRT
        DL+DGELIHG+NFNLF+AMSALEIMDPKMDSG++ +YYSVDEAI++GAAPIPLS DR VDVQC IDIMDHLL+CEATWHKGHSLAQTVFSCIYLL+PDRT
Subjt:  DLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRT

Query:  SSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLRAY------------VEPLQSTFDLEEHYCRVL
        S HALLHSYC VIRATC  V++VVSDART+EEEDLF M YGLPL GDGD+KCLSMLHAVEETI RQLRA             +EPLQ+  DLEE +C+ L
Subjt:  SSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLRAY------------VEPLQSTFDLEEHYCRVL

Query:  LCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKA
        LCR+RFRKHFYH L  M+RPQGRGL+LARKHI SCL ELD I  SS FL +N+ G  ++ LED TTASG QP+GFD+SLNCRLSAPTPPRAIK+LSW KA
Subjt:  LCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKA

Query:  LDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKV
        ++YF KLLHDLD +CSYSLDP LEGV RFVV+FQK QPDLVAR+ LQ LL+ DGKLYG+DP++ VI+KA  LPE  K+HE QKN+ + QLGQLVI+LLK+
Subjt:  LDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKV

Query:  LCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVE
        LCTN+AWQRRKLGKILQDWRV YMQ+E+AFKK+F +I S    E+   KI +HILVWVEEQTYWI+SRFLVLGFELELYSPSEYCMVYWY+ VV IKL E
Subjt:  LCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVE

Query:  KIHLRALMSNETGECANLAAMLGKRKGKKKGAS-KDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYENLVTYNCFKDAQKIA
        K HL+ L S+ET          GKRK KKK  S KD+ +D++IPP V  LQC I LAE L +MLAALRNEH +  S  PFN+EYEN V +NCFKDA +IA
Subjt:  KIHLRALMSNETGECANLAAMLGKRKGKKKGAS-KDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYENLVTYNCFKDAQKIA

Query:  KELKSSFSNDREKLVELRRIEQVAEHNSVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQE
        KEL+SSFS+D E++ ELRRIEQVAEHN VALNLI +L  LE     + P    R       AAQ+    N   +  F C                     
Subjt:  KELKSSFSNDREKLVELRRIEQVAEHNSVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQE

Query:  IKESWMTDHHLSQHVARSEDRMQEPNENALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVE
              +D H                                   F +  G RMS GGTQNS RKALGA+KDTTTVSLAKVNSDYKELDIAIVK+TNHVE
Subjt:  IKESWMTDHHLSQHVARSEDRMQEPNENALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVE

Query:  RPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSY
        RPAKEK+I+AIFAA+SATRPRADVAYCIHALARRLSKT NWAVALKTL+VIHRALREVDPTFHEEL NYGR R+HMLNL+HFKDDSS +AWDYSAWVR+Y
Subjt:  RPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSY

Query:  ALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDAL
        ALFLEER+EC+RVLKYDVE DR RTKDLDT  LLEQLPALQ+LL RVLGCQP+GAAV NFVIQLALS+VASES+ IY AISDGT+NLVDKFFEM   DAL
Subjt:  ALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDAL

Query:  KALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAP
        +ALDIYR+AGQQAERLSEFYE+CK LDI RGE FIKIEQPPASF+QAMEEY REAP+ ST RK+QV   K A+PK +LAIE  K  A  +E     PP P
Subjt:  KALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAP

Query:  SP---PPPEPVKVEA-VVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTL
         P   P PEPVKVEA V    PDLLGLNDP P A S LDEKN++ALAIVPV DQ T++     NG   TGWELALVTAPSSN S + TS+L GGLD LTL
Subjt:  SP---PPPEPVKVEA-VVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTL

Query:  DSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQ-----SNPFGNPHGTNAYH
        DSLYDDAIRR+NQN SYNPWE  P+ G +M Q  HDPFFAS +VAAPHSVQMAAMANQQQA+M QQQQQQQMMMM    QQ     SNPF N +G N + 
Subjt:  DSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQ-----SNPFGNPHGTNAYH

Query:  YGPGMPVHASNPY
        YGPGMPV A NP+
Subjt:  YGPGMPVHASNPY

KAG5559475.1 hypothetical protein RHGRI_009117 [Rhododendron griersonianum]0.0e+0065.66Show/hide
Query:  IPSGEHTVWADVSPLLEAACGD----------------------------------------------------LQDGELIHGDNFNLFSAMSALEIMDP
        IPSGE TVWAD+S LL++AC                                                      L+DGELIHG+NFNLF+AMSALEIMDP
Subjt:  IPSGEHTVWADVSPLLEAACGD----------------------------------------------------LQDGELIHGDNFNLFSAMSALEIMDP

Query:  KMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDA
        KMDSG++ +YYSVDEAI++GAAPIPLS DR VDVQC IDIMDHLL+CEATWHKGHSLAQTVFSCIYLL+PDRTS HALLHSYC VIRATC  V++VVSDA
Subjt:  KMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDA

Query:  RTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLRAY------------VEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDL
        RT+EEEDLF M YGLPL GDGD+KCLSMLHAVEETI RQLRA             +EPLQ+  DLEE +C+ LLCR+RFRKHFYH L  M+RPQGRGL+L
Subjt:  RTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLRAY------------VEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDL

Query:  ARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVF
        ARKHI SCL ELD I  SS FL +N+ G  ++ LED TTASG QP+GFD+SLNCRLSAPTPPRAIK+LSW KA++YF KLLHDLD +CSYSLDP LEGV 
Subjt:  ARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVF

Query:  RFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQME
        RFVV+FQK QPDLVAR+ LQ LL+ DGKLYG+DP++ VI+KA  LPE  K+HE QKN+ + QLGQLVI+LLK+LCTN+AWQRRKLGKILQDWRV YMQ+E
Subjt:  RFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQME

Query:  MAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKG
        +AFKK+F +I S    E+   KI +HILVWVEEQTYWI+SRFLVLGFELELYSPSEYCMVYWY+ VV IKL EK HL+ L S+ET          GKRK 
Subjt:  MAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKG

Query:  KKKGAS-KDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYENLVTYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEHN
        KKK  S KD+ +D++IPP V  LQC I LAE L +MLAALRNEH +  S  PFN+EYEN V +NCFKDA +IAKEL+SSFS+D E++ ELRRIEQVAEHN
Subjt:  KKKGAS-KDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYENLVTYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEHN

Query:  SVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQEIKESWMTDHHLSQHVARSEDRMQEPNE
         VALNLI +L  LE     + P    R       AAQ+    N   +  F C                           +D H                 
Subjt:  SVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQEIKESWMTDHHLSQHVARSEDRMQEPNE

Query:  NALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYC
                          F +  G RMS GGTQNS RKALGA+KDTTTVSLAKVNSDYKELDIAIVK+TNHVERPAKEK+I+AIFAA+SATRPRADVAYC
Subjt:  NALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYC

Query:  IHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRARTKD
        IHALARRLSKT NWAVALKTL+VIHRALREVDPTFHEEL NYGR R+HMLNL+HFKDDSS +AWDYSAWVR+YALFLEER+EC+RVLKYDVE DR RTKD
Subjt:  IHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRARTKD

Query:  LDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLD
        LDT  LLEQLPALQ+LL RVLGCQP+GAAV NFVIQLALS+VASES+ IY AISDGT+NLVDKFFEM   DAL+ALDIYR+AGQQAERLSEFYE+CK LD
Subjt:  LDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLD

Query:  IGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAPSP---PPPEPVKVEA-VVTEQPDLLGL
        I RGE FIKIEQPPASF+QAMEEY REAP+ ST RK+QV   K A+PK +LAIE  K  A  +E     PP P P   P PEPVKVEA V    PDLLGL
Subjt:  IGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAPSP---PPPEPVKVEA-VVTEQPDLLGL

Query:  NDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHG
        NDP P A S LDEKN++ALAIVPV DQ T++     NG   TGWELALVTAPSSN S + TS+L GGLD LTLDSLYDDAIRR+NQN SYNPWE  P+ G
Subjt:  NDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHG

Query:  AMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQ-----SNPFGNPHGTNAYHYGPGMPVHASNPY
         +M Q  HDPFFAS +VAAPHSVQMAAMANQQQA+M QQQQQQQMMMM    QQ     SNPF N +G N + YGPGMPV A NP+
Subjt:  AMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQ-----SNPFGNPHGTNAYHYGPGMPVHASNPY

RXI05691.1 hypothetical protein DVH24_017733 [Malus domestica]0.0e+0065.49Show/hide
Query:  TEPEEGTMATHAHQTASPIPSGEHTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDV
        T  E+ T+        +PIPSG+ TVWADVSPLL+AAC DLQDG LI+GDNFNLF+AMSALEIMDPKMDSGM+CKYYSVDEAI++GAAP+P+SFD+TVDV
Subjt:  TEPEEGTMATHAHQTASPIPSGEHTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDV

Query:  QCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEE
        Q TIDIMDHLL+CEATWHKGHSLAQTVFSCIYLLR DRTSSHALLHSYC VIRATCKAVI+VVSDARTHEEEDLF M YGLPL+GDGD+KCLSMLHAVEE
Subjt:  QCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEE

Query:  TICRQLRAYVEP-------------------------LQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTF
        TI RQLRA   P                          Q+  +LEE YC+ LLCRLRFRKHFYHVL  M+RPQGRGL+LARKHIASC+ EL+ I +SS F
Subjt:  TICRQLRAYVEP-------------------------LQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTF

Query:  LSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQF
        L +++FG  + +LED TTASG +P+GFD+SLNCRLSAPTPPRAIK+LSWKK ++YF+KLL DLD ICSY LDP LE V  F+V FQK+QPDLVAR+ LQ 
Subjt:  LSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQF

Query:  LLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWM
        LLVQDGKLYG+DP++AVI +AA LP++ +NH+ QK + I+QLGQLVI+LLKVLCTN AWQRRKLGKILQDWRV+YMQ+EMAF+K+F +  +ISN EN  +
Subjt:  LLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWM

Query:  KIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKKKGAS-KDIGKDFRIPPAVS
        K+FQHIL+WVE+QTYWI+SRFL+LGFELELYS SEYCMVYWY+ VVL+KL +K +L+ ++ N +G+C          KGKKK  S KD+ +D+ IPPAV 
Subjt:  KIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKKKGAS-KDIGKDFRIPPAVS

Query:  FLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYE---------------------------------NLVTYNCFKDAQKIAKELKSSFSNDREKLV
        FLQCQICLAEGL +MLAAL+N+HM+ QS SPFN+E E                                 NLV ++ FKDAQKIAK +KSSFSND EKL 
Subjt:  FLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYE---------------------------------NLVTYNCFKDAQKIAKELKSSFSNDREKLV

Query:  ELRRIEQVAEHNSVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQEIKESWMTDHHLSQHV
        ELRR+EQVAEHNS+ALN+I +                          A D                                   +K S+  +HH     
Subjt:  ELRRIEQVAEHNSVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQEIKESWMTDHHLSQHV

Query:  ARSEDRMQEPNENALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAI
                           P F+ +      +     RMS GGTQ S RKALGALKDTTTVSLAKVNSDYKELDIAIVK+TNHVERPAKEK+IRAIFAA+
Subjt:  ARSEDRMQEPNENALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAI

Query:  SATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLK
        SATRPRADVAYCIHALARRL+KTHNWAVALKTLVVIHRALREVDPTF+EEL+NYGR R+HMLNL+HFKDDSS NAWDYS W+R+YALFLEERLECFRVLK
Subjt:  SATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLK

Query:  YDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAER
        YDVE DR                             PQGAAVHNFV+QLALS+VASES+KIYQAISDGT+NLVDKFFEM+R DA++ALDIYRRAGQQAER
Subjt:  YDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAER

Query:  LSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAPSPPPPEPVKVEAVVT
        LSEFYEVCKSLDIGRGEKF KIEQPPASF+QAMEEYV+EAPQ ST RK+QV      APKE+LAIEYKK P  + E     PP P P   EPVKVEA V 
Subjt:  LSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAPSPPPPEPVKVEAVVT

Query:  EQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNP
        E PDLLGLNDPVP  T  LD+KN+LALAIVPV+DQ TS+AP+  NG   TGWELALVTAPSSNES  A+SKLAGGLDLLTLDSLYDDAIRRNNQNVSYNP
Subjt:  EQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNP

Query:  WEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQ----MMMMAPPPQQS-NPFGNPHGTNAYHYGPGMPVHASNPY
        WEP PV+G MMQQ +HDPF+AS  +AAPHSVQMAAMANQQQA+MLQQQQQQQ    MMMM    QQS NPFG PHG   + YGPG+PV A NPY
Subjt:  WEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQ----MMMMAPPPQQS-NPFGNPHGTNAYHYGPGMPVHASNPY

TrEMBL top hitse value%identityAlignment
A0A1S3B966 N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X10.0e+0086.99Show/hide
Query:  MADTEPEEGTMATHAHQTASPIPSGEHTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRT
        MAD EPE+GTMATH+ Q ASPIPSGE+TVWADVSPLLEAAC DLQDGELIHG+ FNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRT
Subjt:  MADTEPEEGTMATHAHQTASPIPSGEHTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRT

Query:  VDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHA
        VDVQC IDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLS L+A
Subjt:  VDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHA

Query:  VEETICRQLRAY------------VEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISK
        VEETICRQLRA             VEPLQ++FDLEEHYCR LLCRLRFRKHFYHVLACMRRPQGRGL+LARKHIASCLLELDLIRNSSTFLSN+SFGISK
Subjt:  VEETICRQLRAY------------VEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISK

Query:  EDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYG
        +DLED TTASGRQP+GFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLL DLDNICSYSLD FLEGVFRFVVQFQK+QPDLVARSLLQFLLVQDGKLYG
Subjt:  EDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYG

Query:  KDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWV
        ++PLYAVITKAAGLPESAKNHEN KNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKD AEI+SISNGENAWMKIFQHIL+WV
Subjt:  KDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWV

Query:  EEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKKKGASKDIGKDFRIPPAVSFLQCQICLAEG
        EEQTYWISSRFLVLGFELELYSP +YCMVYWYLSVVLIKLVEKIHLRALMSNET          GKRKGKKKGASKDIGKDFRIPPAVSFLQCQICLAEG
Subjt:  EEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKKKGASKDIGKDFRIPPAVSFLQCQICLAEG

Query:  LVVMLAALRNEHMIAQSPSPFNSEYE---------------------------------NLVTYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEH
        LV+MLAALRNEHMIAQSPSPFNSEYE                                 NLVTYNCFKDAQKIAKELKSSFSND EKLVELRRIEQVAEH
Subjt:  LVVMLAALRNEHMIAQSPSPFNSEYE---------------------------------NLVTYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEH

Query:  NSVALNLIHKLKDLE
        NSVALNLIHK+  L+
Subjt:  NSVALNLIHKLKDLE

A0A498KED9 ENTH domain-containing protein0.0e+0065.49Show/hide
Query:  TEPEEGTMATHAHQTASPIPSGEHTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDV
        T  E+ T+        +PIPSG+ TVWADVSPLL+AAC DLQDG LI+GDNFNLF+AMSALEIMDPKMDSGM+CKYYSVDEAI++GAAP+P+SFD+TVDV
Subjt:  TEPEEGTMATHAHQTASPIPSGEHTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDV

Query:  QCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEE
        Q TIDIMDHLL+CEATWHKGHSLAQTVFSCIYLLR DRTSSHALLHSYC VIRATCKAVI+VVSDARTHEEEDLF M YGLPL+GDGD+KCLSMLHAVEE
Subjt:  QCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEE

Query:  TICRQLRAYVEP-------------------------LQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTF
        TI RQLRA   P                          Q+  +LEE YC+ LLCRLRFRKHFYHVL  M+RPQGRGL+LARKHIASC+ EL+ I +SS F
Subjt:  TICRQLRAYVEP-------------------------LQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTF

Query:  LSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQF
        L +++FG  + +LED TTASG +P+GFD+SLNCRLSAPTPPRAIK+LSWKK ++YF+KLL DLD ICSY LDP LE V  F+V FQK+QPDLVAR+ LQ 
Subjt:  LSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQF

Query:  LLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWM
        LLVQDGKLYG+DP++AVI +AA LP++ +NH+ QK + I+QLGQLVI+LLKVLCTN AWQRRKLGKILQDWRV+YMQ+EMAF+K+F +  +ISN EN  +
Subjt:  LLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWM

Query:  KIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKKKGAS-KDIGKDFRIPPAVS
        K+FQHIL+WVE+QTYWI+SRFL+LGFELELYS SEYCMVYWY+ VVL+KL +K +L+ ++ N +G+C          KGKKK  S KD+ +D+ IPPAV 
Subjt:  KIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKKKGAS-KDIGKDFRIPPAVS

Query:  FLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYE---------------------------------NLVTYNCFKDAQKIAKELKSSFSNDREKLV
        FLQCQICLAEGL +MLAAL+N+HM+ QS SPFN+E E                                 NLV ++ FKDAQKIAK +KSSFSND EKL 
Subjt:  FLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYE---------------------------------NLVTYNCFKDAQKIAKELKSSFSNDREKLV

Query:  ELRRIEQVAEHNSVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQEIKESWMTDHHLSQHV
        ELRR+EQVAEHNS+ALN+I +                          A D                                   +K S+  +HH     
Subjt:  ELRRIEQVAEHNSVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQEIKESWMTDHHLSQHV

Query:  ARSEDRMQEPNENALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAI
                           P F+ +      +     RMS GGTQ S RKALGALKDTTTVSLAKVNSDYKELDIAIVK+TNHVERPAKEK+IRAIFAA+
Subjt:  ARSEDRMQEPNENALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAI

Query:  SATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLK
        SATRPRADVAYCIHALARRL+KTHNWAVALKTLVVIHRALREVDPTF+EEL+NYGR R+HMLNL+HFKDDSS NAWDYS W+R+YALFLEERLECFRVLK
Subjt:  SATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLK

Query:  YDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAER
        YDVE DR                             PQGAAVHNFV+QLALS+VASES+KIYQAISDGT+NLVDKFFEM+R DA++ALDIYRRAGQQAER
Subjt:  YDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAER

Query:  LSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAPSPPPPEPVKVEAVVT
        LSEFYEVCKSLDIGRGEKF KIEQPPASF+QAMEEYV+EAPQ ST RK+QV      APKE+LAIEYKK P  + E     PP P P   EPVKVEA V 
Subjt:  LSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAPSPPPPEPVKVEAVVT

Query:  EQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNP
        E PDLLGLNDPVP  T  LD+KN+LALAIVPV+DQ TS+AP+  NG   TGWELALVTAPSSNES  A+SKLAGGLDLLTLDSLYDDAIRRNNQNVSYNP
Subjt:  EQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNP

Query:  WEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQ----MMMMAPPPQQS-NPFGNPHGTNAYHYGPGMPVHASNPY
        WEP PV+G MMQQ +HDPF+AS  +AAPHSVQMAAMANQQQA+MLQQQQQQQ    MMMM    QQS NPFG PHG   + YGPG+PV A NPY
Subjt:  WEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQ----MMMMAPPPQQS-NPFGNPHGTNAYHYGPGMPVHASNPY

A0A5D3C7R5 N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X10.0e+0086.99Show/hide
Query:  MADTEPEEGTMATHAHQTASPIPSGEHTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRT
        MAD EPE+GTMATH+ Q ASPIPSGE+TVWADVSPLLEAAC DLQDGELIHG+ FNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRT
Subjt:  MADTEPEEGTMATHAHQTASPIPSGEHTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRT

Query:  VDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHA
        VDVQC IDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLS L+A
Subjt:  VDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHA

Query:  VEETICRQLRAY------------VEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISK
        VEETICRQLRA             VEPLQ++FDLEEHYCR LLCRLRFRKHFYHVLACMRRPQGRGL+LARKHIASCLLELDLIRNSSTFLSN+SFGISK
Subjt:  VEETICRQLRAY------------VEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISK

Query:  EDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYG
        +DLED TTASGRQP+GFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLL DLDNICSYSLD FLEGVFRFVVQFQK+QPDLVARSLLQFLLVQDGKLYG
Subjt:  EDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYG

Query:  KDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWV
        ++PLYAVITKAAGLPESAKNHEN KNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKD AEI+SISNGENAWMKIFQHIL+WV
Subjt:  KDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWV

Query:  EEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKKKGASKDIGKDFRIPPAVSFLQCQICLAEG
        EEQTYWISSRFLVLGFELELYSP +YCMVYWYLSVVLIKLVEKIHLRALMSNET          GKRKGKKKGASKDIGKDFRIPPAVSFLQCQICLAEG
Subjt:  EEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKKKGASKDIGKDFRIPPAVSFLQCQICLAEG

Query:  LVVMLAALRNEHMIAQSPSPFNSEYE---------------------------------NLVTYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEH
        LV+MLAALRNEHMIAQSPSPFNSEYE                                 NLVTYNCFKDAQKIAKELKSSFSND EKLVELRRIEQVAEH
Subjt:  LVVMLAALRNEHMIAQSPSPFNSEYE---------------------------------NLVTYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEH

Query:  NSVALNLIHKLKDLE
        NSVALNLIHK+  L+
Subjt:  NSVALNLIHKLKDLE

A0A6G1F997 ENTH domain-containing protein0.0e+0052.93Show/hide
Query:  MATHAHQTASPIPSGE---HTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGM-ICKYYSVDEAIENGAAPIPLSFDRTVDVQCT
        M   +  +A+P P       + WAD SPLL AAC DLQDGEL+HG+NF+LF AMSALEIMDPKMDSG+    YYS++EAIE+G AP+PLS DRT+D+QCT
Subjt:  MATHAHQTASPIPSGE---HTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGM-ICKYYSVDEAIENGAAPIPLSFDRTVDVQCT

Query:  IDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETIC
        +D+MDHL SCEATWHKGH+LAQTVF+CIYL+R +RTSSHA+L+S+C ++RATC AV++VVS ARTHEEEDLF M++GLPL  +GD+KCLS+L++VEETI 
Subjt:  IDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETIC

Query:  RQLRAY-------------VEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLED
        RQLRA              +E LQ   DLEE YCR LLCRLRFRKHFYHV+ C+R+P GRGL+LARKH+ASCL EL L+  S  FL + S    ++  E 
Subjt:  RQLRAY-------------VEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLED

Query:  MTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLY
         TTASG  P+GFD+SLN RL +P PPRA+K+LSW  A+          +    + L+ F    F +  +  ++ P +        LLVQDGKLYG+D   
Subjt:  MTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLY

Query:  AVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTY
         VI++A  LP+   + E Q N+++VQLGQLVINLLK+LCTN+AWQRRKLGK LQDW  I +Q+E+A K++F E  ++   EN  M++ + +LVW +E TY
Subjt:  AVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTY

Query:  WISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKG-KKKGASKDIGKDFRIPPAVSFLQCQICLAEGLVVM
        W++ RFL+LGFEL+LYSPSEYCMVYWY+ VVL+KL+E++ LR L SNE            +RKG KKK  SKD  +D   P +   LQC + L+EGL +M
Subjt:  WISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKG-KKKGASKDIGKDFRIPPAVSFLQCQICLAEGLVVM

Query:  LAALRNEHMIAQSPSPFNSEYENLVTYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEHNSVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAA
        LAALRNE    Q PS FNSE E  + +  F   QK       ++ + RE           A H S+A                                 
Subjt:  LAALRNEHMIAQSPSPFNSEYENLVTYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEHNSVALNLIHKLKDLEAFSLLVCPVFARRGCLVGGFAA

Query:  QDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQEIKESWMTDHHLSQHVARSEDRMQEPNENALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSF
         D++K N            F++I                      H ++  +                  R  F+     L  I    ++M+GGGT  S 
Subjt:  QDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQEIKESWMTDHHLSQHVARSEDRMQEPNENALTTDRPVFSLSRFSLRFINSEGERMSGGGTQNSF

Query:  RKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFH
        RK +GALKD+TTVS+AKVNSDYKELDIAIVK+TNHVE  AKEK+IR IF  +SA RPRADVAYCI AL RRLSKT NWAVALKTL+VIHRALREVD TF 
Subjt:  RKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREVDPTFH

Query:  EELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQ
        +ELI+YGR   HML+LS+FKDDSSA AWDYSAWVR+YAL+LEERLE FRVLKYDVETD  RT+DL+T  LLEQLPALQ+LL+R+LGCQPQG++ +N +IQ
Subjt:  EELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFVIQ

Query:  LALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRK
         A S+VA ESV+I+ AI+DG +NLVDKFF+M+R DA++ALD+++RA  QA +LSEFYEVCK++ IGRGE+F+KIEQPP SF+QAMEEYVR+AP  S  ++
Subjt:  LALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRK

Query:  EQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAP----SPPPP--EPVKVEAVVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPS
         Q           V AIEY+++P  +   + AP P P    S P P  EP K  + + E  DLLG+N+P P+ TS +D+KN+LALAIV   +   ++AP 
Subjt:  EQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAP----SPPPP--EPVKVEAVVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPS

Query:  QVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMM-QQPIHDPFFASTAVAAPHSVQMAAMANQQQ
              T+ WELALVT PSSN +   +SKLAGGLDLLTLDSLY++A R+  QN SYNPWE  P  G MM QQP+H+PF+AS A+A P +VQMAAMA QQQ
Subjt:  QVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMM-QQPIHDPFFASTAVAAPHSVQMAAMANQQQ

Query:  AYMLQQQQQQQMMMMAPPPQQSNPFGNPHGTN
        A+ML  QQQQ MMM         P+  P  TN
Subjt:  AYMLQQQQQQQMMMMAPPPQQSNPFGNPHGTN

A0A6J1KQ10 N-alpha-acetyltransferase 35, NatC auxiliary subunit isoform X20.0e+0087.2Show/hide
Query:  MADTEPEEGTMATHAHQTASPIPSGEHTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRT
        MAD E EE T+  HA Q +SPIPSGEHTVWADVSPLLEAACGDLQDGELIHG+NFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFD T
Subjt:  MADTEPEEGTMATHAHQTASPIPSGEHTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRT

Query:  VDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHA
        VDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGD+KCLSML+A
Subjt:  VDVQCTIDIMDHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHA

Query:  VEETICRQLRA-------YVEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLED
        VEETICRQLRA         EPLQS FDLEE YCR LLCRLRFRKHFYHVL CMRRPQGRGL+LARKHIASCLLELDLIRNSSTFLSNNSFG+SK+ LED
Subjt:  VEETICRQLRA-------YVEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLED

Query:  MTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLY
        MTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKA+DYFVKLLHDLDNICSYSLDP LEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYG+DPLY
Subjt:  MTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLY

Query:  AVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTY
        AVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFK+DF+EILSISNGENAWMKIFQHILVWVEEQTY
Subjt:  AVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTY

Query:  WISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKKKGA-SKDIGKDFRIPPAVSFLQCQICLAEGLVVM
        WIS+RFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNET          GKRKGKKKGA SKDIGKDFRIPPAVSFLQCQICLAEGL++M
Subjt:  WISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKKKGA-SKDIGKDFRIPPAVSFLQCQICLAEGLVVM

Query:  LAALRNEHMIAQSPSPFNSEYE---------------------------------NLVTYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEHNSVA
        LAALRNEHMI QSPSPFNSEYE                                 NLVTYNCFKDAQKIAKE+KSSFSND EKL ELRRIE VAEHNSVA
Subjt:  LAALRNEHMIAQSPSPFNSEYE---------------------------------NLVTYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEHNSVA

Query:  LNLIHKLKDLE
        LNLIHK+  L+
Subjt:  LNLIHKLKDLE

SwissProt top hitse value%identityAlignment
P94017 Putative clathrin assembly protein At1g149103.6e-12249.36Show/hide
Query:  GTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALRE
        GT  S+R+A GALKDTT V L +VNSDY ELD+AIVK+TNHVE P K++H+R IF A SA RPRADVAYCIHAL+RRL KT NW VALK L+VIHR LR+
Subjt:  GTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALRE

Query:  VDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDR------------ARTKDLDTAELLEQLPALQELLY
         DPTF EEL+N+  ++  ++ +S+FKDDSS  AWD S WVR+YALFLEERLECFRVLKYD+E +R            ++T+DLD  +LLEQLPALQ+LL+
Subjt:  VDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDR------------ARTKDLDTAELLEQLPALQELLY

Query:  RVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFI
        R++GC+P+GAA HN +IQ ALSLV  ES K+Y AI++G INLV+KFFEM R +A+KAL+IY+RAG QA  LS FYEVCK L++ R  +F  + +PP SF+
Subjt:  RVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFI

Query:  QAMEEYVREAPQVSTVRK-----EQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAP--SPPPPEPVKVEAVVTEQPDLLGLNDPVPEATSNLDEKN
          MEEY+R+APQ+  V           D+ L + ++V     + E ++  +  V P      S   P  V+      +  DLLGL+D  P+  + LD+ N
Subjt:  QAMEEYVREAPQVSTVRK-----EQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAP--SPPPPEPVKVEAVVTEQPDLLGLNDPVPEATSNLDEKN

Query:  SLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQP--IHDPFFA
        +LALA+V      +  +  Q      +GWELALVT PS++ S A   +LAGGLD LTL+SLYDD   R  Q          P +G     P  + D F  
Subjt:  SLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQP--IHDPFFA

Query:  STAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQSNPFGN
        S +V+ P +V        +  Y   QQ+QQ  + +AP P  +NPFG+
Subjt:  STAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQSNPFGN

Q8LBH2 Putative clathrin assembly protein At2g016006.5e-13251.43Show/hide
Query:  GTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALRE
        GT  S+RKA GALKD+T V L +VNS+Y +LD+AIVK+TNHVE P K++H+R IFAA S TR RADVAYCIHAL+RRL KT NW VALKTL+VIHR LRE
Subjt:  GTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALRE

Query:  VDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDR------------ARTKDLDTAELLEQLPALQELLY
         DPTF EEL+N+  +R  +L LS+FKDDSS  AWD SAWVR+YALFLEERLECFRVLKYD E +R            +RT+DLD  ELLEQLPALQ+LLY
Subjt:  VDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDR------------ARTKDLDTAELLEQLPALQELLY

Query:  RVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFI
        R++GC+P+GAA HN VIQ AL+LV  ES K+Y AI+DG INL+DKFFEM + +A+ +L+IY+RAGQQA  LS+FYE CK L++ R  +F  + +PP SF+
Subjt:  RVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFI

Query:  QAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQT-----------------VAPPPAPSPPPPEPVKVEAVVTEQPDLLGLNDPVP
          MEEY++EAP+V  V            P E L + Y+ +     E T                 V+    PSPPPP     +  + +  DL GLN   P
Subjt:  QAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQT-----------------VAPPPAPSPPPPEPVKVEAVVTEQPDLLGLNDPVP

Query:  EATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQ
        + TS ++++N+LALAIV       +    Q N    TGWELALVTAPSS+ S +   KLAGGLD LTL SLYDD     +Q          PV+GA    
Subjt:  EATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQ

Query:  PI--HDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQ---MMMMAPPPQQSNPFGN
        P   HDPF +S   A P   Q  A+ N   AY    Q Q Q        PP   SNPFG+
Subjt:  PI--HDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQ---MMMMAPPPQQSNPFGN

Q8VYT2 Putative clathrin assembly protein At4g259403.3e-12849.33Show/hide
Query:  TQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREV
        T NSFRKA+GA+KD+TTVS+AKVNS++K+LD+AIVK+TNHVE   KE+HIR IF+A S  +PRADVAYCIHALA+RLSKT NW VA+K L+VIHR LRE 
Subjt:  TQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREV

Query:  DPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRA-------------------RTKDLDTAELLEQLPA
        DPTF EEL+NY   R H+L +S+FKDD+S  AWD SAW+R+YALFLEERLEC+RVLKYD+E +R                    RT+ L   ELLEQLPA
Subjt:  DPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRA-------------------RTKDLDTAELLEQLPA

Query:  LQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQ
        LQ+LLYR++GCQP+G+A  N++IQ AL+LV  ES KIY AI+DG INLVD FFEM R DA+KAL+IY+RAGQQAE L++FYE CK L++ R  +F  + Q
Subjt:  LQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQ

Query:  PPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYK---KEPAA---QVEQTVAPPPAPSPPPPEPVKVE------AVVTEQPDLLGLNDPV
        PP SF+  ME+Y++EAPQ  +V+K+     K    +E    E+    +EPA    Q E +    P       +  K+E      + + +  DLLGLN+  
Subjt:  PPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYK---KEPAA---QVEQTVAPPPAPSPPPPEPVKVE------AVVTEQPDLLGLNDPV

Query:  PEATSNLDEKNSLALAIVPVADQQTSSAPSQVNG---TTTTGWELALVTAPSSNES----VAATSKLAGGLDLLTLDSLY-DDAIRRNNQ--NVSYN---
        P+A + ++++N+LALAI P   +  +  PS +     T  +GWELALVT  ++N +     A  +KLAGG D L LDSLY DD+ RR  Q  N  Y    
Subjt:  PEATSNLDEKNSLALAIVPVADQQTSSAPSQVNG---TTTTGWELALVTAPSSNES----VAATSKLAGGLDLLTLDSLY-DDAIRRNNQ--NVSYN---

Query:  -PWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQSNPFGNPHG--TNAYHYGPGMPVHASNPYG
              P +   MQQ   DPF  S  +A P +VQMA          +QQQQQQQM MM   P     + +PH    N +H+        SNP+G
Subjt:  -PWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQSNPFGNPHG--TNAYHYGPGMPVHASNPYG

Q9LHS0 Putative clathrin assembly protein At5g352001.5e-18967.68Show/hide
Query:  GGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRAL
        GGG+Q+S R+ LGA+KDTTTVSLAKVNSDYKELDIAIVK+TNHVERP+KE++IRAIF AISATRPRADVAYCIHALARRLS+THNWAVALKTL+VIHRAL
Subjt:  GGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRAL

Query:  REVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGA
        REVD TFHEE+INY R R+HMLN+SHFKDDS  NAW YSAWVR YALFLEERLECFRVLKYDVE D  RTKDLDT +LLEQLPALQELL+RVL CQP+GA
Subjt:  REVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGA

Query:  AVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREA
        AV N +IQLALS+V SES KIYQA++DG  NLVDKFF+M+R DA+KALD+YRRA +QA RLSEF+EVCKS+++GRGE+FIKIEQPP SF+QAMEEYV+EA
Subjt:  AVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREA

Query:  PQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAPSPPPPEPVKVEA--VVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTS
        P  + V+KEQV + KL APKE+LAIEY+  P    E+         P  PEPVK EA   V +QPDLL ++DP P   S L+EKN+LALAIVPV+ +Q  
Subjt:  PQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAPSPPPPEPVKVEA--VVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTS

Query:  SAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIR-RNNQNVSYNPWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMA
        S     NG  +TGWELALVTAPSSNE  AA SKLAGGLD LTLDSLY+DAIR    QN SYNPWE  PVH   M   +H PF+AS  VAAP   QMA   
Subjt:  SAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIR-RNNQNVSYNPWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMA

Query:  NQQQAYMLQQQQQQQMMMMAP---PPQQ-----SNPFGNPHGTNAYHYGPGMPVHASNPY
        +Q   Y    Q Q   MMM P   P QQ     +NPFGNP  +N     P  P    NPY
Subjt:  NQQQAYMLQQQQQQQMMMMAP---PPQQ-----SNPFGNPHGTNAYHYGPGMPVHASNPY

Q9LVD8 Putative clathrin assembly protein At5g572009.1e-12650.25Show/hide
Query:  GTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALRE
        GT  SFRKA GALKDTTTV LAKVNS++K+LDIAIVK+TNHVE P KE+H+R IF+A S  +PRADVAYCIHAL++RLSKT NW VA+K L+VIHR LRE
Subjt:  GTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALRE

Query:  VDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRA-----------RTKDLDTAELLEQLPALQELLYR
         DPTF EEL+NY  RR H+L +S+FKDD+S  AWD SAWVR+YALFLEERLEC+RVLKYD+E +R            RT+ L   +LLEQLPALQ+LLYR
Subjt:  VDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRA-----------RTKDLDTAELLEQLPALQELLYR

Query:  VLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQ
        ++GCQP+GAA  N++IQ AL+LV  ES KIY AI+DG INLVD FFEM R DA+KAL+IY+RAGQQAE L+EFY+ CK L++ R  +F  + QPP SF+ 
Subjt:  VLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQ

Query:  AMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAA---QVEQTVAPPP---APSPPPPEPVKVE----AVVTEQPDLLGLNDPVPEATSNLDE
         MEEY++EAPQ  +V+K+     K    +E    E  +EPA    Q E T    P        P E ++VE    + + +  DLLGL++  P+A + +++
Subjt:  AMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAA---QVEQTVAPPP---APSPPPPEPVKVE----AVVTEQPDLLGLNDPVPEATSNLDE

Query:  KNSLALAIVPVADQQTSSAPS---QVNGTTTTGWELALVTAPSSNES------VAATSKLAGGLDLLTLDSLY-DDAIRRNNQ--NVSYNPWEPVPVHGA
         N+ +LAI P   +  +SAPS    +     +GWELALVT  ++N +      V AT KL GG D L LDSLY DD  RR  Q  N  Y  +    + GA
Subjt:  KNSLALAIVPVADQQTSSAPS---QVNGTTTTGWELALVTAPSSNES------VAATSKLAGGLDLLTLDSLY-DDAIRRNNQ--NVSYNPWEPVPVHGA

Query:  MMQ---QPI---HDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQSNPFGNPHGTNAYH-YGPGMPVHAS-NPYG
        +      P     DPF  S  +A P +VQMA              QQQQMMMM       +P+ N +    +H + P     +S NP+G
Subjt:  MMQ---QPI---HDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQSNPFGNPHGTNAYH-YGPGMPVHAS-NPYG

Arabidopsis top hitse value%identityAlignment
AT2G01600.1 ENTH/ANTH/VHS superfamily protein4.6e-13351.43Show/hide
Query:  GTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALRE
        GT  S+RKA GALKD+T V L +VNS+Y +LD+AIVK+TNHVE P K++H+R IFAA S TR RADVAYCIHAL+RRL KT NW VALKTL+VIHR LRE
Subjt:  GTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALRE

Query:  VDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDR------------ARTKDLDTAELLEQLPALQELLY
         DPTF EEL+N+  +R  +L LS+FKDDSS  AWD SAWVR+YALFLEERLECFRVLKYD E +R            +RT+DLD  ELLEQLPALQ+LLY
Subjt:  VDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDR------------ARTKDLDTAELLEQLPALQELLY

Query:  RVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFI
        R++GC+P+GAA HN VIQ AL+LV  ES K+Y AI+DG INL+DKFFEM + +A+ +L+IY+RAGQQA  LS+FYE CK L++ R  +F  + +PP SF+
Subjt:  RVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFI

Query:  QAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQT-----------------VAPPPAPSPPPPEPVKVEAVVTEQPDLLGLNDPVP
          MEEY++EAP+V  V            P E L + Y+ +     E T                 V+    PSPPPP     +  + +  DL GLN   P
Subjt:  QAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQT-----------------VAPPPAPSPPPPEPVKVEAVVTEQPDLLGLNDPVP

Query:  EATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQ
        + TS ++++N+LALAIV       +    Q N    TGWELALVTAPSS+ S +   KLAGGLD LTL SLYDD     +Q          PV+GA    
Subjt:  EATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQ

Query:  PI--HDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQ---MMMMAPPPQQSNPFGN
        P   HDPF +S   A P   Q  A+ N   AY    Q Q Q        PP   SNPFG+
Subjt:  PI--HDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQ---MMMMAPPPQQSNPFGN

AT2G11000.1 MAK10 homologue1.7e-22056.41Show/hide
Query:  HAHQTASPIPSGE-HTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIMDH
        H  +T+S IPSG+ ++VWADVSPLL AAC DLQ+GELI+GDNFNLF+AMSALEIMDPKMDSGM+  +YS+DEAIE+G AP+P+S D TV+VQ  IDIMDH
Subjt:  HAHQTASPIPSGE-HTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIMDH

Query:  LLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLR--
        LL+CEATWH GHSLAQTVFSCIY+LRP+RTSS ALLHSYC VIRATC+AV++VVSDART+EEEDLF M YGLP SG+ D K L +L+AVEETICRQLR  
Subjt:  LLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLR--

Query:  ----------AYVEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGR
                  A +EPLQS   LEE +C+ LLCR+RFRKHF H L CMRRPQGRGL+LARKHI  C+ ELD + +S+ FL  + F     ++E+ TTASGR
Subjt:  ----------AYVEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGR

Query:  QPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAA
         P+GFD +LN RLSAPTPPRAIKLLSWKKA+DY+VKLLH+LD IC++SL+P LE V  FV+QFQK++PDLVAR+ LQ LLVQDGKLYG+D    +  ++ 
Subjt:  QPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAA

Query:  GLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSIS--NGENAWMKIFQHILVWVEEQTYWISSR
         L + +KNH    N+YI+QL QL++NLLK+LC N+ WQRRKLGKIL DW V ++QM +A  +   +  S S  NG+ + + I  HI   +EEQ  W++ R
Subjt:  GLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSIS--NGENAWMKIFQHILVWVEEQTYWISSR

Query:  FLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKK-KGASKDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALR
        FL+LGF+L+LYSPSEYCMVYWY+ ++L KL E+   R L+   T E          RK K+ K  S+D+ ++ RI   V FL+CQ CLA+GL VM+AALR
Subjt:  FLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKK-KGASKDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALR

Query:  NEHMIAQSPSPFNSEYENLV---------------------------------TYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEHNSVALNLIH
        NE M  +S  PFN+E E  +                                  Y  F DAQKIAK++K  ++ND +KL E+  +E+VAE N VA+NL  
Subjt:  NEHMIAQSPSPFNSEYENLV---------------------------------TYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEHNSVALNLIH

Query:  KLKDLE-AFSLLVCPVFA
        + + L+ +F     P FA
Subjt:  KLKDLE-AFSLLVCPVFA

AT2G11000.2 MAK10 homologue1.8e-20954.74Show/hide
Query:  HAHQTASPIPSGE-HTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIMDH
        H  +T+S IPSG+ ++VWADVSPLL AAC DLQ+GELI+GDNFNLF+AMSALEIMDPKMDSGM+  +YS+DEAIE+G AP+P+S D TV+VQ  IDIMDH
Subjt:  HAHQTASPIPSGE-HTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIMDH

Query:  LLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLR--
        LL+CEATWH GHSLAQTVFSCIY+LRP+RTSS ALLHSYC VIRATC+AV++VVSDART+EEEDLF M YGLP SG+ D K L +L+AVEETICRQLR  
Subjt:  LLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLR--

Query:  ----------AYVEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGR
                  A +EPLQS   LEE +C+ LLCR+RFR               RGL+LARKHI  C+ ELD + +S+ FL  + F     ++E+ TTASGR
Subjt:  ----------AYVEPLQSTFDLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGR

Query:  QPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAA
         P+GFD +LN RLSAPTPPRAIKLLSWKKA+DY+VKLLH+LD IC++SL+P LE V  FV+QFQK++PDLVAR+ LQ LLVQDGKLYG+D    +  ++ 
Subjt:  QPLGFDSSLNCRLSAPTPPRAIKLLSWKKALDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAA

Query:  GLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSIS--NGENAWMKIFQHILVWVEEQTYWISSR
         L + +KNH    N+YI+QL QL++NLLK+LC N+ WQRRKLGKIL DW V ++QM +A  +   +  S S  NG+ + + I  HI   +EEQ  W++ R
Subjt:  GLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRRKLGKILQDWRVIYMQMEMAFKKDFAEILSIS--NGENAWMKIFQHILVWVEEQTYWISSR

Query:  FLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKK-KGASKDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALR
        FL+LGF+L+LYSPSEYCMVYWY+ ++L KL E+   R L+   T E          RK K+ K  S+D+ ++ RI   V FL+CQ CLA+GL VM+AALR
Subjt:  FLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAAMLGKRKGKK-KGASKDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALR

Query:  NEHMIAQSPSPFNSEYENLV---------------------------------TYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEHNSVALNLIH
        NE M  +S  PFN+E E  +                                  Y  F DAQKIAK++K  ++ND +KL E+  +E+VAE N VA+NL  
Subjt:  NEHMIAQSPSPFNSEYENLV---------------------------------TYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEHNSVALNLIH

Query:  KLKDLE-AFSLLVCPVFA
        + + L+ +F     P FA
Subjt:  KLKDLE-AFSLLVCPVFA

AT4G25940.1 ENTH/ANTH/VHS superfamily protein2.4e-12949.33Show/hide
Query:  TQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREV
        T NSFRKA+GA+KD+TTVS+AKVNS++K+LD+AIVK+TNHVE   KE+HIR IF+A S  +PRADVAYCIHALA+RLSKT NW VA+K L+VIHR LRE 
Subjt:  TQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRALREV

Query:  DPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRA-------------------RTKDLDTAELLEQLPA
        DPTF EEL+NY   R H+L +S+FKDD+S  AWD SAW+R+YALFLEERLEC+RVLKYD+E +R                    RT+ L   ELLEQLPA
Subjt:  DPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRA-------------------RTKDLDTAELLEQLPA

Query:  LQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQ
        LQ+LLYR++GCQP+G+A  N++IQ AL+LV  ES KIY AI+DG INLVD FFEM R DA+KAL+IY+RAGQQAE L++FYE CK L++ R  +F  + Q
Subjt:  LQELLYRVLGCQPQGAAVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQ

Query:  PPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYK---KEPAA---QVEQTVAPPPAPSPPPPEPVKVE------AVVTEQPDLLGLNDPV
        PP SF+  ME+Y++EAPQ  +V+K+     K    +E    E+    +EPA    Q E +    P       +  K+E      + + +  DLLGLN+  
Subjt:  PPASFIQAMEEYVREAPQVSTVRKEQVADNKLAAPKEVLAIEYK---KEPAA---QVEQTVAPPPAPSPPPPEPVKVE------AVVTEQPDLLGLNDPV

Query:  PEATSNLDEKNSLALAIVPVADQQTSSAPSQVNG---TTTTGWELALVTAPSSNES----VAATSKLAGGLDLLTLDSLY-DDAIRRNNQ--NVSYN---
        P+A + ++++N+LALAI P   +  +  PS +     T  +GWELALVT  ++N +     A  +KLAGG D L LDSLY DD+ RR  Q  N  Y    
Subjt:  PEATSNLDEKNSLALAIVPVADQQTSSAPSQVNG---TTTTGWELALVTAPSSNES----VAATSKLAGGLDLLTLDSLY-DDAIRRNNQ--NVSYN---

Query:  -PWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQSNPFGNPHG--TNAYHYGPGMPVHASNPYG
              P +   MQQ   DPF  S  +A P +VQMA          +QQQQQQQM MM   P     + +PH    N +H+        SNP+G
Subjt:  -PWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQSNPFGNPHG--TNAYHYGPGMPVHASNPYG

AT5G35200.1 ENTH/ANTH/VHS superfamily protein1.1e-19067.68Show/hide
Query:  GGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRAL
        GGG+Q+S R+ LGA+KDTTTVSLAKVNSDYKELDIAIVK+TNHVERP+KE++IRAIF AISATRPRADVAYCIHALARRLS+THNWAVALKTL+VIHRAL
Subjt:  GGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVIHRAL

Query:  REVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGA
        REVD TFHEE+INY R R+HMLN+SHFKDDS  NAW YSAWVR YALFLEERLECFRVLKYDVE D  RTKDLDT +LLEQLPALQELL+RVL CQP+GA
Subjt:  REVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGA

Query:  AVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREA
        AV N +IQLALS+V SES KIYQA++DG  NLVDKFF+M+R DA+KALD+YRRA +QA RLSEF+EVCKS+++GRGE+FIKIEQPP SF+QAMEEYV+EA
Subjt:  AVHNFVIQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREA

Query:  PQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAPSPPPPEPVKVEA--VVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTS
        P  + V+KEQV + KL APKE+LAIEY+  P    E+         P  PEPVK EA   V +QPDLL ++DP P   S L+EKN+LALAIVPV+ +Q  
Subjt:  PQVSTVRKEQVADNKLAAPKEVLAIEYKKEPAAQVEQTVAPPPAPSPPPPEPVKVEA--VVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTS

Query:  SAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIR-RNNQNVSYNPWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMA
        S     NG  +TGWELALVTAPSSNE  AA SKLAGGLD LTLDSLY+DAIR    QN SYNPWE  PVH   M   +H PF+AS  VAAP   QMA   
Subjt:  SAPSQVNGTTTTGWELALVTAPSSNESVAATSKLAGGLDLLTLDSLYDDAIR-RNNQNVSYNPWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMA

Query:  NQQQAYMLQQQQQQQMMMMAP---PPQQ-----SNPFGNPHGTNAYHYGPGMPVHASNPY
        +Q   Y    Q Q   MMM P   P QQ     +NPFGNP  +N     P  P    NPY
Subjt:  NQQQAYMLQQQQQQQMMMMAP---PPQQ-----SNPFGNPHGTNAYHYGPGMPVHASNPY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATACAGAACCAGAAGAAGGAACTATGGCCACTCATGCACATCAAACTGCCTCTCCGATTCCTTCCGGCGAGCATACTGTTTGGGCCGATGTGTCGCCTCTTCT
CGAAGCTGCCTGCGGAGATCTTCAAGATGGAGAACTTATTCATGGGGATAATTTCAATCTTTTTTCTGCTATGTCTGCTTTAGAGATAATGGATCCAAAAATGGATTCGG
GCATGATATGTAAGTACTATTCTGTTGACGAAGCTATTGAGAATGGTGCCGCTCCAATTCCTCTGAGCTTTGATAGAACAGTTGATGTTCAATGCACTATTGATATCATG
GACCATCTTCTATCTTGTGAGGCAACATGGCACAAGGGTCACTCATTGGCACAGACTGTCTTCTCGTGCATTTATCTTCTGAGGCCGGATAGAACAAGTTCACATGCATT
ATTGCATTCCTATTGCAGTGTCATACGTGCAACTTGCAAGGCAGTTATTGCAGTTGTCTCAGATGCACGAACACATGAAGAAGAGGATCTTTTCATAATGGCATATGGTC
TTCCTTTGAGTGGGGATGGAGACGACAAATGCTTATCAATGCTACATGCTGTTGAAGAAACAATTTGCCGCCAATTGCGGGCTTATGTAGAGCCATTACAAAGTACTTTC
GATCTGGAAGAGCACTACTGTAGGGTTCTTTTATGTCGCCTGCGCTTCCGTAAGCATTTCTACCATGTCCTTGCGTGTATGAGGCGGCCTCAGGGCAGAGGTTTAGACTT
GGCAAGAAAACATATAGCTTCCTGTTTGTTGGAGCTAGACTTGATCCGCAATTCATCCACTTTTCTTAGCAATAATAGTTTTGGAATTAGCAAAGAGGATTTGGAGGATA
TGACAACTGCTTCAGGTCGTCAACCACTAGGATTTGATTCTAGTTTGAATTGTAGATTGTCTGCCCCTACTCCACCACGTGCAATCAAACTACTAAGTTGGAAAAAGGCA
TTAGATTATTTTGTGAAACTGCTTCATGACCTAGATAATATATGTTCGTATTCACTGGACCCATTTCTGGAAGGTGTTTTCCGCTTTGTGGTTCAGTTCCAGAAAACTCA
GCCTGATTTGGTTGCTAGATCTCTCCTTCAGTTCCTTCTGGTTCAAGATGGAAAGCTCTATGGGAAGGATCCTCTATATGCAGTGATTACCAAAGCTGCAGGATTGCCAG
AATCTGCCAAAAACCATGAAAATCAGAAGAACCAATACATTGTACAACTAGGACAGTTAGTGATAAATCTTCTCAAGGTACTCTGCACAAATTCTGCATGGCAAAGACGC
AAGCTTGGTAAGATATTACAGGACTGGCGTGTGATCTACATGCAGATGGAGATGGCTTTCAAAAAGGATTTTGCTGAAATCTTGAGCATATCAAATGGAGAGAATGCATG
GATGAAGATTTTCCAACACATCCTCGTTTGGGTAGAGGAGCAAACTTATTGGATTTCTTCGCGGTTCCTTGTCTTAGGTTTTGAATTAGAGCTGTACTCCCCCAGCGAGT
ATTGCATGGTATATTGGTACCTTTCTGTTGTCTTAATCAAGCTCGTGGAGAAAATACATCTGAGAGCATTGATGAGCAATGAAACAGGTGAGTGTGCAAACTTGGCTGCC
ATGTTAGGTAAACGAAAAGGAAAGAAAAAAGGAGCCTCAAAAGATATTGGGAAAGATTTCCGAATTCCACCAGCAGTGTCATTTCTTCAGTGCCAAATATGTCTCGCTGA
AGGGCTTGTAGTGATGCTTGCTGCTTTGAGGAATGAACATATGATCGCACAGAGTCCAAGCCCCTTCAATAGCGAGTATGAGAACCTGGTTACATATAATTGCTTTAAGG
ATGCACAGAAGATTGCAAAGGAATTGAAGAGTAGCTTTTCAAATGACCGTGAAAAACTTGTTGAACTCCGAAGGATCGAGCAAGTTGCAGAGCACAACAGTGTTGCCTTG
AACCTGATCCACAAGTTAAAAGATCTTGAAGCTTTTTCTCTATTGGTCTGTCCCGTTTTCGCACGTAGAGGTTGCTTAGTCGGAGGTTTTGCAGCACAGGATGTTTTAAA
GGACAACATGGTGGATCTTACAACATTTGGATGTCAAGAAAATTTCCAGGATATTAAATTTCAGGTAGGTCTGAAGCTTAGCAATAAAATCCAGGAAATCAAAGAGTCGT
GGATGACAGATCATCATCTTTCTCAACACGTGGCAAGATCTGAAGACAGGATGCAAGAACCAAATGAGAACGCACTAACAACCGATCGCCCTGTCTTCTCTCTTTCTCGA
TTCAGTTTGAGGTTTATTAACAGCGAGGGAGAGAGAATGTCAGGTGGGGGTACACAGAACAGCTTTAGAAAAGCACTGGGAGCCCTGAAGGATACTACCACAGTTTCATT
AGCTAAAGTTAACAGTGATTATAAGGAGTTAGACATTGCTATAGTTAAGTCAACAAATCATGTTGAACGTCCTGCAAAGGAAAAACATATCCGAGCTATATTTGCAGCTA
TTTCAGCTACCAGGCCTAGAGCTGATGTTGCATATTGCATCCATGCTTTGGCAAGAAGATTATCTAAGACACATAATTGGGCAGTTGCACTAAAAACTTTGGTTGTTATC
CATCGTGCTCTGCGGGAAGTGGACCCCACATTTCATGAAGAACTGATTAACTATGGCAGGAGAAGAAACCACATGCTTAATTTATCTCATTTTAAAGATGATTCCAGTGC
TAATGCATGGGATTATTCTGCTTGGGTACGTTCATATGCCTTATTTTTGGAGGAGAGGTTGGAATGTTTCCGTGTGCTGAAGTATGATGTTGAGACAGATCGTGCGAGAA
CCAAAGATCTAGACACGGCTGAGTTGCTTGAGCAGTTGCCAGCATTACAAGAGCTTCTGTATCGGGTACTTGGTTGTCAGCCACAAGGAGCCGCAGTTCATAACTTTGTA
ATTCAGCTGGCCCTTTCATTGGTTGCTTCTGAGAGCGTCAAAATTTATCAGGCCATCAGTGACGGAACAATCAATTTAGTTGACAAGTTTTTTGAGATGCGACGCCAAGA
TGCATTGAAAGCCCTGGATATTTATAGGAGGGCTGGCCAGCAGGCGGAGAGGCTTTCTGAATTCTATGAAGTTTGTAAAAGTCTTGATATTGGGCGTGGCGAGAAGTTTA
TAAAGATTGAACAGCCCCCTGCATCATTTATACAAGCCATGGAAGAATATGTAAGAGAAGCTCCACAGGTTTCAACAGTTCGTAAGGAACAGGTTGCTGATAATAAACTG
GCTGCCCCTAAAGAAGTTTTGGCTATCGAGTACAAGAAGGAACCAGCAGCGCAAGTGGAACAAACGGTGGCACCTCCACCAGCCCCGTCTCCCCCACCACCTGAACCAGT
TAAAGTAGAAGCAGTTGTGACCGAACAACCTGACTTGTTGGGTTTGAACGATCCTGTACCTGAGGCTACCTCCAATTTAGATGAGAAGAATTCTCTGGCGTTGGCTATTG
TCCCAGTTGCCGATCAACAAACCAGTTCTGCTCCAAGCCAAGTTAATGGTACTACAACTACGGGCTGGGAATTGGCACTCGTTACAGCACCAAGCTCAAACGAAAGTGTA
GCTGCTACAAGCAAATTGGCCGGAGGGTTGGACTTGCTTACATTAGACAGCTTATATGATGATGCAATCAGAAGAAACAATCAGAACGTAAGTTACAATCCATGGGAGCC
AGTCCCAGTGCATGGTGCCATGATGCAACAGCCAATCCATGATCCCTTTTTCGCCTCGACTGCGGTGGCTGCACCTCATTCAGTACAAATGGCAGCTATGGCCAACCAGC
AGCAAGCTTACATGTTGCAGCAGCAGCAGCAACAGCAGATGATGATGATGGCCCCCCCACCGCAACAGTCAAATCCTTTTGGAAATCCTCATGGAACCAACGCCTACCAC
TACGGTCCGGGTATGCCTGTTCATGCTTCCAATCCATATGGTCTCATTTAA
mRNA sequenceShow/hide mRNA sequence
ACAAAATTCCCAAATATAATTTTTGTATGCCCTTTTTCTCTACGTTAAACCATTGATCAGCTTACATACAAGATAAAAATTCAGTCAAGCCCTAAGCCTAATTCAGTTTA
CTCCGTCGACACAAACTGTTAAACAGCTGCGAAAGTTGGTCATAGAAGAAACCAAATTAATTGAAAAACCGCGACCTGAAGAACCGACACCGTCGTCCCTACTCTATCTT
CCTCTTCCTTTTCTCCACGACGACTTGTCGAAGACCCACGCATCAAAGCTTCTGAACTCCAGCTGCCCCTCGATGGCCGATACAGAACCAGAAGAAGGAACTATGGCCAC
TCATGCACATCAAACTGCCTCTCCGATTCCTTCCGGCGAGCATACTGTTTGGGCCGATGTGTCGCCTCTTCTCGAAGCTGCCTGCGGAGATCTTCAAGATGGAGAACTTA
TTCATGGGGATAATTTCAATCTTTTTTCTGCTATGTCTGCTTTAGAGATAATGGATCCAAAAATGGATTCGGGCATGATATGTAAGTACTATTCTGTTGACGAAGCTATT
GAGAATGGTGCCGCTCCAATTCCTCTGAGCTTTGATAGAACAGTTGATGTTCAATGCACTATTGATATCATGGACCATCTTCTATCTTGTGAGGCAACATGGCACAAGGG
TCACTCATTGGCACAGACTGTCTTCTCGTGCATTTATCTTCTGAGGCCGGATAGAACAAGTTCACATGCATTATTGCATTCCTATTGCAGTGTCATACGTGCAACTTGCA
AGGCAGTTATTGCAGTTGTCTCAGATGCACGAACACATGAAGAAGAGGATCTTTTCATAATGGCATATGGTCTTCCTTTGAGTGGGGATGGAGACGACAAATGCTTATCA
ATGCTACATGCTGTTGAAGAAACAATTTGCCGCCAATTGCGGGCTTATGTAGAGCCATTACAAAGTACTTTCGATCTGGAAGAGCACTACTGTAGGGTTCTTTTATGTCG
CCTGCGCTTCCGTAAGCATTTCTACCATGTCCTTGCGTGTATGAGGCGGCCTCAGGGCAGAGGTTTAGACTTGGCAAGAAAACATATAGCTTCCTGTTTGTTGGAGCTAG
ACTTGATCCGCAATTCATCCACTTTTCTTAGCAATAATAGTTTTGGAATTAGCAAAGAGGATTTGGAGGATATGACAACTGCTTCAGGTCGTCAACCACTAGGATTTGAT
TCTAGTTTGAATTGTAGATTGTCTGCCCCTACTCCACCACGTGCAATCAAACTACTAAGTTGGAAAAAGGCATTAGATTATTTTGTGAAACTGCTTCATGACCTAGATAA
TATATGTTCGTATTCACTGGACCCATTTCTGGAAGGTGTTTTCCGCTTTGTGGTTCAGTTCCAGAAAACTCAGCCTGATTTGGTTGCTAGATCTCTCCTTCAGTTCCTTC
TGGTTCAAGATGGAAAGCTCTATGGGAAGGATCCTCTATATGCAGTGATTACCAAAGCTGCAGGATTGCCAGAATCTGCCAAAAACCATGAAAATCAGAAGAACCAATAC
ATTGTACAACTAGGACAGTTAGTGATAAATCTTCTCAAGGTACTCTGCACAAATTCTGCATGGCAAAGACGCAAGCTTGGTAAGATATTACAGGACTGGCGTGTGATCTA
CATGCAGATGGAGATGGCTTTCAAAAAGGATTTTGCTGAAATCTTGAGCATATCAAATGGAGAGAATGCATGGATGAAGATTTTCCAACACATCCTCGTTTGGGTAGAGG
AGCAAACTTATTGGATTTCTTCGCGGTTCCTTGTCTTAGGTTTTGAATTAGAGCTGTACTCCCCCAGCGAGTATTGCATGGTATATTGGTACCTTTCTGTTGTCTTAATC
AAGCTCGTGGAGAAAATACATCTGAGAGCATTGATGAGCAATGAAACAGGTGAGTGTGCAAACTTGGCTGCCATGTTAGGTAAACGAAAAGGAAAGAAAAAAGGAGCCTC
AAAAGATATTGGGAAAGATTTCCGAATTCCACCAGCAGTGTCATTTCTTCAGTGCCAAATATGTCTCGCTGAAGGGCTTGTAGTGATGCTTGCTGCTTTGAGGAATGAAC
ATATGATCGCACAGAGTCCAAGCCCCTTCAATAGCGAGTATGAGAACCTGGTTACATATAATTGCTTTAAGGATGCACAGAAGATTGCAAAGGAATTGAAGAGTAGCTTT
TCAAATGACCGTGAAAAACTTGTTGAACTCCGAAGGATCGAGCAAGTTGCAGAGCACAACAGTGTTGCCTTGAACCTGATCCACAAGTTAAAAGATCTTGAAGCTTTTTC
TCTATTGGTCTGTCCCGTTTTCGCACGTAGAGGTTGCTTAGTCGGAGGTTTTGCAGCACAGGATGTTTTAAAGGACAACATGGTGGATCTTACAACATTTGGATGTCAAG
AAAATTTCCAGGATATTAAATTTCAGGTAGGTCTGAAGCTTAGCAATAAAATCCAGGAAATCAAAGAGTCGTGGATGACAGATCATCATCTTTCTCAACACGTGGCAAGA
TCTGAAGACAGGATGCAAGAACCAAATGAGAACGCACTAACAACCGATCGCCCTGTCTTCTCTCTTTCTCGATTCAGTTTGAGGTTTATTAACAGCGAGGGAGAGAGAAT
GTCAGGTGGGGGTACACAGAACAGCTTTAGAAAAGCACTGGGAGCCCTGAAGGATACTACCACAGTTTCATTAGCTAAAGTTAACAGTGATTATAAGGAGTTAGACATTG
CTATAGTTAAGTCAACAAATCATGTTGAACGTCCTGCAAAGGAAAAACATATCCGAGCTATATTTGCAGCTATTTCAGCTACCAGGCCTAGAGCTGATGTTGCATATTGC
ATCCATGCTTTGGCAAGAAGATTATCTAAGACACATAATTGGGCAGTTGCACTAAAAACTTTGGTTGTTATCCATCGTGCTCTGCGGGAAGTGGACCCCACATTTCATGA
AGAACTGATTAACTATGGCAGGAGAAGAAACCACATGCTTAATTTATCTCATTTTAAAGATGATTCCAGTGCTAATGCATGGGATTATTCTGCTTGGGTACGTTCATATG
CCTTATTTTTGGAGGAGAGGTTGGAATGTTTCCGTGTGCTGAAGTATGATGTTGAGACAGATCGTGCGAGAACCAAAGATCTAGACACGGCTGAGTTGCTTGAGCAGTTG
CCAGCATTACAAGAGCTTCTGTATCGGGTACTTGGTTGTCAGCCACAAGGAGCCGCAGTTCATAACTTTGTAATTCAGCTGGCCCTTTCATTGGTTGCTTCTGAGAGCGT
CAAAATTTATCAGGCCATCAGTGACGGAACAATCAATTTAGTTGACAAGTTTTTTGAGATGCGACGCCAAGATGCATTGAAAGCCCTGGATATTTATAGGAGGGCTGGCC
AGCAGGCGGAGAGGCTTTCTGAATTCTATGAAGTTTGTAAAAGTCTTGATATTGGGCGTGGCGAGAAGTTTATAAAGATTGAACAGCCCCCTGCATCATTTATACAAGCC
ATGGAAGAATATGTAAGAGAAGCTCCACAGGTTTCAACAGTTCGTAAGGAACAGGTTGCTGATAATAAACTGGCTGCCCCTAAAGAAGTTTTGGCTATCGAGTACAAGAA
GGAACCAGCAGCGCAAGTGGAACAAACGGTGGCACCTCCACCAGCCCCGTCTCCCCCACCACCTGAACCAGTTAAAGTAGAAGCAGTTGTGACCGAACAACCTGACTTGT
TGGGTTTGAACGATCCTGTACCTGAGGCTACCTCCAATTTAGATGAGAAGAATTCTCTGGCGTTGGCTATTGTCCCAGTTGCCGATCAACAAACCAGTTCTGCTCCAAGC
CAAGTTAATGGTACTACAACTACGGGCTGGGAATTGGCACTCGTTACAGCACCAAGCTCAAACGAAAGTGTAGCTGCTACAAGCAAATTGGCCGGAGGGTTGGACTTGCT
TACATTAGACAGCTTATATGATGATGCAATCAGAAGAAACAATCAGAACGTAAGTTACAATCCATGGGAGCCAGTCCCAGTGCATGGTGCCATGATGCAACAGCCAATCC
ATGATCCCTTTTTCGCCTCGACTGCGGTGGCTGCACCTCATTCAGTACAAATGGCAGCTATGGCCAACCAGCAGCAAGCTTACATGTTGCAGCAGCAGCAGCAACAGCAG
ATGATGATGATGGCCCCCCCACCGCAACAGTCAAATCCTTTTGGAAATCCTCATGGAACCAACGCCTACCACTACGGTCCGGGTATGCCTGTTCATGCTTCCAATCCATA
TGGTCTCATTTAACATCATTCAAAGAATTTACTCTGATAGGAATGTTAATGATCAATCATTCTTGGAGATTGGGGACTACTAAATATGGAGTATACAGATGTACAATATG
TTGTGTATGTGCAGTAGTTAAGTTTAGGTAGGATGAGATTTTTTTGATAATAATTAGTGAGACAGGCTTTTGATTGATTATTTGTGTCATGAAAGATGGTAATAATGTTT
TAGATACTTGGCAAAATTGCTGGCAACCTGCTTCACTATTGTGAAGGATTCAGCAGGAAGTTCAAGTTTTCTTTTTTCTTTTCTTTTTTCCTTTTAAACAGGCCATGTAG
GCCTCTGTAAAATCAGTATGTAGCAAATTTATTTAAAACCCACTTTGATATTTATTTTCGAGGAGATAGTCTTTGAGGTTCTCATGAAAAAATCTCAATTATGTTCTTCC
GT
Protein sequenceShow/hide protein sequence
MADTEPEEGTMATHAHQTASPIPSGEHTVWADVSPLLEAACGDLQDGELIHGDNFNLFSAMSALEIMDPKMDSGMICKYYSVDEAIENGAAPIPLSFDRTVDVQCTIDIM
DHLLSCEATWHKGHSLAQTVFSCIYLLRPDRTSSHALLHSYCSVIRATCKAVIAVVSDARTHEEEDLFIMAYGLPLSGDGDDKCLSMLHAVEETICRQLRAYVEPLQSTF
DLEEHYCRVLLCRLRFRKHFYHVLACMRRPQGRGLDLARKHIASCLLELDLIRNSSTFLSNNSFGISKEDLEDMTTASGRQPLGFDSSLNCRLSAPTPPRAIKLLSWKKA
LDYFVKLLHDLDNICSYSLDPFLEGVFRFVVQFQKTQPDLVARSLLQFLLVQDGKLYGKDPLYAVITKAAGLPESAKNHENQKNQYIVQLGQLVINLLKVLCTNSAWQRR
KLGKILQDWRVIYMQMEMAFKKDFAEILSISNGENAWMKIFQHILVWVEEQTYWISSRFLVLGFELELYSPSEYCMVYWYLSVVLIKLVEKIHLRALMSNETGECANLAA
MLGKRKGKKKGASKDIGKDFRIPPAVSFLQCQICLAEGLVVMLAALRNEHMIAQSPSPFNSEYENLVTYNCFKDAQKIAKELKSSFSNDREKLVELRRIEQVAEHNSVAL
NLIHKLKDLEAFSLLVCPVFARRGCLVGGFAAQDVLKDNMVDLTTFGCQENFQDIKFQVGLKLSNKIQEIKESWMTDHHLSQHVARSEDRMQEPNENALTTDRPVFSLSR
FSLRFINSEGERMSGGGTQNSFRKALGALKDTTTVSLAKVNSDYKELDIAIVKSTNHVERPAKEKHIRAIFAAISATRPRADVAYCIHALARRLSKTHNWAVALKTLVVI
HRALREVDPTFHEELINYGRRRNHMLNLSHFKDDSSANAWDYSAWVRSYALFLEERLECFRVLKYDVETDRARTKDLDTAELLEQLPALQELLYRVLGCQPQGAAVHNFV
IQLALSLVASESVKIYQAISDGTINLVDKFFEMRRQDALKALDIYRRAGQQAERLSEFYEVCKSLDIGRGEKFIKIEQPPASFIQAMEEYVREAPQVSTVRKEQVADNKL
AAPKEVLAIEYKKEPAAQVEQTVAPPPAPSPPPPEPVKVEAVVTEQPDLLGLNDPVPEATSNLDEKNSLALAIVPVADQQTSSAPSQVNGTTTTGWELALVTAPSSNESV
AATSKLAGGLDLLTLDSLYDDAIRRNNQNVSYNPWEPVPVHGAMMQQPIHDPFFASTAVAAPHSVQMAAMANQQQAYMLQQQQQQQMMMMAPPPQQSNPFGNPHGTNAYH
YGPGMPVHASNPYGLI