; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028028 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028028
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionReplication factor C subunit
Genome locationtig00153056:2710850..2726238
RNA-Seq ExpressionSgr028028
SyntenySgr028028
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0006281 - DNA repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005663 - DNA replication factor C complex (cellular component)
GO:0009536 - plastid (cellular component)
GO:0016887 - ATPase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0003689 - DNA clamp loader activity (molecular function)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR013748 - Replication factor C, C-terminal
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR008921 - DNA polymerase III, clamp loader complex, gamma/delta/delta subunit, C-terminal
IPR003959 - ATPase, AAA-type, core
IPR003593 - AAA+ ATPase domain
IPR002885 - Pentatricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608033.1 Replication factor C subunit 4, partial [Cucurbita argyrosperma subsp. sororia]5.4e-17095.05Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLET+SCPHMLFYGPPGTGKTTTALAIAH+LFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AV SGQRQG YPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRIL+I NEEGLSLD EALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC
        TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSG+IP + V+ LFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFE+V+EAHDLQDEQKA+IC
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC

Query:  KKLAEADKCLVDGADEYLQLLDV
        KKLAEADKCLVDGADEYLQLLDV
Subjt:  KKLAEADKCLVDGADEYLQLLDV

XP_022134048.1 replication factor C subunit 4 [Momordica charantia]3.2e-17095.98Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAPL QSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLET+SCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AVGSGQRQG YPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVM KRIL+ICNEE LSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC
        TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIP + VNALFAACKSGNFDLANKEVNNV AEGYPVAQMLSQIFE+VVEA+ LQDEQKARIC
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC

Query:  KKLAEADKCLVDGADEYLQLLDV
        KKLAEADKCLVDGADEYLQLLDV
Subjt:  KKLAEADKCLVDGADEYLQLLDV

XP_022981026.1 replication factor C subunit 4 [Cucurbita maxima]3.5e-16994.43Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLET+SCPHMLFYGPPGTGKTTTALAIAH+LFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AV SGQRQG YPCPPFKIIILDEADSMTEDAQNALRRTMETHS VTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVM KRIL+I NEEGLSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC
        TL SISQGDLRRAITYLQSAARLFGSSISSKDLISVSG+IP + V+ LFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFE+V+EAHDLQDEQKA+IC
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC

Query:  KKLAEADKCLVDGADEYLQLLDV
        KKLAEADKCLVDGADEYLQLLDV
Subjt:  KKLAEADKCLVDGADEYLQLLDV

XP_023523758.1 replication factor C subunit 4 [Cucurbita pepo subsp. pepo]4.1e-17095.05Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAH+LFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AV SGQRQG YPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSE+VM KRIL+I NEEGLSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC
        TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSG+IP + V+ LFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFE+V+EAHDLQDEQKA+IC
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC

Query:  KKLAEADKCLVDGADEYLQLLDV
        KKLAEADKCLVDGADEYLQLLDV
Subjt:  KKLAEADKCLVDGADEYLQLLDV

XP_038897028.1 replication factor C subunit 4 [Benincasa hispida]3.5e-16994.43Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAI+HQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AV SGQRQG YPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVM KRIL+I NEEGLSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC
        TLSSISQGDLRRAITYLQSAARLFGSSISSKDL+SVSG+IP + V+ALF+ACKSGNFD+ANKEVNNV+AEGYPVAQMLSQIFE+VVE +DLQDEQKARIC
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC

Query:  KKLAEADKCLVDGADEYLQLLDV
        KKLAEADKCLVDGADEYLQLLDV
Subjt:  KKLAEADKCLVDGADEYLQLLDV

TrEMBL top hitse value%identityAlignment
A0A0A0KYI8 AAA domain-containing protein5.5e-16893.5Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETA+CPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AV SGQRQG YPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVM KRIL+I NEEGLSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC
        TLSSISQGDLRRAITYLQSAARLFGSSISSKDL++VSG+IP + V+ALF ACKSGNFD ANK+VNNV+AEGYPVAQMLSQIFE+V+E +DLQDEQKARIC
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC

Query:  KKLAEADKCLVDGADEYLQLLDV
        KKLAEADKCLVDGADEYLQLLDV
Subjt:  KKLAEADKCLVDGADEYLQLLDV

A0A1S3B9T6 LOW QUALITY PROTEIN: replication factor C subunit 42.1e-16793.19Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAI+HQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AV SGQRQG YPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRF FICNYISRIIEPLASRCAKFRFKPLSEEVM KRIL+I NEEGLSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC
        TLSS+SQGDLRRAITYLQSAARLFGSSISSKDL++VSG+IP + V+ALF ACKSGNFD ANKEVNNV+AEGYPVAQMLSQIFE+V+E +DLQDEQKARIC
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC

Query:  KKLAEADKCLVDGADEYLQLLDV
        KKLAEADKCLVDGADEYLQLLDV
Subjt:  KKLAEADKCLVDGADEYLQLLDV

A0A6J1C0Y1 replication factor C subunit 41.5e-17095.98Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAPL QSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLET+SCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AVGSGQRQG YPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVM KRIL+ICNEE LSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC
        TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIP + VNALFAACKSGNFDLANKEVNNV AEGYPVAQMLSQIFE+VVEA+ LQDEQKARIC
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC

Query:  KKLAEADKCLVDGADEYLQLLDV
        KKLAEADKCLVDGADEYLQLLDV
Subjt:  KKLAEADKCLVDGADEYLQLLDV

A0A6J1FIS1 replication factor C subunit 42.9e-16994.43Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLET+SCPHMLFYGPPGTGKTTTALAIAH+LFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AV SGQRQG YPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRIL+I NEEGLSLD EALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC
        TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSG+IP + V+ LFAAC+SGNFDLANKEVNNVIAEGYPVAQMLSQIFE+V+EAHDLQDEQ A+IC
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC

Query:  KKLAEADKCLVDGADEYLQLLDV
        KKLAEADKCLVDGADEYLQLLDV
Subjt:  KKLAEADKCLVDGADEYLQLLDV

A0A6J1IVB8 replication factor C subunit 41.7e-16994.43Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLET+SCPHMLFYGPPGTGKTTTALAIAH+LFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AV SGQRQG YPCPPFKIIILDEADSMTEDAQNALRRTMETHS VTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVM KRIL+I NEEGLSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC
        TL SISQGDLRRAITYLQSAARLFGSSISSKDLISVSG+IP + V+ LFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFE+V+EAHDLQDEQKA+IC
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC

Query:  KKLAEADKCLVDGADEYLQLLDV
        KKLAEADKCLVDGADEYLQLLDV
Subjt:  KKLAEADKCLVDGADEYLQLLDV

SwissProt top hitse value%identityAlignment
P35249 Replication factor C subunit 41.5e-9858.1Show/hide
Query:  PWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGVAVGSGQRQG
        PWVEKYRPK V +VA Q+EVV VL  +LE A  P++LFYGPPGTGKT+T LA A +LFGPEL++ RVLELNASD+RGI VVR K+K+FA + V SG R  
Subjt:  PWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGVAVGSGQRQG

Query:  SYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALSTLSSISQGD
          PCPPFKI+ILDEADSMT  AQ ALRRTME  SK TRF  ICNY+SRIIEPL SRC+KFRFKPLS+++  +R+L I  +E + +  E ++ L  +S+GD
Subjt:  SYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALSTLSSISQGD

Query:  LRRAITYLQSAARLF-GSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARICKKLAEADK
        LR+AIT+LQSA RL  G  I+ K +  ++GVIP + ++ +FAAC+SG+FD     V ++I EG+   Q+++Q+ ++VVE ++L D+QK+ I +KLAE DK
Subjt:  LRRAITYLQSAARLF-GSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARICKKLAEADK

Query:  CLVDGADEYLQLLDV
        CL DGADE+LQL+ +
Subjt:  CLVDGADEYLQLLDV

Q1PFC5 Pentatricopeptide repeat-containing protein At1g774052.4e-12053.43Show/hide
Query:  NQILVAMLQNRPFDTHIDRSASTSATHQLWSSESVSNVLRSVPRFFFQSTRSIGRQKGFRHRAPLKQRKLKEEAYKFRNNVLVLGPAAHRDPFKVELGLS
        +Q++ AM+QNRPFD  +    ++S   + W+ + VS+VL S+PRFFF S RSIGRQKGFRHR+PLKQR L +E+ + R+ VLVLGP A+ DP KV +GL 
Subjt:  NQILVAMLQNRPFDTHIDRSASTSATHQLWSSESVSNVLRSVPRFFFQSTRSIGRQKGFRHRAPLKQRKLKEEAYKFRNNVLVLGPAAHRDPFKVELGLS

Query:  KALEFFHWVETHFGFLHDEMTCREMACVLARGNRLRSLWDFLKEMSKRGNG-GLVTTATITCLIKVLGEQGLVNEALTAFYRMKQFHCKPDVFAYNTVIN
        KALEFF W+ETHFGF H+E+TCR+MAC+LA+GN  + LWDFL+++S+R NG  +VTTA+ITCL+K LGE+G V EAL  FYRMK++HCKPDV+AYNT+IN
Subjt:  KALEFFHWVETHFGFLHDEMTCREMACVLARGNRLRSLWDFLKEMSKRGNG-GLVTTATITCLIKVLGEQGLVNEALTAFYRMKQFHCKPDVFAYNTVIN

Query:  ALCRVGNFKKARFLLEQMELPGFRCPPDIFTYTILISSYCKYSLQTGCRKAIRRRLWEANHLFR------------------------------------
        ALCRVGNFKKARFLL+QM+LPGFR PPD +TYTILISSYC+Y +QTGCRKAIRRR+WEAN +FR                                    
Subjt:  ALCRVGNFKKARFLLEQMELPGFRCPPDIFTYTILISSYCKYSLQTGCRKAIRRRLWEANHLFR------------------------------------

Query:  ----------------IRYYSAVNEIDLAIKMLRMMQKMNHGSATSSSYTPIIHALCEAGKVIEACDFLLELLEGGSVPREY-LSVVMHAMNKDN-SCTF
                        IRYYS  NEI+ AI+M+R M+K+ HG   SS+YTP+IHAL E  +  EA D ++E++E G VPREY   +V  A++ +  + T 
Subjt:  ----------------IRYYSAVNEIDLAIKMLRMMQKMNHGSATSSSYTPIIHALCEAGKVIEACDFLLELLEGGSVPREY-LSVVMHAMNKDN-SCTF

Query:  HSEVVKKV
          E+ K++
Subjt:  HSEVVKKV

Q93ZX1 Replication factor C subunit 41.0e-15584.88Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAP+LQSSQPWVEKYRPKQVKDVAHQ+EVVRVLTNTL+TA CPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFA V
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AVGS  RQ  YPCP FKIIILDEADSMTEDAQNALRRTMET+SKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVM  RIL+ICNEEGLSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEA-HDLQDEQKARI
        TLSSISQGDLRRAITYLQSA RLFGS+I+S DL++VSGV+P + VN LF ACKSG+FD+ANKEV+N++AEGYP +Q+++Q+F++V EA  D+ D QKA+I
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEA-HDLQDEQKARI

Query:  CKKLAEADKCLVDGADEYLQLLDV
        CK LAE DK LVDGADEYLQLLDV
Subjt:  CKKLAEADKCLVDGADEYLQLLDV

Q99J62 Replication factor C subunit 41.0e-9958.41Show/hide
Query:  PWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGVAVGSGQRQG
        PWVEKYRPK V +VA QDEVV VL  +LE A  P++LFYGPPGTGKT+T LA A +LFGPEL++ RVLELNASD+RGI VVR K+K+FA + V SG R  
Subjt:  PWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGVAVGSGQRQG

Query:  SYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALSTLSSISQGD
          PCPPFKI+ILDEADSMT  AQ ALRRTME  SK TRF  ICNY+SRIIEPL SRC+KFRFKPLS+++  +R+L I  +E + +  E ++ L  IS+GD
Subjt:  SYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALSTLSSISQGD

Query:  LRRAITYLQSAARLFGSSISSKDLIS-VSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARICKKLAEADK
        LR+AIT+LQSA RL G    S+D+I+ ++GVIP  T++ +F AC SG+FD     V N+I EG+   Q+++Q+ + ++E  +L D+ K+ I +KLAE DK
Subjt:  LRRAITYLQSAARLFGSSISSKDLIS-VSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARICKKLAEADK

Query:  CLVDGADEYLQLLDV
        CL DGADE+LQL+ +
Subjt:  CLVDGADEYLQLLDV

Q9FXM3 Replication factor C subunit 23.3e-15484.52Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAPL+ SSQPWVEKYRP+QVKDVAHQ+EVVRVLT TL+TA  PHMLFYGPPGTGKTTTALAIA+QL+GPELYKSRVLELNASDDRGINVVRTKIKDFA V
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AVGS  R+G YPCPP+KIIILDEADSMTEDAQNALRRTMET+SKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVM  RIL+ICNEEGLSLD +AL+
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC
        TLS+IS GDLRRAITYLQSAARLFGSSISS DLISVSG IP D V +L A+CKSG FD+ANKEVNN+IA+GYPV+Q++SQ  +++V A D+ DEQKARIC
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARIC

Query:  KKLAEADKCLVDGADEYLQLLDV
        KKL EADKCLVDGADEYLQLLDV
Subjt:  KKLAEADKCLVDGADEYLQLLDV

Arabidopsis top hitse value%identityAlignment
AT1G21690.1 ATPase family associated with various cellular activities (AAA)7.3e-15784.88Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAP+LQSSQPWVEKYRPKQVKDVAHQ+EVVRVLTNTL+TA CPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFA V
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AVGS  RQ  YPCP FKIIILDEADSMTEDAQNALRRTMET+SKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVM  RIL+ICNEEGLSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEA-HDLQDEQKARI
        TLSSISQGDLRRAITYLQSA RLFGS+I+S DL++VSGV+P + VN LF ACKSG+FD+ANKEV+N++AEGYP +Q+++Q+F++V EA  D+ D QKA+I
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEA-HDLQDEQKARI

Query:  CKKLAEADKCLVDGADEYLQLLDV
        CK LAE DK LVDGADEYLQLLDV
Subjt:  CKKLAEADKCLVDGADEYLQLLDV

AT1G21690.2 ATPase family associated with various cellular activities (AAA)1.6e-14881.79Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAP+LQSSQPWVEKYRPKQVKDVAHQ+EV            CPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFA V
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AVGS  RQ  YPCP FKIIILDEADSMTEDAQNALRRTMET+SKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVM  RIL+ICNEEGLSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEA-HDLQDEQKARI
        TLSSISQGDLRRAITYLQSA RLFGS+I+S DL++VSGV+P + VN LF ACKSG+FD+ANKEV+N++AEGYP +Q+++Q+F++V EA  D+ D QKA+I
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEA-HDLQDEQKARI

Query:  CKKLAEADKCLVDGADEYLQLLDV
        CK LAE DK LVDGADEYLQLLDV
Subjt:  CKKLAEADKCLVDGADEYLQLLDV

AT1G21690.3 ATPase family associated with various cellular activities (AAA)3.3e-14984.47Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAP+LQSSQPWVEKYRPKQVKDVAHQ+EVVRVLTNTL+TA CPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFA V
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AVGS  RQ  YPCP FKIIILDEADSMTEDAQNALRRTMET+SKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVM  RIL+ICNEEGLSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEA-HDLQDEQKARI
        TLSSISQGDLRRAITYLQSA RLFGS+I+S DL++VSGV+P + VN LF ACKSG+FD+ANKEV+N++AEGYP +Q+++Q+F++V EA  D+ D QKA+I
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEA-HDLQDEQKARI

Query:  CKKLAEADK
        CK LAE DK
Subjt:  CKKLAEADK

AT1G21690.4 ATPase family associated with various cellular activities (AAA)2.3e-15082.72Show/hide
Query:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV
        MAP+LQSSQPWVEKYRPKQVKDVAHQ+EVVRVLTNTL+TA CPHMLFYGPPGTGKTTTALAIAHQLFG       VLELNASDDRGINVVRTKIKDFA V
Subjt:  MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGV

Query:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS
        AVGS  RQ  YPCP FKIIILDEADSMTEDAQNALRRTMET+SKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVM  RIL+ICNEEGLSLDGEALS
Subjt:  AVGSGQRQGSYPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALS

Query:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEA-HDLQDEQKARI
        TLSSISQGDLRRAITYLQSA RLFGS+I+S DL++VSGV+P + VN LF ACKSG+FD+ANKEV+N++AEGYP +Q+++Q+F++V EA  D+ D QKA+I
Subjt:  TLSSISQGDLRRAITYLQSAARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEA-HDLQDEQKARI

Query:  CKKLAEADKCLVDGADEYLQLLDV
        CK LAE DK LVDGADEYLQLLDV
Subjt:  CKKLAEADKCLVDGADEYLQLLDV

AT1G77405.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-12153.43Show/hide
Query:  NQILVAMLQNRPFDTHIDRSASTSATHQLWSSESVSNVLRSVPRFFFQSTRSIGRQKGFRHRAPLKQRKLKEEAYKFRNNVLVLGPAAHRDPFKVELGLS
        +Q++ AM+QNRPFD  +    ++S   + W+ + VS+VL S+PRFFF S RSIGRQKGFRHR+PLKQR L +E+ + R+ VLVLGP A+ DP KV +GL 
Subjt:  NQILVAMLQNRPFDTHIDRSASTSATHQLWSSESVSNVLRSVPRFFFQSTRSIGRQKGFRHRAPLKQRKLKEEAYKFRNNVLVLGPAAHRDPFKVELGLS

Query:  KALEFFHWVETHFGFLHDEMTCREMACVLARGNRLRSLWDFLKEMSKRGNG-GLVTTATITCLIKVLGEQGLVNEALTAFYRMKQFHCKPDVFAYNTVIN
        KALEFF W+ETHFGF H+E+TCR+MAC+LA+GN  + LWDFL+++S+R NG  +VTTA+ITCL+K LGE+G V EAL  FYRMK++HCKPDV+AYNT+IN
Subjt:  KALEFFHWVETHFGFLHDEMTCREMACVLARGNRLRSLWDFLKEMSKRGNG-GLVTTATITCLIKVLGEQGLVNEALTAFYRMKQFHCKPDVFAYNTVIN

Query:  ALCRVGNFKKARFLLEQMELPGFRCPPDIFTYTILISSYCKYSLQTGCRKAIRRRLWEANHLFR------------------------------------
        ALCRVGNFKKARFLL+QM+LPGFR PPD +TYTILISSYC+Y +QTGCRKAIRRR+WEAN +FR                                    
Subjt:  ALCRVGNFKKARFLLEQMELPGFRCPPDIFTYTILISSYCKYSLQTGCRKAIRRRLWEANHLFR------------------------------------

Query:  ----------------IRYYSAVNEIDLAIKMLRMMQKMNHGSATSSSYTPIIHALCEAGKVIEACDFLLELLEGGSVPREY-LSVVMHAMNKDN-SCTF
                        IRYYS  NEI+ AI+M+R M+K+ HG   SS+YTP+IHAL E  +  EA D ++E++E G VPREY   +V  A++ +  + T 
Subjt:  ----------------IRYYSAVNEIDLAIKMLRMMQKMNHGSATSSSYTPIIHALCEAGKVIEACDFLLELLEGGSVPREY-LSVVMHAMNKDN-SCTF

Query:  HSEVVKKV
          E+ K++
Subjt:  HSEVVKKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCATTGTTGCAGAGCTCTCAACCATGGGTGGAGAAATATCGACCCAAGCAAGTGAAAGATGTGGCGCATCAGGATGAAGTGGTCCGAGTCCTCACAAACACCCT
CGAGACTGCTAGTTGTCCTCACATGCTCTTTTATGGACCGCCTGGTACCGGAAAAACCACCACGGCTCTCGCGATTGCCCATCAACTGTTCGGTCCTGAACTTTACAAGT
CAAGAGTATTGGAGTTGAATGCAAGTGATGACCGTGGTATCAATGTTGTTAGGACTAAGATCAAAGATTTTGCTGGTGTCGCAGTAGGCAGTGGTCAGCGTCAAGGGAGT
TATCCTTGTCCACCATTCAAGATAATCATTCTCGACGAGGCTGATTCCATGACTGAAGATGCTCAGAATGCCCTGAGGCGCACCATGGAAACACACTCCAAAGTAACAAG
ATTCTTTTTTATATGCAACTATATCAGCAGGATTATCGAGCCCCTTGCTTCCAGGTGTGCAAAGTTCAGGTTTAAGCCACTTTCTGAGGAGGTTATGGGCAAACGTATAT
TGTACATTTGTAATGAAGAAGGTCTAAGTCTAGATGGAGAGGCTCTTTCAACCTTAAGTTCAATCTCTCAAGGTGACCTACGTCGAGCTATCACATACTTACAGTCAGCA
GCACGCTTATTTGGATCATCAATCTCTTCCAAGGATCTGATTAGTGTGTCTGGGGTTATCCCTTGGGATACTGTTAATGCGCTTTTTGCTGCTTGTAAAAGTGGTAACTT
TGATCTTGCAAACAAGGAAGTCAATAATGTGATTGCAGAGGGATATCCAGTGGCTCAGATGCTTTCACAGATATTTGAGTTGGTTGTTGAAGCCCATGATTTGCAAGATG
AACAGAAGGCCAGGATATGCAAGAAGTTGGCTGAAGCAGATAAGTGTCTGGTTGATGGTGCGGATGAATATTTGCAACTGCTGGATGTGGAGAAGTTCTACTCGATGGGT
GACACAATAAACGTTCAAGTTGACAGCAGGCTTAACTTACGTCCTGGCTTGGCAATATCTGTGAAGACAATGAAGAAGGGAGAGAATGTTGCTGATTGTCCTGAAAAAGA
CCCAGAGGGGGAGGAGAAGAATAATGATCAGCCAAATGGGGCACAATGGTTCAAGGCTGCGGACAGAGAAACTGTTTTTCCTGTATTTCATGATTTTGATCTACCATTTC
TCAATTTATCTATTCGACTCTTGGTGCAGACTTCTTGTCAAGAGATTGTGCAACTGAAAGGATTATGTGAAGTTTTTGTCAGAGTTTCTTTTTGCAACATTGATTTCATG
CAGCTGGCAAATGGTCAAAATTGGGGTAAAGTCAGGGCAAATTCTTTTACTGTTTTTTCCCCACCCATTGTAGCGCGGAAACCCATCAGACCCTGCGGATTGCGTGTCGA
GTTCGAAATGCGCTCCGTCTCGGAAAGAGAGAAAGAGAGAGGTGGATTAGTGCCGATGGAGGAAAGAAATGGCGCCATTGTTGTAGAGAGAGCTCTCGACCATGGCTGGA
GAAATAGTACGTCTCGTATTTTTGTCAAAGATGAAAGGATATCAACTTCCGACTTCCGAGTTCCGACGATCATATTCGTGCTTGTCCTTCTGCTTTGTCACAACCAAATC
TTGGTGGCGATGCTTCAAAATCGACCATTCGACACCCACATCGATCGTTCAGCCTCTACTTCCGCTACCCACCAATTATGGAGTTCGGAGTCTGTTTCCAACGTTTTAAG
ATCGGTACCTCGTTTCTTTTTTCAGTCAACCCGATCCATTGGCCGCCAAAAGGGTTTCCGGCATCGTGCTCCTCTTAAGCAGAGGAAACTGAAAGAGGAGGCATACAAGT
TTCGGAACAATGTGCTTGTTCTTGGCCCTGCTGCTCACAGAGATCCTTTCAAGGTCGAGCTTGGATTGAGTAAAGCGTTGGAGTTCTTTCATTGGGTTGAAACCCATTTT
GGATTTCTACACGACGAGATGACATGCAGAGAGATGGCTTGTGTATTGGCTCGAGGGAATAGATTGAGGAGTCTTTGGGATTTTCTGAAAGAAATGTCCAAGAGAGGGAA
TGGTGGGCTTGTTACTACTGCCACCATCACATGTTTGATAAAAGTTCTAGGAGAGCAGGGATTAGTAAATGAAGCGCTAACTGCATTCTACAGGATGAAGCAATTTCACT
GTAAGCCTGATGTTTTTGCTTATAATACTGTTATTAATGCTCTATGTAGGGTTGGGAATTTCAAGAAGGCCAGGTTTTTGTTGGAGCAGATGGAGTTGCCCGGATTTAGA
TGCCCACCAGACATTTTCACATATACAATTCTGATCAGTTCTTACTGTAAATACAGCTTGCAAACTGGGTGCAGAAAGGCCATCCGAAGGAGGTTGTGGGAAGCTAACCA
CTTGTTTAGAATAAGATACTATAGTGCTGTGAATGAGATTGACCTAGCAATCAAGATGCTAAGAATGATGCAGAAGATGAACCATGGAAGTGCCACATCAAGTTCTTATA
CTCCTATTATTCATGCACTATGTGAAGCAGGAAAGGTAATTGAGGCCTGTGATTTCCTCCTCGAGTTGCTTGAGGGAGGCTCTGTGCCTAGAGAATACCTATCAGTTGTG
ATGCATGCAATGAACAAAGACAATAGCTGCACTTTTCACAGTGAGGTAGTGAAGAAGGTTTGGATGAAAACAGGGTTTTATGGTTTGACTTTCTCCTCTTCTGCCCCTAA
CAGCCAACTGTGGCAAGTGCCATCTGATTTTCACCAATGTAGAAAACAAGCAGAAAATTCATTTAAAGAGATATGGTATGTGAACTGGCAATTTGGCATGTATTCAGGTG
CTCATACATCTGCAGACTCAACCATGCCAATGCCATTACCATTGCCATGCACGATAGAGCAGCAGCCGTTTCTGTTCTCCCTCCATCCATGGCCATCATTTCACTTTTTG
TTCGGATATAACACCGGCTTCCATTCCGATTGTAATAACTTCGACGGTGACGAAGAAGTGAGCGGAATGTGGAACGAATGGTTTGGAGTGTCTGGCCTTTTCATGGCGGC
GGTTCCGTGCAACTGCAGATTCTCCGTCTTCGATATGCTGATTACTCAGATAGCTTCAGCATTGACGGAACGTGGAACGTACGGTGGAGTCATTTTTCGTGGCGGCCGTT
CGGTGCAACTGCAGATTCTCCGTCTCCGATATGCTGATTACTCAGATAGCTTCGGCATTGAAACTTCAACGGCGACGAGAAGTGAGCGGAACGTGGAACGAATGGTGGAG
TCTGATCTTTTTGTGGCGGCTGTTCCGTGCAACTGCAGATTCTCCGTCTCCGATCTGTTAATTAGACAGATAGCTTCGGCATTAACTAAGAGAACTTTAACGGGAAGAAG
TGACGCCATGATCAAAACCTTACCGATTGACTTCGTCAAGCAAGTCGACGTTTGTGAATATCGGCAGATTAGCATCAATGACCATGATCACGATCTTGTCATAGTCTGTC
AGCAGCCACGACCTGAACTCCCTGTAGCTGCCACTCGTTGTAAGCATCCGTCTCTGCTTTTCAGTTTCCGATCCTTTGGATCGTCCTTATTTTCCATCCTGTTTCCTCCA
GTCCACTTGCTGCTAACATGCACAGCATGTAGAGGGCAGGTGGCTCTTCATCGGAGGTCGGTACAAAATCTTAAATGTGCAGGTACTTATCCAGAAGTTGATATTGTGAT
ATCCTTGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCCATTGTTGCAGAGCTCTCAACCATGGGTGGAGAAATATCGACCCAAGCAAGTGAAAGATGTGGCGCATCAGGATGAAGTGGTCCGAGTCCTCACAAACACCCT
CGAGACTGCTAGTTGTCCTCACATGCTCTTTTATGGACCGCCTGGTACCGGAAAAACCACCACGGCTCTCGCGATTGCCCATCAACTGTTCGGTCCTGAACTTTACAAGT
CAAGAGTATTGGAGTTGAATGCAAGTGATGACCGTGGTATCAATGTTGTTAGGACTAAGATCAAAGATTTTGCTGGTGTCGCAGTAGGCAGTGGTCAGCGTCAAGGGAGT
TATCCTTGTCCACCATTCAAGATAATCATTCTCGACGAGGCTGATTCCATGACTGAAGATGCTCAGAATGCCCTGAGGCGCACCATGGAAACACACTCCAAAGTAACAAG
ATTCTTTTTTATATGCAACTATATCAGCAGGATTATCGAGCCCCTTGCTTCCAGGTGTGCAAAGTTCAGGTTTAAGCCACTTTCTGAGGAGGTTATGGGCAAACGTATAT
TGTACATTTGTAATGAAGAAGGTCTAAGTCTAGATGGAGAGGCTCTTTCAACCTTAAGTTCAATCTCTCAAGGTGACCTACGTCGAGCTATCACATACTTACAGTCAGCA
GCACGCTTATTTGGATCATCAATCTCTTCCAAGGATCTGATTAGTGTGTCTGGGGTTATCCCTTGGGATACTGTTAATGCGCTTTTTGCTGCTTGTAAAAGTGGTAACTT
TGATCTTGCAAACAAGGAAGTCAATAATGTGATTGCAGAGGGATATCCAGTGGCTCAGATGCTTTCACAGATATTTGAGTTGGTTGTTGAAGCCCATGATTTGCAAGATG
AACAGAAGGCCAGGATATGCAAGAAGTTGGCTGAAGCAGATAAGTGTCTGGTTGATGGTGCGGATGAATATTTGCAACTGCTGGATGTGGAGAAGTTCTACTCGATGGGT
GACACAATAAACGTTCAAGTTGACAGCAGGCTTAACTTACGTCCTGGCTTGGCAATATCTGTGAAGACAATGAAGAAGGGAGAGAATGTTGCTGATTGTCCTGAAAAAGA
CCCAGAGGGGGAGGAGAAGAATAATGATCAGCCAAATGGGGCACAATGGTTCAAGGCTGCGGACAGAGAAACTGTTTTTCCTGTATTTCATGATTTTGATCTACCATTTC
TCAATTTATCTATTCGACTCTTGGTGCAGACTTCTTGTCAAGAGATTGTGCAACTGAAAGGATTATGTGAAGTTTTTGTCAGAGTTTCTTTTTGCAACATTGATTTCATG
CAGCTGGCAAATGGTCAAAATTGGGGTAAAGTCAGGGCAAATTCTTTTACTGTTTTTTCCCCACCCATTGTAGCGCGGAAACCCATCAGACCCTGCGGATTGCGTGTCGA
GTTCGAAATGCGCTCCGTCTCGGAAAGAGAGAAAGAGAGAGGTGGATTAGTGCCGATGGAGGAAAGAAATGGCGCCATTGTTGTAGAGAGAGCTCTCGACCATGGCTGGA
GAAATAGTACGTCTCGTATTTTTGTCAAAGATGAAAGGATATCAACTTCCGACTTCCGAGTTCCGACGATCATATTCGTGCTTGTCCTTCTGCTTTGTCACAACCAAATC
TTGGTGGCGATGCTTCAAAATCGACCATTCGACACCCACATCGATCGTTCAGCCTCTACTTCCGCTACCCACCAATTATGGAGTTCGGAGTCTGTTTCCAACGTTTTAAG
ATCGGTACCTCGTTTCTTTTTTCAGTCAACCCGATCCATTGGCCGCCAAAAGGGTTTCCGGCATCGTGCTCCTCTTAAGCAGAGGAAACTGAAAGAGGAGGCATACAAGT
TTCGGAACAATGTGCTTGTTCTTGGCCCTGCTGCTCACAGAGATCCTTTCAAGGTCGAGCTTGGATTGAGTAAAGCGTTGGAGTTCTTTCATTGGGTTGAAACCCATTTT
GGATTTCTACACGACGAGATGACATGCAGAGAGATGGCTTGTGTATTGGCTCGAGGGAATAGATTGAGGAGTCTTTGGGATTTTCTGAAAGAAATGTCCAAGAGAGGGAA
TGGTGGGCTTGTTACTACTGCCACCATCACATGTTTGATAAAAGTTCTAGGAGAGCAGGGATTAGTAAATGAAGCGCTAACTGCATTCTACAGGATGAAGCAATTTCACT
GTAAGCCTGATGTTTTTGCTTATAATACTGTTATTAATGCTCTATGTAGGGTTGGGAATTTCAAGAAGGCCAGGTTTTTGTTGGAGCAGATGGAGTTGCCCGGATTTAGA
TGCCCACCAGACATTTTCACATATACAATTCTGATCAGTTCTTACTGTAAATACAGCTTGCAAACTGGGTGCAGAAAGGCCATCCGAAGGAGGTTGTGGGAAGCTAACCA
CTTGTTTAGAATAAGATACTATAGTGCTGTGAATGAGATTGACCTAGCAATCAAGATGCTAAGAATGATGCAGAAGATGAACCATGGAAGTGCCACATCAAGTTCTTATA
CTCCTATTATTCATGCACTATGTGAAGCAGGAAAGGTAATTGAGGCCTGTGATTTCCTCCTCGAGTTGCTTGAGGGAGGCTCTGTGCCTAGAGAATACCTATCAGTTGTG
ATGCATGCAATGAACAAAGACAATAGCTGCACTTTTCACAGTGAGGTAGTGAAGAAGGTTTGGATGAAAACAGGGTTTTATGGTTTGACTTTCTCCTCTTCTGCCCCTAA
CAGCCAACTGTGGCAAGTGCCATCTGATTTTCACCAATGTAGAAAACAAGCAGAAAATTCATTTAAAGAGATATGGTATGTGAACTGGCAATTTGGCATGTATTCAGGTG
CTCATACATCTGCAGACTCAACCATGCCAATGCCATTACCATTGCCATGCACGATAGAGCAGCAGCCGTTTCTGTTCTCCCTCCATCCATGGCCATCATTTCACTTTTTG
TTCGGATATAACACCGGCTTCCATTCCGATTGTAATAACTTCGACGGTGACGAAGAAGTGAGCGGAATGTGGAACGAATGGTTTGGAGTGTCTGGCCTTTTCATGGCGGC
GGTTCCGTGCAACTGCAGATTCTCCGTCTTCGATATGCTGATTACTCAGATAGCTTCAGCATTGACGGAACGTGGAACGTACGGTGGAGTCATTTTTCGTGGCGGCCGTT
CGGTGCAACTGCAGATTCTCCGTCTCCGATATGCTGATTACTCAGATAGCTTCGGCATTGAAACTTCAACGGCGACGAGAAGTGAGCGGAACGTGGAACGAATGGTGGAG
TCTGATCTTTTTGTGGCGGCTGTTCCGTGCAACTGCAGATTCTCCGTCTCCGATCTGTTAATTAGACAGATAGCTTCGGCATTAACTAAGAGAACTTTAACGGGAAGAAG
TGACGCCATGATCAAAACCTTACCGATTGACTTCGTCAAGCAAGTCGACGTTTGTGAATATCGGCAGATTAGCATCAATGACCATGATCACGATCTTGTCATAGTCTGTC
AGCAGCCACGACCTGAACTCCCTGTAGCTGCCACTCGTTGTAAGCATCCGTCTCTGCTTTTCAGTTTCCGATCCTTTGGATCGTCCTTATTTTCCATCCTGTTTCCTCCA
GTCCACTTGCTGCTAACATGCACAGCATGTAGAGGGCAGGTGGCTCTTCATCGGAGGTCGGTACAAAATCTTAAATGTGCAGGTACTTATCCAGAAGTTGATATTGTGAT
ATCCTTGAATTGA
Protein sequenceShow/hide protein sequence
MAPLLQSSQPWVEKYRPKQVKDVAHQDEVVRVLTNTLETASCPHMLFYGPPGTGKTTTALAIAHQLFGPELYKSRVLELNASDDRGINVVRTKIKDFAGVAVGSGQRQGS
YPCPPFKIIILDEADSMTEDAQNALRRTMETHSKVTRFFFICNYISRIIEPLASRCAKFRFKPLSEEVMGKRILYICNEEGLSLDGEALSTLSSISQGDLRRAITYLQSA
ARLFGSSISSKDLISVSGVIPWDTVNALFAACKSGNFDLANKEVNNVIAEGYPVAQMLSQIFELVVEAHDLQDEQKARICKKLAEADKCLVDGADEYLQLLDVEKFYSMG
DTINVQVDSRLNLRPGLAISVKTMKKGENVADCPEKDPEGEEKNNDQPNGAQWFKAADRETVFPVFHDFDLPFLNLSIRLLVQTSCQEIVQLKGLCEVFVRVSFCNIDFM
QLANGQNWGKVRANSFTVFSPPIVARKPIRPCGLRVEFEMRSVSEREKERGGLVPMEERNGAIVVERALDHGWRNSTSRIFVKDERISTSDFRVPTIIFVLVLLLCHNQI
LVAMLQNRPFDTHIDRSASTSATHQLWSSESVSNVLRSVPRFFFQSTRSIGRQKGFRHRAPLKQRKLKEEAYKFRNNVLVLGPAAHRDPFKVELGLSKALEFFHWVETHF
GFLHDEMTCREMACVLARGNRLRSLWDFLKEMSKRGNGGLVTTATITCLIKVLGEQGLVNEALTAFYRMKQFHCKPDVFAYNTVINALCRVGNFKKARFLLEQMELPGFR
CPPDIFTYTILISSYCKYSLQTGCRKAIRRRLWEANHLFRIRYYSAVNEIDLAIKMLRMMQKMNHGSATSSSYTPIIHALCEAGKVIEACDFLLELLEGGSVPREYLSVV
MHAMNKDNSCTFHSEVVKKVWMKTGFYGLTFSSSAPNSQLWQVPSDFHQCRKQAENSFKEIWYVNWQFGMYSGAHTSADSTMPMPLPLPCTIEQQPFLFSLHPWPSFHFL
FGYNTGFHSDCNNFDGDEEVSGMWNEWFGVSGLFMAAVPCNCRFSVFDMLITQIASALTERGTYGGVIFRGGRSVQLQILRLRYADYSDSFGIETSTATRSERNVERMVE
SDLFVAAVPCNCRFSVSDLLIRQIASALTKRTLTGRSDAMIKTLPIDFVKQVDVCEYRQISINDHDHDLVIVCQQPRPELPVAATRCKHPSLLFSFRSFGSSLFSILFPP
VHLLLTCTACRGQVALHRRSVQNLKCAGTYPEVDIVISLN