; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G006670 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G006670
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionInorganic diphosphatase
Genome locationchr01:5360402..5370495
RNA-Seq ExpressionLsi01G006670
SyntenyLsi01G006670
Gene Ontology termsGO:0006261 - DNA-dependent DNA replication (biological process)
GO:0006281 - DNA repair (biological process)
GO:0006796 - phosphate-containing compound metabolic process (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005663 - DNA replication factor C complex (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0000287 - magnesium ion binding (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0003689 - DNA clamp loader activity (molecular function)
GO:0004427 - inorganic diphosphatase activity (molecular function)
InterPro domainsIPR008162 - Inorganic pyrophosphatase
IPR008921 - DNA polymerase III, clamp loader complex, gamma/delta/delta subunit, C-terminal
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR036649 - Inorganic pyrophosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008440733.1 PREDICTED: uncharacterized protein LOC103485060 isoform X1 [Cucumis melo]0.0e+0093.42Show/hide
Query:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR
        M PALNLMKQRKDGYEPSDTETEWQESPWNDP EKKLVLDYNNRRTDSAVSKKFS  ANVSPPG RRNGG+TP RPAKDD+VLVMLQRNISPLSRAERRR
Subjt:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR

Query:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM
        HESPFKA+GEEIGSSSMRSRKEEK T SHGSN+ SQKP +SRRSVTAPRLRMRDEHM A NDLSQRRERAAPTLKV SILQQPKE+S V S SIGEMNEM
Subjt:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM

Query:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR
        +ADGR NRGL FNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFV+KKN DTYNQV VNANGR VSS G GLSTTT SSAAVSR
Subjt:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR

Query:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
        ENSSRIS+ENSKISDVSGRTSEST+RF+ANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEA YIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
Subjt:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL

Query:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA
        KQL SQDSFPHILFKGP GSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSE+NAKYALLGLAKEIGSEYSIN+EA
Subjt:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA

Query:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK
        RNVNPKA FKVVVL+DVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLI+IAEKEEFDLPMNFASKIATKAK
Subjt:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK

Query:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
        QNLR+AIMALEACKAHNYPFSDDQPIPIGWE+AVVELAAHILEDPSNP+LHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
Subjt:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR

Query:  LPIETGTGALPKLE
        LPIETG GALPKLE
Subjt:  LPIETGTGALPKLE

XP_011658016.1 uncharacterized protein LOC101218071 isoform X1 [Cucumis sativus]0.0e+0093.56Show/hide
Query:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR
        M PALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAV KKFS  ANVSPPG RRNGGKTP RPAKDD+VLVMLQRNISPLSRAERRR
Subjt:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR

Query:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM
        HESPFKA+GEEIGSSSMRSRKEEK T SHGSN+ SQKP YSRRSVTAPRLRM+DEHM A NDLSQRRERAAPTLKV SILQQPKEVS   S SIGEMNE+
Subjt:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM

Query:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR
        +ADGR NRGL  NDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFV+KKN DTYNQVEVNANGRGVSS G GLSTTT SSAAVSR
Subjt:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR

Query:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
        ENSSRIS+ENSKISDVSGRTSESTRRF+ANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEA YIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
Subjt:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL

Query:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA
        KQL SQDSFPHILFKGP GSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSE+NAKYALLGLAKEIGSEYSIN+EA
Subjt:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA

Query:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK
        RNVNPKA FKVVVLLDVDKA EDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLI+IAEKEEFDLPMNFASKIATKAK
Subjt:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK

Query:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
        QNLRKAIMALEACKAHNYPFSDDQPIPIGWE+A+VELA+HILEDPSNP+LHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
Subjt:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR

Query:  LPIETGTGALPKLE
        LPIETG GALPKLE
Subjt:  LPIETGTGALPKLE

XP_011658017.1 uncharacterized protein LOC101218071 isoform X2 [Cucumis sativus]0.0e+0093.28Show/hide
Query:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR
        M PALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAV KKFS  ANVSPPG RRNGGKTP RPAKDD+VLVMLQRNISPLSRAERRR
Subjt:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR

Query:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM
        HESPFKA+GEEIGSSSMRSRKEEK T SHGSN+ SQKP YSRRSVTAPRLRM+DEHM A NDLSQRRERAAPTLKV SILQQPKEVS   S SIGEMNE+
Subjt:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM

Query:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR
        +ADGR NRGL  NDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFV+KKN DTYNQVEVNANGRGVSS G GLSTTT SSAAVSR
Subjt:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR

Query:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
        ENSSRIS+ENSKISDVSGRTSESTRRF+ANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEA YIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
Subjt:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL

Query:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA
        KQL   DSFPHILFKGP GSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSE+NAKYALLGLAKEIGSEYSIN+EA
Subjt:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA

Query:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK
        RNVNPKA FKVVVLLDVDKA EDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLI+IAEKEEFDLPMNFASKIATKAK
Subjt:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK

Query:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
        QNLRKAIMALEACKAHNYPFSDDQPIPIGWE+A+VELA+HILEDPSNP+LHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
Subjt:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR

Query:  LPIETGTGALPKLE
        LPIETG GALPKLE
Subjt:  LPIETGTGALPKLE

XP_038883850.1 uncharacterized protein LOC120074706 isoform X1 [Benincasa hispida]0.0e+0094.68Show/hide
Query:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR
        MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKF+  ANVSPPG RRNGGKTP RPAKDD+VLVMLQRNISPLSRAERRR
Subjt:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR

Query:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM
        HESPFKAAGEEIGSSSMRSRKEE LTSSHGS+RISQKPGYSRRSVTAPRLRMRDEHM AVNDLSQRRER APTL+V SILQQPKEVSQVNSLSIGEMNE+
Subjt:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM

Query:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR
        +ADGR NRGL FNDPV ES GSISPGDIFFSRDGLP+GMNNNVT+KRNAFKNYISPKP FV+KKN DTYNQV VNANGRGVSSAGAGLS+TTTSSAAVSR
Subjt:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR

Query:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
        ENSSRISIENSKISDVSGRTSEST+RF+ANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEA YIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
Subjt:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL

Query:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA
        KQL SQDSFPHILFKGP GSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSE+NAKYALLGLAKEIGSEYSIN+EA
Subjt:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA

Query:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK
        RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDD+GILESVI RCKVIKINPPVTHEIMDVLI+IAEKEEFDLPMNFASKIATKAK
Subjt:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK

Query:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
        QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNP+LHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIE+RSRRELYYWHAYYNKR
Subjt:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR

Query:  LPIETGTGALPKLE
        LPIETGT ALPKLE
Subjt:  LPIETGTGALPKLE

XP_038883853.1 uncharacterized protein LOC120074706 isoform X2 [Benincasa hispida]0.0e+0094.4Show/hide
Query:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR
        MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKF+  ANVSPPG RRNGGKTP RPAKDD+VLVMLQRNISPLSRAERRR
Subjt:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR

Query:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM
        HESPFKAAGEEIGSSSMRSRKEE LTSSHGS+RISQKPGYSRRSVTAPRLRMRDEHM AVNDLSQRRER APTL+V SILQQPKEVSQVNSLSIGEMNE+
Subjt:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM

Query:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR
        +ADGR NRGL FNDPV ES GSISPGDIFFSRDGLP+GMNNNVT+KRNAFKNYISPKP FV+KKN DTYNQV VNANGRGVSSAGAGLS+TTTSSAAVSR
Subjt:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR

Query:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
        ENSSRISIENSKISDVSGRTSEST+RF+ANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEA YIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
Subjt:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL

Query:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA
        KQL   DSFPHILFKGP GSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSE+NAKYALLGLAKEIGSEYSIN+EA
Subjt:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA

Query:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK
        RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDD+GILESVI RCKVIKINPPVTHEIMDVLI+IAEKEEFDLPMNFASKIATKAK
Subjt:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK

Query:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
        QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNP+LHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIE+RSRRELYYWHAYYNKR
Subjt:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR

Query:  LPIETGTGALPKLE
        LPIETGT ALPKLE
Subjt:  LPIETGTGALPKLE

TrEMBL top hitse value%identityAlignment
A0A0A0KJ59 Uncharacterized protein0.0e+0093.56Show/hide
Query:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR
        M PALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAV KKFS  ANVSPPG RRNGGKTP RPAKDD+VLVMLQRNISPLSRAERRR
Subjt:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR

Query:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM
        HESPFKA+GEEIGSSSMRSRKEEK T SHGSN+ SQKP YSRRSVTAPRLRM+DEHM A NDLSQRRERAAPTLKV SILQQPKEVS   S SIGEMNE+
Subjt:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM

Query:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR
        +ADGR NRGL  NDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFV+KKN DTYNQVEVNANGRGVSS G GLSTTT SSAAVSR
Subjt:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR

Query:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
        ENSSRIS+ENSKISDVSGRTSESTRRF+ANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEA YIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
Subjt:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL

Query:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA
        KQL SQDSFPHILFKGP GSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSE+NAKYALLGLAKEIGSEYSIN+EA
Subjt:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA

Query:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK
        RNVNPKA FKVVVLLDVDKA EDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLI+IAEKEEFDLPMNFASKIATKAK
Subjt:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK

Query:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
        QNLRKAIMALEACKAHNYPFSDDQPIPIGWE+A+VELA+HILEDPSNP+LHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
Subjt:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR

Query:  LPIETGTGALPKLE
        LPIETG GALPKLE
Subjt:  LPIETGTGALPKLE

A0A1S3B1D0 uncharacterized protein LOC103485060 isoform X20.0e+0093.14Show/hide
Query:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR
        M PALNLMKQRKDGYEPSDTETEWQESPWNDP EKKLVLDYNNRRTDSAVSKKFS  ANVSPPG RRNGG+TP RPAKDD+VLVMLQRNISPLSRAERRR
Subjt:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR

Query:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM
        HESPFKA+GEEIGSSSMRSRKEEK T SHGSN+ SQKP +SRRSVTAPRLRMRDEHM A NDLSQRRERAAPTLKV SILQQPKE+S V S SIGEMNEM
Subjt:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM

Query:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR
        +ADGR NRGL FNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFV+KKN DTYNQV VNANGR VSS G GLSTTT SSAAVSR
Subjt:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR

Query:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
        ENSSRIS+ENSKISDVSGRTSEST+RF+ANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEA YIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
Subjt:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL

Query:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA
        KQL   DSFPHILFKGP GSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSE+NAKYALLGLAKEIGSEYSIN+EA
Subjt:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA

Query:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK
        RNVNPKA FKVVVL+DVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLI+IAEKEEFDLPMNFASKIATKAK
Subjt:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK

Query:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
        QNLR+AIMALEACKAHNYPFSDDQPIPIGWE+AVVELAAHILEDPSNP+LHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
Subjt:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR

Query:  LPIETGTGALPKLE
        LPIETG GALPKLE
Subjt:  LPIETGTGALPKLE

A0A1S3B2J9 uncharacterized protein LOC103485060 isoform X10.0e+0093.42Show/hide
Query:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR
        M PALNLMKQRKDGYEPSDTETEWQESPWNDP EKKLVLDYNNRRTDSAVSKKFS  ANVSPPG RRNGG+TP RPAKDD+VLVMLQRNISPLSRAERRR
Subjt:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR

Query:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM
        HESPFKA+GEEIGSSSMRSRKEEK T SHGSN+ SQKP +SRRSVTAPRLRMRDEHM A NDLSQRRERAAPTLKV SILQQPKE+S V S SIGEMNEM
Subjt:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM

Query:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR
        +ADGR NRGL FNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFV+KKN DTYNQV VNANGR VSS G GLSTTT SSAAVSR
Subjt:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR

Query:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
        ENSSRIS+ENSKISDVSGRTSEST+RF+ANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEA YIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
Subjt:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL

Query:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA
        KQL SQDSFPHILFKGP GSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSE+NAKYALLGLAKEIGSEYSIN+EA
Subjt:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA

Query:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK
        RNVNPKA FKVVVL+DVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLI+IAEKEEFDLPMNFASKIATKAK
Subjt:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK

Query:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
        QNLR+AIMALEACKAHNYPFSDDQPIPIGWE+AVVELAAHILEDPSNP+LHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
Subjt:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR

Query:  LPIETGTGALPKLE
        LPIETG GALPKLE
Subjt:  LPIETGTGALPKLE

A0A5A7T3Q0 Putative ATPase family associated with various cellular activities0.0e+0093.42Show/hide
Query:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR
        M PALNLMKQRKDGYEPSDTETEWQESPWNDP EKKLVLDYNNRRTDSAVSKKFS  ANVSPPG RRNGG+TP RPAKDD+VLVMLQRNISPLSRAERRR
Subjt:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR

Query:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM
        HESPFKA+GEEIGSSSMRSRKEEK T SHGSN+ SQKP +SRRSVTAPRLRMRDEHM A NDLSQRRERAAPTLKV SILQQPKE+S V S SIGEMNEM
Subjt:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM

Query:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR
        +ADGR NRGL FNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFV+KKN DTYNQV VNANGR VSS G GLSTTT SSAAVSR
Subjt:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR

Query:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
        ENSSRIS+ENSKISDVSGRTSEST+RF+ANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEA YIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
Subjt:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL

Query:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA
        KQL SQDSFPHILFKGP GSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSE+NAKYALLGLAKEIGSEYSIN+EA
Subjt:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA

Query:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK
        RNVNPKA FKVVVL+DVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLI+IAEKEEFDLPMNFASKIATKAK
Subjt:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK

Query:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
        QNLR+AIMALEACKAHNYPFSDDQPIPIGWE+AVVELAAHILEDPSNP+LHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
Subjt:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR

Query:  LPIETGTGALPKLE
        LPIETG GALPKLE
Subjt:  LPIETGTGALPKLE

A0A6J1HGX7 uncharacterized protein LOC1114634510.0e+0088.66Show/hide
Query:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR
        MCPALNLMKQRKDGYEPSDTETEWQESPWNDPK KKLVLDYNNRR DSA SKKFST AN+SPPGSRRN GKTPHRPAKDD+VLVM QRNISPLSRAERRR
Subjt:  MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRR

Query:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM
        H+SPFKA  EEIGSSSMRSRKEEKLT SHGSNRISQKP ++RRSVTAPRLR RDEHM+AVNDLSQRR+RAAP+L+V SIL Q KEVSQVNSLS+GEMNEM
Subjt:  HESPFKAAGEEIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEM

Query:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR
        +ADGR +RG +FN+P+VESTGSISPGDIFFSRDG+ +GMNNN T KRNAFKNYISP+P FVSKKN DTYNQVEVNANGRGV+SAG GLSTTTT+SAAVSR
Subjt:  LADGRANRGLVFNDPVVESTGSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSR

Query:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL
        ENSSRIS E+SKISDVSGRTSESTRRF+A+RRKKKN++WFSCMRNG CRTTKSPEKR FDEA +IE+ANVVEYLKPFWAD+HRPVSL+GF FHK EAQ L
Subjt:  ENSSRISIENSKISDVSGRTSESTRRFVANRRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLL

Query:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA
        KQL  QDSFPHILFKGP GSGKRVL+MALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSIN+EA
Subjt:  KQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWNVSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEA

Query:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK
        RN+NPKA+FKVV+LLDVDKA EDIQHLLRWIMDGYKDACKVVLCC++D+GIL+SVISRCKVIKINPPVTHEI+DVLIQIA+KEEFD+PMNFASKIATKAK
Subjt:  RNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSGILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAK

Query:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR
        QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDP+NP+LH VKEKIQKLLV+SVHPKLILQKLVE+FLKRIE+RSRRELYYWHAYYNKR
Subjt:  QNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKLLVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKR

Query:  LPIETGTGALPKLE
        LP ETGTGALPKLE
Subjt:  LPIETGTGALPKLE

SwissProt top hitse value%identityAlignment
A2X8Q3 Soluble inorganic pyrophosphatase1.4e-9680.75Show/hide
Query:  MENSAGGGNSSNIGFPRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICED
        M   A G       +P   LNERILSSMS++ VAAHPWHDLEIGPGAP+VFNCVVEI +GSKVKYELDKA+GLIKVDRVLYSSVVYPHNYGFIPRT+CED
Subjt:  MENSAGGGNSSNIGFPRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICED

Query:  SDPMDVLVLMQEPVLPGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYS
         DPMDVLVLMQE V+PG FLRARAIGLMPMIDQGE+DDKIIAVCADDPE+RH+ DIKEIPPHRL EIRRFFEDYKKNENK+V V +FLPAE A++AIKYS
Subjt:  SDPMDVLVLMQEPVLPGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYS

Query:  MDLYAAYIVESLR
        MDLY AYI+ESLR
Subjt:  MDLYAAYIVESLR

P21216 Soluble inorganic pyrophosphatase 22.7e-9585.35Show/hide
Query:  PRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVL
        P + LNER  ++ + RS AAHPWHDLEIGP AP+VFNCVVEI KG KVKYELDK SGLIKVDRVLYSS+VYPHNYGFIPRTICEDSDPMDVLVLMQEPVL
Subjt:  PRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVL

Query:  PGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR
         GSFLRARAIGLMPMIDQGE+DDKIIAVCADDPEFRHY DIKE+PPHRLAEIRRFFEDYKKNENKKVDVE FLPA+AA+DAIK SMDLYAAYI   L+
Subjt:  PGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR

Q0DYB1 Soluble inorganic pyrophosphatase1.4e-9680.75Show/hide
Query:  MENSAGGGNSSNIGFPRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICED
        M   A G       +P   LNERILSSMS++ VAAHPWHDLEIGPGAP+VFNCVVEI +GSKVKYELDKA+GLIKVDRVLYSSVVYPHNYGFIPRT+CED
Subjt:  MENSAGGGNSSNIGFPRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICED

Query:  SDPMDVLVLMQEPVLPGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYS
         DPMDVLVLMQE V+PG FLRARAIGLMPMIDQGE+DDKIIAVCADDPE+RH+ DIKEIPPHRL EIRRFFEDYKKNENK+V V +FLPAE A++AIKYS
Subjt:  SDPMDVLVLMQEPVLPGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYS

Query:  MDLYAAYIVESLR
        MDLY AYI+ESLR
Subjt:  MDLYAAYIVESLR

Q93V56 Soluble inorganic pyrophosphatase 19.3e-9682.83Show/hide
Query:  PRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVL
        P  RLNERILSS+SRRSVAAHPWHDLEIGPGAP +FN VVEI KGSKVKYELDK +GLIKVDR+LYSSVVYPHNYGF+PRT+CED+DP+DVLV+MQEPVL
Subjt:  PRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVL

Query:  PGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR
        PG FLRARAIGLMPMIDQGE+DDKIIAVC DDPE++HYTDIKE+PPHRL+EIRRFFEDYKKNENK+V V DFLP+E+A++AI+YSMDLYA YI+ +LR
Subjt:  PGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR

Q9LFF9 Soluble inorganic pyrophosphatase 41.6e-9584.54Show/hide
Query:  LNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVLPGSF
        LNERILSSMS RSVAAHPWHDLEIGP AP +FNCVVEIGKGSKVKYELDK +GLIKVDR+LYSSVVYPHNYGFIPRT+CEDSDP+DVLV+MQEPV+PG F
Subjt:  LNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVLPGSF

Query:  LRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR
        LRA+AIGLMPMIDQGE+DDKIIAVCADDPE+RHY DI E+PPHR+AEIRRFFEDYKKNENK+V V DFLPA AA DA+++SMDLYA Y+VE+LR
Subjt:  LRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR

Arabidopsis top hitse value%identityAlignment
AT1G01050.1 pyrophosphorylase 16.6e-9782.83Show/hide
Query:  PRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVL
        P  RLNERILSS+SRRSVAAHPWHDLEIGPGAP +FN VVEI KGSKVKYELDK +GLIKVDR+LYSSVVYPHNYGF+PRT+CED+DP+DVLV+MQEPVL
Subjt:  PRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVL

Query:  PGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR
        PG FLRARAIGLMPMIDQGE+DDKIIAVC DDPE++HYTDIKE+PPHRL+EIRRFFEDYKKNENK+V V DFLP+E+A++AI+YSMDLYA YI+ +LR
Subjt:  PGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR

AT2G18230.1 pyrophosphorylase 21.9e-9685.35Show/hide
Query:  PRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVL
        P + LNER  ++ + RS AAHPWHDLEIGP AP+VFNCVVEI KG KVKYELDK SGLIKVDRVLYSS+VYPHNYGFIPRTICEDSDPMDVLVLMQEPVL
Subjt:  PRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVL

Query:  PGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR
         GSFLRARAIGLMPMIDQGE+DDKIIAVCADDPEFRHY DIKE+PPHRLAEIRRFFEDYKKNENKKVDVE FLPA+AA+DAIK SMDLYAAYI   L+
Subjt:  PGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR

AT2G46860.1 pyrophosphorylase 35.2e-9480.88Show/hide
Query:  SSNIGFPRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVL
        SS    P  +LNERILS++SRRSVAAHPWHDLEIGP AP VFN VVEI KGSKVKYELDK +GLIKVDR+LYSSVVYPHNYGFIPRT+CED+DP+DVLVL
Subjt:  SSNIGFPRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVL

Query:  MQEPVLPGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIV
        MQEPVLPG FLRARAIGLMPMIDQGE+DDKIIAVCADDPE++H+TDIK++ PHRL EIRRFFEDYKKNENKKV V DFLP+E+A +AI+YSMDLYA YI+
Subjt:  MQEPVLPGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIV

Query:  ESLR
         +LR
Subjt:  ESLR

AT3G53620.1 pyrophosphorylase 41.1e-9684.54Show/hide
Query:  LNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVLPGSF
        LNERILSSMS RSVAAHPWHDLEIGP AP +FNCVVEIGKGSKVKYELDK +GLIKVDR+LYSSVVYPHNYGFIPRT+CEDSDP+DVLV+MQEPV+PG F
Subjt:  LNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVLPGSF

Query:  LRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR
        LRA+AIGLMPMIDQGE+DDKIIAVCADDPE+RHY DI E+PPHR+AEIRRFFEDYKKNENK+V V DFLPA AA DA+++SMDLYA Y+VE+LR
Subjt:  LRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR

AT4G01480.1 pyrophosphorylase 53.7e-9279.29Show/hide
Query:  PRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVL
        P  RLNERILSS+S+RSVAAHPWHDLEIGPGAP +FN V+EI KGSKVKYELDK +GLIKVDR+LYSSVVYPHNYGF+PRT+CED+DP+DVLV+MQEPVL
Subjt:  PRIRLNERILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVL

Query:  PGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR
        PG FLRARAIGLMPMIDQGE+DDKIIAVC DDPE++H T+I E+PPHRL+EIRRFFEDYKKNENK+V V DFL    A++AI+YSMDLYA YI+ +LR
Subjt:  PGSFLRARAIGLMPMIDQGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCCAGCTTTGAACTTAATGAAGCAGAGGAAGGATGGTTATGAACCATCTGATACTGAGACTGAATGGCAAGAAAGCCCTTGGAATGACCCTAAGGAGAAGAAACT
TGTACTGGATTACAATAATCGAAGAACTGATTCGGCAGTGTCCAAGAAGTTCAGTACAGTGGCAAATGTTTCTCCTCCTGGTTCGAGAAGAAACGGTGGGAAAACGCCTC
ACAGGCCAGCCAAAGACGACACTGTTCTTGTTATGCTTCAGAGAAACATCAGCCCCTTGTCAAGAGCAGAAAGAAGAAGACATGAGTCTCCTTTTAAAGCTGCAGGGGAG
GAAATTGGAAGCTCTAGCATGAGGTCAAGAAAAGAGGAGAAGTTGACTTCTTCTCATGGGAGTAATAGAATAAGTCAAAAACCGGGCTACAGTAGGAGATCAGTGACTGC
TCCAAGGTTGAGGATGAGAGATGAACATATGAATGCTGTTAATGATTTATCTCAGAGGAGAGAGAGAGCAGCTCCAACCTTGAAGGTCGGCTCCATCCTGCAACAGCCAA
AGGAGGTTTCCCAGGTGAATTCTCTATCTATTGGTGAAATGAATGAGATGCTTGCAGATGGAAGGGCTAATAGAGGTTTGGTTTTCAATGACCCGGTGGTTGAAAGCACG
GGGTCGATCTCGCCGGGGGATATATTCTTTTCGCGTGATGGCTTGCCTGTTGGGATGAATAACAATGTCACAGCAAAGAGAAATGCGTTCAAAAACTATATAAGTCCAAA
GCCCACATTTGTGTCTAAAAAGAATGTTGATACTTATAATCAAGTGGAAGTAAATGCTAATGGTAGAGGGGTTTCTTCTGCTGGAGCAGGTTTGTCAACGACCACAACTA
GTAGTGCTGCTGTAAGTAGAGAAAATAGTAGTAGAATTAGCATTGAAAATAGTAAGATCAGTGATGTGAGTGGTAGAACAAGTGAAAGTACTAGAAGGTTTGTTGCCAAT
AGACGAAAGAAGAAGAACGATATATGGTTTTCTTGTATGAGGAATGGGACTTGCAGGACAACGAAATCGCCTGAAAAGCGACCATTTGATGAAGCTTTATATATTGAAAA
GGCAAATGTTGTTGAATACTTGAAACCCTTCTGGGCGGATCAGCATCGGCCGGTTTCCTTAAATGGGTTCACTTTCCATAAGCATGAGGCCCAACTTCTCAAGCAATTAG
CTTCACAGGACAGTTTTCCCCACATTCTGTTCAAAGGTCCAAGCGGATCTGGTAAAAGAGTGCTGATGATGGCTCTTCTGCGTGAGATATATGGTGATTCATGTTGGAAT
GTTTCTCATGATTTGCGACGTTTCCAGATTCAGGAAAGAAAACTGACGCAAGTCTTCGTTCCATTGACATCAAGTGCTCACCATGTGGAACTAAATCTAAGCTCCGAAGC
AAATGCTAAGTATGCTTTGCTGGGATTGGCTAAAGAAATAGGCAGTGAATATTCCATTAATATGGAAGCAAGAAATGTCAATCCGAAGGCAAATTTCAAAGTGGTAGTCC
TTTTAGATGTAGACAAAGCCGCAGAAGATATTCAGCACTTGCTTAGGTGGATTATGGATGGCTATAAGGATGCCTGCAAAGTAGTACTCTGTTGTGAAGACGACTCAGGC
ATCCTTGAATCGGTGATAAGCCGCTGCAAAGTTATTAAAATTAACCCTCCTGTAACTCATGAAATCATGGATGTACTTATCCAAATAGCAGAGAAGGAGGAATTTGACTT
ACCCATGAACTTTGCTTCTAAGATTGCTACTAAAGCAAAGCAAAACCTGAGAAAAGCAATCATGGCCCTTGAAGCATGCAAGGCACACAATTATCCATTTTCTGATGACC
AGCCAATCCCTATTGGATGGGAAGAGGCCGTGGTAGAACTCGCAGCCCATATCCTCGAAGACCCATCCAATCCAAAATTACACCAAGTAAAAGAAAAAATTCAGAAGCTT
CTAGTTGATTCAGTTCATCCTAAACTAATTCTCCAGAAGCTTGTGGAACAATTTCTGAAAAGAATTGAGATGAGATCAAGGAGGGAACTTTATTATTGGCATGCTTATTA
TAACAAGAGACTCCCAATTGAAACTGGAACAGGTGCTTTACCCAAATTAGAAGCTATTTGTGATTGTTCTTTAAGCTGTAGCGATTCAGTGAATGACCTTTGGTTCAAAC
TGGAACCGACATTCTCTCTGTTTTTCTTAGTGGGTTTGGTGAATATGGAAAACAGTGCTGGAGGAGGAAATTCTTCGAATATAGGATTCCCTAGGATTAGGCTCAATGAA
AGAATTCTTTCTTCCATGTCTCGAAGATCTGTCGCTGCTCACCCTTGGCACGATTTGGAGATTGGACCCGGTGCACCTTCGGTTTTCAATTGTGTTGTCGAAATTGGCAA
AGGCAGCAAGGTTAAGTATGAGCTTGACAAGGCCAGTGGCCTTATAAAAGTCGACCGCGTACTTTACTCATCGGTTGTTTACCCACACAATTATGGTTTCATCCCACGGA
CAATTTGTGAAGATAGTGATCCTATGGATGTTCTGGTACTGATGCAGGAGCCTGTGCTACCTGGATCTTTCCTCCGAGCTCGTGCTATTGGATTGATGCCTATGATTGAT
CAAGGAGAAAGAGATGACAAAATCATAGCAGTATGTGCTGATGACCCTGAATTCCGCCACTATACGGACATCAAGGAGATTCCCCCACACCGGCTGGCCGAAATTCGCCG
ATTCTTCGAAGACTACAAGAAGAACGAGAACAAAAAAGTCGATGTGGAAGACTTCCTGCCAGCGGAGGCTGCCATGGATGCCATTAAGTACTCCATGGACTTGTATGCTG
CCTACATAGTTGAGAGCTTGAGGCATTCTGAAAAATGTATAAGTGGATGA
mRNA sequenceShow/hide mRNA sequence
CTTAAAGAATATTCCTCCGCGAAACATTAAGTTTAGTGTGATTCAATAATTTTTGCACTAAAAAACATATCAATAATTAGTTAATTGAATCAAGGCAAATGTATGAGAAA
AAAAGGAGTGGAGAAAAAATAGCAATAAAAATCAGCCGCCGGAGCGAACATCGAATGTGTCTGAGGTAGAAGAAGGTCTTAAAAGTATTCTCCGTTCATCAGCAATGGCC
GCCATTATTTCTGTTCATAATGCTTTGATCGAGCTCCAACTCTGCCATACTTCACATTCTGCAGCCGCAGCCACCATGGCCAAGGAGAATAAAGACAATATCTGGAGAGA
TCGGCAAAGACAAAGCCTAATTGCTACCTTTCTAAAAAACCAGCTTAGAAATTTCAGGAAAACAACACCAGATTCGGTATCTCACAGGTTTCTTATATCTCCCAATAGTT
CTGAAACTCTTCTCCTCCTCAATTGGTTGTTGCGATGAAGACCGTCGCTCTGAATTAGGGTTTCGGAATCGGTCATCGCAACTCGAAAGAGCCGAATGTGTCCAGCTTTG
AACTTAATGAAGCAGAGGAAGGATGGTTATGAACCATCTGATACTGAGACTGAATGGCAAGAAAGCCCTTGGAATGACCCTAAGGAGAAGAAACTTGTACTGGATTACAA
TAATCGAAGAACTGATTCGGCAGTGTCCAAGAAGTTCAGTACAGTGGCAAATGTTTCTCCTCCTGGTTCGAGAAGAAACGGTGGGAAAACGCCTCACAGGCCAGCCAAAG
ACGACACTGTTCTTGTTATGCTTCAGAGAAACATCAGCCCCTTGTCAAGAGCAGAAAGAAGAAGACATGAGTCTCCTTTTAAAGCTGCAGGGGAGGAAATTGGAAGCTCT
AGCATGAGGTCAAGAAAAGAGGAGAAGTTGACTTCTTCTCATGGGAGTAATAGAATAAGTCAAAAACCGGGCTACAGTAGGAGATCAGTGACTGCTCCAAGGTTGAGGAT
GAGAGATGAACATATGAATGCTGTTAATGATTTATCTCAGAGGAGAGAGAGAGCAGCTCCAACCTTGAAGGTCGGCTCCATCCTGCAACAGCCAAAGGAGGTTTCCCAGG
TGAATTCTCTATCTATTGGTGAAATGAATGAGATGCTTGCAGATGGAAGGGCTAATAGAGGTTTGGTTTTCAATGACCCGGTGGTTGAAAGCACGGGGTCGATCTCGCCG
GGGGATATATTCTTTTCGCGTGATGGCTTGCCTGTTGGGATGAATAACAATGTCACAGCAAAGAGAAATGCGTTCAAAAACTATATAAGTCCAAAGCCCACATTTGTGTC
TAAAAAGAATGTTGATACTTATAATCAAGTGGAAGTAAATGCTAATGGTAGAGGGGTTTCTTCTGCTGGAGCAGGTTTGTCAACGACCACAACTAGTAGTGCTGCTGTAA
GTAGAGAAAATAGTAGTAGAATTAGCATTGAAAATAGTAAGATCAGTGATGTGAGTGGTAGAACAAGTGAAAGTACTAGAAGGTTTGTTGCCAATAGACGAAAGAAGAAG
AACGATATATGGTTTTCTTGTATGAGGAATGGGACTTGCAGGACAACGAAATCGCCTGAAAAGCGACCATTTGATGAAGCTTTATATATTGAAAAGGCAAATGTTGTTGA
ATACTTGAAACCCTTCTGGGCGGATCAGCATCGGCCGGTTTCCTTAAATGGGTTCACTTTCCATAAGCATGAGGCCCAACTTCTCAAGCAATTAGCTTCACAGGACAGTT
TTCCCCACATTCTGTTCAAAGGTCCAAGCGGATCTGGTAAAAGAGTGCTGATGATGGCTCTTCTGCGTGAGATATATGGTGATTCATGTTGGAATGTTTCTCATGATTTG
CGACGTTTCCAGATTCAGGAAAGAAAACTGACGCAAGTCTTCGTTCCATTGACATCAAGTGCTCACCATGTGGAACTAAATCTAAGCTCCGAAGCAAATGCTAAGTATGC
TTTGCTGGGATTGGCTAAAGAAATAGGCAGTGAATATTCCATTAATATGGAAGCAAGAAATGTCAATCCGAAGGCAAATTTCAAAGTGGTAGTCCTTTTAGATGTAGACA
AAGCCGCAGAAGATATTCAGCACTTGCTTAGGTGGATTATGGATGGCTATAAGGATGCCTGCAAAGTAGTACTCTGTTGTGAAGACGACTCAGGCATCCTTGAATCGGTG
ATAAGCCGCTGCAAAGTTATTAAAATTAACCCTCCTGTAACTCATGAAATCATGGATGTACTTATCCAAATAGCAGAGAAGGAGGAATTTGACTTACCCATGAACTTTGC
TTCTAAGATTGCTACTAAAGCAAAGCAAAACCTGAGAAAAGCAATCATGGCCCTTGAAGCATGCAAGGCACACAATTATCCATTTTCTGATGACCAGCCAATCCCTATTG
GATGGGAAGAGGCCGTGGTAGAACTCGCAGCCCATATCCTCGAAGACCCATCCAATCCAAAATTACACCAAGTAAAAGAAAAAATTCAGAAGCTTCTAGTTGATTCAGTT
CATCCTAAACTAATTCTCCAGAAGCTTGTGGAACAATTTCTGAAAAGAATTGAGATGAGATCAAGGAGGGAACTTTATTATTGGCATGCTTATTATAACAAGAGACTCCC
AATTGAAACTGGAACAGGTGCTTTACCCAAATTAGAAGCTATTTGTGATTGTTCTTTAAGCTGTAGCGATTCAGTGAATGACCTTTGGTTCAAACTGGAACCGACATTCT
CTCTGTTTTTCTTAGTGGGTTTGGTGAATATGGAAAACAGTGCTGGAGGAGGAAATTCTTCGAATATAGGATTCCCTAGGATTAGGCTCAATGAAAGAATTCTTTCTTCC
ATGTCTCGAAGATCTGTCGCTGCTCACCCTTGGCACGATTTGGAGATTGGACCCGGTGCACCTTCGGTTTTCAATTGTGTTGTCGAAATTGGCAAAGGCAGCAAGGTTAA
GTATGAGCTTGACAAGGCCAGTGGCCTTATAAAAGTCGACCGCGTACTTTACTCATCGGTTGTTTACCCACACAATTATGGTTTCATCCCACGGACAATTTGTGAAGATA
GTGATCCTATGGATGTTCTGGTACTGATGCAGGAGCCTGTGCTACCTGGATCTTTCCTCCGAGCTCGTGCTATTGGATTGATGCCTATGATTGATCAAGGAGAAAGAGAT
GACAAAATCATAGCAGTATGTGCTGATGACCCTGAATTCCGCCACTATACGGACATCAAGGAGATTCCCCCACACCGGCTGGCCGAAATTCGCCGATTCTTCGAAGACTA
CAAGAAGAACGAGAACAAAAAAGTCGATGTGGAAGACTTCCTGCCAGCGGAGGCTGCCATGGATGCCATTAAGTACTCCATGGACTTGTATGCTGCCTACATAGTTGAGA
GCTTGAGGCATTCTGAAAAATGTATAAGTGGATGA
Protein sequenceShow/hide protein sequence
MCPALNLMKQRKDGYEPSDTETEWQESPWNDPKEKKLVLDYNNRRTDSAVSKKFSTVANVSPPGSRRNGGKTPHRPAKDDTVLVMLQRNISPLSRAERRRHESPFKAAGE
EIGSSSMRSRKEEKLTSSHGSNRISQKPGYSRRSVTAPRLRMRDEHMNAVNDLSQRRERAAPTLKVGSILQQPKEVSQVNSLSIGEMNEMLADGRANRGLVFNDPVVEST
GSISPGDIFFSRDGLPVGMNNNVTAKRNAFKNYISPKPTFVSKKNVDTYNQVEVNANGRGVSSAGAGLSTTTTSSAAVSRENSSRISIENSKISDVSGRTSESTRRFVAN
RRKKKNDIWFSCMRNGTCRTTKSPEKRPFDEALYIEKANVVEYLKPFWADQHRPVSLNGFTFHKHEAQLLKQLASQDSFPHILFKGPSGSGKRVLMMALLREIYGDSCWN
VSHDLRRFQIQERKLTQVFVPLTSSAHHVELNLSSEANAKYALLGLAKEIGSEYSINMEARNVNPKANFKVVVLLDVDKAAEDIQHLLRWIMDGYKDACKVVLCCEDDSG
ILESVISRCKVIKINPPVTHEIMDVLIQIAEKEEFDLPMNFASKIATKAKQNLRKAIMALEACKAHNYPFSDDQPIPIGWEEAVVELAAHILEDPSNPKLHQVKEKIQKL
LVDSVHPKLILQKLVEQFLKRIEMRSRRELYYWHAYYNKRLPIETGTGALPKLEAICDCSLSCSDSVNDLWFKLEPTFSLFFLVGLVNMENSAGGGNSSNIGFPRIRLNE
RILSSMSRRSVAAHPWHDLEIGPGAPSVFNCVVEIGKGSKVKYELDKASGLIKVDRVLYSSVVYPHNYGFIPRTICEDSDPMDVLVLMQEPVLPGSFLRARAIGLMPMID
QGERDDKIIAVCADDPEFRHYTDIKEIPPHRLAEIRRFFEDYKKNENKKVDVEDFLPAEAAMDAIKYSMDLYAAYIVESLRHSEKCISG