; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019499 (gene) of Snake gourd v1 genome

Gene IDTan0019499
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPeptidase S1, PA clan
Genome locationLG06:13265187..13269640
RNA-Seq ExpressionTan0019499
SyntenyTan0019499
Gene Ontology termsGO:0043231 - intracellular membrane-bounded organelle (cellular component)
InterPro domainsIPR009003 - Peptidase S1, PA clan


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026137.1 uncharacterized protein E6C27_scaffold19G00880 [Cucumis melo var. makuwa]7.2e-27691.09Show/hide
Query:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL
        MGVLGDSLCFCKGVGK+ERTKA IFSGK PAMARIS   A S TAFLIHR+LLLTTHVNLPSVSAAE+CEIRLQNGVAATLVPHRFF+TSSVLDLTIVGL
Subjt:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL

Query:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
        DAV+ DSNSQQLQ LKICSKPNL+LGN VYLLGYSEKDELIISEGKVVIATDNLIKLSTDG+T SPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
Subjt:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS

Query:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI
        S+SSSWKKDLPMQFGIPLPIIC WLNQHWEGSLDELNKPK QLIRLMSSGQKSEHSSSFTLRQVFKPME N+EETPSPSN+VSKTRDLPGPSYSTTTNTI
Subjt:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI

Query:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN
        KEEAPMNNLHVNHVQGIPTPEIYESPKLI+VP+RK+ETTPTQLLDINFPPRVSTA I  HPTRQTP   SDENS +DVSE NQLRQ ++M+R +VDPIEN
Subjt:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN

Query:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK
        GEEVASTNS N ALSEVQSCSSPVEVSAMQN YSSEGETTMYSAETAESRNYTSPREG FQQVGRSQSCVNYNRWGSVQ NPMA +TMLENQRSFRNGRK
Subjt:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK

Query:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
        MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
Subjt:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF

KAG7014524.1 hypothetical protein SDJN02_24702, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-27589.19Show/hide
Query:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAAS---SATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTI
        MGVLGDSLCFCKGVGKSER KAAIFSGKAPAMARIS PAA+   S TAFLIHR+LLLTTH+NLPSVSAAE CEIRLQNGVAA+LVPHRFFITSSVLDLTI
Subjt:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAAS---SATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTI

Query:  VGLDAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSS
        VGLDAVDGDSNSQQLQHLKICSKPNLDLGN+VYLLGYSEKDELIISEGKVVIATDNLIKLSTDG+T SPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSS
Subjt:  VGLDAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSS

Query:  TSSSSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTT
        TSSS+SSSWKKDLPMQFGIPLPI+CDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSF+LRQVFKPME N+EETPSPSNIVSKTRD+ GPSYS T+
Subjt:  TSSSSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTT

Query:  NTIKEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPHDSDENSIRDVSEQNQLRQDQSMERNLVDPI
        NTIKEEAP+ NLHVNH QGIPTPEIYESPKLIAVP+RKKE TPTQLLDINFPPRVSTA IT HPTR T   SDENS +DVS+QNQLRQD++M R LV+P+
Subjt:  NTIKEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPHDSDENSIRDVSEQNQLRQDQSMERNLVDPI

Query:  ENG---EEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSF
        ENG   EEVASTNSVNGALSEVQSCSSP+E SA+Q+ YSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMA QTM+ENQRSF
Subjt:  ENG---EEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSF

Query:  RNGRKMYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
        ++GR M+SQGAGSYRSNDYY PTVSSIMKKRNSSEQVNRPRQS+AA HSSPRWMF
Subjt:  RNGRKMYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF

XP_004149782.1 uncharacterized protein LOC101211454 [Cucumis sativus]4.6e-27590Show/hide
Query:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL
        MGVLGDSLCFCKGVGKSERTKA IFS K PAMARIS     S TAFLIHR+LLLTTHVNLPSVSAAE CEIRLQNGVAATLVPHRFF+TSSVLDLTIVGL
Subjt:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL

Query:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
        DAVDGDSNSQQLQHLKICSKPNL+LG+ VYLLGYSEKDELIISEGKVVIATDNLIKLSTDG+T SPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
Subjt:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS

Query:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI
        S+SSSWKKD+PMQFGIPLPIIC WLNQHWEGSLDELNKPK QLIRLMSSGQKS+HSSSFTLRQVFKPME N+EETPSPSN+VSKTRDLPGPSYSTTTNTI
Subjt:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI

Query:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN
        KEEAPMNNLHVNHVQGIPTPEIYESPKLI+VP+RK+ETTPTQLL+INFPPR+STA I  HPTRQTP   SDENS +DVS+ NQLRQ ++M+R + DPIEN
Subjt:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN

Query:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK
        GEEVASTNSVNGALSEVQSCSSPVEVS MQ+ YSSEGETTMYSAETAESRNYTSPREG FQQVGRSQSCVNYNRWGSVQ NPMA +TMLENQRSFRNGRK
Subjt:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK

Query:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
        MYSQGA SYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
Subjt:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF

XP_008458055.1 PREDICTED: uncharacterized protein LOC103497595 [Cucumis melo]2.7e-27590.91Show/hide
Query:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL
        MGVLGDSLCFCKGVGK+ERTKA IFSGK PAMARIS   A S TAFLIHR+LLLTTHVNLPSVSAAE+CEIRLQNGVAATLVPHRFF+TSSVLDLTIVGL
Subjt:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL

Query:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
        DAV+ DSNSQQLQ LKICSKPNL+LGN VYLLGYSEKDELIISEGKVVIATDNLIKLSTDG+T SPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
Subjt:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS

Query:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI
        S+SSSWKKDLPMQFGIPLPIIC WLNQHWEGSLDELNKPK QLIRLMSSGQKSEHSSSFTLRQVFKPME N+EETPSPSN+VSKTRDLPGPSYSTTTNTI
Subjt:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI

Query:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN
        KEEAPMNNLHVNHVQGIPTPEIYESPKLI+VP+RK+ETTPTQLLDINFPPRVSTA I  HPTRQ P   SDENS +DVSE NQLRQ ++M+R +VDPIEN
Subjt:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN

Query:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK
        GEEVASTNS N ALSEVQSCSSPVEVSAMQN YSSEGETTMYSAETAESRNYTSPREG FQQVGRSQSCVNYNRWGSVQ NPMA +TMLENQRSFRNGRK
Subjt:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK

Query:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
        MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
Subjt:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF

XP_038900069.1 uncharacterized protein LOC120087228 [Benincasa hispida]1.2e-28392.91Show/hide
Query:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL
        MGVLGDSLCFCKGVGKSERTKA IFSGK PAMARIS   A S TAFLIHR+LLLTTHVNLPSVSAAE+CEIRLQNGVAATLVPHRFFITSSVLDLTIVGL
Subjt:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL

Query:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
        DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDE+IISEGKVVIATDNLIKLSTDG+T SPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
Subjt:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS

Query:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI
        S+SSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEAN+EETPSPSNIVSKTRDLPGPSYS TTNTI
Subjt:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI

Query:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN
        KEEAPMNNLHVNHVQGIPTPEIYE+PKLIAVP+RK+ETTPT LLDINFPPRVST  ITAHPTRQTP   SDENS +DVS+QNQLRQ ++M+R LVDPIEN
Subjt:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN

Query:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK
         EEVASTNSVNGALSEVQSCSSPVEVS MQNGYSSEGETTMYSAETAESRNYTSPREG FQQVGRSQSCVNYNRWGSVQ NPMA +TMLENQRSFRNGRK
Subjt:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK

Query:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
        MYSQGAGSYRSNDYYSPTVSSIMKKRNS EQVNRPRQSTAAAHSSPRWMF
Subjt:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF

TrEMBL top hitse value%identityAlignment
A0A0A0K4L8 Uncharacterized protein2.3e-27590Show/hide
Query:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL
        MGVLGDSLCFCKGVGKSERTKA IFS K PAMARIS     S TAFLIHR+LLLTTHVNLPSVSAAE CEIRLQNGVAATLVPHRFF+TSSVLDLTIVGL
Subjt:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL

Query:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
        DAVDGDSNSQQLQHLKICSKPNL+LG+ VYLLGYSEKDELIISEGKVVIATDNLIKLSTDG+T SPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
Subjt:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS

Query:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI
        S+SSSWKKD+PMQFGIPLPIIC WLNQHWEGSLDELNKPK QLIRLMSSGQKS+HSSSFTLRQVFKPME N+EETPSPSN+VSKTRDLPGPSYSTTTNTI
Subjt:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI

Query:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN
        KEEAPMNNLHVNHVQGIPTPEIYESPKLI+VP+RK+ETTPTQLL+INFPPR+STA I  HPTRQTP   SDENS +DVS+ NQLRQ ++M+R + DPIEN
Subjt:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN

Query:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK
        GEEVASTNSVNGALSEVQSCSSPVEVS MQ+ YSSEGETTMYSAETAESRNYTSPREG FQQVGRSQSCVNYNRWGSVQ NPMA +TMLENQRSFRNGRK
Subjt:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK

Query:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
        MYSQGA SYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
Subjt:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF

A0A1S3C6X5 uncharacterized protein LOC1034975951.3e-27590.91Show/hide
Query:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL
        MGVLGDSLCFCKGVGK+ERTKA IFSGK PAMARIS   A S TAFLIHR+LLLTTHVNLPSVSAAE+CEIRLQNGVAATLVPHRFF+TSSVLDLTIVGL
Subjt:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL

Query:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
        DAV+ DSNSQQLQ LKICSKPNL+LGN VYLLGYSEKDELIISEGKVVIATDNLIKLSTDG+T SPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
Subjt:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS

Query:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI
        S+SSSWKKDLPMQFGIPLPIIC WLNQHWEGSLDELNKPK QLIRLMSSGQKSEHSSSFTLRQVFKPME N+EETPSPSN+VSKTRDLPGPSYSTTTNTI
Subjt:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI

Query:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN
        KEEAPMNNLHVNHVQGIPTPEIYESPKLI+VP+RK+ETTPTQLLDINFPPRVSTA I  HPTRQ P   SDENS +DVSE NQLRQ ++M+R +VDPIEN
Subjt:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN

Query:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK
        GEEVASTNS N ALSEVQSCSSPVEVSAMQN YSSEGETTMYSAETAESRNYTSPREG FQQVGRSQSCVNYNRWGSVQ NPMA +TMLENQRSFRNGRK
Subjt:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK

Query:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
        MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
Subjt:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF

A0A5A7SN28 Uncharacterized protein3.5e-27691.09Show/hide
Query:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL
        MGVLGDSLCFCKGVGK+ERTKA IFSGK PAMARIS   A S TAFLIHR+LLLTTHVNLPSVSAAE+CEIRLQNGVAATLVPHRFF+TSSVLDLTIVGL
Subjt:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGL

Query:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
        DAV+ DSNSQQLQ LKICSKPNL+LGN VYLLGYSEKDELIISEGKVVIATDNLIKLSTDG+T SPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS
Subjt:  DAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSS

Query:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI
        S+SSSWKKDLPMQFGIPLPIIC WLNQHWEGSLDELNKPK QLIRLMSSGQKSEHSSSFTLRQVFKPME N+EETPSPSN+VSKTRDLPGPSYSTTTNTI
Subjt:  SSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTI

Query:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN
        KEEAPMNNLHVNHVQGIPTPEIYESPKLI+VP+RK+ETTPTQLLDINFPPRVSTA I  HPTRQTP   SDENS +DVSE NQLRQ ++M+R +VDPIEN
Subjt:  KEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPH-DSDENSIRDVSEQNQLRQDQSMERNLVDPIEN

Query:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK
        GEEVASTNS N ALSEVQSCSSPVEVSAMQN YSSEGETTMYSAETAESRNYTSPREG FQQVGRSQSCVNYNRWGSVQ NPMA +TMLENQRSFRNGRK
Subjt:  GEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRK

Query:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
        MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
Subjt:  MYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF

A0A6J1GMW3 uncharacterized protein LOC1114558993.8e-27589.01Show/hide
Query:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAAS---SATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTI
        MGVLGDSLCFCKGVGKSER KAAIFSGKAPAMARIS PAA+   S TAFLIHR+LLLTTH+NLPSVSAAETCEIRLQNGVAA+LVPHRFFITSSVLDLTI
Subjt:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAAS---SATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTI

Query:  VGLDAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSS
        VGLDAVDGDSNSQQLQHLKICSKPNLDLGN+VYLLGYSEKDELIISEGKVVIATDNLIKLSTDG+T SPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSS
Subjt:  VGLDAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSS

Query:  TSSSSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTT
        TSSS+SSSWKKDLPMQFGIPLPI+CDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSF+LRQVFKPME N+EETPSPSNIVSKTRD+ GPSYS T+
Subjt:  TSSSSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTT

Query:  NTIKEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPHDSDENSIRDVSEQNQLRQDQSMERNLVDPI
        NTIKEEAP+ NLHVNH QGIPTPEIYESPKLIAVP+RKKE TPTQLLDINFPPRVSTA IT HPTR T   SDENS +DVS+QNQLRQD++M R LV+P+
Subjt:  NTIKEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPHDSDENSIRDVSEQNQLRQDQSMERNLVDPI

Query:  ENG---EEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSF
        ENG   EEVASTNSVNGALSEVQSCSSP+E SA+Q+ YSSEGE TMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMA QTMLENQ+S+
Subjt:  ENG---EEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSF

Query:  RNGRKMYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
        ++GR M+SQGAGSYRSNDYY PTVSSIMKKRNSSEQVNRPRQS+AA HSSPRWMF
Subjt:  RNGRKMYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF

A0A6J1JRF6 uncharacterized protein LOC1114882231.2e-27388.65Show/hide
Query:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAAS---SATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTI
        MGVLGDSLCFCKGVGKSER KAAIFSGKAPAMARIS PAA+   S TAFLIHR+LLLTTH+NLPSVSAAETCEIRLQNGVAA+LVPHRFFITSSVLDLTI
Subjt:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAAS---SATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTI

Query:  VGLDAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSS
        VGLDAVDGDSNSQQLQHLKICSKPNLD GN+VYLLGYSEKDELIISEGKVVIATDNLIKLSTDG+T SPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSS
Subjt:  VGLDAVDGDSNSQQLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSS

Query:  TSSSSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTT
        TSSS+SSSWKKDLPMQFGIPLPI+CDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSF+LRQVFKPME N+EETPSPSNIVSKTRD+ GPSYS T+
Subjt:  TSSSSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTT

Query:  NTIKEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPHDSDENSIRDVSEQNQLRQDQSMERNLVDPI
        NTIKEEA + NLHVNH QGIPTPEIYESPKLIAVP+RKKE TPTQLLDINFPPRVSTA IT HPTR T   SDENS +DVS+QNQLRQD++M R LV+P+
Subjt:  NTIKEEAPMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPHDSDENSIRDVSEQNQLRQDQSMERNLVDPI

Query:  ENG---EEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSF
        ENG   EEVASTNSVNGALSEVQSCSSP+E S +Q+ YSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMA QTMLENQ+S+
Subjt:  ENG---EEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTMYSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSF

Query:  RNGRKMYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF
        ++GR M+SQGAGSYRSNDYY PTVSSIMKKRNSSEQVNRPRQS+AA HSSPRWMF
Subjt:  RNGRKMYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G07210.1 unknown protein1.6e-16160.95Show/hide
Query:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTP---AASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTI
        MGV+ DS CFCKGVGKSE+ K +IF+GKAPAMARIS       +S T FLIHR LLLTTH+NLPS+SA ET E+RLQNGVAA L PHRFFITSSV+DLTI
Subjt:  MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTP---AASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTI

Query:  VGLDAVDGDSNSQ-QLQ----HLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPN
        VGLD VDGDS+SQ QLQ    +LK CSKPNLDLG+ VYLLGY+ ++EL I EGK+V+ATDNLIKLSTD +  SPGSAGFD QGNLAFM+CDP KL+TSP 
Subjt:  VGLDAVDGDSNSQ-QLQ----HLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPN

Query:  TKSSSTSSSSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDE-LNKPKPQLIRLMSSGQKSEHS-SSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPG
        + SSS+SSS      K L MQFGIP+P+ICDWLNQHWEGSLDE   KPK  LIRLMSSGQKSE S +SFT+R+VFKP ++ +  TPS SN    TRD   
Subjt:  TKSSSTSSSSSSSWKKDLPMQFGIPLPIICDWLNQHWEGSLDE-LNKPKPQLIRLMSSGQKSEHS-SSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPG

Query:  PSYSTTTNTIKEEA-----PMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPHDSDENSIRDVSEQNQLRQ
        PS S      KEE      P       H QGIPTPEIYESPKL + PLR  ET    LLDINFPPR+   AIT HP                 E N L+ 
Subjt:  PSYSTTTNTIKEEA-----PMNNLHVNHVQGIPTPEIYESPKLIAVPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPHDSDENSIRDVSEQNQLRQ

Query:  DQSMERNLVDPIENGEEVASTNSVNGALSEVQSCSSPVEVSAMQN--GYSSEGETTMYSAETAESRNY-TSPREGKF--QQVGRSQSCVNYNRWGSVQRN
        +  +E   +    +  ++AST SVNGALSEV S S P     + N  GYSSE E TMYSAETAESRNY T PR+ +F  ++VGRSQSCV+ +RWG+ Q++
Subjt:  DQSMERNLVDPIENGEEVASTNSVNGALSEVQSCSSPVEVSAMQN--GYSSEGETTMYSAETAESRNY-TSPREGKF--QQVGRSQSCVNYNRWGSVQRN

Query:  PMAHQTMLENQRSFRNGRKMYSQGAGSYRSNDYYSPTVSSIMKKR-NSSE-QVNRPRQSTAAAHSSPRWMF
            + MLE QRSF +G+KM+SQGA S RSNDYYSPTVSSIMKKR NSSE Q+ +P     A  SSPRW F
Subjt:  PMAHQTMLENQRSFRNGRKMYSQGAGSYRSNDYYSPTVSSIMKKR-NSSE-QVNRPRQSTAAAHSSPRWMF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGTTTTGGGCGATTCCTTGTGCTTCTGCAAAGGGGTCGGCAAGTCCGAGCGAACCAAGGCTGCCATTTTTTCCGGCAAAGCCCCCGCCATGGCTCGAATCTCCAC
CCCCGCCGCTTCTTCCGCCACCGCTTTTTTGATCCACCGGACCCTTCTTTTGACCACCCATGTTAATCTTCCCTCCGTTTCTGCTGCTGAGACTTGCGAGATCCGCCTCC
AAAACGGCGTCGCTGCCACTCTCGTTCCTCACAGGTTCTTCATTACAAGTTCTGTTTTGGATCTTACAATAGTGGGCTTAGATGCAGTGGATGGAGACTCCAACTCTCAG
CAACTTCAACACTTGAAGATATGTTCTAAACCGAATCTTGATCTCGGCAATGCTGTTTACCTCTTGGGATATTCTGAGAAAGACGAATTAATCATTAGTGAGGGCAAGGT
GGTTATTGCCACTGATAACCTAATAAAATTGTCGACCGATGGGATAACAGGGAGTCCCGGGTCTGCTGGATTTGATGCACAAGGTAACCTTGCATTCATGGTATGTGACC
CTATGAAACTAGCCACGTCTCCGAATACTAAATCATCGTCAACGTCTTCGTCATCTTCATCATCGTGGAAGAAGGATCTTCCTATGCAATTTGGGATTCCTCTGCCAATC
ATTTGTGATTGGTTAAACCAACATTGGGAAGGTAGTCTTGATGAACTTAACAAACCAAAGCCACAACTCATAAGATTGATGTCTTCGGGACAAAAGAGTGAGCATTCTTC
TTCCTTTACATTGAGGCAAGTTTTTAAGCCAATGGAAGCGAATGAAGAGGAAACACCATCACCATCAAATATAGTCTCGAAAACCAGGGATCTGCCCGGACCGAGCTATT
CCACTACAACAAACACCATCAAGGAAGAGGCTCCGATGAATAACCTACATGTGAACCATGTGCAAGGCATTCCTACACCTGAAATATATGAATCACCAAAGTTGATTGCA
GTTCCTCTTCGAAAAAAGGAAACCACGCCGACGCAGCTTCTGGATATCAACTTTCCTCCAAGAGTTTCCACGGCTGCGATTACGGCGCATCCTACCAGACAAACCCCACA
TGACTCTGATGAAAATTCCATAAGGGATGTTTCTGAACAGAACCAATTGAGACAAGACCAAAGCATGGAGAGAAATCTCGTCGATCCAATCGAAAATGGAGAAGAGGTTG
CCTCAACAAATTCTGTAAATGGGGCCTTAAGTGAAGTTCAGTCTTGTTCATCTCCCGTGGAAGTTTCAGCGATGCAGAACGGGTATAGCAGCGAGGGAGAGACGACAATG
TACTCTGCAGAAACTGCAGAGAGCCGAAACTATACAAGTCCTAGAGAAGGGAAGTTTCAGCAGGTTGGAAGGAGTCAGAGTTGCGTAAACTACAACAGATGGGGATCGGT
CCAAAGAAATCCGATGGCTCACCAAACAATGCTAGAGAATCAAAGAAGTTTCAGAAATGGAAGGAAGATGTATTCTCAAGGGGCTGGATCTTACAGGAGCAATGACTACT
ACAGCCCGACGGTCTCCTCGATCATGAAGAAGCGAAACAGCTCGGAACAAGTTAACCGACCAAGGCAAAGCACTGCTGCTGCTCATTCTTCCCCAAGATGGATGTTCTGA
mRNA sequenceShow/hide mRNA sequence
CTTACAAACACTAGACAAGACTTCCCATTCTCTTTCTCTTAATCTTTCTGTGTGTATATAATTATACGACACAAAAATGGCGACCCATTTCACATTTTAACACTTGACAT
TACCCCTTTCACTCAAATGCTGCAATTCCCACTCCCATTTCTTCTTCTAGATCCATAATTCCCCCTACCCATTTCCCAATTCACCAACAAGATCTCATTTTTCAATGGGT
GTTTTGGGCGATTCCTTGTGCTTCTGCAAAGGGGTCGGCAAGTCCGAGCGAACCAAGGCTGCCATTTTTTCCGGCAAAGCCCCCGCCATGGCTCGAATCTCCACCCCCGC
CGCTTCTTCCGCCACCGCTTTTTTGATCCACCGGACCCTTCTTTTGACCACCCATGTTAATCTTCCCTCCGTTTCTGCTGCTGAGACTTGCGAGATCCGCCTCCAAAACG
GCGTCGCTGCCACTCTCGTTCCTCACAGGTTCTTCATTACAAGTTCTGTTTTGGATCTTACAATAGTGGGCTTAGATGCAGTGGATGGAGACTCCAACTCTCAGCAACTT
CAACACTTGAAGATATGTTCTAAACCGAATCTTGATCTCGGCAATGCTGTTTACCTCTTGGGATATTCTGAGAAAGACGAATTAATCATTAGTGAGGGCAAGGTGGTTAT
TGCCACTGATAACCTAATAAAATTGTCGACCGATGGGATAACAGGGAGTCCCGGGTCTGCTGGATTTGATGCACAAGGTAACCTTGCATTCATGGTATGTGACCCTATGA
AACTAGCCACGTCTCCGAATACTAAATCATCGTCAACGTCTTCGTCATCTTCATCATCGTGGAAGAAGGATCTTCCTATGCAATTTGGGATTCCTCTGCCAATCATTTGT
GATTGGTTAAACCAACATTGGGAAGGTAGTCTTGATGAACTTAACAAACCAAAGCCACAACTCATAAGATTGATGTCTTCGGGACAAAAGAGTGAGCATTCTTCTTCCTT
TACATTGAGGCAAGTTTTTAAGCCAATGGAAGCGAATGAAGAGGAAACACCATCACCATCAAATATAGTCTCGAAAACCAGGGATCTGCCCGGACCGAGCTATTCCACTA
CAACAAACACCATCAAGGAAGAGGCTCCGATGAATAACCTACATGTGAACCATGTGCAAGGCATTCCTACACCTGAAATATATGAATCACCAAAGTTGATTGCAGTTCCT
CTTCGAAAAAAGGAAACCACGCCGACGCAGCTTCTGGATATCAACTTTCCTCCAAGAGTTTCCACGGCTGCGATTACGGCGCATCCTACCAGACAAACCCCACATGACTC
TGATGAAAATTCCATAAGGGATGTTTCTGAACAGAACCAATTGAGACAAGACCAAAGCATGGAGAGAAATCTCGTCGATCCAATCGAAAATGGAGAAGAGGTTGCCTCAA
CAAATTCTGTAAATGGGGCCTTAAGTGAAGTTCAGTCTTGTTCATCTCCCGTGGAAGTTTCAGCGATGCAGAACGGGTATAGCAGCGAGGGAGAGACGACAATGTACTCT
GCAGAAACTGCAGAGAGCCGAAACTATACAAGTCCTAGAGAAGGGAAGTTTCAGCAGGTTGGAAGGAGTCAGAGTTGCGTAAACTACAACAGATGGGGATCGGTCCAAAG
AAATCCGATGGCTCACCAAACAATGCTAGAGAATCAAAGAAGTTTCAGAAATGGAAGGAAGATGTATTCTCAAGGGGCTGGATCTTACAGGAGCAATGACTACTACAGCC
CGACGGTCTCCTCGATCATGAAGAAGCGAAACAGCTCGGAACAAGTTAACCGACCAAGGCAAAGCACTGCTGCTGCTCATTCTTCCCCAAGATGGATGTTCTGATGAATG
ATCATTCCACACAAGAAAAGGGAGAATGGGGAACAAAAATATGAAGGTTTGTTAGTGTATTTGGTTCCCATTTTATCTAAATTTAGATACATCTTACTCATTTCTTCACT
TGTGCAGAGACCAATTTCAGGTGAATAAGAAAATTTTCCTTCCACATTCTTCATTGTAATTGTAAAATTAGTGTGACTTAATTCTTGATTGAGAGGAAAAAAGAGTAAAG
AATAGAAGAACCAGATGATAAAATGTCTGGAATTAGTTAGTTTTTTTTTTTTTTTTTTTGGTTGTGCTTATGCTAAATGAAGCACCATTTTGTGATGCCCTTTTTGTAAA
CAGGAAGTCAATGCAATCCATTTGGCTTCACTGCAAGTAAGTGAACTAATTTGAAATTTTGAGTGCCTTAGAGCTACACGTTCGCCACTTCTATGTCATTGTTTTC
Protein sequenceShow/hide protein sequence
MGVLGDSLCFCKGVGKSERTKAAIFSGKAPAMARISTPAASSATAFLIHRTLLLTTHVNLPSVSAAETCEIRLQNGVAATLVPHRFFITSSVLDLTIVGLDAVDGDSNSQ
QLQHLKICSKPNLDLGNAVYLLGYSEKDELIISEGKVVIATDNLIKLSTDGITGSPGSAGFDAQGNLAFMVCDPMKLATSPNTKSSSTSSSSSSSWKKDLPMQFGIPLPI
ICDWLNQHWEGSLDELNKPKPQLIRLMSSGQKSEHSSSFTLRQVFKPMEANEEETPSPSNIVSKTRDLPGPSYSTTTNTIKEEAPMNNLHVNHVQGIPTPEIYESPKLIA
VPLRKKETTPTQLLDINFPPRVSTAAITAHPTRQTPHDSDENSIRDVSEQNQLRQDQSMERNLVDPIENGEEVASTNSVNGALSEVQSCSSPVEVSAMQNGYSSEGETTM
YSAETAESRNYTSPREGKFQQVGRSQSCVNYNRWGSVQRNPMAHQTMLENQRSFRNGRKMYSQGAGSYRSNDYYSPTVSSIMKKRNSSEQVNRPRQSTAAAHSSPRWMF