; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg01854 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg01854
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein of unknown function (DUF789)
Genome locationCarg_Chr04:8984530..8988023
RNA-Seq ExpressionCarg01854
SyntenyCarg01854
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601276.1 hypothetical protein SDJN03_06509, partial [Cucurbita argyrosperma subsp. sororia]2.9e-22995.47Show/hide
Query:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
        MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSIT RPGGRLES EATEPVAISNPQPAVSLLSNLERFLQSTT
Subjt:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT

Query:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK
        PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK
Subjt:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK

Query:  RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDL
        RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEY ERDLPYLREPLADK         ISDLASRFPQLKTMRSCDL
Subjt:  RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDL

Query:  LPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADE
        LPHSWISVAW     IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPC+ADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADE
Subjt:  LPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADE

Query:  WLRSLQVNHPDFQFFSRQM
        WLRSLQVNHPDFQFFSRQ+
Subjt:  WLRSLQVNHPDFQFFSRQM

KAG7032064.1 hypothetical protein SDJN02_06107 [Cucurbita argyrosperma subsp. argyrosperma]1.0e-242100Show/hide
Query:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
        MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
Subjt:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT

Query:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK
        PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK
Subjt:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK

Query:  RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDL
        RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDL
Subjt:  RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDL

Query:  LPHSWISVAWIPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADEWLRSL
        LPHSWISVAWIPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADEWLRSL
Subjt:  LPHSWISVAWIPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADEWLRSL

Query:  QVNHPDFQFFSRQM
        QVNHPDFQFFSRQM
Subjt:  QVNHPDFQFFSRQM

XP_022957386.1 uncharacterized protein LOC111458799 [Cucurbita moschata]1.6e-23296.66Show/hide
Query:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
        MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
Subjt:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT

Query:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK
        PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK
Subjt:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK

Query:  RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDL
        RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADK         ISDLASRFPQLKTMRSCDL
Subjt:  RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDL

Query:  LPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADE
        LPHSWISVAW     IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADE
Subjt:  LPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADE

Query:  WLRSLQVNHPDFQFFSRQM
        WLRSLQVNHPDFQFFSRQM
Subjt:  WLRSLQVNHPDFQFFSRQM

XP_022980279.1 uncharacterized protein LOC111479669 [Cucurbita maxima]1.6e-21992.18Show/hide
Query:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
        MLG GVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAV+DVFVPSSIT RPG RLESDEATEPVAISNPQPAVS LSNLERFLQSTT
Subjt:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT

Query:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDS---DSSSDGSSDS
        PSVP+QFLSKSA RGCRM DSETQPYFVL DLWEAFKEWSAYG GVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDS   DSSSDGSSDS
Subjt:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDS---DSSSDGSSDS

Query:  ETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRS
        E KRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLG HQDCSSDEAESFNSQGRLLFEY ERDLPYLREPLADK         ISDLASRFPQLKTMRS
Subjt:  ETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRS

Query:  CDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKA
        CDLLPHSWISVAW     IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQ+PSVTYPCK DGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLAN LSKA
Subjt:  CDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKA

Query:  ADEWLRSLQVNHPDFQFFSRQM
        ADEWLRSLQVNHPDFQFFSRQM
Subjt:  ADEWLRSLQVNHPDFQFFSRQM

XP_023514506.1 uncharacterized protein LOC111778762 isoform X1 [Cucurbita pepo subsp. pepo]3.0e-22694.75Show/hide
Query:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
        MLGMG+RFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSIT RPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
Subjt:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT

Query:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK
        PSVP+ FLSKSAWRGCRMRDSETQPYFVLGDLWEAFKE SAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK
Subjt:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK

Query:  RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDL
        RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAE FNSQGRLLFEY ERDLPYLREPLADK         IS LASRFPQLKTMRSCDL
Subjt:  RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDL

Query:  LPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADE
        LPHSWISVAW     IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADE
Subjt:  LPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADE

Query:  WLRSLQVNHPDFQFFSRQM
        WLRSLQVNHPDFQFFSRQM
Subjt:  WLRSLQVNHPDFQFFSRQM

TrEMBL top hitse value%identityAlignment
A0A0A0KR63 Uncharacterized protein5.6e-17877.67Show/hide
Query:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
        MLG GVRFGR +GEDRFYDSSRARKGLLSRQNDRL   QQ ASATTPS A  +V     ++ RP  RL SDEAT+PV        VS+LSNLERFLQS T
Subjt:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT

Query:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDS---DSSSDGSSDS
        P VP+QFLSKSA RG R  D ET+PYF+LGDLWEAFKEWSAYG GVPLLLNN+DGVVQYYVPYLSGIQLY MESST+ RRW EESDS   DSSSDGSSDS
Subjt:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDS---DSSSDGSSDS

Query:  ETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRS
        ET RRIKHSREPPHH+DP IT P RMDRLSLRDQHLGLH+DCSSDEAESFNSQGRLLFEY ERDLPYLREPLADK         ISDLASRFPQLKTMRS
Subjt:  ETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRS

Query:  CDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKA
        CDLLP+SWISVAW     IPTGQTLKDLDACFLTYH LHTP+R  +SP  P V YPCK +GA+K+PLRIFGLASYKFNGSSLWMRNGGVEHQLAN LS+A
Subjt:  CDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKA

Query:  ADEWLRSLQVNHPDFQFFSRQ
        A+ WLR L VNHPDF FFSR+
Subjt:  ADEWLRSLQVNHPDFQFFSRQ

A0A1S3BG78 uncharacterized protein LOC1034893061.9e-17877.91Show/hide
Query:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
        MLG GVRFGR +GEDRFYDSSRARKGLLSRQNDRL   QQ ASATTPS A  +V     ++ RP  RL SDEAT+PV        VSLLSNLERFLQS T
Subjt:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT

Query:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDS---DSSSDGSSDS
        P VP+QFLSKSA RG R  D ET+PYF+LGDLWEAFKEWSAYG GVPLLLNNTDGVVQYYVPYLSGIQLYAMESST+ RRW EESDS   DSSSDGSSDS
Subjt:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDS---DSSSDGSSDS

Query:  ETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRS
        ET +RIKH+REPPHH+DP IT P RMDRLSLR+QHLGLH+DCSSDEAESFNSQGRLLFEY ERDLPYLREPLADK         ISDLASRFPQLKTMRS
Subjt:  ETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRS

Query:  CDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKA
        CDLLP+SWISVAW     IPTGQTLKDLDACFLTYH LHTP+R  +SP +P V YPCK +GA K+PLRIFGLASYKFNGSSLWMRNGGVEHQLAN LS+A
Subjt:  CDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKA

Query:  ADEWLRSLQVNHPDFQFFSRQ
        AD WLR L VNHPDF FFSR+
Subjt:  ADEWLRSLQVNHPDFQFFSRQ

A0A6J1DC61 uncharacterized protein LOC1110187378.7e-17978.25Show/hide
Query:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAI--SNPQPAVSLLSNLERFLQS
        MLG GVRFGRGRGEDRFYDSSRAR+GLLSRQNDRL RPQ+ ASA TPS  V D  + S IT     R+ SDEAT+PVA+   NPQP VS LSNLERFLQS
Subjt:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAI--SNPQPAVSLLSNLERFLQS

Query:  TTPSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDS---DSSSDGSS
         TPSVP+QF SKS+ RG R  DSETQPYFVLGDLWEAFKEWSAYG GVPLLLNNTDGVVQYYVPYLSGIQLY ME S +PRRW EESDS   DSSSDGSS
Subjt:  TTPSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDS---DSSSDGSS

Query:  DSETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTM
        DSETKRRIKH+RE  HH+DP IT P R+DRLSLRDQH+GLH+DCSSDEAESFNS+GRLLFEY ERDLPY REPLADK         I DLASRFPQLKTM
Subjt:  DSETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTM

Query:  RSCDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLS
        RSCDLLP+SWISVAW     IPTGQTLKDLDACFLTYH LHT IR P+S Q+P V YPCK D A+KIPLRIFGLASYKF GSSLWMRNGGVEHQLAN LS
Subjt:  RSCDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLS

Query:  KAADEWLRSLQVNHPDFQFFSRQ
        +AAD WLR LQVNHPDF FFSR+
Subjt:  KAADEWLRSLQVNHPDFQFFSRQ

A0A6J1GZ27 uncharacterized protein LOC1114587998.0e-23396.66Show/hide
Query:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
        MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
Subjt:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT

Query:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK
        PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK
Subjt:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETK

Query:  RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDL
        RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADK         ISDLASRFPQLKTMRSCDL
Subjt:  RRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDL

Query:  LPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADE
        LPHSWISVAW     IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADE
Subjt:  LPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADE

Query:  WLRSLQVNHPDFQFFSRQM
        WLRSLQVNHPDFQFFSRQM
Subjt:  WLRSLQVNHPDFQFFSRQM

A0A6J1IYU8 uncharacterized protein LOC1114796697.7e-22092.18Show/hide
Query:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT
        MLG GVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAV+DVFVPSSIT RPG RLESDEATEPVAISNPQPAVS LSNLERFLQSTT
Subjt:  MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTT

Query:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDS---DSSSDGSSDS
        PSVP+QFLSKSA RGCRM DSETQPYFVL DLWEAFKEWSAYG GVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDS   DSSSDGSSDS
Subjt:  PSVPSQFLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDS---DSSSDGSSDS

Query:  ETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRS
        E KRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLG HQDCSSDEAESFNSQGRLLFEY ERDLPYLREPLADK         ISDLASRFPQLKTMRS
Subjt:  ETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRS

Query:  CDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKA
        CDLLPHSWISVAW     IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQ+PSVTYPCK DGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLAN LSKA
Subjt:  CDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKA

Query:  ADEWLRSLQVNHPDFQFFSRQM
        ADEWLRSLQVNHPDFQFFSRQM
Subjt:  ADEWLRSLQVNHPDFQFFSRQM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)5.7e-9056.51Show/hide
Query:  SNLERFLQSTTPSVPSQFLSKSAWRGCRMRDSETQ-PYFVLGDLWEAFKEWSAYGTGVPLLLNNT-DGVVQYYVPYLSGIQLY----AMESSTRPRRWCE
        SN+ERFL S TPSVP+ +LSK+  R     D E+Q PYF+LGD+WE+F EWSAYG GVPL LNN  D V QYYVP LSGIQ+Y    A+ SS + RR  E
Subjt:  SNLERFLQSTTPSVPSQFLSKSAWRGCRMRDSETQ-PYFVLGDLWEAFKEWSAYGTGVPLLLNNT-DGVVQYYVPYLSGIQLY----AMESSTRPRRWCE

Query:  ESDS---DSSSDGSSDSETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFD
        ES+S   DSSS+GSS SE++R + +S+E             RMD+LSLR +H    +D SSD+ E  +SQGRL+FEY ERDLPY+REP ADK        
Subjt:  ESDS---DSSSDGSSDSETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFD

Query:  TISDLASRFPQLKTMRSCDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLW
         +SDLASRFP+LKT+RSCDLLP SW SVAW     IPTG TLKDLDACFLTYH LHTP + P      S+      +  +K+ L +FGLASYK  G S+W
Subjt:  TISDLASRFPQLKTMRSCDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLW

Query:  MRNGGVEHQLANVLSKAADEWLRSLQVNHPDFQFFSRQ
           GG  HQLAN L +AAD WLR  QVNHPDF FF R+
Subjt:  MRNGGVEHQLANVLSKAADEWLRSLQVNHPDFQFFSRQ

AT2G01260.1 Protein of unknown function (DUF789)5.5e-9349.42Show/hide
Query:  MLGMGVRFGRGR-GEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQST
        MLG G +  RGR G+D FY S++ R+   +++ D+LRR Q   S   PS A S                   +  EP  +S+        SNL+RFL+S 
Subjt:  MLGMGVRFGRGR-GEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQST

Query:  TPSVPSQFLSKSAWRGCRMRD--SETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNT-DGVVQYYVPYLSGIQLY----AMESSTRPRRWCEESDS---DS
        TPSVP+QFLSK+  R  R  D  ++  PYFVLGD+W++F EWSAYGTGVPL+LNN  D V+QYYVP LS IQ+Y    A++SS + RR  + SDS   DS
Subjt:  TPSVPSQFLSKSAWRGCRMRD--SETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNT-DGVVQYYVPYLSGIQLY----AMESSTRPRRWCEESDS---DS

Query:  SSDGSSDSETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRF
        SSD SSDS+++R                    R+D +SLRDQH    +D SSD+ E   SQGRL+FEY ERDLPY+REP ADK         + DLA++F
Subjt:  SSDGSSDSETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRF

Query:  PQLKTMRSCDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQ
        P+L T+RSCDLL  SW SVAW     IPTG TLKDLDACFLTYH LHT      S Q  S+T P +++   K+ L +FGLASYKF G SLW   GG EHQ
Subjt:  PQLKTMRSCDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQ

Query:  LANVLSKAADEWLRSLQVNHPDFQFFSRQ
        L N L +AAD+WL S  V+HPDF FF R+
Subjt:  LANVLSKAADEWLRSLQVNHPDFQFFSRQ

AT2G01260.2 Protein of unknown function (DUF789)2.0e-7149.14Show/hide
Query:  MLGMGVRFGRGR-GEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQST
        MLG G +  RGR G+D FY S++ R+   +++ D+LRR Q   S   PS A S                   +  EP  +S+        SNL+RFL+S 
Subjt:  MLGMGVRFGRGR-GEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQST

Query:  TPSVPSQFLSKSAWRGCRMRD--SETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNT-DGVVQYYVPYLSGIQLY----AMESSTRPRRWCEESDS---DS
        TPSVP+QFLSK+  R  R  D  ++  PYFVLGD+W++F EWSAYGTGVPL+LNN  D V+QYYVP LS IQ+Y    A++SS + RR  + SDS   DS
Subjt:  TPSVPSQFLSKSAWRGCRMRD--SETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNT-DGVVQYYVPYLSGIQLY----AMESSTRPRRWCEESDS---DS

Query:  SSDGSSDSETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRF
        SSD SSDS+++R                    R+D +SLRDQH    +D SSD+ E   SQGRL+FEY ERDLPY+REP ADK         + DLA++F
Subjt:  SSDGSSDSETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRF

Query:  PQLKTMRSCDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHT
        P+L T+RSCDLL  SW SVAW     IPTG TLKDLDACFLTYH LHT
Subjt:  PQLKTMRSCDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHT

AT4G16100.1 Protein of unknown function (DUF789)1.5e-6942.2Show/hide
Query:  RGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVS-DVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTTPSVPSQFL
        R RGE+RFY+    RK    R+  RL   +           +   + V     ++P     SD +      S      +  SNL RFL  TTP V +Q L
Subjt:  RGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVS-DVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTTPSVPSQFL

Query:  SKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLY--AMESSTRPRRWCEESDSDSSSDGSSDSETKRRIKHS
          ++ +G R R+ E +PYF+L DLW++F+EWSAYG GVPLLLN  D VVQYYVPYLSGIQLY     + T  RR  EESD DS  D SSD     R    
Subjt:  SKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLY--AMESSTRPRRWCEESDSDSSSDGSSDSETKRRIKHS

Query:  REPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAE-SFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDLLPHSW
                        + R SL ++        SSDE+E S NS G L+FEY E  +P+ REPL DK         IS+L+S+FP L+T RSCDL P SW
Subjt:  REPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAE-SFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDLLPHSW

Query:  ISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADEWLRSL
        +SVAW     IP GQ+L++LDACFLT+H L TP R   + +  S +   K+  + K+PL  FGLASYKF  S     +   E+Q    L + A+EWLR L
Subjt:  ISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADEWLRSL

Query:  QVNHPDFQFF
        +V  PDF+ F
Subjt:  QVNHPDFQFF

AT5G49220.1 Protein of unknown function (DUF789)1.0e-6741.78Show/hide
Query:  RGEDRFYDSSRARK-----GLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDE----------ATEPVAISNPQPAVSLLSNLERFL
        RGE+RFY+    R+      L  +  ++ RR  +        R  +    P + TR+  G  ES            A    + S     +S  SNL+RFL
Subjt:  RGEDRFYDSSRARK-----GLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDE----------ATEPVAISNPQPAVSLLSNLERFL

Query:  QSTTPSVPSQ-FLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGV-----PLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSS
        + TTP VP++ F  +S W   + R+S+   YFVL DLWE+F EWSAYG GV     PL ++  D  VQYYVPYLSGIQLY ++   +PR      D++ S
Subjt:  QSTTPSVPSQ-FLSKSAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGV-----PLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSS

Query:  SDGSSDSETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFP
        S+GSS+S T               P   +   ++R+SL+DQ   +    SS EAE  N QGRLLFEY E + P+ REPLA+K         ISDLASR P
Subjt:  SDGSSDSETKRRIKHSREPPHHSDPFITTPFRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFP

Query:  QLKTMRSCDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQL
        +L T RSCDLLP SW+SV+W     IP G TL++LDACFLT+H L T    P+S    S + P     + K+PL  FGLASYK    S+W +N   E Q 
Subjt:  QLKTMRSCDLLPHSWISVAW-----IPTGQTLKDLDACFLTYHYLHTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQL

Query:  ANVLSKAADEWLRSLQVNHPDFQFFS
           L +AAD+WL+ LQV+HPD++FF+
Subjt:  ANVLSKAADEWLRSLQVNHPDFQFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGGAATGGGTGTACGCTTTGGTCGCGGTAGGGGAGAGGACCGGTTCTACGATTCATCGAGAGCGAGGAAGGGCCTTCTCAGCCGTCAAAATGATAGGCTCCGTAG
ACCTCAACAACACGCTTCCGCTACTACTCCATCCCGCGCGGTTAGTGATGTTTTCGTGCCTTCTTCTATTACTAGACGGCCTGGGGGCCGTCTGGAGTCTGATGAAGCTA
CTGAACCAGTTGCCATTTCTAATCCCCAGCCTGCTGTTTCACTGTTGAGTAATCTCGAGCGCTTCTTACAATCGACTACTCCGTCTGTGCCTTCTCAGTTTCTCTCTAAG
AGTGCGTGGAGAGGTTGTAGAATGCGTGATTCGGAGACGCAGCCTTACTTCGTGCTTGGGGATTTGTGGGAGGCTTTCAAGGAGTGGAGTGCTTATGGGACAGGAGTGCC
TCTTTTATTGAACAACACTGATGGTGTGGTTCAGTATTATGTCCCGTATTTATCTGGCATACAATTGTATGCCATGGAATCGTCTACAAGGCCAAGGCGATGGTGTGAGG
AAAGTGACAGTGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACAAAGAGAAGAATAAAACACAGTAGAGAACCACCCCACCATAGTGATCCGTTTATCACAACTCCT
TTTAGAATGGATAGATTGTCTTTGAGGGATCAGCACTTGGGACTTCATCAGGACTGCTCTAGTGATGAGGCTGAATCTTTCAATTCTCAAGGTCGCCTTCTATTCGAGTA
TTTTGAAAGAGACCTACCGTATTTACGTGAGCCTTTGGCTGATAAGGCAAGTGGCTTTCTGTGTTTTGATACTATATCGGACCTTGCTTCACGCTTCCCTCAGTTGAAAA
CAATGAGAAGCTGTGACCTGCTACCACATAGTTGGATATCTGTGGCATGGATACCAACGGGGCAAACATTAAAGGATCTTGATGCTTGCTTTCTCACATACCATTATCTA
CACACACCAATCAGAAGCCCTCGAAGCCCACAAATGCCATCTGTGACATATCCTTGTAAGGCGGATGGTGCCAAAAAGATTCCTTTACGAATTTTTGGACTTGCTTCATA
CAAGTTCAACGGGTCGTCATTGTGGATGCGCAATGGTGGAGTTGAGCATCAATTGGCAAACGTCCTTTCGAAGGCAGCTGATGAGTGGTTAAGATCTCTCCAGGTCAATC
ACCCAGACTTCCAATTCTTCAGCCGCCAGATGTAA
mRNA sequenceShow/hide mRNA sequence
TCTCTCCTAACCCGAATCGTCTTTTTCGTCGAACCGTTTTGATCTTTTGAAACCCGCCTCTTCCTATTCGATTCGACTGTGTCTTCATTGCTACTGTTTCTTGGATTGCT
TCTCTTGTTTGTGGAATTTCTTCGTCGGCGATCTGATTTCCCTGGCAGATCCGACAAATTGCTGTTTCTGCTGTTTCGATTTCGGATACTGAGATCGGTCCTCACTTTCT
TCCTGTACCACGATTACTTCCACTTGCAGAGTAGTGTTCCTCACTTGGTGCTAGACTTTCGAGATGCTTGGAATGGGTGTACGCTTTGGTCGCGGTAGGGGAGAGGACCG
GTTCTACGATTCATCGAGAGCGAGGAAGGGCCTTCTCAGCCGTCAAAATGATAGGCTCCGTAGACCTCAACAACACGCTTCCGCTACTACTCCATCCCGCGCGGTTAGTG
ATGTTTTCGTGCCTTCTTCTATTACTAGACGGCCTGGGGGCCGTCTGGAGTCTGATGAAGCTACTGAACCAGTTGCCATTTCTAATCCCCAGCCTGCTGTTTCACTGTTG
AGTAATCTCGAGCGCTTCTTACAATCGACTACTCCGTCTGTGCCTTCTCAGTTTCTCTCTAAGAGTGCGTGGAGAGGTTGTAGAATGCGTGATTCGGAGACGCAGCCTTA
CTTCGTGCTTGGGGATTTGTGGGAGGCTTTCAAGGAGTGGAGTGCTTATGGGACAGGAGTGCCTCTTTTATTGAACAACACTGATGGTGTGGTTCAGTATTATGTCCCGT
ATTTATCTGGCATACAATTGTATGCCATGGAATCGTCTACAAGGCCAAGGCGATGGTGTGAGGAAAGTGACAGTGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACA
AAGAGAAGAATAAAACACAGTAGAGAACCACCCCACCATAGTGATCCGTTTATCACAACTCCTTTTAGAATGGATAGATTGTCTTTGAGGGATCAGCACTTGGGACTTCA
TCAGGACTGCTCTAGTGATGAGGCTGAATCTTTCAATTCTCAAGGTCGCCTTCTATTCGAGTATTTTGAAAGAGACCTACCGTATTTACGTGAGCCTTTGGCTGATAAGG
CAAGTGGCTTTCTGTGTTTTGATACTATATCGGACCTTGCTTCACGCTTCCCTCAGTTGAAAACAATGAGAAGCTGTGACCTGCTACCACATAGTTGGATATCTGTGGCA
TGGATACCAACGGGGCAAACATTAAAGGATCTTGATGCTTGCTTTCTCACATACCATTATCTACACACACCAATCAGAAGCCCTCGAAGCCCACAAATGCCATCTGTGAC
ATATCCTTGTAAGGCGGATGGTGCCAAAAAGATTCCTTTACGAATTTTTGGACTTGCTTCATACAAGTTCAACGGGTCGTCATTGTGGATGCGCAATGGTGGAGTTGAGC
ATCAATTGGCAAACGTCCTTTCGAAGGCAGCTGATGAGTGGTTAAGATCTCTCCAGGTCAATCACCCAGACTTCCAATTCTTCAGCCGCCAGATGTAACGCCTTACTGAT
GCAATATACTCCTACCATCTAGAAACGAAGATTGACATGCTGTGGCCCTGGAAAACAATCCCATAATTTGGTGGATTTCCTGTTTGCTTGGTGAAGAGATGATTCAGGAG
GACAGGAAGAGGAAGGCCTCTCGGCCTTAGAAGGTCGGGAAGCACATGCATGATGCAGTAACAAAGACCGATAGAAAGCAGAAAACATTGGTGGTAAGGCACGCTGGTTT
AGCATGGTGTAGGGATGCTATAGTTCGAATAAGCAGGCGCTTTTTTTCCTTATTTTTCCCGAAGGCCGGCCTTCATTTGAGAAATCAGGCCTGCCTCCCTTCCTTTAACC
TCTCTGACCGACAGTCATTTTGTAAAAATACAGTTTTGTTTAAAGGGGGGATTGGATCTGGGATGCATTTAGGCATTCAACTTGAACTGTACTGCTGAGCATCCGTGACA
ATATATTTAAACATCTTTTAATCAATTTGTTACTGTTAATAGAACGAAGCATATTCTTATTACTTCCTACAGCCGGTGAGACAACCGGAAAACAGGGACTTCAATATTCC
AAGCAGCGGTCCCACTTGCATAAACAAAAACAGCACTCGTGTAATAAACCGACCACCTTAACAACCCAATTAGATCTCGCCACGTCTACTACACCAATTTTAACGAAAGT
TATTGGTTATAATTATAGACCCCACAGTAAAGATAAAAGCAGAGAATCAAAACTCAGTAGTCAGAGTGGTCACTAAAT
Protein sequenceShow/hide protein sequence
MLGMGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLRRPQQHASATTPSRAVSDVFVPSSITRRPGGRLESDEATEPVAISNPQPAVSLLSNLERFLQSTTPSVPSQFLSK
SAWRGCRMRDSETQPYFVLGDLWEAFKEWSAYGTGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTRPRRWCEESDSDSSSDGSSDSETKRRIKHSREPPHHSDPFITTP
FRMDRLSLRDQHLGLHQDCSSDEAESFNSQGRLLFEYFERDLPYLREPLADKASGFLCFDTISDLASRFPQLKTMRSCDLLPHSWISVAWIPTGQTLKDLDACFLTYHYL
HTPIRSPRSPQMPSVTYPCKADGAKKIPLRIFGLASYKFNGSSLWMRNGGVEHQLANVLSKAADEWLRSLQVNHPDFQFFSRQM