; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0003221 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0003221
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationchr06:4856584..4861050
RNA-Seq ExpressionPay0003221
SyntenyPay0003221
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134231.3 uncharacterized protein LOC101208769 isoform X1 [Cucumis sativus]6.6e-23797.66Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL----QQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDST
        MSVSGGVSIARIRGENRFYHPPAMRRRL    QQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGL DST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL----QQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSF
        ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFN  GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF ELKTYRS 
Subjt:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSF

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW

Query:  QDADSWLRLLNVNHPDYRFFASHNSFWR
        QDADSWLRLLNVNHPDYRFFASHNSFWR
Subjt:  QDADSWLRLLNVNHPDYRFFASHNSFWR

XP_008438916.1 PREDICTED: uncharacterized protein LOC103483873 isoform X1 [Cucumis melo]9.2e-23998.82Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR
        MSVSGGVSIARIRGENRFYHPPAMRRRL QQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR

Query:  FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSK
        FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDAESSK
Subjt:  FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSK

Query:  ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSP
        ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRS DLSP
Subjt:  ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSP

Query:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
        SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA QG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
Subjt:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD

Query:  SWLRLLNVNHPDYRFFASHNSFWR
        SWLRLLNVNHPDYRFFASHNSFWR
Subjt:  SWLRLLNVNHPDYRFFASHNSFWR

XP_008438917.1 PREDICTED: uncharacterized protein LOC103483873 isoform X2 [Cucumis melo]5.1e-23798.58Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR
        MSVSGGVSIARIRGENRFYHPPAMRRRL QQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR

Query:  FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSK
        FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS AL RRRGADSDAESSK
Subjt:  FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSK

Query:  ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSP
        ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRS DLSP
Subjt:  ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSP

Query:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
        SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA QG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
Subjt:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD

Query:  SWLRLLNVNHPDYRFFASHNSFWR
        SWLRLLNVNHPDYRFFASHNSFWR
Subjt:  SWLRLLNVNHPDYRFFASHNSFWR

XP_011651067.2 uncharacterized protein LOC101208769 isoform X2 [Cucumis sativus]3.6e-23597.43Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL----QQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDST
        MSVSGGVSIARIRGENRFYHPPAMRRRL    QQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGL DST
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL----QQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDST

Query:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA
        NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSAL RRRGADSDA
Subjt:  NLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDA

Query:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSF
        ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFN  GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF ELKTYRS 
Subjt:  ESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSF

Query:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
        DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW
Subjt:  DLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLW

Query:  QDADSWLRLLNVNHPDYRFFASHNSFWR
        QDADSWLRLLNVNHPDYRFFASHNSFWR
Subjt:  QDADSWLRLLNVNHPDYRFFASHNSFWR

XP_038877692.1 uncharacterized protein LOC120069924 [Benincasa hispida]3.2e-23195.07Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQP--KQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQ   KQS LDSKDV+ A+T+TIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQP--KQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNL

Query:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES
        DRFLEHTTPLVPAHCIPKTSLRGWRNREV EASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSS+LSRRRG DSDA S
Subjt:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES

Query:  SKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDL
        SKETSSDGSSNSGAEKKTKTALQ+EWIQDF+  GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKIT+LASRFPELKTYRS DL
Subjt:  SKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDL

Query:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQD
        SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQD
Subjt:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQD

Query:  ADSWLRLLNVNHPDYRFFASHNSFWR
        AD+WLRLLNVNHPDYRFFASHNSFWR
Subjt:  ADSWLRLLNVNHPDYRFFASHNSFWR

TrEMBL top hitse value%identityAlignment
A0A0A0L5V4 Uncharacterized protein3.0e-23597.17Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR
        MSVSGGVSIARIRGENRFYHPPAMRRRL      QQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGL DSTNLDR
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR

Query:  FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSK
        FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSK
Subjt:  FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSK

Query:  ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSP
        ETSSDGSSNSGAEKKTKTALQNEWIQDFN  GSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRF ELKTYRS DLSP
Subjt:  ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSP

Query:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
        SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
Subjt:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD

Query:  SWLRLLNVNHPDYRFFASHNSFWR
        SWLRLLNVNHPDYRFFASHNSFWR
Subjt:  SWLRLLNVNHPDYRFFASHNSFWR

A0A1S3AY60 uncharacterized protein LOC103483873 isoform X14.5e-23998.82Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR
        MSVSGGVSIARIRGENRFYHPPAMRRRL QQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR

Query:  FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSK
        FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDAESSK
Subjt:  FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSK

Query:  ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSP
        ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRS DLSP
Subjt:  ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSP

Query:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
        SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA QG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
Subjt:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD

Query:  SWLRLLNVNHPDYRFFASHNSFWR
        SWLRLLNVNHPDYRFFASHNSFWR
Subjt:  SWLRLLNVNHPDYRFFASHNSFWR

A0A1S3AY77 uncharacterized protein LOC103483873 isoform X22.5e-23798.58Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR
        MSVSGGVSIARIRGENRFYHPPAMRRRL QQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDR

Query:  FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSK
        FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS AL RRRGADSDAESSK
Subjt:  FLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSK

Query:  ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSP
        ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRS DLSP
Subjt:  ETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSP

Query:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
        SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTA QG STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD
Subjt:  SSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDAD

Query:  SWLRLLNVNHPDYRFFASHNSFWR
        SWLRLLNVNHPDYRFFASHNSFWR
Subjt:  SWLRLLNVNHPDYRFFASHNSFWR

A0A5A7U113 Uncharacterized protein1.2e-22395.93Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL--QQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRL  QQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL--QQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNL

Query:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES
        DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS ALSRRRGADSDAES
Subjt:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES

Query:  SKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDL
        SKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRS DL
Subjt:  SKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDL

Query:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ------------GNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA
        SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ            G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA
Subjt:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ------------GNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA

Query:  EECSKAHSLWQDADSWLR
        EECSKAHSLWQDADSWLR
Subjt:  EECSKAHSLWQDADSWLR

A0A5D3CXG0 Uncharacterized protein6.5e-22295.69Show/hide
Query:  MSVSGGVSIARIRGENRFYHPPAMRRRL--QQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNL
        MSVSGGVSIARIRGENRFYHPPAMRRRL  QQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNL
Subjt:  MSVSGGVSIARIRGENRFYHPPAMRRRL--QQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNL

Query:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES
        DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKS AL RRRGADSDAES
Subjt:  DRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAES

Query:  SKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDL
        SKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRS DL
Subjt:  SKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDL

Query:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ------------GNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA
        SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ            G STDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA
Subjt:  SPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQ------------GNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGA

Query:  EECSKAHSLWQDADSWLR
        EECSKAHSLWQDADSWLR
Subjt:  EECSKAHSLWQDADSWLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)3.8e-8150.15Show/hide
Query:  STNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLY--VDPSKSSALSRRRG
        S+N++RFL+  TP VPAH + KT +R     +V    PYF+LGD+WESF EWSAYG G+PL LN + D V QYYVP LSGIQ+Y  VD   SS  +RR+G
Subjt:  STNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLY--VDPSKSSALSRRRG

Query:  ADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELK
         +S+++  +++SS+GSS+        +  Q     D  +L  +          +SSSD+ +     G+L+FEYLERD P+ REP  DK++ LASRFPELK
Subjt:  ADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELK

Query:  TYRSFDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSK
        T RS DL PSSW SVAWYPIY+IPTGPTL+ LDACFLT+H+L T FQG        H  + RE        K++LP+FGLASYK +   W S G      
Subjt:  TYRSFDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSK

Query:  AHSLWQDADSWLRLLNVNHPDYRFF
        A+SL+Q AD+WLRL  VNHPD+ FF
Subjt:  AHSLWQDADSWLRLLNVNHPDYRFF

AT2G01260.1 Protein of unknown function (DUF789)7.4e-7748.48Show/hide
Query:  VDSTNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYV-DPSKSSALSRR
        + S+NLDRFLE  TP VPA  + KT LR  R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y    +  S+L  R
Subjt:  VDSTNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYV-DPSKSSALSRR

Query:  RGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPE
        R  DS     +++SSD SS+S +E+ +          D  +L  Q          +SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++FPE
Subjt:  RGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPE

Query:  LKTYRSFDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQG-NSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEE
        L T RS DL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+H+L T+F G  S   +    PR  E        K+ LP+FGLASYKF+   W   G  E
Subjt:  LKTYRSFDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQG-NSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEE

Query:  CSKAHSLWQDADSWLRLLNVNHPDYRFF
            +SL+Q AD WL   +V+HPD+ FF
Subjt:  CSKAHSLWQDADSWLRLLNVNHPDYRFF

AT2G01260.2 Protein of unknown function (DUF789)2.2e-6051.2Show/hide
Query:  VDSTNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYV-DPSKSSALSRR
        + S+NLDRFLE  TP VPA  + KT LR  R + + ++  PYFVLGD+W+SF EWSAYG G+PL+LN + D V+QYYVP LS IQ+Y    +  S+L  R
Subjt:  VDSTNLDRFLEHTTPLVPAHCIPKTSLRGWR-NREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGS-DSVVQYYVPYLSGIQLYV-DPSKSSALSRR

Query:  RGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPE
        R  DS     +++SSD SS+S +E+ +          D  +L  Q          +SSSD+ +     G+L+FEYLERD P+ REP  DK+  LA++FPE
Subjt:  RGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPE

Query:  LKTYRSFDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQG
        L T RS DL  SSW SVAWYPIYRIPTGPTL+ LDACFLT+H+L T+F G
Subjt:  LKTYRSFDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQG

AT4G16100.1 Protein of unknown function (DUF789)3.4e-9048.82Show/hide
Query:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGL-------VDSTNLDRFLE
        RIRGENRFY+PP M R+LQQ++++++ + ++ +++ K+    +K+++       +   K+ E  EC    + SDCSV  R           S+NL RFL+
Subjt:  RIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGL-------VDSTNLDRFLE

Query:  HTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETS
         TTP+V    +P TS +GWR RE  E  PYF+L DLW+SF+EWSAYG G+PLLLNG DSVVQYYVPYLSGIQLY DPS++    RR G +SD +S ++ S
Subjt:  HTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETS

Query:  SDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESD-SCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSPSS
        SDGS++       +   QN +          RA     P   SSSDES+ S    G+LVFEYLE   PF REPLTDKI+ L+S+FP L+TYRS DLSPSS
Subjt:  SDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESD-SCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSPSS

Query:  WISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWN-STGAEECSKAHSLWQDADS
        W+SVAWYPIYRIP G +LQ+LDACFLTFH+LST  +G S +  Q     V          KL LP FGLASYKFK+  W+  +  +E  +  +L + A+ 
Subjt:  WISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWN-STGAEECSKAHSLWQDADS

Query:  WLRLLNVNHPDYRFFASHN-SFWR
        WLR L V  PD+R F SH+ S WR
Subjt:  WLRLLNVNHPDYRFFASHN-SFWR

AT5G49220.1 Protein of unknown function (DUF789)1.2e-9049.66Show/hide
Query:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSV----------S
        MS SGGVSIAR  IRGENRFY+PP M RR+QQ+ Q QQQ +++Q++  +   L  K+   AAT       K     E +S    S   V          S
Subjt:  MSVSGGVSIAR--IRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSV----------S

Query:  DRGLVDSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYVDPSKS
         R L D +NLDRFLEHTTP+VPA   P  S    + RE S+   YFVL DLWESF EWSAYGAG+     PL ++G+DS VQYYVPYLSGIQLYVDP   
Subjt:  DRGLVDSTNLDRFLEHTTPLVPAHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGI-----PLLLNGSDSVVQYYVPYLSGIQLYVDPSKS

Query:  SALSRRRGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVL
          L + R    D     E SS+GSSNS      +T   +  + + N +    +L+    +   SS E++     G+L+FEYLE +PPF REPL +KI+ L
Subjt:  SALSRRRGADSDAESSKETSSDGSSNSGAEKKTKTALQNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVL

Query:  ASRFPELKTYRSFDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNS
        ASR PEL TYRS DL PSSW+SV+WYPIYRIP GPTLQ+LDACFLTFH+LSTA    S  G     P            KL LP FGLASYK K+  WN 
Subjt:  ASRFPELKTYRSFDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTFHNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNS

Query:  TGAEECSKAHSLWQDADSWLRLLNVNHPDYRFFASHN
           +E  K  SL Q AD WL+ L V+HPDYRFF S++
Subjt:  TGAEECSKAHSLWQDADSWLRLLNVNHPDYRFFASHN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGTCTCCGGTGGTGTTTCGATTGCCCGAATCCGTGGCGAGAATCGCTTCTACCATCCACCTGCGATGCGACGTCGTTTGCAGCAGCAGCAGCAGCAACAACAACA
ACAACAACAACAACAGCAACAGCAGCCGAAGCAAAGTGCTTTAGATTCTAAGGACGTTGTGGCGGCTGCTACTTCCACGATCGATGACTTGGAGAAGAGGAGTGAGTTCG
ATGAGTGTCGTTCTTGGTCCACTCGCTCTGATTGTTCTGTTTCCGATCGGGGACTTGTTGATTCTACTAATTTGGATCGCTTCTTGGAGCATACTACTCCTCTTGTTCCG
GCTCATTGTATTCCTAAGACGAGCCTGAGGGGGTGGAGAAACCGTGAAGTTTCAGAGGCATCTCCTTATTTTGTGCTCGGTGATCTCTGGGAATCTTTCAAGGAATGGAG
TGCATATGGAGCCGGTATCCCTCTATTGTTAAATGGCAGTGACTCTGTAGTACAGTACTATGTTCCATATCTGTCTGGCATTCAACTCTATGTTGATCCTTCAAAGTCCT
CTGCACTAAGTAGAAGGCGTGGTGCCGATAGTGATGCTGAGTCATCAAAGGAAACAAGCAGTGATGGAAGCAGTAATTCTGGGGCAGAAAAGAAAACAAAAACTGCCCTT
CAGAATGAGTGGATACAGGACTTTAATGCTCTGGGGTCTCAAAGAGCTCTTCAGATGAATGTACCTTCTTCCGAGTCATCGAGTGATGAAAGTGACTCTTGCTACCGGCA
TGGTCAGCTTGTTTTTGAATACTTGGAACGTGACCCACCATTTTGTCGTGAACCATTAACTGATAAGATCACTGTCCTTGCATCACGTTTTCCTGAATTAAAGACATACA
GGAGCTTTGATTTATCTCCTTCCAGTTGGATATCTGTGGCATGGTATCCAATTTATCGGATTCCGACGGGGCCAACTCTACAAAGTCTAGATGCTTGTTTCTTGACCTTC
CATAATCTGTCAACAGCATTTCAAGGCAATAGCACAGATGGACTACAATTCCATTGGCCAAGAGTTAGAGAGGTGTACACTGCGGATTGCCCTCTCAAACTGCAGTTGCC
TATATTTGGACTTGCTTCCTATAAGTTCAAAATTCCTTTTTGGAATTCGACTGGTGCAGAGGAATGTTCGAAGGCTCACTCTTTATGGCAAGATGCCGACAGCTGGCTTA
GGTTATTAAACGTAAACCATCCTGATTACAGATTTTTCGCATCTCATAACTCATTCTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
GAAAGACATTGGTTGAGTTAAATCTGGGAATATCCAGATTCGGGAGAGGATACGTCGGTCATAAGGAGAAAGAGGAAGGGGCAAAGAAGTAAATTTCGAGTCAAATACCC
CACAAGCATTACGCAAAGGATAGAGAGATAGGTGAAGAAGAAGAGGAATGGAAAGGGCATTGCAAAATTATGATTTCTGGAGTCTTCTGATCCTAAAATCTCCCCCTCCC
CCCTCTCTCTCTAATCCAATCCCCTCTTTCCCCCACCTTTCTATAATTTTCCAAAAAATAGGTCTAATTTTCTAATAAACCTCTCTGTTAGTTATCTATCTTCTTCCCCC
AATAGATTTTACAAAACCCTAGTTTCTCCTCCTCCTTCTCCTCCTTCTTCTTCTTCATCATCACTGTATACATATAAAAACACACACACACACACTCACACTCACACCTG
AATACCTACCCTTGCAATGTCAGTCTCCGGTGGTGTTTCGATTGCCCGAATCCGTGGCGAGAATCGCTTCTACCATCCACCTGCGATGCGACGTCGTTTGCAGCAGCAGC
AGCAGCAACAACAACAACAACAACAACAACAGCAACAGCAGCCGAAGCAAAGTGCTTTAGATTCTAAGGACGTTGTGGCGGCTGCTACTTCCACGATCGATGACTTGGAG
AAGAGGAGTGAGTTCGATGAGTGTCGTTCTTGGTCCACTCGCTCTGATTGTTCTGTTTCCGATCGGGGACTTGTTGATTCTACTAATTTGGATCGCTTCTTGGAGCATAC
TACTCCTCTTGTTCCGGCTCATTGTATTCCTAAGACGAGCCTGAGGGGGTGGAGAAACCGTGAAGTTTCAGAGGCATCTCCTTATTTTGTGCTCGGTGATCTCTGGGAAT
CTTTCAAGGAATGGAGTGCATATGGAGCCGGTATCCCTCTATTGTTAAATGGCAGTGACTCTGTAGTACAGTACTATGTTCCATATCTGTCTGGCATTCAACTCTATGTT
GATCCTTCAAAGTCCTCTGCACTAAGTAGAAGGCGTGGTGCCGATAGTGATGCTGAGTCATCAAAGGAAACAAGCAGTGATGGAAGCAGTAATTCTGGGGCAGAAAAGAA
AACAAAAACTGCCCTTCAGAATGAGTGGATACAGGACTTTAATGCTCTGGGGTCTCAAAGAGCTCTTCAGATGAATGTACCTTCTTCCGAGTCATCGAGTGATGAAAGTG
ACTCTTGCTACCGGCATGGTCAGCTTGTTTTTGAATACTTGGAACGTGACCCACCATTTTGTCGTGAACCATTAACTGATAAGATCACTGTCCTTGCATCACGTTTTCCT
GAATTAAAGACATACAGGAGCTTTGATTTATCTCCTTCCAGTTGGATATCTGTGGCATGGTATCCAATTTATCGGATTCCGACGGGGCCAACTCTACAAAGTCTAGATGC
TTGTTTCTTGACCTTCCATAATCTGTCAACAGCATTTCAAGGCAATAGCACAGATGGACTACAATTCCATTGGCCAAGAGTTAGAGAGGTGTACACTGCGGATTGCCCTC
TCAAACTGCAGTTGCCTATATTTGGACTTGCTTCCTATAAGTTCAAAATTCCTTTTTGGAATTCGACTGGTGCAGAGGAATGTTCGAAGGCTCACTCTTTATGGCAAGAT
GCCGACAGCTGGCTTAGGTTATTAAACGTAAACCATCCTGATTACAGATTTTTCGCATCTCATAACTCATTCTGGAGATGATAACGAAGGATATATCATGCATGATGCAT
AAATGTGGGATTACAGTTTTAAGTCCAAAGAAACTCGCTTCTCCTGAATGTCGTGAAAGTTTTAATAGCATCTTTGGCTTCCTTTTTTTCTCAAAAAAATTCTACTTTGA
CCACCGTATAAAGGGTGGTTCTGTACAGGGAGAATGATGTTGATAGTAACATTACGGTTGAAATGGGGGGATTAGCTTTAGGATAGTTGTTATTCTTGGAGCGTCAATCC
CCTTAGTATTCTATTAGTCTATTCTTCCATGTGTTGAGAAATGAGTGATGATGGGAATATGGTGGTTGTGGACAGCGGAGAAACTTCTGTCAAATACCAACACCCCCATT
CTTTTTTCTTTTTTCTTACTTTAAATTGTGGGTTAGATTTGTAACTTTCACTGGCACAACGATAATGATGAAAAGTATATCTAGTATGTTAACTCCAATCTTTGCTGTAT
TAGCAGAGTATTTATTAACAGTTCATTAGATAAAGGCAATTGCTGC
Protein sequenceShow/hide protein sequence
MSVSGGVSIARIRGENRFYHPPAMRRRLQQQQQQQQQQQQQQQQQPKQSALDSKDVVAAATSTIDDLEKRSEFDECRSWSTRSDCSVSDRGLVDSTNLDRFLEHTTPLVP
AHCIPKTSLRGWRNREVSEASPYFVLGDLWESFKEWSAYGAGIPLLLNGSDSVVQYYVPYLSGIQLYVDPSKSSALSRRRGADSDAESSKETSSDGSSNSGAEKKTKTAL
QNEWIQDFNALGSQRALQMNVPSSESSSDESDSCYRHGQLVFEYLERDPPFCREPLTDKITVLASRFPELKTYRSFDLSPSSWISVAWYPIYRIPTGPTLQSLDACFLTF
HNLSTAFQGNSTDGLQFHWPRVREVYTADCPLKLQLPIFGLASYKFKIPFWNSTGAEECSKAHSLWQDADSWLRLLNVNHPDYRFFASHNSFWR