; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020195 (gene) of Snake gourd v1 genome

Gene IDTan0020195
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransducin/WD40 repeat-like superfamily protein
Genome locationLG03:62125448..62141320
RNA-Seq ExpressionTan0020195
SyntenyTan0020195
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR015943 - WD40/YVTN repeat-like-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156222.1 uncharacterized protein LOC111023160 [Momordica charantia]2.3e-26799.15Show/hide
Query:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
        MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
Subjt:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF

Query:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
        ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
Subjt:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT

Query:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
        YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
Subjt:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN

Query:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
        AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
Subjt:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK

Query:  CLAKINASNGNPKVEDCSCAG---SSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
        CLAKINASNGNPKV+DCSCAG   SSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
Subjt:  CLAKINASNGNPKVEDCSCAG---SSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN

XP_022925111.1 uncharacterized protein LOC111432451 [Cucurbita moschata]2.2e-26598.72Show/hide
Query:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
        MEGRRISASPRPCSGRRILAKKR RVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
Subjt:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF

Query:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
        ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
Subjt:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT

Query:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
        YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
Subjt:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN

Query:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
        AELMEVSR+EFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKAD DDHWMEGNAGSINISNILTGK
Subjt:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK

Query:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
        CLAKINASNGNPKVEDCSCA  SKKQC+SSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
Subjt:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN

XP_022950437.1 uncharacterized protein LOC111453540 [Cucurbita moschata]6.4e-26598.29Show/hide
Query:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
        MEGRRISASPR CSGRRILAKKR RVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
Subjt:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF

Query:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
        ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
Subjt:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT

Query:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
        YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
Subjt:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN

Query:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
        AELMEVSRTEF+TPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSF+DHLLWHPDCNTNNIYITSDQDLIISYCKADS+DHWMEGNAGSINISNILTGK
Subjt:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK

Query:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
        CLAKINA+NGNPKV+DCSCAGSSK QCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
Subjt:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN

XP_022978161.1 uncharacterized protein LOC111478225 [Cucurbita maxima]2.4e-26498.08Show/hide
Query:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
        MEGRRISASPR CSGRRILAKKR RVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
Subjt:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF

Query:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
        ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
Subjt:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT

Query:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
        YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
Subjt:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN

Query:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
        AELMEVSRTEF+TPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSF+DHLLWHPDCNTNNIYITSDQDLIISYCKADS+DHWMEGNAGSINISNILTGK
Subjt:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK

Query:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
        CLAKINA+NGNPKV DCSCAGSSK QCNS+QMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
Subjt:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN

XP_038881895.1 uncharacterized protein LOC120073242 isoform X1 [Benincasa hispida]2.9e-26598.72Show/hide
Query:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
        MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
Subjt:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF

Query:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
        ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
Subjt:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT

Query:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
        YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
Subjt:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN

Query:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
        AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
Subjt:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK

Query:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
        CLAKINASNGNPKV+D S  G S+KQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
Subjt:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN

TrEMBL top hitse value%identityAlignment
A0A6J1DRG8 uncharacterized protein LOC1110231601.1e-26799.15Show/hide
Query:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
        MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
Subjt:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF

Query:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
        ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
Subjt:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT

Query:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
        YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
Subjt:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN

Query:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
        AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
Subjt:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK

Query:  CLAKINASNGNPKVEDCSCAG---SSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
        CLAKINASNGNPKV+DCSCAG   SSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
Subjt:  CLAKINASNGNPKVEDCSCAG---SSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN

A0A6J1EBA4 uncharacterized protein LOC1114324511.1e-26598.72Show/hide
Query:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
        MEGRRISASPRPCSGRRILAKKR RVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
Subjt:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF

Query:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
        ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
Subjt:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT

Query:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
        YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
Subjt:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN

Query:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
        AELMEVSR+EFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKAD DDHWMEGNAGSINISNILTGK
Subjt:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK

Query:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
        CLAKINASNGNPKVEDCSCA  SKKQC+SSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
Subjt:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN

A0A6J1GET4 uncharacterized protein LOC1114535403.1e-26598.29Show/hide
Query:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
        MEGRRISASPR CSGRRILAKKR RVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
Subjt:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF

Query:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
        ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
Subjt:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT

Query:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
        YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
Subjt:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN

Query:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
        AELMEVSRTEF+TPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSF+DHLLWHPDCNTNNIYITSDQDLIISYCKADS+DHWMEGNAGSINISNILTGK
Subjt:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK

Query:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
        CLAKINA+NGNPKV+DCSCAGSSK QCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
Subjt:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN

A0A6J1HNY3 uncharacterized protein LOC1114659991.1e-26598.72Show/hide
Query:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
        MEGRRISASPRPCSGRRILAKKR RVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
Subjt:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF

Query:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
        ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
Subjt:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT

Query:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
        YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
Subjt:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN

Query:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
        AELMEVSR+EFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKAD DDHWMEGNAGSINISNILTGK
Subjt:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK

Query:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
        CLAKINASNGNPKVEDCSCA  SKKQC+SSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
Subjt:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN

A0A6J1ILZ5 uncharacterized protein LOC1114782251.2e-26498.08Show/hide
Query:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
        MEGRRISASPR CSGRRILAKKR RVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
Subjt:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF

Query:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
        ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
Subjt:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT

Query:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
        YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
Subjt:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN

Query:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
        AELMEVSRTEF+TPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSF+DHLLWHPDCNTNNIYITSDQDLIISYCKADS+DHWMEGNAGSINISNILTGK
Subjt:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK

Query:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
        CLAKINA+NGNPKV DCSCAGSSK QCNS+QMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
Subjt:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38630.1 Transducin/WD40 repeat-like superfamily protein1.3e-23185.53Show/hide
Query:  MEGRRISASPRPCSG-RRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIV
        MEGRRI A+PRPCSG RR++AKKR R DGFVNSVKKLQRREI S+ DRAFS+S AQERFRNMRL+E+YDTHDPKG+C   LP L+KR+KVIEIVAARDIV
Subjt:  MEGRRISASPRPCSG-RRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIV

Query:  FALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVL
        FAL  SGVCA+FSRETN+++CFLNVSPDEVIRSLFYNKNNDSLITVSVYASDN+SSLKCRSTRIEYI RG+ DAGF LFESESLKWPGFVEFDDVNGKVL
Subjt:  FALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVL

Query:  TYSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVR
        TYSAQDS+YKVFDLKNY +LYSISDK+VQEIKISPGIMLLIF RA+SHVPLKILSIEDGT+LK+F+HLLHRNKKVDFIEQFNEKLLVKQENENLQILDVR
Subjt:  TYSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVR

Query:  NAELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTG
        NAEL+EVSRT+FMTPSAFIFLYENQLFLTFRNR V+VWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKAD++D W+EGNAGSINISNILTG
Subjt:  NAELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTG

Query:  KCLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
        KCLAKI A+NG PK EDC    SS    NSS+ R+ VAEALEDITALFYDEERNEIYTGN++GL+HVWSN
Subjt:  KCLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN

AT3G54190.1 Transducin/WD40 repeat-like superfamily protein2.3e-24189.13Show/hide
Query:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF
        MEGRRI+ASPRPCSGRRI+AKKR R DGFVNSVKKLQRREI S++DRAFS+S AQERFRNMRL+E+YDTHDPKGHC   LPFLMKRTKVIEIVAARDIVF
Subjt:  MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVF

Query:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT
        ALAHSGVCAAFSRE+N+RICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYI RG+PDAGFALFESESLKWPGFVEFDDVNGKVLT
Subjt:  ALAHSGVCAAFSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLT

Query:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
        YSAQDS+YKVFDLKNYTMLYSISDK+VQEIKISPGIMLLIF RA+SHVPLKILSIEDGTVLK+FNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN
Subjt:  YSAQDSIYKVFDLKNYTMLYSISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRN

Query:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK
        AELMEVSR EFMTPSAFIFLYENQLFLTFRNR V+VWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKAD++D W+EGNAGSINISNILTGK
Subjt:  AELMEVSRTEFMTPSAFIFLYENQLFLTFRNRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGK

Query:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN
        CLAKI  S+G PK ++ S +    K  NS Q RN VAEALEDITALFYDEERNEIYTGN++GLVHVWSN
Subjt:  CLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEALEDITALFYDEERNEIYTGNKNGLVHVWSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGAGGAGGATTTCGGCGAGTCCTAGGCCTTGTTCTGGTCGGCGTATTTTGGCTAAGAAACGCCCTCGAGTTGATGGGTTTGTTAATAGTGTGAAGAAGCTGCA
GCGACGGGAGATTTGCTCGAAGCGCGATCGTGCTTTTAGCATGAGTAATGCCCAAGAGAGATTTCGCAACATGCGTTTGATGGAGGAATATGACACTCATGATCCAAAGG
GGCATTGTTCACCAGTTTTACCTTTTCTAATGAAGAGAACAAAAGTAATAGAAATTGTTGCTGCACGTGATATTGTTTTTGCGCTGGCGCATTCTGGTGTTTGTGCAGCT
TTTAGTCGAGAGACAAATGAAAGGATATGCTTCTTGAACGTCAGCCCAGATGAGGTCATTCGGAGTTTGTTTTACAATAAGAACAATGATTCACTTATCACCGTTTCAGT
TTATGCTTCTGATAACTTCAGCTCTTTGAAATGCAGATCTACCAGGATTGAATATATTCGAAGGGGCAAGCCTGATGCTGGTTTTGCTCTTTTTGAGTCTGAATCATTGA
AGTGGCCTGGATTTGTTGAGTTTGATGATGTTAATGGGAAAGTTCTAACCTACTCTGCACAGGATAGTATATACAAGGTGTTTGACCTGAAAAACTATACCATGCTTTAC
TCTATATCAGATAAACATGTTCAAGAAATCAAGATCAGCCCTGGGATCATGCTATTGATTTTCAATAGAGCCAGTAGCCACGTTCCTCTTAAGATTCTATCAATAGAAGA
TGGCACAGTTCTCAAGGCTTTCAACCATTTGCTTCATCGGAACAAGAAAGTGGACTTCATAGAACAATTTAATGAAAAACTTCTTGTGAAGCAGGAAAATGAAAATCTTC
AAATACTTGATGTTCGTAATGCTGAATTGATGGAAGTTAGCAGAACTGAATTTATGACTCCATCAGCGTTTATCTTTCTGTATGAAAACCAATTGTTTCTAACATTTAGA
AACCGAACTGTGGCTGTCTGGAACTTTCGTGGGGAGCTTGTCACTTCATTTGAGGATCATCTTTTATGGCATCCAGACTGCAATACAAATAACATTTATATCACTAGTGA
TCAGGACCTCATTATATCCTACTGTAAAGCTGATTCTGATGATCATTGGATGGAAGGAAATGCTGGTTCGATCAATATCAGTAACATCTTAACTGGAAAATGTCTGGCCA
AGATTAATGCTAGCAACGGAAATCCGAAGGTGGAGGACTGCAGCTGCGCTGGCAGCTCGAAAAAGCAGTGCAACTCTTCACAAATGAGAAACACAGTTGCAGAGGCTTTG
GAAGATATTACTGCACTTTTCTATGATGAAGAGCGTAATGAGATATACACTGGTAACAAGAACGGTCTGGTACATGTGTGGTCTAACTAA
mRNA sequenceShow/hide mRNA sequence
TTCTTTTTCCCCCCTTCTTCGTCGTCTTCTTCATCTTTCATCTCAAAGGTTTAGCGATGTCTGGTGTTCAATCCTATCTGACTTCACTTGGACTCCACTGACACACTCCA
ACTCCCTGCTCCGGCGGCCTCACTCCGACGCCTCACCGGAGACACACTCTTCTTCATCTTCCCCTTTTTACTTCCTGGTGAGTTTTTCTCTGCTCTAGGGTTTCCAGATT
TTCCAATTCCACCACCTATTTGCCTCGATCGCCCCGCGTTTTGGTTCCTCTCGCCGATTGACTTCCTTCCGGGGTTTAAACCTTCTGGTTTCCCCGATTTGTTCAAGTAA
TTGGGGGTTTTCTTTTGGAGGGTTATGTGAATTTCGCTCCTGCTTGGAACGACGAGAAGTAGGAGTGTTGTTGTTGGGGGGAAAACTCATGGAAGGGAGGAGGATTTCGG
CGAGTCCTAGGCCTTGTTCTGGTCGGCGTATTTTGGCTAAGAAACGCCCTCGAGTTGATGGGTTTGTTAATAGTGTGAAGAAGCTGCAGCGACGGGAGATTTGCTCGAAG
CGCGATCGTGCTTTTAGCATGAGTAATGCCCAAGAGAGATTTCGCAACATGCGTTTGATGGAGGAATATGACACTCATGATCCAAAGGGGCATTGTTCACCAGTTTTACC
TTTTCTAATGAAGAGAACAAAAGTAATAGAAATTGTTGCTGCACGTGATATTGTTTTTGCGCTGGCGCATTCTGGTGTTTGTGCAGCTTTTAGTCGAGAGACAAATGAAA
GGATATGCTTCTTGAACGTCAGCCCAGATGAGGTCATTCGGAGTTTGTTTTACAATAAGAACAATGATTCACTTATCACCGTTTCAGTTTATGCTTCTGATAACTTCAGC
TCTTTGAAATGCAGATCTACCAGGATTGAATATATTCGAAGGGGCAAGCCTGATGCTGGTTTTGCTCTTTTTGAGTCTGAATCATTGAAGTGGCCTGGATTTGTTGAGTT
TGATGATGTTAATGGGAAAGTTCTAACCTACTCTGCACAGGATAGTATATACAAGGTGTTTGACCTGAAAAACTATACCATGCTTTACTCTATATCAGATAAACATGTTC
AAGAAATCAAGATCAGCCCTGGGATCATGCTATTGATTTTCAATAGAGCCAGTAGCCACGTTCCTCTTAAGATTCTATCAATAGAAGATGGCACAGTTCTCAAGGCTTTC
AACCATTTGCTTCATCGGAACAAGAAAGTGGACTTCATAGAACAATTTAATGAAAAACTTCTTGTGAAGCAGGAAAATGAAAATCTTCAAATACTTGATGTTCGTAATGC
TGAATTGATGGAAGTTAGCAGAACTGAATTTATGACTCCATCAGCGTTTATCTTTCTGTATGAAAACCAATTGTTTCTAACATTTAGAAACCGAACTGTGGCTGTCTGGA
ACTTTCGTGGGGAGCTTGTCACTTCATTTGAGGATCATCTTTTATGGCATCCAGACTGCAATACAAATAACATTTATATCACTAGTGATCAGGACCTCATTATATCCTAC
TGTAAAGCTGATTCTGATGATCATTGGATGGAAGGAAATGCTGGTTCGATCAATATCAGTAACATCTTAACTGGAAAATGTCTGGCCAAGATTAATGCTAGCAACGGAAA
TCCGAAGGTGGAGGACTGCAGCTGCGCTGGCAGCTCGAAAAAGCAGTGCAACTCTTCACAAATGAGAAACACAGTTGCAGAGGCTTTGGAAGATATTACTGCACTTTTCT
ATGATGAAGAGCGTAATGAGATATACACTGGTAACAAGAACGGTCTGGTACATGTGTGGTCTAACTAAAGACCTCATTTTTTTTATTTGTTTTTCTTTCTTTCTTTCTTT
CTTTTTGTTTCCTGAAGTTCGTTTAGACATTGATCAAACTGATGATGATGATTTGATGATGATCCAAGGCGGTTCATATGGTCAAGCCCTAATGCAGATTTGATGGTAGA
AAATAATAGGTCGTCACTTGTAGTTGAATAGAAGGTGTGTTTGTAACATTAATAGATTTTAATGTGTTAATGTATTCTGGTGTAAAGATCTTCCTTGGCTTTACAAGCAT
AGTCAACAATTTAAGATTTTTTCTGTCACTGAAATTTGCTTGTGATTCTTGTAAAGTGCAGAACTTGAGTATAATTTAAATGATTAGTCAGTCCATGAATCAAAA
Protein sequenceShow/hide protein sequence
MEGRRISASPRPCSGRRILAKKRPRVDGFVNSVKKLQRREICSKRDRAFSMSNAQERFRNMRLMEEYDTHDPKGHCSPVLPFLMKRTKVIEIVAARDIVFALAHSGVCAA
FSRETNERICFLNVSPDEVIRSLFYNKNNDSLITVSVYASDNFSSLKCRSTRIEYIRRGKPDAGFALFESESLKWPGFVEFDDVNGKVLTYSAQDSIYKVFDLKNYTMLY
SISDKHVQEIKISPGIMLLIFNRASSHVPLKILSIEDGTVLKAFNHLLHRNKKVDFIEQFNEKLLVKQENENLQILDVRNAELMEVSRTEFMTPSAFIFLYENQLFLTFR
NRTVAVWNFRGELVTSFEDHLLWHPDCNTNNIYITSDQDLIISYCKADSDDHWMEGNAGSINISNILTGKCLAKINASNGNPKVEDCSCAGSSKKQCNSSQMRNTVAEAL
EDITALFYDEERNEIYTGNKNGLVHVWSN