; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G013160 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G013160
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationchr06:23739356..23743867
RNA-Seq ExpressionLsi06G013160
SyntenyLsi06G013160
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453426.1 PREDICTED: uncharacterized protein LOC103494138 isoform X1 [Cucumis melo]4.2e-20187.66Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARKNYNQQK SRRPTKTDETES SSKVV  T KP ++LTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ

Query:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA
        PYFILNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQIYGEA+ALR DSN RLA EDSDLDS RDTSSDGSIDY+ GK  N S EQW H HLA
Subjt:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA

Query:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL
        CEN  KMR TSLTDE + +QEGF SDD DAGY RSGLLFQFLE DLPYQRVPLADKIFELA+QFPGLKTLRSCDIL ASWVSVAWYPIYRIPTGPTLKDL
Subjt:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR
        DACFLTYHSLSTP  GN H LPPVM+YPKDID I K+SLPVFG+ASYKLKGSIWGQNG+N+HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTYWR
Subjt:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR

XP_022921943.1 uncharacterized protein LOC111430050 [Cucurbita moschata]2.5e-19886.4Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARK+YNQQK SRRPTKTDETE+PSS+VVAST  PSK LTPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ

Query:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA
        PYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE++ALR DS SRLA+EDSDLDS +DTSS+GSIDYEFGK CN S EQWVHHHLA
Subjt:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA

Query:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL
        CE+ + MR TSL DEH T QEGFSSDD DA Y RSGLLFQFLE DLPYQRVPLADKIF+LA+QFPGLKTLRSCDIL ASW+SVAWYPIYRIPTGPTLKDL
Subjt:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR
        DACFLTYHSLSTP  GNGHG  P MIYP D DGI KVSLPVFGLASYKLKGSIW QN V EHQ ANSLMQAA+KWLR LQV QPDFQFFASH TYWR
Subjt:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR

XP_023516127.1 uncharacterized protein LOC111780081 [Cucurbita pepo subsp. pepo]2.5e-19886.65Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARK+YNQQK SRRPTKTDETE+PSSKVVAST  PSK LTPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ

Query:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA
        PYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE++ALR DS SRLA+EDSDLDS RDTSS+GSIDYEFGK CN S EQWVHHHLA
Subjt:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA

Query:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL
        CE+ + MR TSL DEH T QEGFSSDD DA Y RSGLLFQFLE DLPYQRVPLADKIF+LA+QFPGLKTLRSCDIL ASW+SVAWYPIYRIPTGPTLKDL
Subjt:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR
        DACFLTYHSLSTP  GNGHG  P MIYP D DGI KVSLPVFGLASYKLKGSIW QN V EHQ ANSLMQAA+KWLR LQV QPDFQFFAS+ TYWR
Subjt:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR

XP_031736215.1 uncharacterized protein LOC101215266 [Cucumis sativus]8.7e-19985.89Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
        MLGT LQFGGIKGEDRFY+PVRARKNYNQQK SR PTKTDETES SSKVV  T KP ++LTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ

Query:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA
        PYFILNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQIYGEA+ALR DS+ RLA EDSDLDS RDTSSDGSID++ GK  NFS EQW H HLA
Subjt:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA

Query:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL
        CEN LKMR TSLTDEH+ +QEGF SDD DAGY RS LLFQFLE DLPYQRVPLADKIFELA+QFPGLKTL SCDIL ASWVSVAWYPIYRIPTGPTLKDL
Subjt:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR
        DACFLTYHSLSTP  GN H LPP+M+YPKDID I K+SLPVFG+ASYK+KGSIWGQNG+++HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTYWR
Subjt:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR

XP_038897708.1 uncharacterized protein LOC120085653 [Benincasa hispida]4.4e-21994.18Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
        MLGT LQFGGIKGEDRFYIP+RARKNYNQQK SRRPTKTDETESPS KV+ASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ

Query:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA
        PYFILNDLWESFKEWSAYGAGVPLVLNGGDSV+QYYVPYLSGIQIYGEASA+R DSN RL SEDSDLDS RDTSS+GSIDYEFGKICNFSSEQW HHHLA
Subjt:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA

Query:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL
        CENTLKMR TSLTDEHRTIQ+GFSSDD DAGY RSGLLFQFLE DLPYQRVPLADKIFELA+QFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL
Subjt:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTPGNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR
        DACFLTYHSLST GNGHGLPPVMIYPKDID IAKVSLPVFGLASYKLKGSIWGQNG+NEHQTANSLMQAADKWLRSLQV+QPDFQFFASHGTYWR
Subjt:  DACFLTYHSLSTPGNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR

TrEMBL top hitse value%identityAlignment
A0A0A0LS49 Uncharacterized protein4.2e-19985.89Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
        MLGT LQFGGIKGEDRFY+PVRARKNYNQQK SR PTKTDETES SSKVV  T KP ++LTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ

Query:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA
        PYFILNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQIYGEA+ALR DS+ RLA EDSDLDS RDTSSDGSID++ GK  NFS EQW H HLA
Subjt:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA

Query:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL
        CEN LKMR TSLTDEH+ +QEGF SDD DAGY RS LLFQFLE DLPYQRVPLADKIFELA+QFPGLKTL SCDIL ASWVSVAWYPIYRIPTGPTLKDL
Subjt:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR
        DACFLTYHSLSTP  GN H LPP+M+YPKDID I K+SLPVFG+ASYK+KGSIWGQNG+++HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTYWR
Subjt:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR

A0A1S3BX10 uncharacterized protein LOC103494138 isoform X12.0e-20187.66Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARKNYNQQK SRRPTKTDETES SSKVV  T KP ++LTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ

Query:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA
        PYFILNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQIYGEA+ALR DSN RLA EDSDLDS RDTSSDGSIDY+ GK  N S EQW H HLA
Subjt:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA

Query:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL
        CEN  KMR TSLTDE + +QEGF SDD DAGY RSGLLFQFLE DLPYQRVPLADKIFELA+QFPGLKTLRSCDIL ASWVSVAWYPIYRIPTGPTLKDL
Subjt:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR
        DACFLTYHSLSTP  GN H LPPVM+YPKDID I K+SLPVFG+ASYKLKGSIWGQNG+N+HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTYWR
Subjt:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR

A0A5A7USF1 Uncharacterized protein2.0e-20187.66Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARKNYNQQK SRRPTKTDETES SSKVV  T KP ++LTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ

Query:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA
        PYFILNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLSGIQIYGEA+ALR DSN RLA EDSDLDS RDTSSDGSIDY+ GK  N S EQW H HLA
Subjt:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA

Query:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL
        CEN  KMR TSLTDE + +QEGF SDD DAGY RSGLLFQFLE DLPYQRVPLADKIFELA+QFPGLKTLRSCDIL ASWVSVAWYPIYRIPTGPTLKDL
Subjt:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR
        DACFLTYHSLSTP  GN H LPPVM+YPKDID I K+SLPVFG+ASYKLKGSIWGQNG+N+HQ ANSLMQAADKWLRSLQV QPDFQFF+SHGTYWR
Subjt:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR

A0A6J1E577 uncharacterized protein LOC1114300501.2e-19886.4Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARK+YNQQK SRRPTKTDETE+PSS+VVAST  PSK LTPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ

Query:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA
        PYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE++ALR DS SRLA+EDSDLDS +DTSS+GSIDYEFGK CN S EQWVHHHLA
Subjt:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA

Query:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL
        CE+ + MR TSL DEH T QEGFSSDD DA Y RSGLLFQFLE DLPYQRVPLADKIF+LA+QFPGLKTLRSCDIL ASW+SVAWYPIYRIPTGPTLKDL
Subjt:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR
        DACFLTYHSLSTP  GNGHG  P MIYP D DGI KVSLPVFGLASYKLKGSIW QN V EHQ ANSLMQAA+KWLR LQV QPDFQFFASH TYWR
Subjt:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR

A0A6J1JE68 uncharacterized protein LOC1114849835.2e-19786.15Show/hide
Query:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ
        MLGTALQFGGIKGEDRFYIPVRARK YNQQK SRRPTKTDETE+PSSKVVAST  PSK LTPQSKSNLERFL+AT+PSVPAQYFSKTTMR WRTCDIEFQ
Subjt:  MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ

Query:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA
        PYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGE++ALR DS SRLA+EDSDLDS RDTSS+GSIDYEFGK CN S EQWVHHHLA
Subjt:  PYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLA

Query:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL
        C++ L +R TSL DEH T QEGFSSDD DA Y RSGLLFQFLE DLPYQRVPLADKIF+LA+QFPGLKTLRSCDIL ASW+SVAWYPIYRIPTGPTLKDL
Subjt:  CENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDL

Query:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR
        DACFLTYHSLSTP  GNGHG  P MIYP D DGI KVSLPVFGLASYKLKGSIW QN V E+Q  NSLMQAA+KWLR LQV QPDFQFFASH TYWR
Subjt:  DACFLTYHSLSTP--GNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.6e-8954.74Show/hide
Query:  SKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ-PYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEASALRPDSNSRLA
        S SN+ERFL++  PSVPA Y SKT +R+    D+E Q PYF+L D+WESF EWSAYG GVPL LN   D V QYYVP LSGIQ+Y +  AL     +R  
Subjt:  SKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQ-PYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEASALRPDSNSRLA

Query:  SEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLACENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELA
         E+S+ D FRD+SS+GS       +C +S EQ          + +M   SL  EH   QE  SSDD +    +  L+F++LE DLPY R P ADK+ +LA
Subjt:  SEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLACENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELA

Query:  FQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPGNGHGLPP-VMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEH
         +FP LKTLRSCD+L +SW SVAWYPIY+IPTGPTLKDLDACFLTYHSL TP  G G+    M   +  + + K+ LPVFGLASYKL+GS+W   G + H
Subjt:  FQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPGNGHGLPP-VMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEH

Query:  QTANSLMQAADKWLRSLQVVQPDFQFF
        Q ANSL QAAD WLR  QV  PDF FF
Subjt:  QTANSLMQAADKWLRSLQVVQPDFQFF

AT2G01260.1 Protein of unknown function (DUF789)6.4e-9150.64Show/hide
Query:  MLGTALQF-GGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCD--I
        MLG   Q   G  G+D FY   + R+  NQ+    R  ++D +  PSS    S  K   + +  S SNL+RFLE+  PSVPAQ+ SKT +R+ R  D   
Subjt:  MLGTALQF-GGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCD--I

Query:  EFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVH
        +  PYF+L D+W+SF EWSAYG GVPLVLN   D V+QYYVP LS IQIY  + AL     SR   + SD D FRD+SSD S D         S  + V 
Subjt:  EFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVH

Query:  HHLACENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPT
          + C         SL D+H   QE  SSDD +    +  L+F++LE DLPY R P ADK+ +LA QFP L TLRSCD+L +SW SVAWYPIYRIPTGPT
Subjt:  HHLACENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPGNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFF
        LKDLDACFLTYHSL T   G G    M   +  +   K+SLPVFGLASYK +GS+W   G +EHQ  NSL QAADKWL S  V  PDF FF
Subjt:  LKDLDACFLTYHSLSTPGNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFF

AT2G01260.2 Protein of unknown function (DUF789)3.7e-7050Show/hide
Query:  MLGTALQF-GGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCD--I
        MLG   Q   G  G+D FY   + R+  NQ+    R  ++D +  PSS    S  K   + +  S SNL+RFLE+  PSVPAQ+ SKT +R+ R  D   
Subjt:  MLGTALQF-GGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCD--I

Query:  EFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVH
        +  PYF+L D+W+SF EWSAYG GVPLVLN   D V+QYYVP LS IQIY  + AL     SR   + SD D FRD+SSD S D         S  + V 
Subjt:  EFQPYFILNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVH

Query:  HHLACENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPT
          + C         SL D+H   QE  SSDD +    +  L+F++LE DLPY R P ADK+ +LA QFP L TLRSCD+L +SW SVAWYPIYRIPTGPT
Subjt:  HHLACENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPGNG
        LKDLDACFLTYHSL T   G
Subjt:  LKDLDACFLTYHSLSTPGNG

AT4G16100.1 Protein of unknown function (DUF789)1.4e-8244.31Show/hide
Query:  IKGEDRFYIPVRARKNYNQQKQSR------------------RPTKTDETE-------SPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFS
        I+GE+RFY P   RK   ++++ R                  R  K +E E       S S   V S +  +   T  + SNL RFL+ T P V  Q+  
Subjt:  IKGEDRFYIPVRARKNYNQQKQSR------------------RPTKTDETE-------SPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFS

Query:  KTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGK
         T+ + WRT + E++PYF+LNDLW+SF+EWSAYG GVPL+LNG DSVVQYYVPYLSGIQ+Y + S  R  +  R   E+SD DS RD SSDGS D     
Subjt:  KTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGK

Query:  ICNFSSEQWVHHHLACENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSG-LLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVA
                       C    +    +  +E   I  G SSD+++A  +  G L+F++LE  +P+ R PL DKI  L+ QFP L+T RSCD+  +SWVSVA
Subjt:  ICNFSSEQWVHHHLACENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSG-LLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVA

Query:  WYPIYRIPTGPTLKDLDACFLTYHSLSTPGNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWG-QNGVNEHQTANSLMQAADKWLRSLQVVQPD
        WYPIYRIP G +L++LDACFLT+HSLSTP  G          K +   AK+ LP FGLASYK K S W  ++ V+E+Q   +L++ A++WLR L+V+ PD
Subjt:  WYPIYRIPTGPTLKDLDACFLTYHSLSTPGNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWG-QNGVNEHQTANSLMQAADKWLRSLQVVQPD

Query:  FQFFASH-GTYWR
        F+ F SH G+ WR
Subjt:  FQFFASH-GTYWR

AT5G49220.1 Protein of unknown function (DUF789)1.1e-7139.22Show/hide
Query:  GTALQFGGIKGEDRFYIPVRARKN------YNQQKQSRRPTKTDETESPSSKVVASTIKP-------------SKQLTPQSK------------------
        G ++    I+GE+RFY P   R+         Q ++ +R    DE      +  A+T+ P             S+ +   S+                  
Subjt:  GTALQFGGIKGEDRFYIPVRARKN------YNQQKQSRRPTKTDETESPSSKVVASTIKP-------------SKQLTPQSK------------------

Query:  -SNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSR
         SNL+RFLE T P VPA+ F   +  + +T + +   YF+L DLWESF EWSAYGAGV     PL ++G DS VQYYVPYLSGIQ+Y     L+   N  
Subjt:  -SNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSR

Query:  LASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLACENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFE
          +E S   S    S    +D   G++                N + +++ S+T          SS +A+    +  LLF++LE++ P+ R PLA+KI +
Subjt:  LASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLACENTLKMRNTSLTDEHRTIQEGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFE

Query:  LAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPGNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNE
        LA + P L T RSCD+L +SWVSV+WYPIYRIP GPTL++LDACFLT+HSLST       P   +   D     K+ LP FGLASYKLK S+W QN + E
Subjt:  LAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPGNGHGLPPVMIYPKDIDGIAKVSLPVFGLASYKLKGSIWGQNGVNE

Query:  HQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR
         Q   SL+QAADKWL+ LQV  PD++FF S+    R
Subjt:  HQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGAACTGCGTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTAAGGGCACGAAAGAATTATAATCAGCAAAAGCAATCGAGGAGACCTAC
CAAGACCGATGAAACTGAGAGCCCATCGAGTAAAGTTGTGGCTTCTACTATAAAGCCTTCTAAGCAATTAACTCCTCAGTCTAAGAGCAACTTAGAGAGATTCTTGGAGG
CCACAAGGCCTTCAGTTCCAGCGCAGTATTTCTCTAAGACAACTATGAGGGATTGGAGGACTTGTGACATTGAGTTTCAACCTTATTTCATTCTGAATGATCTGTGGGAG
TCTTTCAAGGAGTGGAGTGCATATGGCGCTGGAGTTCCTTTAGTTCTTAATGGAGGTGATTCTGTTGTTCAATATTACGTTCCATATTTGTCTGGTATCCAAATATATGG
TGAAGCTTCTGCACTGAGACCAGATTCTAACTCCAGGCTGGCTAGTGAGGACAGTGATCTTGACTCTTTTAGAGATACAAGCAGCGATGGAAGCATTGACTATGAGTTTG
GAAAAATCTGTAACTTTTCTAGTGAACAGTGGGTTCATCACCATCTAGCTTGTGAAAACACACTGAAAATGAGAAATACGTCTTTAACTGATGAACATAGAACGATACAA
GAAGGTTTTTCGAGTGATGATGCGGATGCAGGATATCATCGAAGTGGTTTGCTCTTTCAGTTTCTTGAGCATGATCTTCCTTATCAACGTGTACCATTGGCTGATAAAAT
ATTTGAACTTGCTTTCCAATTTCCTGGTTTGAAAACGTTAAGAAGTTGTGATATCCTGTCAGCCAGCTGGGTCTCTGTAGCATGGTACCCTATTTACCGTATACCCACCG
GTCCGACATTAAAAGATTTGGATGCTTGCTTCTTAACATATCATTCCCTTTCCACACCAGGTAATGGACATGGTCTGCCACCAGTAATGATATATCCAAAGGACATTGAT
GGTATCGCAAAGGTCTCCTTGCCTGTTTTTGGATTGGCTTCTTATAAGCTGAAAGGATCGATATGGGGGCAAAATGGCGTCAACGAGCATCAAACGGCAAATTCTCTCAT
GCAGGCAGCAGATAAGTGGCTGAGGAGCCTTCAGGTCGTTCAACCTGATTTTCAGTTCTTTGCATCACATGGGACATACTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
TTTCTATAAATTTTGCTTTTCTCACCATTATATCATCTTCTTCTTTTATAACACACTCTCTGGTTCAAGCTTCTTCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCT
CTCTCTCTGCACTTTCCTTCTCGTCTGGTTTCTTAATTCACACAAAACGCATTTGGGTCATTCATTGCGTGCATTCTTCAAATCTTTTCTACGAAGTTTTTGAATCGAGT
TTCACTTCTCTTCTTCGTTCAATCGATTTGAAATCCTCTGTTATTTTGGGTTCTTTCTGATCTGATATTGTATTGTCAACATCCTCCATTCAATTCATTGTTCTTGAGAC
AAGATTTCGAGTCTTGTTTTACAAGAAGAAAGCTGGGTTTAAAGAATTTGCAACCCTTTTCTTGTAAACGCCATTTTCATCGGCTTTGTTCTACTTGGGGGAGTTTAAAG
AAACCGATATAATCTTCTTTTGTTCTGGTGGGTTGAGTTCTTTTTTCCCATTTTAGAGAAATTTGGGGTTTCCCATAATAATCTGTGTAATTTCCTCCGTTTTCCATTGA
TACCGAACGGAGGATTCTTCTTTTGGGTTAGAAATGTTGGGAACTGCGTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTAAGGGCACGAAAGAA
TTATAATCAGCAAAAGCAATCGAGGAGACCTACCAAGACCGATGAAACTGAGAGCCCATCGAGTAAAGTTGTGGCTTCTACTATAAAGCCTTCTAAGCAATTAACTCCTC
AGTCTAAGAGCAACTTAGAGAGATTCTTGGAGGCCACAAGGCCTTCAGTTCCAGCGCAGTATTTCTCTAAGACAACTATGAGGGATTGGAGGACTTGTGACATTGAGTTT
CAACCTTATTTCATTCTGAATGATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATATGGCGCTGGAGTTCCTTTAGTTCTTAATGGAGGTGATTCTGTTGTTCAATATTA
CGTTCCATATTTGTCTGGTATCCAAATATATGGTGAAGCTTCTGCACTGAGACCAGATTCTAACTCCAGGCTGGCTAGTGAGGACAGTGATCTTGACTCTTTTAGAGATA
CAAGCAGCGATGGAAGCATTGACTATGAGTTTGGAAAAATCTGTAACTTTTCTAGTGAACAGTGGGTTCATCACCATCTAGCTTGTGAAAACACACTGAAAATGAGAAAT
ACGTCTTTAACTGATGAACATAGAACGATACAAGAAGGTTTTTCGAGTGATGATGCGGATGCAGGATATCATCGAAGTGGTTTGCTCTTTCAGTTTCTTGAGCATGATCT
TCCTTATCAACGTGTACCATTGGCTGATAAAATATTTGAACTTGCTTTCCAATTTCCTGGTTTGAAAACGTTAAGAAGTTGTGATATCCTGTCAGCCAGCTGGGTCTCTG
TAGCATGGTACCCTATTTACCGTATACCCACCGGTCCGACATTAAAAGATTTGGATGCTTGCTTCTTAACATATCATTCCCTTTCCACACCAGGTAATGGACATGGTCTG
CCACCAGTAATGATATATCCAAAGGACATTGATGGTATCGCAAAGGTCTCCTTGCCTGTTTTTGGATTGGCTTCTTATAAGCTGAAAGGATCGATATGGGGGCAAAATGG
CGTCAACGAGCATCAAACGGCAAATTCTCTCATGCAGGCAGCAGATAAGTGGCTGAGGAGCCTTCAGGTCGTTCAACCTGATTTTCAGTTCTTTGCATCACATGGGACAT
ACTGGAGATGACAGGAGATGACTCAAGATATAATCTACTTCTCTGCTGCCCATTCGAACTATCAAAATAACAAATTTGTGCCCTTATCAACTATTTGACCTGCCTTCAAT
ACTGGTAAGGATATATTTTGATTTTTGGTGGTTTCTTCTTTCTGGAAATACATTTCGTGGAGGCAGTTAAGAAAAATGTTCGCAGCAGCTGGTTAGAACCGTGTGTAAAA
TGGGGGAGGAGAGTAATAAAACTGTGAAATGGGAGTAGTAGCAGAACAGGGCTGATGTGAAGTGGAATTTTGAAGGGCAAAAGCAATATGGAGATTGGAAACTGATGGCA
ATCAAGCTGAAGGTTGTCTATCATTCTACACTGACCTTGGAAACTTGTGAGAACTCATTCTCTATTTCTACACTCCCATAAAATTTAGAGTCATTGTTTGTCTTGTTTTG
TGAACTTCAAGAAGTACAAATTTCCATGTATATTACATATGTTTTGATGTTGGGCATCTTTTTTAGATTCACACAAGTATAGATGAGAAATATTAGATGGAATTCTATAT
TGTAAGAAGAAATATTATATTTGTAGCATTTTGTAGTTTCGAATATGCATCATTCTGGATTGAACAGGAATCATATTCCCTCCCCATCCAACATTGGTAGAATTTGTACC
ATTTAAAATATTCAGAAGGGAATGTGGGAATGGAGAATCAGTGTTCTTTTCCCCCTTGATTGATGTGAAACCGTCTAGGAATAGTATTCTGTTTGTG
Protein sequenceShow/hide protein sequence
MLGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSKVVASTIKPSKQLTPQSKSNLERFLEATRPSVPAQYFSKTTMRDWRTCDIEFQPYFILNDLWE
SFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSGIQIYGEASALRPDSNSRLASEDSDLDSFRDTSSDGSIDYEFGKICNFSSEQWVHHHLACENTLKMRNTSLTDEHRTIQ
EGFSSDDADAGYHRSGLLFQFLEHDLPYQRVPLADKIFELAFQFPGLKTLRSCDILSASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPGNGHGLPPVMIYPKDID
GIAKVSLPVFGLASYKLKGSIWGQNGVNEHQTANSLMQAADKWLRSLQVVQPDFQFFASHGTYWR