; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0819 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0819
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionNHL domain-containing protein
Genome locationMC11:6949698..6952144
RNA-Seq ExpressionMC11g0819
SyntenyMC11g0819
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001258 - NHL repeat
IPR011042 - Six-bladed beta-propeller, TolB-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146020.1 uncharacterized protein LOC101206392 isoform X1 [Cucumis sativus]2.87e-24178.31Show/hide
Query:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG
        SGPLARHLSSLLKWTGSS KTPQPDGNAIQFESGYLVETIVEGNEIGMVP+KIRVSEDGELFAVD ++SNVVKVSPPLSRYSRARLVAGSFQGYKGH DG
Subjt:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG

Query:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ
        KPSDARFN PKG+ +DD GNVYVADT NLAIRKIVD+GVTTIAGGKTNVPGYSDGPGEEAKFSNDFD+IYVRR+CSLLVVDRGNAALRQISLNKEDCD Q
Subjt:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ

Query:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLM--SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP
        YGSVSTS             DVAMFIGAL +GY TYMLQHGF LSFFT M  SEH +TETKE SKGK T+LV  +KEETWW SFG+ + EL KQ IE LP
Subjt:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLM--SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP

Query:  DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG
         NLKSF+  YFRS++N +KGLTPLKDALKMP+DEIK NVSLK++ +TP SETKHAS      K DE+KPPKMKSS     PSLLNKHSHSGQ EYAEFYG
Subjt:  DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG

Query:  SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY
        + +VSSS SRSKGQKDRSRHRQKEKG ++L   L AEPK AEM+TDY++ KF Q+N RNKY
Subjt:  SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY

XP_011654037.1 uncharacterized protein LOC101206392 isoform X2 [Cucumis sativus]2.81e-24378.65Show/hide
Query:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG
        SGPLARHLSSLLKWTGSS KTPQPDGNAIQFESGYLVETIVEGNEIGMVP+KIRVSEDGELFAVD ++SNVVKVSPPLSRYSRARLVAGSFQGYKGH DG
Subjt:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG

Query:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ
        KPSDARFN PKG+ +DD GNVYVADT NLAIRKIVD+GVTTIAGGKTNVPGYSDGPGEEAKFSNDFD+IYVRR+CSLLVVDRGNAALRQISLNKEDCD Q
Subjt:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ

Query:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLPDN
        YGSVSTS             DVAMFIGAL +GY TYMLQHGF LSFFT MSEH +TETKE SKGK T+LV  +KEETWW SFG+ + EL KQ IE LP N
Subjt:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLPDN

Query:  LKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYGSS
        LKSF+  YFRS++N +KGLTPLKDALKMP+DEIK NVSLK++ +TP SETKHAS      K DE+KPPKMKSS     PSLLNKHSHSGQ EYAEFYG+ 
Subjt:  LKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYGSS

Query:  LVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY
        +VSSS SRSKGQKDRSRHRQKEKG ++L   L AEPK AEM+TDY++ KF Q+N RNKY
Subjt:  LVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY

XP_022142855.1 uncharacterized protein LOC111012863 [Momordica charantia]2.94e-31196.53Show/hide
Query:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG
        SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGH DG
Subjt:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG

Query:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ
        KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ
Subjt:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ

Query:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLM--SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP
        YGSVSTS             DVAMFIGALFVGYVTYMLQHGFGLSFFTLM  SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP
Subjt:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLM--SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP

Query:  DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG
        DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG
Subjt:  DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG

Query:  SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY
        SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY
Subjt:  SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY

XP_038900080.1 uncharacterized protein LOC120087236 isoform X1 [Benincasa hispida]3.19e-24477.87Show/hide
Query:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG
        SGPLARHLSSLLKWTGSSSKTPQPDGNA+QFESGYLVETIVEGNEIGMVP+KIRVSEDGELFAVD ++SNVVKVSPPLSRYSRARLVAGSFQGYKGH DG
Subjt:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG

Query:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ
        KPSDARFN PKGV +DD GNVYVADT NL IRKIVD+GVTTIAGGKTN+PGYSDGPGEEAKFSNDFDIIYVRR+CSLLVVDRGNAALRQISLNKEDCD Q
Subjt:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ

Query:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLM--SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP
        YGSVSTS             DVAMFIGAL +GY TYMLQHGF LSFF+ M  SEH +TETKE SKGK   L + +K+ETWW SFG+ + EL KQ IE LP
Subjt:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLM--SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP

Query:  DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG
         N K F+P YFRS++  +KGLTPLKDALKMP+DEIK NVSLK+R +TP SETKH S      K DE+KPPKMKSSS K P SLLNKH HSGQ EYAEFYG
Subjt:  DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG

Query:  SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY
        + +VSSSHSRSKGQKDRSRHRQKEKGS++L +VL AE KPAEM+TDY++ KF Q+N RNKY
Subjt:  SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY

XP_038900081.1 uncharacterized protein LOC120087236 isoform X2 [Benincasa hispida]3.12e-24678.21Show/hide
Query:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG
        SGPLARHLSSLLKWTGSSSKTPQPDGNA+QFESGYLVETIVEGNEIGMVP+KIRVSEDGELFAVD ++SNVVKVSPPLSRYSRARLVAGSFQGYKGH DG
Subjt:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG

Query:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ
        KPSDARFN PKGV +DD GNVYVADT NL IRKIVD+GVTTIAGGKTN+PGYSDGPGEEAKFSNDFDIIYVRR+CSLLVVDRGNAALRQISLNKEDCD Q
Subjt:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ

Query:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLPDN
        YGSVSTS             DVAMFIGAL +GY TYMLQHGF LSFF+ MSEH +TETKE SKGK   L + +K+ETWW SFG+ + EL KQ IE LP N
Subjt:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLPDN

Query:  LKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYGSS
         K F+P YFRS++  +KGLTPLKDALKMP+DEIK NVSLK+R +TP SETKH S      K DE+KPPKMKSSS K P SLLNKH HSGQ EYAEFYG+ 
Subjt:  LKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYGSS

Query:  LVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY
        +VSSSHSRSKGQKDRSRHRQKEKGS++L +VL AE KPAEM+TDY++ KF Q+N RNKY
Subjt:  LVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY

TrEMBL top hitse value%identityAlignment
A0A0A0L212 Uncharacterized protein1.39e-24178.31Show/hide
Query:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG
        SGPLARHLSSLLKWTGSS KTPQPDGNAIQFESGYLVETIVEGNEIGMVP+KIRVSEDGELFAVD ++SNVVKVSPPLSRYSRARLVAGSFQGYKGH DG
Subjt:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG

Query:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ
        KPSDARFN PKG+ +DD GNVYVADT NLAIRKIVD+GVTTIAGGKTNVPGYSDGPGEEAKFSNDFD+IYVRR+CSLLVVDRGNAALRQISLNKEDCD Q
Subjt:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ

Query:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLM--SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP
        YGSVSTS             DVAMFIGAL +GY TYMLQHGF LSFFT M  SEH +TETKE SKGK T+LV  +KEETWW SFG+ + EL KQ IE LP
Subjt:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLM--SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP

Query:  DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG
         NLKSF+  YFRS++N +KGLTPLKDALKMP+DEIK NVSLK++ +TP SETKHAS      K DE+KPPKMKSS     PSLLNKHSHSGQ EYAEFYG
Subjt:  DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG

Query:  SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY
        + +VSSS SRSKGQKDRSRHRQKEKG ++L   L AEPK AEM+TDY++ KF Q+N RNKY
Subjt:  SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY

A0A1S3CJZ4 uncharacterized protein LOC103501822 isoform X11.02e-23677.44Show/hide
Query:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG
        SG LARHLSSLLKWTGSS KTPQPDGNAIQFESGYLVETIVEGNEIGMVP+KIRVSEDGELFAVD ++SNVVKVSPPLSRYSRARLVAGSFQGYKGH DG
Subjt:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG

Query:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ
        KPSDARFN PKGV +DD GNVYVADT NLAIRKIVD+GVTTIAGGKTNVPGYSDGPGEEAKFSNDFD+IYVRR+CSLLVVDRGNAALRQISLNKEDCD Q
Subjt:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ

Query:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLM--SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP
        YGSVSTS             DVAMFIGAL +GY TYMLQHGF LSFF+ M  SEH +TETKE SKGK T+LV  +KEETWW SFG+ + EL KQ IE LP
Subjt:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLM--SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP

Query:  DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG
         NLKSF+  YFRS++N +KGLTPLKDALKMP+DEIK NVSLK++   P SETKH S      K DE+KPPKMKSS     PSLLNKHSHSGQ EYAEFY 
Subjt:  DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG

Query:  SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY
        + +VSSS SRSKGQKDRSRHRQKEKGS++L   L AEPK AEM+TDY++ K+ Q+N RNKY
Subjt:  SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY

A0A1S3CKG1 uncharacterized protein LOC103501822 isoform X21.00e-23877.78Show/hide
Query:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG
        SG LARHLSSLLKWTGSS KTPQPDGNAIQFESGYLVETIVEGNEIGMVP+KIRVSEDGELFAVD ++SNVVKVSPPLSRYSRARLVAGSFQGYKGH DG
Subjt:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG

Query:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ
        KPSDARFN PKGV +DD GNVYVADT NLAIRKIVD+GVTTIAGGKTNVPGYSDGPGEEAKFSNDFD+IYVRR+CSLLVVDRGNAALRQISLNKEDCD Q
Subjt:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ

Query:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLPDN
        YGSVSTS             DVAMFIGAL +GY TYMLQHGF LSFF+ MSEH +TETKE SKGK T+LV  +KEETWW SFG+ + EL KQ IE LP N
Subjt:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLPDN

Query:  LKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYGSS
        LKSF+  YFRS++N +KGLTPLKDALKMP+DEIK NVSLK++   P SETKH S      K DE+KPPKMKSS     PSLLNKHSHSGQ EYAEFY + 
Subjt:  LKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYGSS

Query:  LVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY
        +VSSS SRSKGQKDRSRHRQKEKGS++L   L AEPK AEM+TDY++ K+ Q+N RNKY
Subjt:  LVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY

A0A5A7VG70 NHL domain-containing protein1.00e-23877.78Show/hide
Query:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG
        SG LARHLSSLLKWTGSS KTPQPDGNAIQFESGYLVETIVEGNEIGMVP+KIRVSEDGELFAVD ++SNVVKVSPPLSRYSRARLVAGSFQGYKGH DG
Subjt:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG

Query:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ
        KPSDARFN PKGV +DD GNVYVADT NLAIRKIVD+GVTTIAGGKTNVPGYSDGPGEEAKFSNDFD+IYVRR+CSLLVVDRGNAALRQISLNKEDCD Q
Subjt:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ

Query:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLPDN
        YGSVSTS             DVAMFIGAL +GY TYMLQHGF LSFF+ MSEH +TETKE SKGK T+LV  +KEETWW SFG+ + EL KQ IE LP N
Subjt:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLPDN

Query:  LKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYGSS
        LKSF+  YFRS++N +KGLTPLKDALKMP+DEIK NVSLK++   P SETKH S      K DE+KPPKMKSS     PSLLNKHSHSGQ EYAEFY + 
Subjt:  LKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYGSS

Query:  LVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY
        +VSSS SRSKGQKDRSRHRQKEKGS++L   L AEPK AEM+TDY++ K+ Q+N RNKY
Subjt:  LVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY

A0A6J1CM34 uncharacterized protein LOC1110128631.42e-31196.53Show/hide
Query:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG
        SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGH DG
Subjt:  SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADG

Query:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ
        KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ
Subjt:  KPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQ

Query:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLM--SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP
        YGSVSTS             DVAMFIGALFVGYVTYMLQHGFGLSFFTLM  SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP
Subjt:  YGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLM--SEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLP

Query:  DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG
        DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG
Subjt:  DNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYG

Query:  SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY
        SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY
Subjt:  SSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKTDYSDAKFGQFNFRNKY

SwissProt top hitse value%identityAlignment
Q5ZI67 NHL repeat-containing protein 27.2e-0634.13Show/hide
Query:  MVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADGKPSDARFNHPKGVALDDNGNVYVADTSNLAIRKI--VDSGVTTIAG-
        + P K+ V + GE   +     + + V+  L        + G   G K   DG+ S+A FN P+GVA+ +N  +YVADT N  IRKI      VTT+AG 
Subjt:  MVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADGKPSDARFNHPKGVALDDNGNVYVADTSNLAIRKI--VDSGVTTIAG-

Query:  GKTNVPGYSDGPGEEAKFSNDFDIIY
        G   V       GEE   S+ +D+++
Subjt:  GKTNVPGYSDGPGEEAKFSNDFDIIY

Q8VZ10 Protein SUPPRESSOR OF QUENCHING 1, chloroplastic4.7e-0532.67Show/hide
Query:  GHADGKPSDARFNHPKGVALDDNGNVYVADTSNLAIRKI--VDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLN
        G  DG  ++    HP GV   ++G +Y+ D+ N  I+K+  V   V T+AG  T   G+ DG  + A+ S    +  +  +  L V D  N+ +R I LN
Subjt:  GHADGKPSDARFNHPKGVALDDNGNVYVADTSNLAIRKI--VDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLN

Query:  K
        K
Subjt:  K

Arabidopsis top hitse value%identityAlignment
AT1G23880.1 NHL domain-containing protein1.8e-6046.39Show/hide
Query:  LARHLSSLLKWTGS-----SSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHA
        ++ H +SLLKW  S     ++KT  P  + ++FE+GY VET+++G+++G+ P+ I+V  +GEL  +D  +SN+ ++S  LS YSR RLV GS +GY GH 
Subjt:  LARHLSSLLKWTGS-----SSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHA

Query:  DGKPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGK-TNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDC
        DG+  DAR N+PKG+ +DD GN+YVADT N AIRKI ++GVTTIAGGK     G+ DGP E+AKFSNDFD++Y+  SCSLLV+DRGN A+R+I L+ +DC
Subjt:  DGKPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGK-TNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDC

Query:  DNQYGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTETKE
         +QYGS                  +A+ + A+F GY+  +LQ        +++S H+D E  E
Subjt:  DNQYGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTETKE

AT1G70280.1 NHL domain-containing protein2.0e-5944.68Show/hide
Query:  IQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADGKPSDARFNHPKGVALDDNGNVYVADTSN
        ++FE+GY VET+ +G+++G+ P+ I V  +GEL  +D  +SN+ K+S  LS YSR RLV GS +GY GH DG+  DA+ NHPKG+ +DD GN+YVADT N
Subjt:  IQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADGKPSDARFNHPKGVALDDNGNVYVADTSN

Query:  LAIRKIVDSGVTTIAGGKT-NVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQYGSVSTSALFLSFLTDEPFADVAMFIG
         AIRKI + GVTTIAGGKT    G+ DGP E+AKFSNDFD++YV  SCSLLV+DRGN A+R+I L+ +DC  QYGS                  +A+ + 
Subjt:  LAIRKIVDSGVTTIAGGKT-NVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQYGSVSTSALFLSFLTDEPFADVAMFIG

Query:  ALFVGYVTYMLQHGFGLSFFTLMSEHSDTE------TKEPSKGKHTQLVD------HMKEETWWVSFGRAIMELSKQLIETL
        A F GY+  +LQ   G    +++S H+D E       ++P K     L+         +EET+ VS G+ +    + ++E L
Subjt:  ALFVGYVTYMLQHGFGLSFFTLMSEHSDTE------TKEPSKGKHTQLVD------HMKEETWWVSFGRAIMELSKQLIETL

AT1G70280.2 NHL domain-containing protein5.0e-6343.45Show/hide
Query:  SGPLARHLSSLLKWTGS---SSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGH
        +G ++ H SSL+KW  S   ++KT     + ++FE+GY VET+ +G+++G+ P+ I V  +GEL  +D  +SN+ K+S  LS YSR RLV GS +GY GH
Subjt:  SGPLARHLSSLLKWTGS---SSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGH

Query:  ADGKPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKT-NVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKED
         DG+  DA+ NHPKG+ +DD GN+YVADT N AIRKI + GVTTIAGGKT    G+ DGP E+AKFSNDFD++YV  SCSLLV+DRGN A+R+I L+ +D
Subjt:  ADGKPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKT-NVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKED

Query:  CDNQYGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTE------TKEPSKGKHTQLVD------HMKEETWWVSFGR
        C  QYGS                  +A+ + A F GY+  +LQ   G    +++S H+D E       ++P K     L+         +EET+ VS G+
Subjt:  CDNQYGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTE------TKEPSKGKHTQLVD------HMKEETWWVSFGR

Query:  AIMELSKQLIETL
         +    + ++E L
Subjt:  AIMELSKQLIETL

AT3G14860.1 NHL domain-containing protein2.2e-11451.49Show/hide
Query:  SGPLARHLSSLLKW-TGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHAD
        SG L +H+SS+LKW TGSSSK  Q D N +QFE+GYLVET+VEGN+IG+VP+KIRVS+DGEL+AVD L+SN++K++PPLS+YSR RLVAGSFQG  GHAD
Subjt:  SGPLARHLSSLLKW-TGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHAD

Query:  GKPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDN
        GKPS+ARFNHP+GV +DD GNVYVADT NLAIRKI DSGVTTIAGGK+N+ GY DGP E+AKFSNDFD++YVR +CSLLV+DRGNAALRQISL++EDCD 
Subjt:  GKPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDN

Query:  QYGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFT-LMSEHSDTETKEPSKGKHTQLVDH---MKEETWWVSFGRAIMELSKQLIE
        Q      S++ L+        D+ + IGA+ +GY T MLQ GFG SFF+  +   +  E + P K K ++ V      KEE  W SFG+ + +L K  +E
Subjt:  QYGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFT-LMSEHSDTETKEPSKGKHTQLVDH---MKEETWWVSFGRAIMELSKQLIE

Query:  TLPDNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSH---SGQQE
         +  +L   VP+ F+++ N    L PLKD L MP+DE +     +     P SE++HA    A     E K PK++SSS    P+L +   H   S +Q+
Subjt:  TLPDNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSH---SGQQE

Query:  YAEFYGSSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKT----DYSD-AKFGQFNFRN
        YA+FY S  V    ++ K  K+RSR R ++K +E        EPKP    T    +YS+ +KF  +N R+
Subjt:  YAEFYGSSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKT----DYSD-AKFGQFNFRN

AT3G14860.2 NHL domain-containing protein7.4e-11551.8Show/hide
Query:  SGPLARHLSSLLKW-TGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHAD
        SG L +H+SS+LKW TGSSSK  Q D N +QFE+GYLVET+VEGN+IG+VP+KIRVS+DGEL+AVD L+SN++K++PPLS+YSR RLVAGSFQG  GHAD
Subjt:  SGPLARHLSSLLKW-TGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHAD

Query:  GKPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDN
        GKPS+ARFNHP+GV +DD GNVYVADT NLAIRKI DSGVTTIAGGK+N+ GY DGP E+AKFSNDFD++YVR +CSLLV+DRGNAALRQISL++EDCD 
Subjt:  GKPSDARFNHPKGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDN

Query:  QYGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDT--ETKEPSKGKHTQLVDH---MKEETWWVSFGRAIMELSKQLI
        Q      S++ L+        D+ + IGA+ +GY T MLQ GFG SFF+     S+T  E + P K K ++ V      KEE  W SFG+ + +L K  +
Subjt:  QYGSVSTSALFLSFLTDEPFADVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDT--ETKEPSKGKHTQLVDH---MKEETWWVSFGRAIMELSKQLI

Query:  ETLPDNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSH---SGQQ
        E +  +L   VP+ F+++ N    L PLKD L MP+DE +     +     P SE++HA    A     E K PK++SSS    P+L +   H   S +Q
Subjt:  ETLPDNLKSFVPSYFRSDENHKKGLTPLKDALKMPKDEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSH---SGQQ

Query:  EYAEFYGSSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKT----DYSD-AKFGQFNFRN
        +YA+FY S  V    ++ K  K+RSR R ++K +E        EPKP    T    +YS+ +KF  +N R+
Subjt:  EYAEFYGSSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAEMKT----DYSD-AKFGQFNFRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCAGGACCATTGGCAAGGCACTTGTCTTCTCTTCTTAAATGGACTGGGTCTTCTTCCAAAACTCCTCAACCAGATGGGAATGCTATTCAGTTTGAGAGTGGTTACTTAGT
TGAGACTATTGTAGAGGGAAATGAAATTGGAATGGTTCCTCATAAGATACGTGTCTCGGAGGACGGTGAACTCTTCGCTGTTGATTTGCTTAGTAGCAATGTTGTCAAGG
TTTCTCCGCCATTATCTCGATATAGTAGAGCAAGATTGGTTGCCGGGTCTTTCCAGGGCTACAAAGGGCATGCTGATGGGAAACCAAGTGATGCTCGTTTCAATCATCCG
AAAGGCGTAGCTTTGGATGATAACGGAAACGTGTATGTTGCTGATACCTCAAACCTTGCCATCAGAAAGATTGTTGATTCTGGTGTGACAACAATTGCAGGAGGCAAGAC
TAATGTTCCAGGCTATAGTGATGGGCCTGGGGAGGAAGCAAAGTTTTCGAACGATTTTGATATCATATATGTCCGGCGTAGCTGTTCGTTGTTGGTCGTTGATAGAGGAA
ACGCTGCACTCCGTCAAATATCTCTTAACAAGGAGGATTGTGATAATCAATATGGCTCAGTTTCTACCTCAGCACTGTTCCTTTCCTTCCTCACCGACGAACCTTTTGCG
GATGTCGCAATGTTCATCGGTGCTCTTTTCGTCGGATACGTTACGTATATGCTTCAACATGGATTTGGGCTGTCATTCTTCACTCTTATGAGTGAACACTCGGATACCGA
AACTAAAGAACCGAGCAAGGGAAAACACACTCAGCTTGTAGACCACATGAAAGAGGAAACATGGTGGGTGTCATTTGGACGGGCCATTATGGAACTTTCCAAGCAACTAA
TCGAAACATTGCCCGACAACCTCAAATCCTTTGTTCCCTCCTACTTTAGATCAGATGAGAACCACAAGAAAGGTCTTACCCCTCTGAAAGATGCTCTCAAGATGCCAAAA
GATGAAATAAAAAACAATGTGTCCTTGAAAAAGAGAATTATCACTCCTACCTCCGAAACCAAGCATGCCTCTTTTTCAAATGCTCATGCCAAATCTGATGAAATGAAGCC
ACCAAAAATGAAGTCAAGCAGTTCCAAGACCCCTCCCTCTCTCTTAAACAAGCACAGCCATTCAGGACAACAAGAGTACGCCGAGTTTTACGGGTCCAGCTTGGTCTCTT
CCTCGCATAGTAGGTCCAAAGGCCAGAAGGATAGATCAAGACATCGACAGAAAGAAAAAGGGTCGGAACTTTTAGCGGCCGTACTCAGTGCTGAACCGAAACCTGCAGAG
ATGAAGACGGATTATAGCGATGCGAAGTTTGGCCAGTTCAACTTCAGGAATAAGTAC
mRNA sequenceShow/hide mRNA sequence
TCAGGACCATTGGCAAGGCACTTGTCTTCTCTTCTTAAATGGACTGGGTCTTCTTCCAAAACTCCTCAACCAGATGGGAATGCTATTCAGTTTGAGAGTGGTTACTTAGT
TGAGACTATTGTAGAGGGAAATGAAATTGGAATGGTTCCTCATAAGATACGTGTCTCGGAGGACGGTGAACTCTTCGCTGTTGATTTGCTTAGTAGCAATGTTGTCAAGG
TTTCTCCGCCATTATCTCGATATAGTAGAGCAAGATTGGTTGCCGGGTCTTTCCAGGGCTACAAAGGGCATGCTGATGGGAAACCAAGTGATGCTCGTTTCAATCATCCG
AAAGGCGTAGCTTTGGATGATAACGGAAACGTGTATGTTGCTGATACCTCAAACCTTGCCATCAGAAAGATTGTTGATTCTGGTGTGACAACAATTGCAGGAGGCAAGAC
TAATGTTCCAGGCTATAGTGATGGGCCTGGGGAGGAAGCAAAGTTTTCGAACGATTTTGATATCATATATGTCCGGCGTAGCTGTTCGTTGTTGGTCGTTGATAGAGGAA
ACGCTGCACTCCGTCAAATATCTCTTAACAAGGAGGATTGTGATAATCAATATGGCTCAGTTTCTACCTCAGCACTGTTCCTTTCCTTCCTCACCGACGAACCTTTTGCG
GATGTCGCAATGTTCATCGGTGCTCTTTTCGTCGGATACGTTACGTATATGCTTCAACATGGATTTGGGCTGTCATTCTTCACTCTTATGAGTGAACACTCGGATACCGA
AACTAAAGAACCGAGCAAGGGAAAACACACTCAGCTTGTAGACCACATGAAAGAGGAAACATGGTGGGTGTCATTTGGACGGGCCATTATGGAACTTTCCAAGCAACTAA
TCGAAACATTGCCCGACAACCTCAAATCCTTTGTTCCCTCCTACTTTAGATCAGATGAGAACCACAAGAAAGGTCTTACCCCTCTGAAAGATGCTCTCAAGATGCCAAAA
GATGAAATAAAAAACAATGTGTCCTTGAAAAAGAGAATTATCACTCCTACCTCCGAAACCAAGCATGCCTCTTTTTCAAATGCTCATGCCAAATCTGATGAAATGAAGCC
ACCAAAAATGAAGTCAAGCAGTTCCAAGACCCCTCCCTCTCTCTTAAACAAGCACAGCCATTCAGGACAACAAGAGTACGCCGAGTTTTACGGGTCCAGCTTGGTCTCTT
CCTCGCATAGTAGGTCCAAAGGCCAGAAGGATAGATCAAGACATCGACAGAAAGAAAAAGGGTCGGAACTTTTAGCGGCCGTACTCAGTGCTGAACCGAAACCTGCAGAG
ATGAAGACGGATTATAGCGATGCGAAGTTTGGCCAGTTCAACTTCAGGAATAAGTAC
Protein sequenceShow/hide protein sequence
SGPLARHLSSLLKWTGSSSKTPQPDGNAIQFESGYLVETIVEGNEIGMVPHKIRVSEDGELFAVDLLSSNVVKVSPPLSRYSRARLVAGSFQGYKGHADGKPSDARFNHP
KGVALDDNGNVYVADTSNLAIRKIVDSGVTTIAGGKTNVPGYSDGPGEEAKFSNDFDIIYVRRSCSLLVVDRGNAALRQISLNKEDCDNQYGSVSTSALFLSFLTDEPFA
DVAMFIGALFVGYVTYMLQHGFGLSFFTLMSEHSDTETKEPSKGKHTQLVDHMKEETWWVSFGRAIMELSKQLIETLPDNLKSFVPSYFRSDENHKKGLTPLKDALKMPK
DEIKNNVSLKKRIITPTSETKHASFSNAHAKSDEMKPPKMKSSSSKTPPSLLNKHSHSGQQEYAEFYGSSLVSSSHSRSKGQKDRSRHRQKEKGSELLAAVLSAEPKPAE
MKTDYSDAKFGQFNFRNKY