; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006641 (gene) of Snake gourd v1 genome

Gene IDTan0006641
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationLG09:41884196..41895420
RNA-Seq ExpressionTan0006641
SyntenyTan0006641
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443756.1 PREDICTED: uncharacterized protein LOC103487273 [Cucumis melo]4.5e-22887.19Show/hide
Query:  MGTKRRVSISLP----LLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP
        MGTK  VS S+     L F L+FCFVV QRFTLVCGLNYTY+++SSLRLDRI+RHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP
Subjt:  MGTKRRVSISLP----LLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP

Query:  TEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEM
        TEWPK K VKEN E+  E+R GSGA  A+QTWRVNGTRCPKG+IPVRR+TV DVLR+KSLFDFGKK+RPILLDR++DAPDVVSGNGHEHAIAYTGSSEEM
Subjt:  TEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEM

Query:  YGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLT
        YGAKATINVW+PSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS++
Subjt:  YGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLT

Query:  GSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQE
        GSQYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NG+HTST+MGSG F D+GF KASYFRNLEIVDSDNSLS +Q+
Subjt:  GSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQE

Query:  ISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ
        IS +AENT+CYNI+SSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  ISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ

XP_011660253.1 uncharacterized protein LOC101208882 [Cucumis sativus]2.1e-22586.1Show/hide
Query:  MGTKR-----RVSISLPLLFALLFCFVVLQRFTLVCGLNYTY-KQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
        MGTK       +SIS  L F L+FCFV+ QRFTLVCGLNYTY K +SSLRLDRI+RHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
Subjt:  MGTKR-----RVSISLPLLFALLFCFVVLQRFTLVCGLNYTY-KQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR

Query:  GPTEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSE
        GPTEWPK K  KEN E+ SE+R GSGA  ++QTWRVNGTRCPKG++PVRR+TV DVLR+KSLFDFGKKKRPILLDR++DAPDVVSGNGHEHAIAYTGSSE
Subjt:  GPTEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSE

Query:  EMYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS
        EMYGAKATINVW+PSI++VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISPISS
Subjt:  EMYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS

Query:  LTGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAI
        + GSQYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NG+HTST+MGSG F D+GF KASYFRNLEIVDSDNSLS++
Subjt:  LTGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAI

Query:  QEISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ
        Q+IS +AENT+CYNI+SSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  QEISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ

XP_022151674.1 uncharacterized protein LOC111019590 [Momordica charantia]3.0e-23289.95Show/hide
Query:  MGTKRRVSIS-LPLLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW
        MG  +R  +S  PL  AL    VV +RF+LV GLNYTYKQVSSLRLDRI+RHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW
Subjt:  MGTKRRVSIS-LPLLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW

Query:  PKMKKVKENNE----DGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEE
        P++KK+KE NE    DGSE R GSGAGGA+QTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKK+RPILLDR+MDAPDVVSGNGHEHAIAYTGSS+E
Subjt:  PKMKKVKENNE----DGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEE

Query:  MYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSL
        MYGAKATINVW+PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS 
Subjt:  MYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSL

Query:  TGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQ
        TGSQYD+TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A GEHTST+MGSGRFA+EGFGKASYFRNLEIVDSDNSLSA+Q
Subjt:  TGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQ

Query:  EISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ
        EISTLAEN HCYNI+SSYNDQWGTHFYYGGPGRNPECQ
Subjt:  EISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ

XP_038878455.1 uncharacterized protein LOC120070684 isoform X1 [Benincasa hispida]4.9e-22786.34Show/hide
Query:  MGTKRRVSISLP------------LLFALLFCFVV--LQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLT-------IQSPDGDIIDCVHKRKQ
        MG KR VSIS+             L FALLF  +V  LQRFTLVCGLNYTYKQVSSLRL+RI+RHLD+INKPPLLT       IQSPDGDIIDCVHKRKQ
Subjt:  MGTKRRVSISLP------------LLFALLFCFVV--LQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLT-------IQSPDGDIIDCVHKRKQ

Query:  PALDHPLLKNHKIQRGPTEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVS
        PALDHPLLKNHKIQRGPTEWPK KKV EN E  S +  GSGAGGA QTWRVNGTRCPKGSIPVRRSTVNDVLR+KSLFDFGKKKRPILLDRR+DAPDVVS
Subjt:  PALDHPLLKNHKIQRGPTEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVS

Query:  GNGHEHAIAYTGSSEEMYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQT
        GNGHEHAIAYT SSEEMYGAKATINVW+PSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQT
Subjt:  GNGHEHAIAYTGSSEEMYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQT

Query:  NSKIAIGAAISPISSLTGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYF
        N+KIAIGAAISPISS +GSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANG+HTST+MGSG F D+GFGKASYF
Subjt:  NSKIAIGAAISPISSLTGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYF

Query:  RNLEIVDSDNSLSAIQEISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ
        RNLEIVDSDNSLSA+Q+IS +AENT+CYNI+SSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  RNLEIVDSDNSLSAIQEISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ

XP_038878456.1 uncharacterized protein LOC120070684 isoform X2 [Benincasa hispida]4.0e-22987.7Show/hide
Query:  MGTKRRVSISLP------------LLFALLFCFVV--LQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPL
        MG KR VSIS+             L FALLF  +V  LQRFTLVCGLNYTYKQVSSLRL+RI+RHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPL
Subjt:  MGTKRRVSISLP------------LLFALLFCFVV--LQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPL

Query:  LKNHKIQRGPTEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHA
        LKNHKIQRGPTEWPK KKV EN E  S +  GSGAGGA QTWRVNGTRCPKGSIPVRRSTVNDVLR+KSLFDFGKKKRPILLDRR+DAPDVVSGNGHEHA
Subjt:  LKNHKIQRGPTEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHA

Query:  IAYTGSSEEMYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIG
        IAYT SSEEMYGAKATINVW+PSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTN+KIAIG
Subjt:  IAYTGSSEEMYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIG

Query:  AAISPISSLTGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVD
        AAISPISS +GSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANG+HTST+MGSG F D+GFGKASYFRNLEIVD
Subjt:  AAISPISSLTGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVD

Query:  SDNSLSAIQEISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ
        SDNSLSA+Q+IS +AENT+CYNI+SSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  SDNSLSAIQEISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ

TrEMBL top hitse value%identityAlignment
A0A0A0LXY9 Uncharacterized protein1.0e-22586.1Show/hide
Query:  MGTKR-----RVSISLPLLFALLFCFVVLQRFTLVCGLNYTY-KQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
        MGTK       +SIS  L F L+FCFV+ QRFTLVCGLNYTY K +SSLRLDRI+RHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR
Subjt:  MGTKR-----RVSISLPLLFALLFCFVVLQRFTLVCGLNYTY-KQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQR

Query:  GPTEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSE
        GPTEWPK K  KEN E+ SE+R GSGA  ++QTWRVNGTRCPKG++PVRR+TV DVLR+KSLFDFGKKKRPILLDR++DAPDVVSGNGHEHAIAYTGSSE
Subjt:  GPTEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSE

Query:  EMYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS
        EMYGAKATINVW+PSI++VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTNSKIAIGAAISPISS
Subjt:  EMYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS

Query:  LTGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAI
        + GSQYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NG+HTST+MGSG F D+GF KASYFRNLEIVDSDNSLS++
Subjt:  LTGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAI

Query:  QEISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ
        Q+IS +AENT+CYNI+SSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  QEISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ

A0A1S3B8R5 uncharacterized protein LOC1034872732.2e-22887.19Show/hide
Query:  MGTKRRVSISLP----LLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP
        MGTK  VS S+     L F L+FCFVV QRFTLVCGLNYTY+++SSLRLDRI+RHLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP
Subjt:  MGTKRRVSISLP----LLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGP

Query:  TEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEM
        TEWPK K VKEN E+  E+R GSGA  A+QTWRVNGTRCPKG+IPVRR+TV DVLR+KSLFDFGKK+RPILLDR++DAPDVVSGNGHEHAIAYTGSSEEM
Subjt:  TEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEM

Query:  YGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLT
        YGAKATINVW+PSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS++
Subjt:  YGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLT

Query:  GSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQE
        GSQYDITILIWKDPKLGNWWMGFG+NTLVGYWPAELFTHLADHATMVEWGGEVVNSR NG+HTST+MGSG F D+GF KASYFRNLEIVDSDNSLS +Q+
Subjt:  GSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQE

Query:  ISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ
        IS +AENT+CYNI+SSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  ISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ

A0A6J1DDR6 uncharacterized protein LOC1110195901.4e-23289.95Show/hide
Query:  MGTKRRVSIS-LPLLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW
        MG  +R  +S  PL  AL    VV +RF+LV GLNYTYKQVSSLRLDRI+RHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW
Subjt:  MGTKRRVSIS-LPLLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEW

Query:  PKMKKVKENNE----DGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEE
        P++KK+KE NE    DGSE R GSGAGGA+QTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKK+RPILLDR+MDAPDVVSGNGHEHAIAYTGSS+E
Subjt:  PKMKKVKENNE----DGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEE

Query:  MYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSL
        MYGAKATINVW+PSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISS 
Subjt:  MYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSL

Query:  TGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQ
        TGSQYD+TILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNS+A GEHTST+MGSGRFA+EGFGKASYFRNLEIVDSDNSLSA+Q
Subjt:  TGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQ

Query:  EISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ
        EISTLAEN HCYNI+SSYNDQWGTHFYYGGPGRNPECQ
Subjt:  EISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ

A0A6J1HAZ7 uncharacterized protein LOC1114616982.6e-22187Show/hide
Query:  LPLLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKMKKVKENNE
        LP    L    +++QRF LVCGLN++ KQVSSLRLDRI+RHLD INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPK KKVKEN E
Subjt:  LPLLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKMKKVKENNE

Query:  DGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSI
        D  + R GSGAGG +QTWRVNGTRCPKGSIPVRRSTVNDVLR KS+FD+GKKKRPILLDR++DAPDVVSGNGHEHAIAYT SS EMYGAKATINVW+PSI
Subjt:  DGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSI

Query:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDP
        QVVNEFSLSQ+WI+SGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLC+GFVQTN+KIAIGAAISP+SS +GSQYDITILIWKDP
Subjt:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDP

Query:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNIL
        KLG+WWMGFGDNTLVGYWPAELFTHL DHATMVEWGGEVVNSRANG+HTST+MGSG F D+GFGKASYFRNLEIVDSDNSLSA+Q+IS +AENT+CYNI+
Subjt:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNIL

Query:  SSYNDQWGTHFYYGGPGRNPECQ
        SSYNDQWGTHFYYGGPGRNP+CQ
Subjt:  SSYNDQWGTHFYYGGPGRNPECQ

A0A6J1I9T9 uncharacterized protein LOC1114729041.7e-22085.68Show/hide
Query:  MGTKRRVSISLPLLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP
        MG  + VS+S+ L   LL   +VLQ F+LVCGL Y+Y+ VSSLR DRI+ HLD+INKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLK+HKIQR PT WP
Subjt:  MGTKRRVSISLPLLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWP

Query:  KMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAK
        KMKK   NNEDGSE++ GSGAGG +QTW  N TRCPKG+IPVRRSTV DVLRAKSLFDFGKKKRPILLDR+MDAPDVVSGNGHEHAIAYT S  EMYGAK
Subjt:  KMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAK

Query:  ATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQY
        ATINVWEPSIQ+VNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPE YGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNS+IAIGAAISP+SSLTG+QY
Subjt:  ATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQY

Query:  DITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTL
        DITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSR+NG+HTST+MGSG FA++GFGKASYFRNLEIVD+DN+L  +QEISTL
Subjt:  DITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTL

Query:  AENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ
        AENT+CYNI+SSYNDQWGTHFYYGGPGRNPECQ
Subjt:  AENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)2.4e-13154.81Show/hide
Query:  LQRFTLVC-------GLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKMKKVKENNEDGSEKR
        L R  LVC        L+Y  +   S +   +++HL+ +NKP + +IQS DGD+IDCV   KQPA DHP LK+HKIQ  P   P  + + ++N+  + K 
Subjt:  LQRFTLVC-------GLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKMKKVKENNEDGSEKR

Query:  VGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSIQVVNEF
             G   Q W   G +C +G+IP+RR+  +DVLRA S+  +GKKKR  +   +   PD+++ +GH+HAIAY    ++ YGAKATINVWEP IQ  NEF
Subjt:  VGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSIQVVNEF

Query:  SLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDPKLGNWW
        SLSQIW+L GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS IA+GA+ISP+S    SQYDI+ILIWKDPK G+WW
Subjt:  SLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDPKLGNWW

Query:  MGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNILSSYNDQ
        M FG+  ++GYWP+ LF++L + A+M+EWGGEVVNS+++G+HTST+MGSG+F +EGF KASYFRN+++VD  N+L A + + T  E ++CY++ +  ND 
Subjt:  MGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNILSSYNDQ

Query:  WGTHFYYGGPGRNPEC
        WG +FYYGGPG+N +C
Subjt:  WGTHFYYGGPGRNPEC

AT3G13510.1 Protein of Unknown Function (DUF239)2.4e-13156.63Show/hide
Query:  SSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSI
        SS +   +++HL+ +NKPP+ TIQSPDGDIIDC+   KQPA DHP LK+HKIQ  P+  P  + + ++N+  +E +         Q W   G +C +G+I
Subjt:  SSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKMKKVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSI

Query:  PVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGW
        P+RR+  +DVLRA S+  +GKKK   +   +   PD+++ NGH+HAIAY    ++ YGAKAT+NVWEP IQ  NEFSLSQIW+L GSF G DLNSIEAGW
Subjt:  PVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSIQVVNEFSLSQIWILSGSFDGSDLNSIEAGW

Query:  QVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHA
        QVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS IA+GA+ISP+S    SQYDI+ILIWKDPK G+WWM FG+  ++GYWP+ LF++L + A
Subjt:  QVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDPKLGNWWMGFGDNTLVGYWPAELFTHLADHA

Query:  TMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPEC
        +M+EWGGEVVNS++ G HT T+MGSG F +EGF KASYFRN+++VD  N+L A + + T  E ++CY++ +  ND WG +FYYGGPG+N  C
Subjt:  TMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPEC

AT5G18460.1 Protein of Unknown Function (DUF239)1.4e-18271.33Show/hide
Query:  LLFALLFCFVVLQRFTLVCGLNYT--YKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKMKKVKENNE
        LLF L+    + Q+   +   N T  Y+QVSSLRL RI++HL+ INK P+ TIQSPDGD+IDCV KRKQPALDHPLLK+HKIQ+ P + PKMK      +
Subjt:  LLFALLFCFVVLQRFTLVCGLNYT--YKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKMKKVKENNE

Query:  DGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSI
        D   K   +   GA+Q W VNGTRCPKG++P+RR+T+NDVLRAKSLFDFGKK+R I LD+R + PD +  NGHEHAIAYT SS E+YGAKATINVW+P I
Subjt:  DGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSI

Query:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDP
        + VNEFSLSQIWILSGSF G DLNSIEAGWQVSPELYGD+RPRLFTYWTSD+YQATGCYNLLC+GF+QTN+KIAIGAAISP+S+  G+Q+DITILIWKDP
Subjt:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDP

Query:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNIL
        K+GNWWMG GD+TLVGYWPAELFTHLADHAT VEWGGEVVN+RA+G HT+T+MGSG F DEGFGKASYFRNLE+VDSDNSL  + ++  LAENT CY+I 
Subjt:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNIL

Query:  SSYNDQWGTHFYYGGPGRNPEC
        SSY+++WGT+FYYGGPG NP C
Subjt:  SSYNDQWGTHFYYGGPGRNPEC

AT5G56530.1 Protein of Unknown Function (DUF239)6.8e-13455.56Show/hide
Query:  FALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKM----KKVKENNE
        F + FCF  L   T    L+ + +         + +HL+ +NKP + +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP+  P+      KV E  +
Subjt:  FALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKM----KKVKENNE

Query:  DGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSI
        +              Q W  NG  C +G+IPVRR+   DVLRA S+  +GKKK   +   R   PD+++ +GH+HAIAY     + YGAKATINVWEP +
Subjt:  DGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSI

Query:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDP
        Q  NEFSLSQ+WIL GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS+IA+GA+ISP+S     QYDI+I IWKDP
Subjt:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDP

Query:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNIL
        K G+WWM FGD  ++GYWP+ LF++LAD A++VEWGGEVVN   +G HT+T+MGSG+F DEGF KASYFRN+++VDS N+L   + ++T  E ++CY++ 
Subjt:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNIL

Query:  SSYNDQWGTHFYYGGPGRNPECQ
           ND WG +FYYGGPGRNP CQ
Subjt:  SSYNDQWGTHFYYGGPGRNPECQ

AT5G56530.2 Protein of Unknown Function (DUF239)6.8e-13455.56Show/hide
Query:  FALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKM----KKVKENNE
        F + FCF  L   T    L+ + +         + +HL+ +NKP + +IQSPDGDIIDCVH  KQPA DHP LK+HKIQ GP+  P+      KV E  +
Subjt:  FALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKM----KKVKENNE

Query:  DGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSI
        +              Q W  NG  C +G+IPVRR+   DVLRA S+  +GKKK   +   R   PD+++ +GH+HAIAY     + YGAKATINVWEP +
Subjt:  DGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSI

Query:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDP
        Q  NEFSLSQ+WIL GSF G DLNSIEAGWQVSP+LYGD+  RLFTYWTSDAYQATGCYNLLC+GF+Q NS+IA+GA+ISP+S     QYDI+I IWKDP
Subjt:  QVVNEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDP

Query:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNIL
        K G+WWM FGD  ++GYWP+ LF++LAD A++VEWGGEVVN   +G HT+T+MGSG+F DEGF KASYFRN+++VDS N+L   + ++T  E ++CY++ 
Subjt:  KLGNWWMGFGDNTLVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNIL

Query:  SSYNDQWGTHFYYGGPGRNPECQ
           ND WG +FYYGGPGRNP CQ
Subjt:  SSYNDQWGTHFYYGGPGRNPECQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTTTGTGAGAGAAATTATGGGAACAAAAAGAAGGGTTTCAATATCATTACCTCTTCTTTTTGCTTTGCTTTTCTGTTTTGTTGTTCTTCAAAGATTTACTTTGGT
TTGTGGCCTCAATTATACTTATAAACAAGTTAGTAGCTTGAGATTGGACAGGATTCGAAGGCATTTGGACAACATTAACAAGCCTCCTCTTCTCACTATTCAGAGCCCAG
ATGGTGATATTATAGATTGTGTTCATAAAAGAAAACAGCCAGCTCTGGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAACAGAGTGGCCAAAAATGAAG
AAGGTGAAAGAGAATAATGAAGATGGAAGTGAGAAGAGGGTGGGATCAGGTGCGGGAGGTGCATATCAAACTTGGCGTGTGAACGGGACACGGTGTCCAAAGGGGAGTAT
TCCAGTGCGACGCAGCACAGTGAACGACGTGCTAAGAGCCAAGTCTTTGTTTGACTTTGGGAAGAAGAAACGACCGATTCTTCTTGATCGACGAATGGACGCTCCTGATG
TGGTCAGTGGGAATGGTCACGAGCATGCGATCGCATACACTGGATCATCGGAAGAAATGTACGGAGCGAAGGCGACAATAAACGTGTGGGAGCCGTCAATCCAAGTGGTC
AACGAGTTTAGCCTCTCTCAGATTTGGATCCTCTCGGGATCATTCGACGGCTCAGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGTGACAG
CAGACCTAGATTATTCACATATTGGACGAGTGATGCGTATCAGGCAACTGGTTGCTATAATCTTTTATGTGCTGGATTTGTTCAAACAAACAGCAAAATCGCGATCGGAG
CCGCCATTTCTCCCATCTCTTCTCTTACCGGCAGCCAATACGACATTACCATTCTCATTTGGAAGGATCCAAAATTGGGAAACTGGTGGATGGGATTTGGGGATAACACA
CTGGTCGGGTACTGGCCGGCGGAGCTGTTCACTCACCTGGCCGACCATGCGACCATGGTGGAGTGGGGCGGCGAGGTGGTGAACTCAAGGGCAAATGGCGAGCACACTTC
CACCGAAATGGGCTCCGGCCGGTTCGCCGATGAGGGCTTTGGCAAAGCTAGCTACTTTCGAAACCTCGAGATCGTCGACTCCGACAACAGCCTCAGCGCCATTCAAGAGA
TCTCGACCTTGGCCGAGAACACCCATTGTTACAACATTTTGAGCTCCTACAATGACCAGTGGGGCACTCACTTCTACTACGGAGGCCCCGGTAGAAACCCAGAATGTCAA
TGA
mRNA sequenceShow/hide mRNA sequence
GTCCTTTAATGGATTTTCTTCTTCTCCTTCTTCTTCATTTTCAAATTAGCCCAAATACAATTTGATATTTCCTTTTTTTTCCCCTCTTTCTATGCTGTTTGTGAGAGAAA
TTATGGGAACAAAAAGAAGGGTTTCAATATCATTACCTCTTCTTTTTGCTTTGCTTTTCTGTTTTGTTGTTCTTCAAAGATTTACTTTGGTTTGTGGCCTCAATTATACT
TATAAACAAGTTAGTAGCTTGAGATTGGACAGGATTCGAAGGCATTTGGACAACATTAACAAGCCTCCTCTTCTCACTATTCAGAGCCCAGATGGTGATATTATAGATTG
TGTTCATAAAAGAAAACAGCCAGCTCTGGATCATCCCCTCTTGAAGAACCACAAGATTCAGAGAGGGCCAACAGAGTGGCCAAAAATGAAGAAGGTGAAAGAGAATAATG
AAGATGGAAGTGAGAAGAGGGTGGGATCAGGTGCGGGAGGTGCATATCAAACTTGGCGTGTGAACGGGACACGGTGTCCAAAGGGGAGTATTCCAGTGCGACGCAGCACA
GTGAACGACGTGCTAAGAGCCAAGTCTTTGTTTGACTTTGGGAAGAAGAAACGACCGATTCTTCTTGATCGACGAATGGACGCTCCTGATGTGGTCAGTGGGAATGGTCA
CGAGCATGCGATCGCATACACTGGATCATCGGAAGAAATGTACGGAGCGAAGGCGACAATAAACGTGTGGGAGCCGTCAATCCAAGTGGTCAACGAGTTTAGCCTCTCTC
AGATTTGGATCCTCTCGGGATCATTCGACGGCTCAGATCTCAACAGCATAGAAGCTGGTTGGCAGGTCAGTCCGGAGCTTTATGGTGACAGCAGACCTAGATTATTCACA
TATTGGACGAGTGATGCGTATCAGGCAACTGGTTGCTATAATCTTTTATGTGCTGGATTTGTTCAAACAAACAGCAAAATCGCGATCGGAGCCGCCATTTCTCCCATCTC
TTCTCTTACCGGCAGCCAATACGACATTACCATTCTCATTTGGAAGGATCCAAAATTGGGAAACTGGTGGATGGGATTTGGGGATAACACACTGGTCGGGTACTGGCCGG
CGGAGCTGTTCACTCACCTGGCCGACCATGCGACCATGGTGGAGTGGGGCGGCGAGGTGGTGAACTCAAGGGCAAATGGCGAGCACACTTCCACCGAAATGGGCTCCGGC
CGGTTCGCCGATGAGGGCTTTGGCAAAGCTAGCTACTTTCGAAACCTCGAGATCGTCGACTCCGACAACAGCCTCAGCGCCATTCAAGAGATCTCGACCTTGGCCGAGAA
CACCCATTGTTACAACATTTTGAGCTCCTACAATGACCAGTGGGGCACTCACTTCTACTACGGAGGCCCCGGTAGAAACCCAGAATGTCAATGATCAACATCATCTCTCA
ACCATGATAACCCCATGTTGGTCGTTCGGGGTATGTCATAGGAAAAATTCATTCTTATTTCTAAATTCGATAAAATGAATCAAATTTTCATCTTAAACTTTTAATTCTAT
CCCATCAAATTGGAATTGAAAGTTTAAGGTAAAAAATAATTCAAGTGGTAATTTAGGGATAGGAATTGAATTTTTATTTTTTCCCTTTCAATATTATAAGTCTTTAAGGC
TATTATTAATAGTAAGTGAAATTGATTCTTTAATTGTGTAATTAATGATCGATGTGGGTAGCTAGCTATTTCCTCTCTTGTTTTTGCTTCACTTTCAAAGTGTTCTTTTG
GTGCCTATTATTTCGAGAGAGAAATGAATTTGTACCCTTACAATAAAGACACACATACACTTGTTGTATCTGCTTAAACATGTCTGGAAATGGCACTATTATTTTG
Protein sequenceShow/hide protein sequence
MLFVREIMGTKRRVSISLPLLFALLFCFVVLQRFTLVCGLNYTYKQVSSLRLDRIRRHLDNINKPPLLTIQSPDGDIIDCVHKRKQPALDHPLLKNHKIQRGPTEWPKMK
KVKENNEDGSEKRVGSGAGGAYQTWRVNGTRCPKGSIPVRRSTVNDVLRAKSLFDFGKKKRPILLDRRMDAPDVVSGNGHEHAIAYTGSSEEMYGAKATINVWEPSIQVV
NEFSLSQIWILSGSFDGSDLNSIEAGWQVSPELYGDSRPRLFTYWTSDAYQATGCYNLLCAGFVQTNSKIAIGAAISPISSLTGSQYDITILIWKDPKLGNWWMGFGDNT
LVGYWPAELFTHLADHATMVEWGGEVVNSRANGEHTSTEMGSGRFADEGFGKASYFRNLEIVDSDNSLSAIQEISTLAENTHCYNILSSYNDQWGTHFYYGGPGRNPECQ