; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001844 (gene) of Snake gourd v1 genome

Gene IDTan0001844
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF688)
Genome locationLG05:76350345..76353714
RNA-Seq ExpressionTan0001844
SyntenyTan0001844
Gene Ontology termsGO:0006413 - translational initiation (biological process)
GO:0003743 - translation initiation factor activity (molecular function)
InterPro domainsIPR007789 - Protein of unknown function DUF688


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8653215.1 hypothetical protein Csa_019700 [Cucumis sativus]2.5e-30281.42Show/hide
Query:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS
        MEERKLNFNAPLMSVRRFSKA+SS++K NEKKSENS  SRRSTFPVSRPQFNL+QVTEPVAVPF+WE IPGRAKNDSGSASPEV LP PPERT STPRLS
Subjt:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS

Query:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQ
        FG ALD  K+SSEMEAC ++GCE++SSNAIVVRLE  KAS  RSLASEN DDDDDDDFSDARETLSLTGSFSVNNCSVSG+SGYNGP+VKPSGTFRTDPQ
Subjt:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQ

Query:  TRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCGLIPN
        TRDFMMSRFLPAAKAMVLEPAKYSLKK+LVAVEQ RQ+KKA  ENRR+SP+K+LESTLLLQYGK+EVH   E+DEESDSVDDEYDN G+ISARGCGLIPN
Subjt:  TRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCGLIPN

Query:  ICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSR
        ICFKNSLGLL+PVPG+RIR EAPMS T KVGGSSRT+H S+ QK+NKH WDA YKQKSEAAVGSPKL EVKDKWTGE KHF SSTDLQM+GRSSPFRHSR
Subjt:  ICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSR

Query:  PASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRAT-KHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFET
         ASPFRNEAS S CR+QP VVPKEV+  S  KGD D  D PSI+AT K GVDMA+ L+EKTLYIDTASVA   PP NS++ D +KK +    KNETA E 
Subjt:  PASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRAT-KHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFET

Query:  RVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPK
        RV+EE+TT EPSFLE+KCLT+VEEG+LEREAAE K KD I+D  K+GH L +E+ +E++NLG+ADE++YSKANYQLVKVEDPASVKVTS ISSQPPPLPK
Subjt:  RVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPK

Query:  SPSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK
        SPSESWLWRTLPSVSSKKLLAGSN GNKLY KPQSPR SASTKWETIVKSSNL HDHVRYSEEL+PRVSQHSTTENFK
Subjt:  SPSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK

XP_008443314.1 PREDICTED: uncharacterized protein LOC103486924 [Cucumis melo]1.2e-30181.39Show/hide
Query:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS
        MEERKLNFNAPLMSVRRFSKA+SS+ K NEKKSENS  SRRSTFPVSRPQFNL+QVTEPVAVPF+WE IPGRAKNDSGSASPEVQLP PPERT STPRLS
Subjt:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS

Query:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQ
        FG ALD  K+SSEMEAC ++GCE++SSNAIVVRLE  KAS ARSLASEN DDDDDDDFSDARETLSLTGSFSVNNCSVSG+SGYNGP+VKPSGTFRTDPQ
Subjt:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQ

Query:  TRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCGLIPN
        TRDFMMSRFLPAAKAMVLEPAKYSLKK+LVAVEQ RQ+KK   ENRRMSP+K+LESTLLLQYGK+EVH   E+DEESDSVDDEYDN GNISARGCGLIPN
Subjt:  TRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCGLIPN

Query:  ICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSR
        ICFKNSLGLL+PVPG+RIR EAPMS T KVG SSRT+H  + QK NKH WDA YKQKSEAAVGS KL EVKDKWTGE KHF  STDLQM+GRSSPFRHSR
Subjt:  ICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSR

Query:  PASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETR
         ASPFRNEAS+S CR+QP VVPKEV+T S  KGD DF D PSI+A K GVDMAS L+EKTLYIDTASVAE  PP N ++ D +KK+++   K+ETA E R
Subjt:  PASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETR

Query:  VIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKS
        V+EE+TT EPSFLE+KCLT+VEEG+LEREAAE K KD  +    +GH L +E+ +E++N G ADE++YSKANYQLVKVEDPA VKVTS ISSQPPPLPKS
Subjt:  VIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKS

Query:  PSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK
        PSESWLWRTLPSVSSKKLLAGSNLGNKLY KPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK
Subjt:  PSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK

XP_011657681.2 uncharacterized protein LOC101207534 [Cucumis sativus]2.5e-30281.42Show/hide
Query:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS
        MEERKLNFNAPLMSVRRFSKA+SS++K NEKKSENS  SRRSTFPVSRPQFNL+QVTEPVAVPF+WE IPGRAKNDSGSASPEV LP PPERT STPRLS
Subjt:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS

Query:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQ
        FG ALD  K+SSEMEAC ++GCE++SSNAIVVRLE  KAS  RSLASEN DDDDDDDFSDARETLSLTGSFSVNNCSVSG+SGYNGP+VKPSGTFRTDPQ
Subjt:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQ

Query:  TRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCGLIPN
        TRDFMMSRFLPAAKAMVLEPAKYSLKK+LVAVEQ RQ+KKA  ENRR+SP+K+LESTLLLQYGK+EVH   E+DEESDSVDDEYDN G+ISARGCGLIPN
Subjt:  TRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCGLIPN

Query:  ICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSR
        ICFKNSLGLL+PVPG+RIR EAPMS T KVGGSSRT+H S+ QK+NKH WDA YKQKSEAAVGSPKL EVKDKWTGE KHF SSTDLQM+GRSSPFRHSR
Subjt:  ICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSR

Query:  PASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRAT-KHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFET
         ASPFRNEAS S CR+QP VVPKEV+  S  KGD D  D PSI+AT K GVDMA+ L+EKTLYIDTASVA   PP NS++ D +KK +    KNETA E 
Subjt:  PASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRAT-KHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFET

Query:  RVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPK
        RV+EE+TT EPSFLE+KCLT+VEEG+LEREAAE K KD I+D  K+GH L +E+ +E++NLG+ADE++YSKANYQLVKVEDPASVKVTS ISSQPPPLPK
Subjt:  RVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPK

Query:  SPSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK
        SPSESWLWRTLPSVSSKKLLAGSN GNKLY KPQSPR SASTKWETIVKSSNL HDHVRYSEEL+PRVSQHSTTENFK
Subjt:  SPSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK

XP_022140234.1 uncharacterized protein LOC111010950 isoform X1 [Momordica charantia]1.0e-29581.12Show/hide
Query:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS
        MEERKLNFNAPLMSVRR S ASSS AKVNEKKSEN QL+RR TFPV+R QFNL+QVTEPVAVPFHWE IPGRAKNDSGSASPE+QL QPPERT STPRLS
Subjt:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS

Query:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKA--SDARSLASENDDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDP
        FGR LDVKKH  E EAC  NGC+ANSSNAIVVRLE TKA   D R+LASE DDDDDDD+SDA +TL  + + SVNNCSVSGLSGYNGP+VKPSGTFRTDP
Subjt:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKA--SDARSLASENDDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDP

Query:  QTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGCGLIPNIC
        QTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQ RQ KK  SENRRMSP K+LEST+LLQYGK+EV   D+ESDS DDEYDN GNISARGCGLIPNIC
Subjt:  QTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGCGLIPNIC

Query:  FKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSRPA
        FKNSLGLL+PVPG+RIR E+P   TNKVGGSSRTMH SHSQKINKH WDAAYKQK EAAVGSP+LQEVKDKW GE K F +STDLQMRGRSSPFRHSR A
Subjt:  FKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSRPA

Query:  SPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVI
        SPFRNEA +S CR+Q ++VPKEVE  S  KGD DF D PSIRATK GVDM S +IEKTLYIDT SVAEIT P NS+LLD EK VD A  KNET   TR +
Subjt:  SPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVI

Query:  EETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPS
        EETTT EPSFLEVKCLTLVEEG+LEREAAE K K  I D SKM H LDKEE+S +SN+  ADEDEYSKANYQ+ KVEDPAS KVTS+ISSQPPPLPKSPS
Subjt:  EETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPS

Query:  ESWLWRTLPSVSSKKLLAGSNLGNKLYHK--PQSPRISA-STKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK
        ESWLWRTLPSVSS+KLLAGSNLGNKLYHK   QSPR SA STKWETIVKSS LRHDHVRYSEELIPRVSQHSTTE+FK
Subjt:  ESWLWRTLPSVSSKKLLAGSNLGNKLYHK--PQSPRISA-STKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK

XP_038894811.1 uncharacterized protein LOC120083223 [Benincasa hispida]2.8e-30682.38Show/hide
Query:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS
        MEERKLNFNAPLMSVRRFSKA+SSLAKVNEKKSENS  SRRSTFPVSRPQFNL+QVTEPVAVPFHWE IPGRAKNDSGSASPEVQLP PPERT STPRLS
Subjt:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS

Query:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-----DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFR
        F       K+S EMEAC ++ CEA+SS+AIVVRLE TKA DARSLASEN     DDDDDDDFSDARETLSLTGSFSVNNCSVSG+SG NGP+VKPSGTFR
Subjt:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-----DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFR

Query:  TDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCG
        TDPQTRDFMMSRFLPAAKAMVLEPAKYSLKK+LVAVEQ R +KK MSENRR SP+KQLESTLLLQYG++EVH   E+DEESDSVDDEYDN GNISARGCG
Subjt:  TDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCG

Query:  LIPNICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPF
        LIPNICFKNSLGLL+PVPG+RIR +A +S  NKVGGSSRTMH SHSQKINKH WDAAYKQKSEAAVGSPKL EVKDKWTGE KHFPSSTD+QMRGRSSPF
Subjt:  LIPNICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPF

Query:  RHSRPASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETA
        R+SR ASPFR+EAS  +CRKQP+VVPKEV+  S  KGD DF D PSIRATKHGVDMASTLIEK LYIDTASVAE   P NS+ LD EKK D +  KNETA
Subjt:  RHSRPASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETA

Query:  FETRVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPP
        FE RV+EE+TTVEPSFLE+KCLTLVEEG+LEREAAE K KD I+D   +GHEL  E+ + ++NLG ADE++YSKANYQLVKVEDPA+  VTS+ISSQPPP
Subjt:  FETRVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPP

Query:  LPKSPSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK
        LPKSPSESWLWRTLPSVSSKKLLAGSNLG+K Y KPQSPR SASTKWETIVKSSNL HDHVRYSEELIPRVSQHSTTENFK
Subjt:  LPKSPSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK

TrEMBL top hitse value%identityAlignment
A0A0A0LX77 Uncharacterized protein2.6e-30281.27Show/hide
Query:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS
        MEERKLNFNAPLMSVRRFSKA+SS++K NEKKSENS  SRRSTFPVSRPQFNL+QVTEPVAVPF+WE IPGRAKNDSGSASPEV LP PPERT STPRLS
Subjt:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS

Query:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQ
        FG ALD  K+SSEMEAC ++GCE++SSNAIVVRLE  KAS  RSLASEN DDDDDDDFSDARETLSLTGSFSVNNCSVSG+SGYNGP+VKPSGTFRTDPQ
Subjt:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQ

Query:  TRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCGLIPN
        TRDFMMSRFLPAAKAMVLEPAKYSLKK+LVAVEQ RQ+KKA  ENRR+SP+K+LESTLLLQYGK+EVH   E+DEESDSVDDEYDN G+ISARGCGLIPN
Subjt:  TRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCGLIPN

Query:  ICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSR
        ICFKNSLGLL+PVPG+RIR EAPMS T KVGGSSRT+H S+ QK+NKH WDA YKQKSEAAVGSP+L EVKDKWTGE KHF SSTDLQM+GRSSPFRHSR
Subjt:  ICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSR

Query:  PASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRAT-KHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFET
         ASPFRNEAS S CR+QP VVPKEV+  S  KGD D  D PSI+AT K GVDMA+ L+EKTLYIDTASVA   PP NS++ D +KK +    KNETA E 
Subjt:  PASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRAT-KHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFET

Query:  RVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPK
        RV+EE+TT EPSFLE+KCLT+VEEG+LEREAAE K KD I+D  K+GH L +E+ +E++NLG+ADE++YSKANYQLVKVEDPASVKVTS ISSQPPPLPK
Subjt:  RVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPK

Query:  SPSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK
        SPSESWLWRTLPSVSSKKLLAGSN GNKLY KPQSPR SASTKWETIVKSSNL HDHVRYSEEL+PRVSQHSTTENFK
Subjt:  SPSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK

A0A1S3B7S2 uncharacterized protein LOC1034869245.9e-30281.39Show/hide
Query:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS
        MEERKLNFNAPLMSVRRFSKA+SS+ K NEKKSENS  SRRSTFPVSRPQFNL+QVTEPVAVPF+WE IPGRAKNDSGSASPEVQLP PPERT STPRLS
Subjt:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS

Query:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQ
        FG ALD  K+SSEMEAC ++GCE++SSNAIVVRLE  KAS ARSLASEN DDDDDDDFSDARETLSLTGSFSVNNCSVSG+SGYNGP+VKPSGTFRTDPQ
Subjt:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQ

Query:  TRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCGLIPN
        TRDFMMSRFLPAAKAMVLEPAKYSLKK+LVAVEQ RQ+KK   ENRRMSP+K+LESTLLLQYGK+EVH   E+DEESDSVDDEYDN GNISARGCGLIPN
Subjt:  TRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCGLIPN

Query:  ICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSR
        ICFKNSLGLL+PVPG+RIR EAPMS T KVG SSRT+H  + QK NKH WDA YKQKSEAAVGS KL EVKDKWTGE KHF  STDLQM+GRSSPFRHSR
Subjt:  ICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSR

Query:  PASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETR
         ASPFRNEAS+S CR+QP VVPKEV+T S  KGD DF D PSI+A K GVDMAS L+EKTLYIDTASVAE  PP N ++ D +KK+++   K+ETA E R
Subjt:  PASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETR

Query:  VIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKS
        V+EE+TT EPSFLE+KCLT+VEEG+LEREAAE K KD  +    +GH L +E+ +E++N G ADE++YSKANYQLVKVEDPA VKVTS ISSQPPPLPKS
Subjt:  VIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKS

Query:  PSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK
        PSESWLWRTLPSVSSKKLLAGSNLGNKLY KPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK
Subjt:  PSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK

A0A5A7UPB5 Putative Transcription initiation factor TFIID subunit 111.3e-28580.59Show/hide
Query:  MSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLSFGRALDVKKHSS
        MSVRRFSKA+SS+ K NEKKSENS  SRRSTFPVSRPQFNL+QVTEPVAVPF+WE IPGRAKNDSGSASPEVQLP PPERT STPRLSFG ALD  K+SS
Subjt:  MSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLSFGRALDVKKHSS

Query:  EMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQTRDFMMSRFLPA
        EMEAC ++GCE++SSNAIVVRLE  KAS ARSLASEN DDDDDDDFSDARETLSLTGSFSVNNCSVSG+SGYNGP+VKPSGTFRTDPQTRDFMMSRFLPA
Subjt:  EMEACDRNGCEANSSNAIVVRLEPTKASDARSLASEN-DDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQTRDFMMSRFLPA

Query:  AKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCGLIPNICFKNSLGLLHP
        AKAMVLEPAKYSLKK+LVAVEQ RQ+KK   ENRRMSP+K+LESTLLLQYGK+EVH   E+DEESDSVDDEYDN GNISARGCGLIPNICFKNSLGLL+P
Subjt:  AKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVH---EIDEESDSVDDEYDNPGNISARGCGLIPNICFKNSLGLLHP

Query:  VPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSRPASPFRNEASES
        VPG+RIR EAPMS T KVG SSRT+H  + QK NKH WDA YKQKSEAAVGS KL EVKDKWTGE KHF  STDLQM+GRSSPFRHSR ASPFRNEAS+S
Subjt:  VPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSRPASPFRNEASES

Query:  SCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVIEETTTVEPSF
         CR+QP VVPKEV+T S  KGD DF D PSI+A K GVDMAS L+EKTLYIDTASVAE  PP N ++ D +KK+++   K+ETA E RV+EE+TT EPSF
Subjt:  SCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVIEETTTVEPSF

Query:  LEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPSESWLWRTLPS
        LE+KCLT+VEEG+LEREAAE K KD  +    +GH L +E+ +E++N G ADE++YSKANYQLVKVEDPA VKVTS ISSQPPPLPKSPSESWLWRTLPS
Subjt:  LEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPSESWLWRTLPS

Query:  VSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSE
        VSSKKLLAGSNLGNKLY KPQSPRISASTKWETIVKSSNLRHDHVRYSE
Subjt:  VSSKKLLAGSNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSE

A0A6J1CEJ0 uncharacterized protein LOC111010950 isoform X24.8e-29681.12Show/hide
Query:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS
        MEERKLNFNAPLMSVRR S ASSS AKVNEKKSEN QL+RR TFPV+R QFNL+QVTEPVAVPFHWE IPGRAKNDSGSASPE+QL QPPERT STPRLS
Subjt:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS

Query:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKA--SDARSLASENDDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDP
        FGR LDVKKH  E EAC  NGC+ANSSNAIVVRLE TKA   D R+LASE DDDDDDD+SDA +TL  + + SVNNCSVSGLSGYNGP+VKPSGTFRTDP
Subjt:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKA--SDARSLASENDDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDP

Query:  QTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGCGLIPNIC
        QTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQ RQ KK  SENRRMSP K+LEST+LLQYGK+EV   D+ESDS DDEYDN GNISARGCGLIPNIC
Subjt:  QTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGCGLIPNIC

Query:  FKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSRPA
        FKNSLGLL+PVPG+RIR E+P   TNKVGGSSRTMH SHSQKINKH WDAAYKQK EAAVGSP+LQEVKDKW GE K F +STDLQMRGRSSPFRHSR A
Subjt:  FKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSRPA

Query:  SPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVI
        SPFRNEA +S CR+Q ++VPKEVE  S  KGD DF D PSIRATK GVDM S +IEKTLYIDT SVAEIT P NS+LLD EK VD A  KNET   TR +
Subjt:  SPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVI

Query:  EETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPS
        EETTT EPSFLEVKCLTLVEEG+LEREAAE K K  I D SKM H LDKEE+S +SN+  ADEDEYSKANYQ+ KVEDPAS KVTS+ISSQPPPLPKSPS
Subjt:  EETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPS

Query:  ESWLWRTLPSVSSKKLLAGSNLGNKLYHK--PQSPRISA-STKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK
        ESWLWRTLPSVSS+KLLAGSNLGNKLYHK   QSPR SA STKWETIVKSS LRHDHVRYSEELIPRVSQHSTTE+FK
Subjt:  ESWLWRTLPSVSSKKLLAGSNLGNKLYHK--PQSPRISA-STKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK

A0A6J1CF57 uncharacterized protein LOC111010950 isoform X14.8e-29681.12Show/hide
Query:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS
        MEERKLNFNAPLMSVRR S ASSS AKVNEKKSEN QL+RR TFPV+R QFNL+QVTEPVAVPFHWE IPGRAKNDSGSASPE+QL QPPERT STPRLS
Subjt:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS

Query:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKA--SDARSLASENDDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDP
        FGR LDVKKH  E EAC  NGC+ANSSNAIVVRLE TKA   D R+LASE DDDDDDD+SDA +TL  + + SVNNCSVSGLSGYNGP+VKPSGTFRTDP
Subjt:  FGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKA--SDARSLASENDDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDP

Query:  QTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGCGLIPNIC
        QTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQ RQ KK  SENRRMSP K+LEST+LLQYGK+EV   D+ESDS DDEYDN GNISARGCGLIPNIC
Subjt:  QTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGCGLIPNIC

Query:  FKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSRPA
        FKNSLGLL+PVPG+RIR E+P   TNKVGGSSRTMH SHSQKINKH WDAAYKQK EAAVGSP+LQEVKDKW GE K F +STDLQMRGRSSPFRHSR A
Subjt:  FKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSRPA

Query:  SPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVI
        SPFRNEA +S CR+Q ++VPKEVE  S  KGD DF D PSIRATK GVDM S +IEKTLYIDT SVAEIT P NS+LLD EK VD A  KNET   TR +
Subjt:  SPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVI

Query:  EETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPS
        EETTT EPSFLEVKCLTLVEEG+LEREAAE K K  I D SKM H LDKEE+S +SN+  ADEDEYSKANYQ+ KVEDPAS KVTS+ISSQPPPLPKSPS
Subjt:  EETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPS

Query:  ESWLWRTLPSVSSKKLLAGSNLGNKLYHK--PQSPRISA-STKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK
        ESWLWRTLPSVSS+KLLAGSNLGNKLYHK   QSPR SA STKWETIVKSS LRHDHVRYSEELIPRVSQHSTTE+FK
Subjt:  ESWLWRTLPSVSSKKLLAGSNLGNKLYHK--PQSPRISA-STKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G29240.1 Protein of unknown function (DUF688)1.8e-5632.07Show/hide
Query:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSEN-----SQLSRRSTFPVSRPQFN--LEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERT
        MEERKLNF+ PL+S RR  K +     V   KS N     S+ S   + PV  P      ++VTEP +VPF WE  PGR K +     P+V +    E  
Subjt:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSEN-----SQLSRRSTFPVSRPQFN--LEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERT

Query:  WSTPRLSFGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASENDDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGT
          TP L  G+A+D                      A + RL+ +K         E+DDD+DD FSDA +TLS   SFS NN S+SG+S Y G   K    
Subjt:  WSTPRLSFGRALDVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASENDDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGT

Query:  FRTDPQTRDFMMSRFLPAAKAMVLEPAKY--SLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGC
           D Q+RDFMMSRFLPAAKAM +E + Y  + K      E + QI++ +   ++ +P +   S +   Y   ++ + + +    DDE      +S RGC
Subjt:  FRTDPQTRDFMMSRFLPAAKAMVLEPAKY--SLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGC

Query:  GLIPNICFKNSLGLLHPVPGVRIRNEAPMSFT--NKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRS
        G++P +CFK+SLG+L+ VPG + ++ +P++    ++V  S     +   Q + K   D+  K K    V SP       K+  E     S+     +  S
Subjt:  GLIPNICFKNSLGLLHPVPGVRIRNEAPMSFT--NKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRS

Query:  SPFRHSRPASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKN
        SP+RHSR  SPFR+  + S     PL      ET          R+  ++RA +        L   T  I   S   + P SN S+L+    VD      
Subjt:  SPFRHSRPASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKN

Query:  ETAFETRVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQ
             T     T     S L +             EA    RK D N            EL  F N+        S  + ++VK  +   +      S  
Subjt:  ETAFETRVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQ

Query:  PPPLPKSPSESWLWRTLPSVSSKKLLAGSNLGNKLYH--KPQSPRISAS----TKWETIVKSSNLRHDHVRYSEELIPRVSQHSTT
         PP PK PSESWL   LPSV+S+       + ++ YH   PQ   ++ +    TKWETIVK+S +  DH+RYSEEL+   S  S T
Subjt:  PPPLPKSPSESWLWRTLPSVSSKKLLAGSNLGNKLYH--KPQSPRISAS----TKWETIVKSSNLRHDHVRYSEELIPRVSQHSTT

AT2G30990.1 Protein of unknown function (DUF688)3.8e-3527.79Show/hide
Query:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS
        MEE++L+FN PL+S+RR ++ S S +K     S  + +    + PV +       V  P  VPF WEH PG+ K++     P +Q    P      P+L 
Subjt:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS

Query:  FGRALDVK-KHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASENDDDDDDD---FSDARETLSLTGSFSVNNCSVSGLSGYNGP--LVKPSGTF
         GR   V+     E    D      +SS+  +V        DA+S +S  DDDDDD    + DA +TLS T SF  N  +VSG SG +G   LV+P GT 
Subjt:  FGRALDVK-KHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASENDDDDDDD---FSDARETLSLTGSFSVNNCSVSGLSGYNGP--LVKPSGTF

Query:  RTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARG-CGL
         TD QT+D MM RFLPAAKA+  E   +  +K     E  +Q+ K        +P +               H  D+E     +E  N  ++ A G CGL
Subjt:  RTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARG-CGL

Query:  IPNICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFR
        +P +C ++SLGLL+PVP VR++ +  +S         R+ +Q  +     H                 K  E K K    LK   S      +G S    
Subjt:  IPNICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFR

Query:  HSRPASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAF
         S  + P   E  E+            V T S  K   +F +  +           + + EKTLY+D   +              +KKV    +K     
Subjt:  HSRPASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAF

Query:  ETRVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQ----
           +++E+ +++   ++ +   + +   +E+E      +D+  D +K   +  +E     + +   + D       + + +E    V  T++ SS+    
Subjt:  ETRVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQ----

Query:  -------PPPLPKSPSESWLWRTLPSVSSKKLLAG--SNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELI
               PPPLPK+PS+SWL RTLP++  K        +LG          +  A+ KWET+VK+SN +   V +S+E +
Subjt:  -------PPPLPKSPSESWLWRTLPSVSSKKLLAG--SNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELI

AT2G30990.2 Protein of unknown function (DUF688)3.8e-3527.79Show/hide
Query:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS
        MEE++L+FN PL+S+RR ++ S S +K     S  + +    + PV +       V  P  VPF WEH PG+ K++     P +Q    P      P+L 
Subjt:  MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLS

Query:  FGRALDVK-KHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASENDDDDDDD---FSDARETLSLTGSFSVNNCSVSGLSGYNGP--LVKPSGTF
         GR   V+     E    D      +SS+  +V        DA+S +S  DDDDDD    + DA +TLS T SF  N  +VSG SG +G   LV+P GT 
Subjt:  FGRALDVK-KHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASENDDDDDDD---FSDARETLSLTGSFSVNNCSVSGLSGYNGP--LVKPSGTF

Query:  RTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARG-CGL
         TD QT+D MM RFLPAAKA+  E   +  +K     E  +Q+ K        +P +               H  D+E     +E  N  ++ A G CGL
Subjt:  RTDPQTRDFMMSRFLPAAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARG-CGL

Query:  IPNICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFR
        +P +C ++SLGLL+PVP VR++ +  +S         R+ +Q  +     H                 K  E K K    LK   S      +G S    
Subjt:  IPNICFKNSLGLLHPVPGVRIRNEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFR

Query:  HSRPASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAF
         S  + P   E  E+            V T S  K   +F +  +           + + EKTLY+D   +              +KKV    +K     
Subjt:  HSRPASPFRNEASESSCRKQPLVVPKEVETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAF

Query:  ETRVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQ----
           +++E+ +++   ++ +   + +   +E+E      +D+  D +K   +  +E     + +   + D       + + +E    V  T++ SS+    
Subjt:  ETRVIEETTTVEPSFLEVKCLTLVEEGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQ----

Query:  -------PPPLPKSPSESWLWRTLPSVSSKKLLAG--SNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELI
               PPPLPK+PS+SWL RTLP++  K        +LG          +  A+ KWET+VK+SN +   V +S+E +
Subjt:  -------PPPLPKSPSESWLWRTLPSVSSKKLLAG--SNLGNKLYHKPQSPRISASTKWETIVKSSNLRHDHVRYSEELI

AT2G34170.1 Protein of unknown function (DUF688)4.0e-3231.89Show/hide
Query:  PTKASDARSLASENDDDDDDD-FSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLE-PAKYSLKKQLVAV--
        P    +++ +  E ++ +DDD FSDA +TLSL         S+SG  G     +KPS     DPQ   FM+ RFLPAAK++ LE P +YS K+Q + +  
Subjt:  PTKASDARSLASENDDDDDDD-FSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLE-PAKYSLKKQLVAV--

Query:  EQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGCGLI-PNICFKNSLGLLHPVPGVRIRNEAPMSFT----NKV
        E  RQI+  +    R +P +  ES+    Y   ++ + + E DS DDE      +S RGCG++ P ICFKNSLG+L  V G++   E P S      ++V
Subjt:  EQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGCGLI-PNICFKNSLGLLHPVPGVRIRNEAPMSFT----NKV

Query:  GGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWT-GELKHFPSSTDLQMRGRSSPFRHSRPASPFRNEASESSCRKQPLV-VPKEVETF
          S     +S  Q + K   D  YKQK  +   SP    V  K+  G  +H   S+  +    SSP+R +   SP+R+  + S           KE E  
Subjt:  GGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWT-GELKHFPSSTDLQMRGRSSPFRHSRPASPFRNEASESSCRKQPLV-VPKEVETF

Query:  S----NFKGDNDFRDRPSI--RATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVIEETTTVEPSFLEVKCLTLVE
             N    N  +   S+  ++TK     +S + EKTLY+D+          NS     E +  + ++          + ET + EP            
Subjt:  S----NFKGDNDFRDRPSI--RATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVIEETTTVEPSFLEVKCLTLVE

Query:  EGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPSESWLWRTLPSVSSKKLLAGS
        EGK  +   ELK                 E LS  S +     DE  K N                 +S   PP PK PSESWL+  LPSVSSK      
Subjt:  EGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPSESWLWRTLPSVSSKKLLAGS

Query:  NLGNKLYHKPQSPRI----SASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTT
             L+H PQ   +    ++ TKWETIVK+S    DH+RYSEEL+   S  S T
Subjt:  NLGNKLYHKPQSPRI----SASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTT

AT2G34170.2 Protein of unknown function (DUF688)4.0e-3231.89Show/hide
Query:  PTKASDARSLASENDDDDDDD-FSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLE-PAKYSLKKQLVAV--
        P    +++ +  E ++ +DDD FSDA +TLSL         S+SG  G     +KPS     DPQ   FM+ RFLPAAK++ LE P +YS K+Q + +  
Subjt:  PTKASDARSLASENDDDDDDD-FSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQTRDFMMSRFLPAAKAMVLE-PAKYSLKKQLVAV--

Query:  EQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGCGLI-PNICFKNSLGLLHPVPGVRIRNEAPMSFT----NKV
        E  RQI+  +    R +P +  ES+    Y   ++ + + E DS DDE      +S RGCG++ P ICFKNSLG+L  V G++   E P S      ++V
Subjt:  EQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGCGLI-PNICFKNSLGLLHPVPGVRIRNEAPMSFT----NKV

Query:  GGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWT-GELKHFPSSTDLQMRGRSSPFRHSRPASPFRNEASESSCRKQPLV-VPKEVETF
          S     +S  Q + K   D  YKQK  +   SP    V  K+  G  +H   S+  +    SSP+R +   SP+R+  + S           KE E  
Subjt:  GGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWT-GELKHFPSSTDLQMRGRSSPFRHSRPASPFRNEASESSCRKQPLV-VPKEVETF

Query:  S----NFKGDNDFRDRPSI--RATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVIEETTTVEPSFLEVKCLTLVE
             N    N  +   S+  ++TK     +S + EKTLY+D+          NS     E +  + ++          + ET + EP            
Subjt:  S----NFKGDNDFRDRPSI--RATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVIEETTTVEPSFLEVKCLTLVE

Query:  EGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPSESWLWRTLPSVSSKKLLAGS
        EGK  +   ELK                 E LS  S +     DE  K N                 +S   PP PK PSESWL+  LPSVSSK      
Subjt:  EGKLEREAAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPSESWLWRTLPSVSSKKLLAGS

Query:  NLGNKLYHKPQSPRI----SASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTT
             L+H PQ   +    ++ TKWETIVK+S    DH+RYSEEL+   S  S T
Subjt:  NLGNKLYHKPQSPRI----SASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAAGAAAGCTCAATTTTAATGCACCTCTCATGTCTGTGAGGCGATTTTCCAAGGCATCAAGTTCTTTAGCTAAAGTGAATGAAAAGAAATCTGAAAAT
TCCCAACTTAGTAGACGGAGTACCTTTCCAGTTTCTAGACCACAATTCAATTTAGAGCAAGTAACAGAACCAGTTGCAGTTCCCTTCCACTGGGAGCATATTCCA
GGAAGAGCTAAGAATGATAGTGGTTCAGCCTCACCTGAGGTTCAGCTGCCTCAGCCTCCTGAGAGAACTTGGTCTACTCCTAGGCTTTCTTTTGGGAGGGCTTTG
GATGTTAAAAAACATAGTTCGGAAATGGAAGCTTGTGATCGAAATGGGTGTGAAGCAAATTCTTCCAATGCCATTGTTGTTAGATTAGAGCCAACGAAAGCTAGT
GATGCGAGGAGCTTGGCATCTGAGAATGATGATGATGATGATGATGATTTCTCCGATGCACGTGAGACGCTGTCCCTCACTGGTTCATTTTCTGTTAACAACTGT
AGTGTAAGTGGTCTAAGTGGATACAACGGTCCCTTGGTGAAACCATCGGGAACCTTCCGAACAGACCCTCAAACTCGAGATTTCATGATGAGCCGCTTCTTACCT
GCAGCCAAGGCAATGGTTTTGGAGCCTGCTAAGTATTCCTTAAAGAAGCAACTTGTAGCAGTTGAGCAATCTAGACAAATTAAGAAGGCGATGTCTGAGAATAGG
AGGATGTCTCCGCTTAAACAGCTTGAGTCTACCCTGTTACTACAGTATGGCAAAAATGAAGTACATGAAATAGATGAAGAAAGTGACTCTGTGGATGATGAATAT
GACAATCCAGGTAATATATCAGCTAGAGGTTGTGGTCTAATACCCAATATATGCTTCAAAAACTCTTTGGGTCTTCTTCATCCTGTGCCTGGGGTGAGAATCAGG
AACGAGGCACCCATGTCTTTCACTAATAAAGTTGGGGGATCAAGCAGAACAATGCACCAGTCACACAGCCAAAAGATCAACAAGCATGGTTGGGATGCTGCTTAC
AAGCAAAAATCTGAAGCTGCTGTTGGATCACCTAAGCTGCAGGAGGTAAAAGATAAGTGGACTGGTGAATTAAAACATTTTCCTTCCTCCACCGACCTGCAAATG
AGAGGTAGGTCTTCTCCATTCAGGCATTCGAGGCCTGCTTCTCCCTTCCGAAATGAAGCATCAGAGTCTTCTTGTAGAAAGCAGCCCCTTGTAGTTCCTAAAGAA
GTTGAGACTTTCTCCAACTTTAAAGGTGATAACGACTTTCGTGATAGACCGTCCATTCGAGCAACTAAACATGGAGTTGACATGGCAAGTACCCTGATTGAGAAG
ACACTCTATATCGATACTGCAAGTGTTGCTGAAATAACTCCCCCATCAAATTCAAGCCTTTTGGATACTGAGAAGAAAGTCGATCATGCTAGGAGGAAGAATGAA
ACAGCATTTGAGACTAGAGTGATAGAAGAAACTACCACTGTGGAACCTTCTTTCCTAGAAGTAAAGTGCTTAACTTTGGTTGAAGAAGGGAAGCTGGAGCGTGAA
GCTGCAGAATTGAAAAGGAAAGATGATATTAATGATGACTCCAAAATGGGGCATGAACTTGATAAAGAAGAGCTCTCTGAATTCTCTAATTTGGGCGCTGCTGAT
GAAGATGAATACTCCAAGGCCAATTATCAGCTAGTAAAAGTAGAAGATCCAGCAAGTGTCAAAGTAACTTCTATGATATCTTCTCAACCTCCACCTCTACCGAAG
TCTCCTTCCGAGTCTTGGCTTTGGCGTACCCTGCCTTCAGTTTCCTCGAAAAAGTTACTAGCAGGATCAAACCTTGGAAACAAGTTGTATCACAAGCCGCAGAGT
CCTAGAATATCAGCCAGCACCAAGTGGGAAACCATAGTAAAATCTTCAAATTTACGCCACGATCACGTTCGCTACTCCGAGGAATTAATTCCTCGTGTTTCTCAG
CACTCAACAACAGAGAATTTCAAGTAG
mRNA sequenceShow/hide mRNA sequence
GAAAAGTTGAAAACAGAGAGCGCGTGAAAAATTGAAACCACCATCGGCGTAAAGCAACAAATTAAAATCCCTCTCAGAATTTGGCGCACAAAAAGTCACTGATTT
CGTTTCGATTTACTTTCCCATTCCTCATTTTCCCCTAACTACACGCCATCATTTGTGGACTCTTTCTAACAGGGACTATCGTCAATTTTCTCTCTTATTCTCTTT
CTCTCTGCATCCCTCCAAGTTACTGTACTTAAATTTTCGTCCCATAGTTTTTGTTGGTTGTGGGCGCACGCAATGGAGGAAAGAAAGCTCAATTTTAATGCACCT
CTCATGTCTGTGAGGCGATTTTCCAAGGCATCAAGTTCTTTAGCTAAAGTGAATGAAAAGAAATCTGAAAATTCCCAACTTAGTAGACGGAGTACCTTTCCAGTT
TCTAGACCACAATTCAATTTAGAGCAAGTAACAGAACCAGTTGCAGTTCCCTTCCACTGGGAGCATATTCCAGGAAGAGCTAAGAATGATAGTGGTTCAGCCTCA
CCTGAGGTTCAGCTGCCTCAGCCTCCTGAGAGAACTTGGTCTACTCCTAGGCTTTCTTTTGGGAGGGCTTTGGATGTTAAAAAACATAGTTCGGAAATGGAAGCT
TGTGATCGAAATGGGTGTGAAGCAAATTCTTCCAATGCCATTGTTGTTAGATTAGAGCCAACGAAAGCTAGTGATGCGAGGAGCTTGGCATCTGAGAATGATGAT
GATGATGATGATGATTTCTCCGATGCACGTGAGACGCTGTCCCTCACTGGTTCATTTTCTGTTAACAACTGTAGTGTAAGTGGTCTAAGTGGATACAACGGTCCC
TTGGTGAAACCATCGGGAACCTTCCGAACAGACCCTCAAACTCGAGATTTCATGATGAGCCGCTTCTTACCTGCAGCCAAGGCAATGGTTTTGGAGCCTGCTAAG
TATTCCTTAAAGAAGCAACTTGTAGCAGTTGAGCAATCTAGACAAATTAAGAAGGCGATGTCTGAGAATAGGAGGATGTCTCCGCTTAAACAGCTTGAGTCTACC
CTGTTACTACAGTATGGCAAAAATGAAGTACATGAAATAGATGAAGAAAGTGACTCTGTGGATGATGAATATGACAATCCAGGTAATATATCAGCTAGAGGTTGT
GGTCTAATACCCAATATATGCTTCAAAAACTCTTTGGGTCTTCTTCATCCTGTGCCTGGGGTGAGAATCAGGAACGAGGCACCCATGTCTTTCACTAATAAAGTT
GGGGGATCAAGCAGAACAATGCACCAGTCACACAGCCAAAAGATCAACAAGCATGGTTGGGATGCTGCTTACAAGCAAAAATCTGAAGCTGCTGTTGGATCACCT
AAGCTGCAGGAGGTAAAAGATAAGTGGACTGGTGAATTAAAACATTTTCCTTCCTCCACCGACCTGCAAATGAGAGGTAGGTCTTCTCCATTCAGGCATTCGAGG
CCTGCTTCTCCCTTCCGAAATGAAGCATCAGAGTCTTCTTGTAGAAAGCAGCCCCTTGTAGTTCCTAAAGAAGTTGAGACTTTCTCCAACTTTAAAGGTGATAAC
GACTTTCGTGATAGACCGTCCATTCGAGCAACTAAACATGGAGTTGACATGGCAAGTACCCTGATTGAGAAGACACTCTATATCGATACTGCAAGTGTTGCTGAA
ATAACTCCCCCATCAAATTCAAGCCTTTTGGATACTGAGAAGAAAGTCGATCATGCTAGGAGGAAGAATGAAACAGCATTTGAGACTAGAGTGATAGAAGAAACT
ACCACTGTGGAACCTTCTTTCCTAGAAGTAAAGTGCTTAACTTTGGTTGAAGAAGGGAAGCTGGAGCGTGAAGCTGCAGAATTGAAAAGGAAAGATGATATTAAT
GATGACTCCAAAATGGGGCATGAACTTGATAAAGAAGAGCTCTCTGAATTCTCTAATTTGGGCGCTGCTGATGAAGATGAATACTCCAAGGCCAATTATCAGCTA
GTAAAAGTAGAAGATCCAGCAAGTGTCAAAGTAACTTCTATGATATCTTCTCAACCTCCACCTCTACCGAAGTCTCCTTCCGAGTCTTGGCTTTGGCGTACCCTG
CCTTCAGTTTCCTCGAAAAAGTTACTAGCAGGATCAAACCTTGGAAACAAGTTGTATCACAAGCCGCAGAGTCCTAGAATATCAGCCAGCACCAAGTGGGAAACC
ATAGTAAAATCTTCAAATTTACGCCACGATCACGTTCGCTACTCCGAGGAATTAATTCCTCGTGTTTCTCAGCACTCAACAACAGAGAATTTCAAGTAGTTCTTT
GTGACACTCAGGTTGATTGATTGGCTTTGTGTGAAAGTTTATACATTAGCATGGCTTGTATTTTCCTTATGAATTCCTTTTCCCCTTCCTTTCCTCAATTTTTTT
TGAATAAGTTAGGAGGATTTGAGCTGCCCTTTTCCTTTTATGTCTTCTGATATTTATGTGAAGACATGTACGGGCATGGCCGAGGAAATATTCATATTTATAGAA
TGAATTCTGTATAATGCAGCCTAGGAATCAATATAGTGGCGAGGTAATTGCATTGTATATTCGAAGTAAAAGGTTTATTAAATAGTTTATGTGAATGAAAGTTTG
TTCATAAATTGTTGTT
Protein sequenceShow/hide protein sequence
MEERKLNFNAPLMSVRRFSKASSSLAKVNEKKSENSQLSRRSTFPVSRPQFNLEQVTEPVAVPFHWEHIPGRAKNDSGSASPEVQLPQPPERTWSTPRLSFGRAL
DVKKHSSEMEACDRNGCEANSSNAIVVRLEPTKASDARSLASENDDDDDDDFSDARETLSLTGSFSVNNCSVSGLSGYNGPLVKPSGTFRTDPQTRDFMMSRFLP
AAKAMVLEPAKYSLKKQLVAVEQSRQIKKAMSENRRMSPLKQLESTLLLQYGKNEVHEIDEESDSVDDEYDNPGNISARGCGLIPNICFKNSLGLLHPVPGVRIR
NEAPMSFTNKVGGSSRTMHQSHSQKINKHGWDAAYKQKSEAAVGSPKLQEVKDKWTGELKHFPSSTDLQMRGRSSPFRHSRPASPFRNEASESSCRKQPLVVPKE
VETFSNFKGDNDFRDRPSIRATKHGVDMASTLIEKTLYIDTASVAEITPPSNSSLLDTEKKVDHARRKNETAFETRVIEETTTVEPSFLEVKCLTLVEEGKLERE
AAELKRKDDINDDSKMGHELDKEELSEFSNLGAADEDEYSKANYQLVKVEDPASVKVTSMISSQPPPLPKSPSESWLWRTLPSVSSKKLLAGSNLGNKLYHKPQS
PRISASTKWETIVKSSNLRHDHVRYSEELIPRVSQHSTTENFK