; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0001660 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0001660
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionTMEM135_C_rich domain-containing protein
Genome locationchr03:2481484..2487043
RNA-Seq ExpressionPI0001660
SyntenyPI0001660
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR026749 - Transmembrane protein 135


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444833.1 PREDICTED: uncharacterized protein LOC103488060 [Cucumis melo]6.6e-28498.02Show/hide
Query:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGG FDA+GGC CLAQQNGDAE AANCKS DSYCEHC  GSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP
        LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDT+ILEGLKSFVSGMPSS+KFKAIEKYYSAMGAAVKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTS+IMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKKY
        GKKY
Subjt:  GKKY

XP_011649669.1 uncharacterized protein LOC101202879 isoform X1 [Cucumis sativus]5.8e-28097.02Show/hide
Query:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGG FDADGGC CLAQQNGDAE AANCKS DSYCEHCS GSADSSSFPSFSCSSSSLWLDSTRLREYGKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGN+AGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP
        LASRCGIKSKRLGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDT+ILEGLKSFVSGMPSSNKFKA+EKYYSAMG+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTS+IMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKK
        G K
Subjt:  GKK

XP_022131834.1 uncharacterized protein LOC111004892 isoform X1 [Momordica charantia]2.6e-26490.67Show/hide
Query:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA  GFDADGGC C+A+QNGD E+A NCKS +SYC+HC CGSADSSS P+FSCSSSSLW DS RLRE GKLWRILVASAKGFTIGAGLKGGLSLFS+
Subjt:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASL KKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHK+LAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP
        LASRCGIKSK+LGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDT+ILEGLKSFVSGMPSSNKF+ IEKYY AMGA  KLD 
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHF+SFLIQGYKRALPVYLPVYL+PALIVHR+ L+NRP EILARGLLGTARSSLFLS YC+SAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFF C TD GYLP SLNFKRADVIVFS+ST++IMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRC N
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKKY
         K++
Subjt:  GKKY

XP_023545566.1 uncharacterized protein LOC111804956 isoform X1 [Cucurbita pepo subsp. pepo]4.3e-26791.45Show/hide
Query:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA G F ADGGC CLA++NGD  A  NCKS DSYC+HC CGSAD SS P+FSCSSSSLW DS RLRE GKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDA+SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP
        LASRCGI+SKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDT+ILEGLKSF+ GMPS NKF AIEKYY   GA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
         MKTPCTIIHGNQSCGGHFLSF+IQGYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLS YCASAW+WTCLT+RTF+KIN+PLVA+
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFS+STS+IMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKK
        GK+
Subjt:  GKK

XP_038886157.1 uncharacterized protein LOC120076412 isoform X1 [Benincasa hispida]4.0e-27394.44Show/hide
Query:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA G FDADGGC C A QNGDA+A AN KS DSYCEHC CGSADSSSFPSFSCSSSSLWLDSTRLRE GKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEI+GNL GHRRTA WRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP
        LASRCGIKSKR GHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDT+ILEGLKSFVSGMPSSNKFKAIEKYYSAMGA V+L+ 
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPC IIHGNQSCGGHFLSFLI+GYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLT+R+FKKINIPLVA+
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFFS MTDLGYLPPSLNFKRADVIVFSISTS+IMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKKY
        GK+Y
Subjt:  GKKY

TrEMBL top hitse value%identityAlignment
A0A0A0LRQ1 Uncharacterized protein2.8e-28097.02Show/hide
Query:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGG FDADGGC CLAQQNGDAE AANCKS DSYCEHCS GSADSSSFPSFSCSSSSLWLDSTRLREYGKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGN+AGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP
        LASRCGIKSKRLGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDT+ILEGLKSFVSGMPSSNKFKA+EKYYSAMG+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTS+IMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKK
        G K
Subjt:  GKK

A0A1S3BAT4 uncharacterized protein LOC1034880603.2e-28498.02Show/hide
Query:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGG FDA+GGC CLAQQNGDAE AANCKS DSYCEHC  GSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP
        LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDT+ILEGLKSFVSGMPSS+KFKAIEKYYSAMGAAVKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTS+IMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKKY
        GKKY
Subjt:  GKKY

A0A5A7VHD2 Uncharacterized protein3.2e-28498.02Show/hide
Query:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGG FDA+GGC CLAQQNGDAE AANCKS DSYCEHC  GSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP
        LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDT+ILEGLKSFVSGMPSS+KFKAIEKYYSAMGAAVKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTS+IMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKKY
        GKKY
Subjt:  GKKY

A0A6J1BRD3 uncharacterized protein LOC111004892 isoform X11.3e-26490.67Show/hide
Query:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA  GFDADGGC C+A+QNGD E+A NCKS +SYC+HC CGSADSSS P+FSCSSSSLW DS RLRE GKLWRILVASAKGFTIGAGLKGGLSLFS+
Subjt:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASL KKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHK+LAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP
        LASRCGIKSK+LGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDT+ILEGLKSFVSGMPSSNKF+ IEKYY AMGA  KLD 
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHF+SFLIQGYKRALPVYLPVYL+PALIVHR+ L+NRP EILARGLLGTARSSLFLS YC+SAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFF C TD GYLP SLNFKRADVIVFS+ST++IMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRC N
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKKY
         K++
Subjt:  GKKY

A0A6J1HFI3 uncharacterized protein LOC111462580 isoform X12.1e-26491.25Show/hide
Query:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA G F ADGGC CLA++NGD  A  NCKSSDSYC+HC CGSAD SS P FSCSSSSLW DSTRL E GKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP
        LASRCGI+SKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDT+ILEGLKSF+ GMP SNKF AIEKYY   GA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
         MKTPCTIIHGNQSCGGHFLSF+I+GYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLS YCASAW+WTCLT+RTF+KIN+PLVA+
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVIVFS+STS+IMHCYAQER VFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKK
        GK+
Subjt:  GKK

SwissProt top hitse value%identityAlignment
Q6GQ39 Transmembrane protein 1352.5e-0425.15Show/hide
Query:  NQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLG----TARSSLFLSAYC------ASAWMWTCLTSRTFKKINIPLVALA
        N SC G +L       + +  +Y P+YL+ A I+ R+ L    +++L   L      TA  SL+++ +C         + WT            P    A
Subjt:  NQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLG----TARSSLFLSAYC------ASAWMWTCLTSRTFKKINIPLVALA

Query:  TFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIM
           +  A+ IE+KSRR  +++Y   +  E+ F      GY+ P    +  +V++F I++++ M
Subjt:  TFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIM

Q95QD1 Transmembrane protein 135 homolog3.9e-0532.26Show/hide
Query:  IPLVALATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFG
        +P V    F  GLA         + I++YCL + IE+ +  + D GYLP    FK  +VI+++I+T  ++     E    R  YLN L  + G
Subjt:  IPLVALATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFG

Arabidopsis top hitse value%identityAlignment
AT1G34630.1 BEST Arabidopsis thaliana protein match is: Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein (TAIR:AT5G51150.1)1.8e-17565.19Show/hide
Query:  SSDSYCEHCSCGSAD------SSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKALASLGKK-GVITNRDAI
        ++ S C  C     D      S  F    C +S     S  + +  KL RI+VAS KGFTIG GLKGGL++FS++A   RR+  +   +K G  +N +AI
Subjt:  SSDSYCEHCSCGSAD------SSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKALASLGKK-GVITNRDAI

Query:  SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGD
        +M +KETLRYGLFLGTFAGTFVS+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVLASRCGIKSKR G ICKPLTW +GD
Subjt:  SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGD

Query:  IFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDPQMKTPCTIIHGNQSCGGHFLSFLIQ
        +FLMCLSSSQILSAY+LKQ+SLP S++SFLN  GGKD  IL+G+K   +  P +N  +AIEKYY ++G  +KLDP MK PCTIIHGN+SC  H ++F +Q
Subjt:  IFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDPQMKTPCTIIHGNQSCGGHFLSFLIQ

Query:  GYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVALATFLTGLALAIEKKSRRIEISLYCL
         YKRALPVY+PVYLIPALIVHR+ L+ + Y IL +GLLGTARSSLFL+ YC+SAW WTCL  RTF+  NIPLVA+ATF TGLALAIEKKSRRIEISLYCL
Subjt:  GYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVALATFLTGLALAIEKKSRRIEISLYCL

Query:  ARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPP----CET
        AR IESFF+CMT+ GY+ P  + +RADV+VFS+ST++IMHCYAQER+VFRSKYLNVLDWVFGVPPPP    CET
Subjt:  ARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPP----CET

AT1G34630.2 FUNCTIONS IN: molecular_function unknown3.8e-12861.58Show/hide
Query:  SSDSYCEHCSCGSAD------SSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKALASLGKK-GVITNRDAI
        ++ S C  C     D      S  F    C +S     S  + +  KL RI+VAS KGFTIG GLKGGL++FS++A   RR+  +   +K G  +N +AI
Subjt:  SSDSYCEHCSCGSAD------SSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKALASLGKK-GVITNRDAI

Query:  SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGD
        +M +KETLRYGLFLGTFAGTFVS+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVLASRCGIKSKR G ICKPLTW +GD
Subjt:  SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGD

Query:  IFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDPQMKTPCTIIHGNQSCGGHFLSFLIQ
        +FLMCLSSSQILSAY+LKQ+SLP S++SFLN  GGKD  IL+G+K   +  P +N  +AIEKYY ++G  +KLDP MK PCTIIHGN+SC  H ++F +Q
Subjt:  IFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDPQMKTPCTIIHGNQSCGGHFLSFLIQ

Query:  GYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVALATFLT
         YKRALPVY+PVYLIPALIVHR+ L+ + Y IL +GLLGTARSSLFL+ YC+SAW WTCL  RTF+  NIPLVA+AT  T
Subjt:  GYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVALATFLT

AT5G51150.1 Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein9.8e-2825.26Show/hide
Query:  KGFTIGAGLKGGLSLFSVLAGLKRRKALAS-LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLT
        + F +  G++ G+ +      L R ++ +S L  K +++ +D I    +E  R GL  G F G++ ++   +      ++     ++LAG++AG S+L  
Subjt:  KGFTIGAGLKGGLSLFSVLAGLKRRKALAS-LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLT

Query:  GLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNK
          + Q +TLA+Y+  R    A     KSK   H+     W +GD  L  L+ +Q++ +++++ ++LP S+R F+   G     + + ++    G P    
Subjt:  GLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNK

Query:  FKAIEKYYSAMGAA--VKLDPQMK-TPCTIIHGN-QSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCA
          ++  Y S+   A  VK++      PC  IH N  SC     + +   +K+  P+Y  +  +P +++H +  M  PY      +  + RS+ FLSA+  
Subjt:  FKAIEKYYSAMGAA--VKLDPQMK-TPCTIIHGN-QSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCA

Query:  SAWMWTCLTSRTFKKINIPLVALATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMH
            + C   +   K +  +   A     L++ +EKK RR E++LY L R  +S +  + +   LP   + K A+V +F      IM+
Subjt:  SAWMWTCLTSRTFKKINIPLVALATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSVIMH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACCGTCTGCTGGCGGAGGCTTCGACGCCGACGGCGGATGTACATGCCTTGCTCAACAGAATGGGGATGCCGAAGCAGCTGCCAACTGTAAATCCAGTGATTCCTA
TTGCGAGCATTGTAGCTGTGGTTCAGCGGATTCATCATCTTTTCCGTCGTTTTCCTGCTCTTCCTCTTCTCTATGGCTTGATTCGACGCGGTTAAGAGAGTACGGGAAGC
TATGGCGGATCCTTGTTGCTTCTGCTAAAGGCTTCACAATTGGAGCTGGTCTCAAAGGTGGTCTCTCTCTCTTTTCTGTCCTCGCTGGATTGAAGCGGAGAAAAGCTTTG
GCTTCCCTCGGGAAGAAAGGAGTGATTACGAATCGAGATGCGATTTCCATGGCTTTGAAGGAGACTTTGAGATACGGCCTTTTTCTTGGAACCTTTGCTGGTACGTTCGT
TTCCATTGATGAGATAATTGGCAATCTGGCAGGTCACCGTAGGACTGCAAGATGGCGGGCTCTATTGGCGGGAGCATTAGCTGGGCCGTCGATGCTTCTGACTGGGTTAA
ATACGCAACATAAGACCTTGGCTATCTACATTTTCATGCGTGCTGCGGTCTTGGCATCGCGCTGTGGGATTAAAAGCAAGAGGCTCGGGCATATTTGTAAGCCTCTCACG
TGGTCATATGGCGACATCTTCCTTATGTGTCTCTCCTCTTCGCAGATCTTGTCTGCTTATGTGTTAAAGCAAGATAGCTTACCACCATCATTTAGGTCCTTTCTCAATAC
ACATGGTGGAAAGGATACCATAATCTTGGAAGGTTTAAAGAGTTTCGTATCAGGCATGCCTTCCTCCAATAAATTTAAAGCAATAGAGAAGTACTACAGTGCCATGGGTG
CGGCCGTCAAATTAGATCCGCAAATGAAGACTCCATGCACGATTATACATGGAAATCAATCATGTGGTGGCCATTTTCTTTCCTTTCTCATTCAAGGATATAAAAGAGCG
TTGCCAGTATACCTCCCTGTTTATCTTATCCCAGCTCTAATAGTTCACCGTGAAGGTCTCATGAATAGGCCATATGAAATTTTAGCTAGGGGACTTCTTGGAACTGCTAG
ATCAAGTCTGTTCCTTTCTGCATATTGTGCATCTGCTTGGATGTGGACATGCCTGACCTCAAGGACTTTCAAGAAAATAAATATTCCATTGGTTGCTCTAGCAACGTTCT
TAACAGGTTTGGCACTGGCCATTGAGAAGAAAAGCAGGAGGATAGAAATCTCACTCTATTGCCTTGCTAGAGGTATTGAGAGCTTCTTCAGCTGCATGACGGATTTGGGA
TACTTGCCACCATCGTTGAATTTCAAACGAGCGGATGTGATAGTTTTCAGCATATCAACTTCCGTTATAATGCATTGCTATGCTCAGGAAAGGGAGGTATTTCGATCCAA
GTATCTTAACGTTCTCGATTGGGTTTTTGGTGTGCCTCCTCCCCCATGTGAAACACCACGCTGCAAGAATGGCAAGAAGTATTGA
mRNA sequenceShow/hide mRNA sequence
ATTTTTACCATTAAAAATTTAGTGGAATTATAATTGTGCCATTTTTTGTAGTCAATGTAAGACACTTAATTTAGTCATACTAAATAGAGCGAAAACGAAGAAGAAGAAAG
TATAAAGTTAAAAATGGATTAGGACTTAAGAGTGAGATTTCACGTTAATTAATTCACGCTTCACGCTTCACGCTTAACAAAATAATTAGCACAAACCTTCTCTGTTTTAT
CTCTCAATTCCTTTATTCTGCCATTTTCGGTTCTCTCACTTCCAATCCTACTCAGCTCACGTTTGTAAGGAGGAGCCCTAGCCTCTGCTTCCGGGACCATCATGTCACCG
TCTGCTGGCGGAGGCTTCGACGCCGACGGCGGATGTACATGCCTTGCTCAACAGAATGGGGATGCCGAAGCAGCTGCCAACTGTAAATCCAGTGATTCCTATTGCGAGCA
TTGTAGCTGTGGTTCAGCGGATTCATCATCTTTTCCGTCGTTTTCCTGCTCTTCCTCTTCTCTATGGCTTGATTCGACGCGGTTAAGAGAGTACGGGAAGCTATGGCGGA
TCCTTGTTGCTTCTGCTAAAGGCTTCACAATTGGAGCTGGTCTCAAAGGTGGTCTCTCTCTCTTTTCTGTCCTCGCTGGATTGAAGCGGAGAAAAGCTTTGGCTTCCCTC
GGGAAGAAAGGAGTGATTACGAATCGAGATGCGATTTCCATGGCTTTGAAGGAGACTTTGAGATACGGCCTTTTTCTTGGAACCTTTGCTGGTACGTTCGTTTCCATTGA
TGAGATAATTGGCAATCTGGCAGGTCACCGTAGGACTGCAAGATGGCGGGCTCTATTGGCGGGAGCATTAGCTGGGCCGTCGATGCTTCTGACTGGGTTAAATACGCAAC
ATAAGACCTTGGCTATCTACATTTTCATGCGTGCTGCGGTCTTGGCATCGCGCTGTGGGATTAAAAGCAAGAGGCTCGGGCATATTTGTAAGCCTCTCACGTGGTCATAT
GGCGACATCTTCCTTATGTGTCTCTCCTCTTCGCAGATCTTGTCTGCTTATGTGTTAAAGCAAGATAGCTTACCACCATCATTTAGGTCCTTTCTCAATACACATGGTGG
AAAGGATACCATAATCTTGGAAGGTTTAAAGAGTTTCGTATCAGGCATGCCTTCCTCCAATAAATTTAAAGCAATAGAGAAGTACTACAGTGCCATGGGTGCGGCCGTCA
AATTAGATCCGCAAATGAAGACTCCATGCACGATTATACATGGAAATCAATCATGTGGTGGCCATTTTCTTTCCTTTCTCATTCAAGGATATAAAAGAGCGTTGCCAGTA
TACCTCCCTGTTTATCTTATCCCAGCTCTAATAGTTCACCGTGAAGGTCTCATGAATAGGCCATATGAAATTTTAGCTAGGGGACTTCTTGGAACTGCTAGATCAAGTCT
GTTCCTTTCTGCATATTGTGCATCTGCTTGGATGTGGACATGCCTGACCTCAAGGACTTTCAAGAAAATAAATATTCCATTGGTTGCTCTAGCAACGTTCTTAACAGGTT
TGGCACTGGCCATTGAGAAGAAAAGCAGGAGGATAGAAATCTCACTCTATTGCCTTGCTAGAGGTATTGAGAGCTTCTTCAGCTGCATGACGGATTTGGGATACTTGCCA
CCATCGTTGAATTTCAAACGAGCGGATGTGATAGTTTTCAGCATATCAACTTCCGTTATAATGCATTGCTATGCTCAGGAAAGGGAGGTATTTCGATCCAAGTATCTTAA
CGTTCTCGATTGGGTTTTTGGTGTGCCTCCTCCCCCATGTGAAACACCACGCTGCAAGAATGGCAAGAAGTATTGAAAGAAGCCTTGGATCCTGGATCTCTGACGAACTA
ACACACTGGATTTGAATTGAAGGTACCAAATATTTGCCAGTAATGTTTAGTCAAGAATTTTCATACTACATAATATTTTAATGTGTAGCTACCATTACCTGCTATAGAAT
TAGGGTTTAATCAGACTCATTTGATCACATGATTCATACGGAGATGTTTCCGATATACACATTGGTTTAATGGACGATTTTTCGAATTTGTTAAATAACAGTCAACGGAT
AGCATCGTTGATGATTAAATTACGAAGTATAGTTTCTTAGATTTTAGACTTGCGTCTGTAATCTCCAATACTGCAGTATATTTGCAACATTAATAATTCATTAAAAGTAA
TAATTAACAGGGAATTCCAATA
Protein sequenceShow/hide protein sequence
MSPSAGGGFDADGGCTCLAQQNGDAEAAANCKSSDSYCEHCSCGSADSSSFPSFSCSSSSLWLDSTRLREYGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKAL
ASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLT
WSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTIILEGLKSFVSGMPSSNKFKAIEKYYSAMGAAVKLDPQMKTPCTIIHGNQSCGGHFLSFLIQGYKRA
LPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVALATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLG
YLPPSLNFKRADVIVFSISTSVIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKNGKKY