; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G21000 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G21000
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTMEM135_C_rich domain-containing protein
Genome locationChr2:18511138..18516442
RNA-Seq ExpressionCSPI02G21000
SyntenyCSPI02G21000
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR026749 - Transmembrane protein 135


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444833.1 PREDICTED: uncharacterized protein LOC103488060 [Cucumis melo]3.3e-28398.01Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGGCFDA+GGCACLAQQNGDAETAANCKSGDSYCEHC YGSADSSSFPSFSCSSSSLWLDSTRLREYGKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP
        LASRCGIKSKRLGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSS+KFKA+EKYYSAMG+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GNK
        G K
Subjt:  GNK

XP_011649669.1 uncharacterized protein LOC101202879 isoform X1 [Cucumis sativus]6.1e-29099.8Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGN+AGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP
        LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GNKC
        GNKC
Subjt:  GNKC

XP_022961964.1 uncharacterized protein LOC111462580 isoform X1 [Cucurbita moschata]3.0e-26090.26Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA GCF ADGGCACLA++NGD     NCKS DSYC+HC  GSAD SS P FSCSSSSLW DSTRL E GKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP
        LASRCGI+SKRLGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+ GMP SNKF A+EKYY   G+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
         MKTPCTIIHGNQSCGGHFLSF+I+GYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLS YCASAW+WTCLT+RTF+KIN+PLVA+
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLP SLNFKRADVIVFS+STSIIMHCYAQER VFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GNK
        G +
Subjt:  GNK

XP_023545566.1 uncharacterized protein LOC111804956 isoform X1 [Cucurbita pepo subsp. pepo]2.0e-26490.85Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA GCF ADGGCACLA++NGD     NCKSGDSYC+HC  GSAD SS P+FSCSSSSLW DS RLRE GKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDA+SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP
        LASRCGI+SKRLGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+ GMPS NKF A+EKYY   G+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
         MKTPCTIIHGNQSCGGHFLSF+IQGYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLS YCASAW+WTCLT+RTF+KIN+PLVA+
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLPPSLNFKRADVIVFS+STSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GNK
        G +
Subjt:  GNK

XP_038886157.1 uncharacterized protein LOC120076412 isoform X1 [Benincasa hispida]1.1e-26793.44Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA G FDADGGC C A QNGDA+  AN KSGDSYCEHC  GSADSSSFPSFSCSSSSLWLDSTRLRE GKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEI+GNL GHRRTA WRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP
        LASRCGIKSKR GHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKA+EKYYSAMG+ V+L+ 
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPC IIHGNQSCGGHFLSFLI+GYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLT+R+FKKINIPLVA+
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFS MTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GNK
        G +
Subjt:  GNK

TrEMBL top hitse value%identityAlignment
A0A0A0LRQ1 Uncharacterized protein3.0e-29099.8Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGN+AGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP
        LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GNKC
        GNKC
Subjt:  GNKC

A0A1S3BAT4 uncharacterized protein LOC1034880601.6e-28398.01Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGGCFDA+GGCACLAQQNGDAETAANCKSGDSYCEHC YGSADSSSFPSFSCSSSSLWLDSTRLREYGKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP
        LASRCGIKSKRLGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSS+KFKA+EKYYSAMG+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GNK
        G K
Subjt:  GNK

A0A5A7VHD2 Uncharacterized protein1.6e-28398.01Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGGCFDA+GGCACLAQQNGDAETAANCKSGDSYCEHC YGSADSSSFPSFSCSSSSLWLDSTRLREYGKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP
        LASRCGIKSKRLGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSS+KFKA+EKYYSAMG+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GNK
        G K
Subjt:  GNK

A0A6J1BRD3 uncharacterized protein LOC111004892 isoform X19.3e-26090.06Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA   FDADGGCAC+A+QNGD E+A NCKSG+SYC+HC  GSADSSS P+FSCSSSSLW DS RLRE GKL RILVASAKGFTIGAGLKGGLSLFS+
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASL KKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHK+LAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP
        LASRCGIKSK+LGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKF+ +EKYY AMG+  KLD 
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
        QMKTPCTIIHGNQSCGGHF+SFLIQGYKRALPVYLPVYL+PALIVHR+ L+NRP EILARGLLGTARSSLFLS YC+SAWMWTCLTSRTFKKINIPLVAL
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCL+RGIESFF C TD GYLP SLNFKRADVIVFS+ST+IIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRC N
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GNK
          +
Subjt:  GNK

A0A6J1HFI3 uncharacterized protein LOC111462580 isoform X11.4e-26090.26Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA GCF ADGGCACLA++NGD     NCKS DSYC+HC  GSAD SS P FSCSSSSLW DSTRL E GKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP
        LASRCGI+SKRLGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+ GMP SNKF A+EKYY   G+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL
         MKTPCTIIHGNQSCGGHFLSF+I+GYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLS YCASAW+WTCLT+RTF+KIN+PLVA+
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVAL

Query:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLP SLNFKRADVIVFS+STSIIMHCYAQER VFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GNK
        G +
Subjt:  GNK

SwissProt top hitse value%identityAlignment
Q6GQ39 Transmembrane protein 1358.8e-0525.15Show/hide
Query:  NQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLG----TARSSLFLSAYC------ASAWMWTCLTSRTFKKINIPLVALA
        N SC G +L       + +  +Y P+YL+ A I+ R+ L    +++L   L      TA  SL+++ +C         + WT            P    A
Subjt:  NQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLG----TARSSLFLSAYC------ASAWMWTCLTSRTFKKINIPLVALA

Query:  TFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIM
           +  A+ IE+KSRR  +++Y  ++  E+ F      GY+ P    +  +V++F I++++ M
Subjt:  TFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIM

Q95QD1 Transmembrane protein 135 homolog3.0e-0532.26Show/hide
Query:  IPLVALATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFG
        +P V    F  GLA         + I++YCL + IE+ +  + D GYLP    FK  +VI+++I+T  ++     E    R  YLN L  + G
Subjt:  IPLVALATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFG

Arabidopsis top hitse value%identityAlignment
AT1G34630.1 BEST Arabidopsis thaliana protein match is: Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein (TAIR:AT5G51150.1)1.6e-17463.9Show/hide
Query:  AQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKALASLGKK-G
        + ++ +    ++C S D+  +   +G +D   F    C +S     S  + +  KL RI+VAS KGFTIG GLKGGL++FS++A   RR+  +   +K G
Subjt:  AQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKALASLGKK-G

Query:  VITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICK
          +N +AI+M +KETLRYGLFLGTFAGTFVS+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVLASRCGIKSKR G ICK
Subjt:  VITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICK

Query:  PLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDPQMKTPCTIIHGNQSCGG
        PLTW  GD+FLMCLSSSQILSAY+LKQ+SLP S++SFLN  GGKD  IL+G+K   +  P +N  +A+EKYY ++G  +KLDP MK PCTIIHGN+SC  
Subjt:  PLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDPQMKTPCTIIHGNQSCGG

Query:  HFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVALATFLTGLALAIEKKSRR
        H ++F +Q YKRALPVY+PVYLIPALIVHR+ L+ + Y IL +GLLGTARSSLFL+ YC+SAW WTCL  RTF+  NIPLVA+ATF TGLALAIEKKSRR
Subjt:  HFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVALATFLTGLALAIEKKSRR

Query:  IEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP----CET
        IEISLYCL+R IESFF+CMT+ GY+ P  + +RADV+VFS+ST+IIMHCYAQER+VFRSKYLNVLDWVFGVPPPP    CET
Subjt:  IEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP----CET

AT1G34630.2 FUNCTIONS IN: molecular_function unknown2.4e-12760.05Show/hide
Query:  AQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKALASLGKK-G
        + ++ +    ++C S D+  +   +G +D   F    C +S     S  + +  KL RI+VAS KGFTIG GLKGGL++FS++A   RR+  +   +K G
Subjt:  AQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKALASLGKK-G

Query:  VITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICK
          +N +AI+M +KETLRYGLFLGTFAGTFVS+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVLASRCGIKSKR G ICK
Subjt:  VITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICK

Query:  PLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDPQMKTPCTIIHGNQSCGG
        PLTW  GD+FLMCLSSSQILSAY+LKQ+SLP S++SFLN  GGKD  IL+G+K   +  P +N  +A+EKYY ++G  +KLDP MK PCTIIHGN+SC  
Subjt:  PLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDPQMKTPCTIIHGNQSCGG

Query:  HFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVALATFLT
        H ++F +Q YKRALPVY+PVYLIPALIVHR+ L+ + Y IL +GLLGTARSSLFL+ YC+SAW WTCL  RTF+  NIPLVA+AT  T
Subjt:  HFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVALATFLT

AT5G51150.1 Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein1.1e-2625.33Show/hide
Query:  KGFTIGAGLKGGLSLFSVLAGLKRRKALAS-LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLT
        + F +  G++ G+ +      L R ++ +S L  K +++ +D I    +E  R GL  G F G++ ++   +      ++     ++LAG++AG S+L  
Subjt:  KGFTIGAGLKGGLSLFSVLAGLKRRKALAS-LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLT

Query:  GLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNK
          + Q +TLA+Y+  R    A     KSK   H+     W  GD  L  L+ +Q++ +++++ ++LP S+R F+   G     + + ++    G P    
Subjt:  GLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNK

Query:  FKAVEKYYSAM--GSTVKLDPQMK-TPCTIIHGN-QSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCA
          ++  Y S+    S VK++      PC  IH N  SC     + +   +K+  P+Y  +  +P +++H +  M  PY      +  + RS+ FLSA+  
Subjt:  FKAVEKYYSAM--GSTVKLDPQMK-TPCTIIHGN-QSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCA

Query:  SAWMWTCLTSRTFKKINIPLVALATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVF
            + C   +   K +  +   A     L++ +EKK RR E++LY L R  +S +  + +   LP   + K A+V +F
Subjt:  SAWMWTCLTSRTFKKINIPLVALATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCGTCTGCTGGCGGATGCTTCGACGCTGACGGCGGATGTGCATGCCTTGCTCAACAGAACGGGGATGCCGAAACAGCTGCCAACTGTAAATCCGGTGATTCCTA
TTGCGAGCATTGTAGCTATGGTTCAGCTGATTCATCATCTTTTCCCTCGTTTTCCTGCTCTTCCTCATCTCTGTGGCTTGATTCGACGCGGTTAAGAGAGTACGGGAAGC
TATCGCGGATCCTTGTTGCTTCTGCTAAAGGCTTCACAATTGGAGCTGGTCTCAAAGGTGGTCTCTCTCTCTTTTCTGTCCTCGCTGGATTGAAGCGGAGAAAGGCTTTG
GCCTCCCTCGGGAAGAAAGGAGTGATTACGAATCGAGATGCGATTTCCATGGCTTTGAAGGAGACTTTGAGATACGGCCTTTTTCTTGGAACCTTTGCCGGTACGTTCGT
TTCCATTGATGAGATAATTGGCAATCTGGCAGGTCACCGTAGGACTGCAAGATGGCGGGCTCTATTGGCGGGAGCATTAGCTGGGCCGTCGATGCTTCTGACTGGGTTAA
ATACGCAACATAAGACCTTGGCTATCTACATTTTCATGCGTGCTGCGGTCTTGGCATCGCGCTGTGGGATTAAAAGCAAGAGGCTCGGGCATATTTGTAAGCCTCTCACG
TGGTCATGTGGTGACATCTTCCTCATGTGTCTCTCCTCTTCGCAGATCTTGTCTGCTTATGTGTTAAAGCAAGATAGCTTACCACCATCGTTTAGGTCCTTTCTCAATAC
ACATGGTGGAAAGGATACTGTAATCTTGGAAGGCTTAAAGAGTTTCGTATCAGGCATGCCTTCCTCCAATAAATTTAAAGCAGTAGAGAAGTACTACAGTGCCATGGGTT
CAACTGTCAAATTAGATCCGCAAATGAAGACTCCATGCACGATCATACATGGAAATCAATCATGTGGTGGCCATTTTCTTTCCTTTCTCATTCAAGGATATAAAAGAGCG
TTGCCAGTATACCTCCCTGTTTATCTTATCCCAGCTCTAATAGTTCATCGTGAAGGTCTCATGAATAGGCCATACGAAATTTTAGCTAGGGGGCTTCTTGGAACTGCTAG
ATCAAGTCTGTTCCTCTCTGCATATTGTGCATCTGCTTGGATGTGGACATGCCTGACCTCAAGGACTTTCAAAAAAATAAATATTCCATTGGTTGCTCTAGCAACGTTCT
TAACAGGTTTAGCACTGGCCATTGAGAAGAAAAGCAGGAGGATAGAAATCTCACTCTATTGCCTCTCTAGAGGTATCGAGAGCTTCTTCAGCTGCATGACAGATTTGGGA
TACTTGCCACCATCATTGAATTTCAAACGAGCAGATGTAATAGTTTTCAGCATATCAACTTCCATTATCATGCATTGCTATGCTCAGGAAAGGGAGGTATTTCGATCCAA
GTATCTTAACGTTCTCGATTGGGTTTTTGGTGTGCCTCCTCCCCCTTGTGAAACACCACGCTGCAAGAATGGCAACAAGTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATTAATTCACGCGTAGCAAAACCATTAGCACAAACCTTCTCTGTTTTATCTCTCGATTCCTTTATTCTGCCATTTTTGGTTCTCCCACTTGCAATCCCACTCAGCTCACG
TTTGTAAGGAGGAGCCCTAGCCTCTGCTTCCGGGACCATCATGTCGCCGTCTGCTGGCGGATGCTTCGACGCTGACGGCGGATGTGCATGCCTTGCTCAACAGAACGGGG
ATGCCGAAACAGCTGCCAACTGTAAATCCGGTGATTCCTATTGCGAGCATTGTAGCTATGGTTCAGCTGATTCATCATCTTTTCCCTCGTTTTCCTGCTCTTCCTCATCT
CTGTGGCTTGATTCGACGCGGTTAAGAGAGTACGGGAAGCTATCGCGGATCCTTGTTGCTTCTGCTAAAGGCTTCACAATTGGAGCTGGTCTCAAAGGTGGTCTCTCTCT
CTTTTCTGTCCTCGCTGGATTGAAGCGGAGAAAGGCTTTGGCCTCCCTCGGGAAGAAAGGAGTGATTACGAATCGAGATGCGATTTCCATGGCTTTGAAGGAGACTTTGA
GATACGGCCTTTTTCTTGGAACCTTTGCCGGTACGTTCGTTTCCATTGATGAGATAATTGGCAATCTGGCAGGTCACCGTAGGACTGCAAGATGGCGGGCTCTATTGGCG
GGAGCATTAGCTGGGCCGTCGATGCTTCTGACTGGGTTAAATACGCAACATAAGACCTTGGCTATCTACATTTTCATGCGTGCTGCGGTCTTGGCATCGCGCTGTGGGAT
TAAAAGCAAGAGGCTCGGGCATATTTGTAAGCCTCTCACGTGGTCATGTGGTGACATCTTCCTCATGTGTCTCTCCTCTTCGCAGATCTTGTCTGCTTATGTGTTAAAGC
AAGATAGCTTACCACCATCGTTTAGGTCCTTTCTCAATACACATGGTGGAAAGGATACTGTAATCTTGGAAGGCTTAAAGAGTTTCGTATCAGGCATGCCTTCCTCCAAT
AAATTTAAAGCAGTAGAGAAGTACTACAGTGCCATGGGTTCAACTGTCAAATTAGATCCGCAAATGAAGACTCCATGCACGATCATACATGGAAATCAATCATGTGGTGG
CCATTTTCTTTCCTTTCTCATTCAAGGATATAAAAGAGCGTTGCCAGTATACCTCCCTGTTTATCTTATCCCAGCTCTAATAGTTCATCGTGAAGGTCTCATGAATAGGC
CATACGAAATTTTAGCTAGGGGGCTTCTTGGAACTGCTAGATCAAGTCTGTTCCTCTCTGCATATTGTGCATCTGCTTGGATGTGGACATGCCTGACCTCAAGGACTTTC
AAAAAAATAAATATTCCATTGGTTGCTCTAGCAACGTTCTTAACAGGTTTAGCACTGGCCATTGAGAAGAAAAGCAGGAGGATAGAAATCTCACTCTATTGCCTCTCTAG
AGGTATCGAGAGCTTCTTCAGCTGCATGACAGATTTGGGATACTTGCCACCATCATTGAATTTCAAACGAGCAGATGTAATAGTTTTCAGCATATCAACTTCCATTATCA
TGCATTGCTATGCTCAGGAAAGGGAGGTATTTCGATCCAAGTATCTTAACGTTCTCGATTGGGTTTTTGGTGTGCCTCCTCCCCCTTGTGAAACACCACGCTGCAAGAAT
GGCAACAAGTGTTGAAAGAAGCCTTGAATCTCTGACGAACTAACACACTGGATTTAAATTGAAGGTACCAAATATTTGCCAGTACTGTTTAGTCAAGAATTTTCATACTA
CACAATATTTTAATGTTTAGTTACCATTACCTTCTATAGAATTAGACTCATTTGATTACATGATTCATATGAAGATGTAGTTTTCCATATACACATTGATTTAGTGGACG
ATTTTTCGAAATTGTTAAATAACAGTCGACGG
Protein sequenceShow/hide protein sequence
MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSYCEHCSYGSADSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKAL
ASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLT
WSCGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAVEKYYSAMGSTVKLDPQMKTPCTIIHGNQSCGGHFLSFLIQGYKRA
LPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWMWTCLTSRTFKKINIPLVALATFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLG
YLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKNGNKC