; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy3G063500 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy3G063500
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionTMEM135_C_rich domain-containing protein
Genome locationchrH03:16543985..16548861
RNA-Seq ExpressionChy3G063500
SyntenyChy3G063500
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR026749 - Transmembrane protein 135


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444833.1 PREDICTED: uncharacterized protein LOC103488060 [Cucumis melo]0.094.23Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGGCFDA+GGCACLAQQNGDAETAANCKSGDS+CEHC YGS DSSSFPSFSCSSSSLWLDSTRLREYGKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRK LASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP
        LASRCGIKSKRLGHICKPLTWS GD FLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSS+KFKAIEKYYSAMG+AVKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW      +R F    I L   
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF

Query:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
          FLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKK
        GKK
Subjt:  GKK

XP_011649669.1 uncharacterized protein LOC101202879 isoform X1 [Cucumis sativus]0.094.84Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDS+CEHCSYGS DSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRK LASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGN+AGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP
        LASRCGIKSKRLGHICKPLTWSCGD FLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKA+EKYYSAMGS VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW      +R F    I L   
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF

Query:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
          FLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKKC
        G KC
Subjt:  GKKC

XP_022961964.1 uncharacterized protein LOC111462580 isoform X1 [Cucurbita moschata]6.11e-31587.28Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA GCF ADGGCACLA++NGD     NCKS DS+C+HC  GS D SS P FSCSSSSLW DSTRL E GKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRK LASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP
        LASRCGI+SKRLGHICKPLTWS GD FLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+ GMPS NKF AIEKYY   G+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF
         MKTPCTIIHGNQSCGGHFLSF+I+GYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLS YCASAW      AR F    + L   
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF

Query:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
          FLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLP SLNFKRADVIVFS+STSIIMHCYAQER VFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKK
        GK+
Subjt:  GKK

XP_023545566.1 uncharacterized protein LOC111804956 isoform X1 [Cucurbita pepo subsp. pepo]0.087.87Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA GCF ADGGCACLA++NGD     NCKSGDS+C+HC  GS D SS P+FSCSSSSLW DS RLRE GKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRK LASLGKKGVITNRDA+SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP
        LASRCGI+SKRLGHICKPLTWS GD FLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+ GMPS NKF AIEKYY   G+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF
         MKTPCTIIHGNQSCGGHFLSF+IQGYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLS YCASAW      AR F    + L   
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF

Query:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
          FLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLPPSLNFKRADVIVFS+STSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKK
        GK+
Subjt:  GKK

XP_038886157.1 uncharacterized protein LOC120076412 isoform X1 [Benincasa hispida]0.090.26Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA G FDADGGC C A QNGDA+  AN KSGDS+CEHC  GS DSSSFPSFSCSSSSLWLDSTRLRE GKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRK LASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEI+GNL GHRRTA WRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP
        LASRCGIKSKR GHICKPLTWS GD FLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMG+ V+L+ 
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF
        QMKTPC IIHGNQSCGGHFLSFLI+GYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW      AR+F    I L   
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF

Query:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
          FLTGLALAIEKKSRRIEISLYCL+RGIESFFS MTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKK
        GK+
Subjt:  GKK

TrEMBL top hitse value%identityAlignment
A0A0A0LRQ1 Uncharacterized protein2.1e-27294.84Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDS+CEHCSYGS DSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRK LASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGN+AGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP
        LASRCGIKSKRLGHICKPLTWSCGD FLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKA+EKYYSAMGS VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW      +R F    I L   
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF

Query:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
          FLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKKC
        G KC
Subjt:  GKKC

A0A1S3BAT4 uncharacterized protein LOC1034880603.2e-26894.23Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGGCFDA+GGCACLAQQNGDAETAANCKSGDS+CEHC YGS DSSSFPSFSCSSSSLWLDSTRLREYGKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRK LASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP
        LASRCGIKSKRLGHICKPLTWS GD FLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSS+KFKAIEKYYSAMG+AVKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW      +R F    I L   
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF

Query:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
          FLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKK
        GKK
Subjt:  GKK

A0A5A7VHD2 Uncharacterized protein3.2e-26894.23Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGGCFDA+GGCACLAQQNGDAETAANCKSGDS+CEHC YGS DSSSFPSFSCSSSSLWLDSTRLREYGKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRK LASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP
        LASRCGIKSKRLGHICKPLTWS GD FLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSS+KFKAIEKYYSAMG+AVKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW      +R F    I L   
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF

Query:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
          FLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKK
        GKK
Subjt:  GKK

A0A6J1BRD3 uncharacterized protein LOC111004892 isoform X17.1e-24486.08Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA   FDADGGCAC+A+QNGD E+A NCKSG+S+C+HC  GS DSSS P+FSCSSSSLW DS RLRE GKL RILVASAKGFTIGAGLKGGLSLFS+
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRK LASL KKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHK+LAIYIFMRAAV
Subjt:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP
        LASRCGIKSK+LGHICKPLTWS GD FLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKF+ IEKYY AMG+  KLD 
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF
        QMKTPCTIIHGNQSCGGHF+SFLIQGYKRALPVYLPVYL+PALIVHR+ L+NRP EILARGLLGTARSSLFLS YC+SAW      +R F    I L   
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF

Query:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
          FLTGLALAIEKKSRRIEISLYCL+RGIESFF C TD GYLP SLNFKRADVIVFS+ST+IIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRC N
Subjt:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKK
         K+
Subjt:  GKK

A0A6J1HFI3 uncharacterized protein LOC111462580 isoform X11.2e-24687.28Show/hide
Query:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA GCF ADGGCACLA++NGD     NCKS DS+C+HC  GS D SS P FSCSSSSLW DSTRL E GKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRK LASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP
        LASRCGI+SKRLGHICKPLTWS GD FLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+ GMP SNKF AIEKYY   G+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF
         MKTPCTIIHGNQSCGGHFLSF+I+GYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLS YCASAW      AR F    + L   
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW------ARAFVPSLIKLRLF

Query:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
          FLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLP SLNFKRADVIVFS+STSIIMHCYAQER VFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  EQFLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  GKK
        GK+
Subjt:  GKK

SwissProt top hitse value%identityAlignment
Q6GQ39 Transmembrane protein 1351.9e-0425.16Show/hide
Query:  NQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLG----TARSSLFLSAYCASA--------WARAFVPSLIKLRLFEQFLT
        N SC G +L       + +  +Y P+YL+ A I+ R+ L    +++L   L      TA  SL+++ +C           W   F  +L          +
Subjt:  NQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLG----TARSSLFLSAYCASA--------WARAFVPSLIKLRLFEQFLT

Query:  GLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIM
          A+ IE+KSRR  +++Y  ++  E+ F      GY+ P    +  +V++F I++++ M
Subjt:  GLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIM

Q95QD1 Transmembrane protein 135 homolog8.7e-0532.94Show/hide
Query:  FLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFG
        F  GLA         + I++YCL + IE+ +  + D GYLP    FK  +VI+++I+T  ++     E    R  YLN L  + G
Subjt:  FLTGLALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFG

Arabidopsis top hitse value%identityAlignment
AT1G34630.1 BEST Arabidopsis thaliana protein match is: Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein (TAIR:AT5G51150.1)3.2e-16462.03Show/hide
Query:  AQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKTLASLGKK-G
        + ++ +    ++C S D+  +   +G +D   F    C +S     S  + +  KL RI+VAS KGFTIG GLKGGL++FS++A   RR+  +   +K G
Subjt:  AQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKTLASLGKK-G

Query:  VITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICK
          +N +AI+M +KETLRYGLFLGTFAGTFVS+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVLASRCGIKSKR G ICK
Subjt:  VITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICK

Query:  PLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDPQMKTPCTIIHGNQSCGG
        PLTW  GD FLMCLSSSQILSAY+LKQ+SLP S++SFLN  GGKD  IL+G+K   +  P +N  +AIEKYY ++G  +KLDP MK PCTIIHGN+SC  
Subjt:  PLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDPQMKTPCTIIHGNQSCGG

Query:  HFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWA------RAFVPSLIKLRLFEQFLTGLALAIEKKSRR
        H ++F +Q YKRALPVY+PVYLIPALIVHR+ L+ + Y IL +GLLGTARSSLFL+ YC+SAWA      R F    I L     F TGLALAIEKKSRR
Subjt:  HFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWA------RAFVPSLIKLRLFEQFLTGLALAIEKKSRR

Query:  IEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP----CET
        IEISLYCL+R IESFF+CMT+ GY+ P  + +RADV+VFS+ST+IIMHCYAQER+VFRSKYLNVLDWVFGVPPPP    CET
Subjt:  IEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP----CET

AT1G34630.2 FUNCTIONS IN: molecular_function unknown7.0e-11956.36Show/hide
Query:  AQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKTLASLGKK-G
        + ++ +    ++C S D+  +   +G +D   F    C +S     S  + +  KL RI+VAS KGFTIG GLKGGL++FS++A   RR+  +   +K G
Subjt:  AQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKTLASLGKK-G

Query:  VITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICK
          +N +AI+M +KETLRYGLFLGTFAGTFVS+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVLASRCGIKSKR G ICK
Subjt:  VITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICK

Query:  PLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDPQMKTPCTIIHGNQSCGG
        PLTW  GD FLMCLSSSQILSAY+LKQ+SLP S++SFLN  GGKD  IL+G+K   +  P +N  +AIEKYY ++G  +KLDP MK PCTIIHGN+SC  
Subjt:  PLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDPQMKTPCTIIHGNQSCGG

Query:  HFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWARAFVPSLIKLRLFEQFLTGLALAIEKKSRRIEISLY
        H ++F +Q YKRALPVY+PVYLIPALIVHR+ L+ + Y IL +GLLGTARSSLFL+ YC+SAWA     + +  R FE     L      +++ I I+ +
Subjt:  HFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWARAFVPSLIKLRLFEQFLTGLALAIEKKSRRIEISLY

Query:  C
        C
Subjt:  C

AT5G51150.1 Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein5.4e-2625.98Show/hide
Query:  KGFTIGAGLKGGLSLFSVLAGLKRRKTLAS-LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLT
        + F +  G++ G+ +      L R ++ +S L  K +++ +D I    +E  R GL  G F G++ ++   +      ++     ++LAG++AG S+L  
Subjt:  KGFTIGAGLKGGLSLFSVLAGLKRRKTLAS-LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLT

Query:  GLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNK
          + Q +TLA+Y+  R    A     KSK   H+     W  GD+ L  L+ +Q++ +++++ ++LP S+R F+   G     + + ++    G P    
Subjt:  GLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNK

Query:  FKAIEKYYSAMGSA--VKLDPQMK-TPCTIIHGN-QSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCA
          ++  Y S+   A  VK++      PC  IH N  SC     + +   +K+  P+Y  +  +P +++H +  M  PY      +  + RS+ FLSA+  
Subjt:  FKAIEKYYSAMGSA--VKLDPQMK-TPCTIIHGN-QSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCA

Query:  SAWARAFVPSLIKL-----RLFEQFLTG---LALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVF
            +AF+ +  K+     +L   F  G   L++ +EKK RR E++LY L R  +S +  + +   LP   + K A+V +F
Subjt:  SAWARAFVPSLIKL-----RLFEQFLTG---LALAIEKKSRRIEISLYCLSRGIESFFSCMTDLGYLPPSLNFKRADVIVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCGTCTGCTGGCGGATGCTTCGACGCTGACGGCGGATGTGCATGCCTTGCTCAACAGAATGGGGATGCCGAAACAGCTGCCAACTGTAAATCCGGTGAT
TCCTTTTGCGAGCATTGTAGCTATGGTTCAACTGATTCATCATCTTTTCCGTCGTTTTCCTGCTCTTCCTCATCTCTATGGCTTGATTCGACGCGGTTAAGAGAG
TACGGGAAGCTATCGCGGATCCTTGTTGCTTCTGCTAAAGGCTTCACAATTGGAGCTGGACTCAAAGGTGGTCTCTCTCTCTTTTCTGTCCTCGCTGGATTGAAG
CGGAGAAAGACTTTGGCCTCCCTCGGGAAGAAAGGAGTGATTACGAATCGAGATGCGATTTCCATGGCTTTGAAGGAGACTTTGAGATACGGCCTTTTTCTTGGA
ACCTTTGCCGGTACGTTCGTTTCCATTGATGAGATAATTGGCAATCTGGCAGGTCACCGTAGGACTGCAAGATGGCGGGCTCTATTGGCGGGAGCATTAGCTGGG
CCGTCGATGCTTCTGACTGGGTTAAATACGCAACATAAGACCTTGGCTATCTACATTTTCATGCGTGCTGCGGTCTTGGCATCGCGCTGTGGGATTAAAAGCAAG
AGGCTCGGGCATATTTGTAAGCCTCTCACGTGGTCATGTGGTGACACCTTCCTCATGTGTCTCTCCTCTTCGCAGATCTTGTCTGCTTATGTGTTAAAGCAAGAT
AGCTTACCACCATCGTTTAGATCCTTTCTCAATACACACGGTGGAAAGGATACTGTAATCTTGGAAGGCTTAAAGAGTTTCGTATCAGGCATGCCTTCCTCCAAT
AAATTTAAAGCAATAGAGAAGTACTACAGTGCCATGGGTTCGGCCGTCAAATTAGATCCGCAAATGAAGACTCCATGCACGATTATACATGGAAATCAATCATGC
GGTGGCCATTTTCTTTCCTTTCTCATTCAAGGATATAAAAGAGCGTTGCCAGTATACCTCCCTGTTTATCTTATCCCAGCTCTAATAGTTCATCGTGAAGGTCTC
ATGAATAGGCCATACGAAATTTTAGCTAGGGGGCTTCTTGGAACTGCTAGATCAAGTCTGTTCCTCTCTGCATATTGTGCATCTGCTTGGGCAAGAGCTTTCGTA
CCAAGCCTGATAAAACTACGTCTTTTTGAGCAGTTCTTAACAGGTTTGGCACTAGCCATTGAGAAGAAAAGCAGGAGGATAGAAATCTCACTCTATTGCCTCTCT
AGAGGTATCGAGAGCTTCTTCAGCTGCATGACAGATTTGGGATACTTGCCACCATCATTGAATTTCAAACGAGCAGATGTGATAGTTTTCAGCATATCAACTTCC
ATTATAATGCATTGCTATGCTCAGGAAAGGGAGGTATTTCGATCCAAGTATCTTAACGTTCTCGATTGGGTTTTTGGTGTGCCTCCTCCCCCTTGTGAAACACCA
CGCTGCAAGAATGGCAAGAAGTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCGTCTGCTGGCGGATGCTTCGACGCTGACGGCGGATGTGCATGCCTTGCTCAACAGAATGGGGATGCCGAAACAGCTGCCAACTGTAAATCCGGTGAT
TCCTTTTGCGAGCATTGTAGCTATGGTTCAACTGATTCATCATCTTTTCCGTCGTTTTCCTGCTCTTCCTCATCTCTATGGCTTGATTCGACGCGGTTAAGAGAG
TACGGGAAGCTATCGCGGATCCTTGTTGCTTCTGCTAAAGGCTTCACAATTGGAGCTGGACTCAAAGGTGGTCTCTCTCTCTTTTCTGTCCTCGCTGGATTGAAG
CGGAGAAAGACTTTGGCCTCCCTCGGGAAGAAAGGAGTGATTACGAATCGAGATGCGATTTCCATGGCTTTGAAGGAGACTTTGAGATACGGCCTTTTTCTTGGA
ACCTTTGCCGGTACGTTCGTTTCCATTGATGAGATAATTGGCAATCTGGCAGGTCACCGTAGGACTGCAAGATGGCGGGCTCTATTGGCGGGAGCATTAGCTGGG
CCGTCGATGCTTCTGACTGGGTTAAATACGCAACATAAGACCTTGGCTATCTACATTTTCATGCGTGCTGCGGTCTTGGCATCGCGCTGTGGGATTAAAAGCAAG
AGGCTCGGGCATATTTGTAAGCCTCTCACGTGGTCATGTGGTGACACCTTCCTCATGTGTCTCTCCTCTTCGCAGATCTTGTCTGCTTATGTGTTAAAGCAAGAT
AGCTTACCACCATCGTTTAGATCCTTTCTCAATACACACGGTGGAAAGGATACTGTAATCTTGGAAGGCTTAAAGAGTTTCGTATCAGGCATGCCTTCCTCCAAT
AAATTTAAAGCAATAGAGAAGTACTACAGTGCCATGGGTTCGGCCGTCAAATTAGATCCGCAAATGAAGACTCCATGCACGATTATACATGGAAATCAATCATGC
GGTGGCCATTTTCTTTCCTTTCTCATTCAAGGATATAAAAGAGCGTTGCCAGTATACCTCCCTGTTTATCTTATCCCAGCTCTAATAGTTCATCGTGAAGGTCTC
ATGAATAGGCCATACGAAATTTTAGCTAGGGGGCTTCTTGGAACTGCTAGATCAAGTCTGTTCCTCTCTGCATATTGTGCATCTGCTTGGGCAAGAGCTTTCGTA
CCAAGCCTGATAAAACTACGTCTTTTTGAGCAGTTCTTAACAGGTTTGGCACTAGCCATTGAGAAGAAAAGCAGGAGGATAGAAATCTCACTCTATTGCCTCTCT
AGAGGTATCGAGAGCTTCTTCAGCTGCATGACAGATTTGGGATACTTGCCACCATCATTGAATTTCAAACGAGCAGATGTGATAGTTTTCAGCATATCAACTTCC
ATTATAATGCATTGCTATGCTCAGGAAAGGGAGGTATTTCGATCCAAGTATCTTAACGTTCTCGATTGGGTTTTTGGTGTGCCTCCTCCCCCTTGTGAAACACCA
CGCTGCAAGAATGGCAAGAAGTGTTGA
Protein sequenceShow/hide protein sequence
MSPSAGGCFDADGGCACLAQQNGDAETAANCKSGDSFCEHCSYGSTDSSSFPSFSCSSSSLWLDSTRLREYGKLSRILVASAKGFTIGAGLKGGLSLFSVLAGLK
RRKTLASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSK
RLGHICKPLTWSCGDTFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGSAVKLDPQMKTPCTIIHGNQSC
GGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWARAFVPSLIKLRLFEQFLTGLALAIEKKSRRIEISLYCLS
RGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKNGKKC