; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004443 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004443
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTMEM135_C_rich domain-containing protein
Genome locationChr08:17138997..17144333
RNA-Seq ExpressionHG10004443
SyntenyHG10004443
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR026749 - Transmembrane protein 135


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444833.1 PREDICTED: uncharacterized protein LOC103488060 [Cucumis melo]1.3e-26092.46Show/hide
Query:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGG FDA+GGCACLAQQNGDAE AAN KSGDSYCEHC  GSADSSS P+FSCSSSSLWLDSTRLRE GKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP
        LASRCGIKSKRLGHICKPLTWSYGDIFLMC+SSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGL+SFVSGMPSS KFKAIEKYYSAMGA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP

Query:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV
        QMKTPCT                  GYKRALPVYLPVYL+PALIVHREGLMN PYEILARGLLGTARSSLFLSAYCASAWMWTCLT+RTFKKINIPLVA+
Subjt:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  SKQY
         K+Y
Subjt:  SKQY

XP_011649669.1 uncharacterized protein LOC101202879 isoform X1 [Cucumis sativus]9.5e-25691.25Show/hide
Query:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGG FDADGGCACLAQQNGDAE AAN KSGDSYCEHC  GSADSSS P+FSCSSSSLWLDSTRLRE GKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGN+AGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP
        LASRCGIKSKRLGHICKPLTWS GDIFLMC+SSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGL+SFVSGMPSS KFKA+EKYYSAMG+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP

Query:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV
        QMKTPCT                  GYKRALPVYLPVYL+PALIVHREGLMN PYEILARGLLGTARSSLFLSAYCASAWMWTCLT+RTFKKINIPLVA+
Subjt:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLP SLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  SKQ
          +
Subjt:  SKQ

XP_022131834.1 uncharacterized protein LOC111004892 isoform X1 [Momordica charantia]2.1e-24787.9Show/hide
Query:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA  GFDADGGCAC+A+QNGD E+A N KSG+SYC+H  CGSADSSSLPAFSCSSSSLW DS RLRESGKLWRILVASAKGFTIGAGLKGGLSLFS+
Subjt:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASL KKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHK+LAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP
        LASRCGIKSK+LGHICKPLTWSYGDIFLMC+SSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGL+SFVSGMPSS KF+ IEKYY AMGA  KLD 
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP

Query:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV
        QMKTPCT                  GYKRALPVYLPVYLVPALIVHR+ L+N P EILARGLLGTARSSLFLS YC+SAWMWTCLT+RTFKKINIPLVA+
Subjt:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFF C TD GYLPQSLNFKRADVIVFS+ST+IIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRC N
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  SKQY
        SKQ+
Subjt:  SKQY

XP_023545566.1 uncharacterized protein LOC111804956 isoform X1 [Cucurbita pepo subsp. pepo]1.9e-24888.27Show/hide
Query:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA G F ADGGCACLA++NGD  A  N KSGDSYC+H  CGSAD SSLPAFSCSSSSLW DS RLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDA+SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP
        LASRCGI+SKRLGHICKPLTWSYGDIFLMC+SSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGL+SF+ GMPS  KF AIEKYY   GA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP

Query:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV
         MKTPCT                  GYKRALPVYLPVYL+PALIVHR+GLMN PYEILARGLLGTARSSLFLS YCASAW+WTCLTARTF+KIN+PLVAV
Subjt:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVIVFS+STSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  SKQ
         KQ
Subjt:  SKQ

XP_038886157.1 uncharacterized protein LOC120076412 isoform X1 [Benincasa hispida]3.1e-25490.67Show/hide
Query:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA G FDADGGC C A QNGDA+A ANSKSGDSYCEH  CGSADSSS P+FSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEI+GNL GHRRTA WRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP
        LASRCGIKSKR GHICKPLTWSYGDIFLMC+SSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGL+SFVSGMPSS KFKAIEKYYSAMGA V+L+ 
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP

Query:  QMKTPC------------------TGYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV
        QMKTPC                   GYKRALPVYLPVYL+PALIVHREGLMN PYEILARGLLGTARSSLFLSAYCASAWMWTCLTAR+FKKINIPLVAV
Subjt:  QMKTPC------------------TGYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFFS MTDLGYLP SLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  SKQY
         KQY
Subjt:  SKQY

TrEMBL top hitse value%identityAlignment
A0A0A0LRQ1 Uncharacterized protein4.6e-25691.25Show/hide
Query:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGG FDADGGCACLAQQNGDAE AAN KSGDSYCEHC  GSADSSS P+FSCSSSSLWLDSTRLRE GKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGN+AGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP
        LASRCGIKSKRLGHICKPLTWS GDIFLMC+SSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGL+SFVSGMPSS KFKA+EKYYSAMG+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP

Query:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV
        QMKTPCT                  GYKRALPVYLPVYL+PALIVHREGLMN PYEILARGLLGTARSSLFLSAYCASAWMWTCLT+RTFKKINIPLVA+
Subjt:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLP SLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  SKQ
          +
Subjt:  SKQ

A0A1S3BAT4 uncharacterized protein LOC1034880606.2e-26192.46Show/hide
Query:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGG FDA+GGCACLAQQNGDAE AAN KSGDSYCEHC  GSADSSS P+FSCSSSSLWLDSTRLRE GKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP
        LASRCGIKSKRLGHICKPLTWSYGDIFLMC+SSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGL+SFVSGMPSS KFKAIEKYYSAMGA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP

Query:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV
        QMKTPCT                  GYKRALPVYLPVYL+PALIVHREGLMN PYEILARGLLGTARSSLFLSAYCASAWMWTCLT+RTFKKINIPLVA+
Subjt:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  SKQY
         K+Y
Subjt:  SKQY

A0A5A7VHD2 Uncharacterized protein6.2e-26192.46Show/hide
Query:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSAGG FDA+GGCACLAQQNGDAE AAN KSGDSYCEHC  GSADSSS P+FSCSSSSLWLDSTRLRE GKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP
        LASRCGIKSKRLGHICKPLTWSYGDIFLMC+SSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGL+SFVSGMPSS KFKAIEKYYSAMGA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP

Query:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV
        QMKTPCT                  GYKRALPVYLPVYL+PALIVHREGLMN PYEILARGLLGTARSSLFLSAYCASAWMWTCLT+RTFKKINIPLVA+
Subjt:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  SKQY
         K+Y
Subjt:  SKQY

A0A6J1BRD3 uncharacterized protein LOC111004892 isoform X11.0e-24787.9Show/hide
Query:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA  GFDADGGCAC+A+QNGD E+A N KSG+SYC+H  CGSADSSSLPAFSCSSSSLW DS RLRESGKLWRILVASAKGFTIGAGLKGGLSLFS+
Subjt:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASL KKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHK+LAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP
        LASRCGIKSK+LGHICKPLTWSYGDIFLMC+SSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGL+SFVSGMPSS KF+ IEKYY AMGA  KLD 
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP

Query:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV
        QMKTPCT                  GYKRALPVYLPVYLVPALIVHR+ L+N P EILARGLLGTARSSLFLS YC+SAWMWTCLT+RTFKKINIPLVA+
Subjt:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFF C TD GYLPQSLNFKRADVIVFS+ST+IIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRC N
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  SKQY
        SKQ+
Subjt:  SKQY

A0A6J1HFI3 uncharacterized protein LOC111462580 isoform X16.7e-24788.07Show/hide
Query:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA G F ADGGCACLA++NGD  A  N KS DSYC+H  CGSAD SSLP FSCSSSSLW DSTRL ESGKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP
        LASRCGI+SKRLGHICKPLTWSYGDIFLMC+SSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGL+SF+ GMPS+ KF AIEKYY   GA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDP

Query:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV
         MKTPCT                  GYKRALPVYLPVYL+PALIVHR+GLMN PYEILARGLLGTARSSLFLS YCASAW+WTCLTARTF+KIN+PLVAV
Subjt:  QMKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAV

Query:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN
        ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFS+STSIIMHCYAQER VFRSKYLNVLDWVFGVPPPPCETPRCKN
Subjt:  ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKN

Query:  SKQ
         KQ
Subjt:  SKQ

SwissProt top hitse value%identityAlignment
Q95QD1 Transmembrane protein 135 homolog2.9e-0532.26Show/hide
Query:  IPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFG
        +P V    F  GLA         + I++YCL + IE+ +  + D GYLP+   FK  +VI+++I+T  ++     E    R  YLN L  + G
Subjt:  IPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFG

Arabidopsis top hitse value%identityAlignment
AT1G34630.1 BEST Arabidopsis thaliana protein match is: Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein (TAIR:AT5G51150.1)9.1e-16460.64Show/hide
Query:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHCGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSVLA
        MSPS     ++  G  C +  + D +     KS +  C+ C ++DS S              S  + +S KL RI+VAS KGFTIG GLKGGL++FS++A
Subjt:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHCGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSVLA

Query:  GLKRRKALASLGKK-GVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVL
           RR+  +   +K G  +N +AI+M +KETLRYGLFLGTFAGTFVS+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVL
Subjt:  GLKRRKALASLGKK-GVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVL

Query:  ASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDPQ
        ASRCGIKSKR G ICKPLTW +GD+FLMC+SSSQILSAY+LKQ+SLP S++SFLN  GGKD  IL+G++   +  P +   +AIEKYY ++G  +KLDP 
Subjt:  ASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDPQ

Query:  MKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAVA
        MK PCT                   YKRALPVY+PVYL+PALIVHR+ L+   Y IL +GLLGTARSSLFL+ YC+SAW WTCL  RTF+  NIPLVA+A
Subjt:  MKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAVA

Query:  TFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP----CET
        TF TGLALAIEKKSRRIEISLYCLAR IESFF+CMT+ GY+    + +RADV+VFS+ST+IIMHCYAQER+VFRSKYLNVLDWVFGVPPPP    CET
Subjt:  TFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP----CET

AT1G34630.2 FUNCTIONS IN: molecular_function unknown1.7e-11756.19Show/hide
Query:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHCGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSVLA
        MSPS     ++  G  C +  + D +     KS +  C+ C ++DS S              S  + +S KL RI+VAS KGFTIG GLKGGL++FS++A
Subjt:  MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHCGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSVLA

Query:  GLKRRKALASLGKK-GVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVL
           RR+  +   +K G  +N +AI+M +KETLRYGLFLGTFAGTFVS+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVL
Subjt:  GLKRRKALASLGKK-GVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVL

Query:  ASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDPQ
        ASRCGIKSKR G ICKPLTW +GD+FLMC+SSSQILSAY+LKQ+SLP S++SFLN  GGKD  IL+G++   +  P +   +AIEKYY ++G  +KLDP 
Subjt:  ASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDPQ

Query:  MKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAVA
        MK PCT                   YKRALPVY+PVYL+PALIVHR+ L+   Y IL +GLLGTARSSLFL+ YC+SAW WTCL  RTF+  NIPLVA+A
Subjt:  MKTPCT------------------GYKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAVA

Query:  TFLT
        T  T
Subjt:  TFLT

AT5G51150.1 Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein8.3e-2424.27Show/hide
Query:  KGFTIGAGLKGGLSLFSVLAGLKRRKALAS-LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLT
        + F +  G++ G+ +      L R ++ +S L  K +++ +D I    +E  R GL  G F G++ ++   +      ++     ++LAG++AG S+L  
Subjt:  KGFTIGAGLKGGLSLFSVLAGLKRRKALAS-LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLT

Query:  GLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIK
          + Q +TLA+Y+  R    A     KSK   H+     W +GD  L  ++ +Q++ +++++ ++LP S+R F+   G     + + +R    G P  I 
Subjt:  GLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIK

Query:  FKAIEKYYSAM--GAGVKLDPQMK-TPCTG-------------------YKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCA
          ++  Y S+    + VK++      PC                     +K+  P+Y  +  VP +++H +  M  PY      +  + RS+ FLSA+  
Subjt:  FKAIEKYYSAM--GAGVKLDPQMK-TPCTG-------------------YKRALPVYLPVYLVPALIVHREGLMNGPYEILARGLLGTARSSLFLSAYCA

Query:  SAWMWTCLTARTFKKINIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVF
            + C   +   K +  +   A     L++ +EKK RR E++LY L R  +S +  + +   LP   + K A+V +F
Subjt:  SAWMWTCLTARTFKKINIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCGTCTGCTGGGGGCGGCTTCGACGCCGACGGCGGATGTGCATGCCTTGCTCAACAGAATGGGGATGCCGAAGCAGCTGCTAACTCTAAATCCGGTGATTCCTA
TTGCGAGCATTGTGGTTCAGCGGATTCATCTTCTTTGCCGGCGTTTTCCTGCTCTTCCTCTTCTCTATGGCTTGATTCGACGCGGTTGAGAGAGTCCGGGAAGCTATGGC
GGATCCTTGTTGCTTCTGCTAAAGGCTTCACAATTGGAGCTGGTCTCAAAGGTGGCCTCTCTCTCTTTTCGGTCCTCGCTGGATTGAAGCGGAGAAAGGCTTTGGCCTCC
CTCGGGAAGAAAGGAGTGATTACGAATCGAGATGCGATTTCCATGGCGTTGAAGGAGACTTTGAGATACGGTCTCTTTCTTGGAACCTTTGCAGGTACATTCGTTTCAAT
CGATGAGATAATTGGCAATCTAGCAGGCCACCGTAGGACTGCAAGATGGAGGGCACTATTAGCGGGAGCATTAGCTGGGCCGTCGATGCTTCTGACTGGGTTAAATACTC
AACATAAGACCTTGGCTATCTACATTTTTATGCGTGCTGCGGTCTTGGCTTCGCGCTGTGGGATTAAAAGCAAGCGGCTCGGGCATATTTGTAAGCCTCTCACGTGGTCA
TATGGTGATATCTTCCTTATGTGTGTCTCCTCTTCGCAGATCTTGTCTGCTTATGTGTTAAAGCAAGATAGCTTACCACCATCGTTTAGGTCCTTTCTCAATACACATGG
TGGAAAGGATACTGTAATCTTGGAAGGCTTAAGGAGTTTCGTATCAGGCATGCCTTCTTCTATTAAATTTAAAGCAATAGAGAAGTACTACAGTGCCATGGGTGCGGGCG
TCAAATTAGATCCGCAAATGAAGACTCCATGCACGGGATATAAAAGAGCGTTGCCAGTTTACCTCCCCGTTTATCTTGTCCCAGCCCTAATAGTTCATCGTGAAGGTCTC
ATGAATGGGCCATACGAAATTTTAGCGAGGGGGCTTCTTGGAACTGCTAGATCAAGTCTGTTCCTTTCTGCATATTGTGCATCTGCTTGGATGTGGACATGTCTGACCGC
AAGGACTTTCAAGAAAATAAACATACCGTTGGTTGCTGTGGCAACGTTCTTAACAGGTTTGGCACTGGCCATCGAGAAGAAAAGCAGGAGGATAGAAATCTCACTCTATT
GCCTTGCCCGAGGTATCGAGAGCTTCTTCAGCTGCATGACGGATTTAGGATACTTACCACAATCGTTAAATTTCAAGAGAGCGGATGTGATAGTTTTCAGCATATCAACT
TCCATTATAATGCATTGCTATGCTCAGGAAAGGGAGGTATTTCGATCTAAGTATTTGAACGTTCTCGATTGGGTTTTCGGTGTGCCTCCTCCCCCTTGTGAAACACCACG
CTGCAAGAATAGCAAGCAATATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCGTCTGCTGGGGGCGGCTTCGACGCCGACGGCGGATGTGCATGCCTTGCTCAACAGAATGGGGATGCCGAAGCAGCTGCTAACTCTAAATCCGGTGATTCCTA
TTGCGAGCATTGTGGTTCAGCGGATTCATCTTCTTTGCCGGCGTTTTCCTGCTCTTCCTCTTCTCTATGGCTTGATTCGACGCGGTTGAGAGAGTCCGGGAAGCTATGGC
GGATCCTTGTTGCTTCTGCTAAAGGCTTCACAATTGGAGCTGGTCTCAAAGGTGGCCTCTCTCTCTTTTCGGTCCTCGCTGGATTGAAGCGGAGAAAGGCTTTGGCCTCC
CTCGGGAAGAAAGGAGTGATTACGAATCGAGATGCGATTTCCATGGCGTTGAAGGAGACTTTGAGATACGGTCTCTTTCTTGGAACCTTTGCAGGTACATTCGTTTCAAT
CGATGAGATAATTGGCAATCTAGCAGGCCACCGTAGGACTGCAAGATGGAGGGCACTATTAGCGGGAGCATTAGCTGGGCCGTCGATGCTTCTGACTGGGTTAAATACTC
AACATAAGACCTTGGCTATCTACATTTTTATGCGTGCTGCGGTCTTGGCTTCGCGCTGTGGGATTAAAAGCAAGCGGCTCGGGCATATTTGTAAGCCTCTCACGTGGTCA
TATGGTGATATCTTCCTTATGTGTGTCTCCTCTTCGCAGATCTTGTCTGCTTATGTGTTAAAGCAAGATAGCTTACCACCATCGTTTAGGTCCTTTCTCAATACACATGG
TGGAAAGGATACTGTAATCTTGGAAGGCTTAAGGAGTTTCGTATCAGGCATGCCTTCTTCTATTAAATTTAAAGCAATAGAGAAGTACTACAGTGCCATGGGTGCGGGCG
TCAAATTAGATCCGCAAATGAAGACTCCATGCACGGGATATAAAAGAGCGTTGCCAGTTTACCTCCCCGTTTATCTTGTCCCAGCCCTAATAGTTCATCGTGAAGGTCTC
ATGAATGGGCCATACGAAATTTTAGCGAGGGGGCTTCTTGGAACTGCTAGATCAAGTCTGTTCCTTTCTGCATATTGTGCATCTGCTTGGATGTGGACATGTCTGACCGC
AAGGACTTTCAAGAAAATAAACATACCGTTGGTTGCTGTGGCAACGTTCTTAACAGGTTTGGCACTGGCCATCGAGAAGAAAAGCAGGAGGATAGAAATCTCACTCTATT
GCCTTGCCCGAGGTATCGAGAGCTTCTTCAGCTGCATGACGGATTTAGGATACTTACCACAATCGTTAAATTTCAAGAGAGCGGATGTGATAGTTTTCAGCATATCAACT
TCCATTATAATGCATTGCTATGCTCAGGAAAGGGAGGTATTTCGATCTAAGTATTTGAACGTTCTCGATTGGGTTTTCGGTGTGCCTCCTCCCCCTTGTGAAACACCACG
CTGCAAGAATAGCAAGCAATATTGA
Protein sequenceShow/hide protein sequence
MSPSAGGGFDADGGCACLAQQNGDAEAAANSKSGDSYCEHCGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKALAS
LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWS
YGDIFLMCVSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLRSFVSGMPSSIKFKAIEKYYSAMGAGVKLDPQMKTPCTGYKRALPVYLPVYLVPALIVHREGL
MNGPYEILARGLLGTARSSLFLSAYCASAWMWTCLTARTFKKINIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSIST
SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKNSKQY