; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G11330 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G11330
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTMEM135_C_rich domain-containing protein
Genome locationClcChr08:22622869..22628036
RNA-Seq ExpressionClc08G11330
SyntenyClc08G11330
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR026749 - Transmembrane protein 135


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444833.1 PREDICTED: uncharacterized protein LOC103488060 [Cucumis melo]6.4e-25385.82Show/hide
Query:  MSPSA--C----GGCACLAQQNGDAHVAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA  C    GGCACLAQQNGDA  AAN KSGDSYCEHC  GSADSSS P+FSCSSSSLWLDSTRLRE GKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSA--C----GGCACLAQQNGDAHVAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP
        LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSS+KFKAIEKYYSAMGA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW  T       TS+       
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS

Query:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
                            TF+ +++P          ++     GLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
Subjt:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST

Query:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ
        SIIMHCYAQEREVFRSKYLNVLDWVFGVPPP CETPRCKNGK+
Subjt:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ

XP_011649669.1 uncharacterized protein LOC101202879 isoform X1 [Cucumis sativus]3.0e-25084.74Show/hide
Query:  MSPSA--C----GGCACLAQQNGDAHVAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA  C    GGCACLAQQNGDA  AAN KSGDSYCEHC  GSADSSS P+FSCSSSSLWLDSTRLRE GKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSA--C----GGCACLAQQNGDAHVAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGN+AGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP
        LASRCGIKSKRLGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKA+EKYYSAMG+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW  T       TS+       
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS

Query:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
                            TF+ +++P          ++     GLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
Subjt:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST

Query:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQC
        SIIMHCYAQEREVFRSKYLNVLDWVFGVPPP CETPRCKNG +C
Subjt:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQC

XP_022961964.1 uncharacterized protein LOC111462580 isoform X1 [Cucurbita moschata]9.6e-24181.4Show/hide
Query:  MSPSAC------GGCACLAQQNGDAHVAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA       GGCACLA++NGD     N KS DSYC+H  CGSAD SSLP FSCSSSSLW DSTRL ESGKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAC------GGCACLAQQNGDAHVAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP
        LASRCGI+SKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+ GMP SNKF AIEKYY   GA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS
         MKTPCTIIHGNQSCGGHFLSF+I+GYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLS YCASAW  T   A             
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS

Query:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
                            TFQ +++P          ++     GLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVIVFS+ST
Subjt:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST

Query:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ
        SIIMHCYAQER VFRSKYLNVLDWVFGVPPP CETPRCKNGKQ
Subjt:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ

XP_023545566.1 uncharacterized protein LOC111804956 isoform X1 [Cucurbita pepo subsp. pepo]4.9e-24582.14Show/hide
Query:  MSPSAC------GGCACLAQQNGDAHVAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA       GGCACLA++NGD     N KSGDSYC+H  CGSAD SSLPAFSCSSSSLW DS RLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAC------GGCACLAQQNGDAHVAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDA+SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP
        LASRCGI+SKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+ GMPS NKF AIEKYY   GA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS
         MKTPCTIIHGNQSCGGHFLSF+IQGYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLS YCASAW  T   A             
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS

Query:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
                            TFQ +++P          ++     GLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFS+ST
Subjt:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST

Query:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ
        SIIMHCYAQEREVFRSKYLNVLDWVFGVPPP CETPRCKNGKQ
Subjt:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ

XP_038886157.1 uncharacterized protein LOC120076412 isoform X1 [Benincasa hispida]1.2e-24683.79Show/hide
Query:  MSPSAC------GGCACLAQQNGDAHVAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA       GGC C A QNGDA   ANSKSGDSYCEH  CGSADSSS P+FSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAC------GGCACLAQQNGDAHVAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEI+GNL GHRRTA WRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP
        LASRCGIKSKR GHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGA+V+L+ 
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS
        QMKTPC IIHGNQSCGGHFLSFLI+GYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW  T             T RS
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS

Query:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
                             F+ +++P          ++     GLALAIEKKSRRIEISLYCLARGIESFFS MTDLGYLPPSLNFKRADVIVFSIST
Subjt:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST

Query:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ
        SIIMHCYAQEREVFRSKYLNVLDWVFGVPPP CETPRCKNGKQ
Subjt:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ

TrEMBL top hitse value%identityAlignment
A0A0A0LRQ1 Uncharacterized protein1.4e-25084.74Show/hide
Query:  MSPSA--C----GGCACLAQQNGDAHVAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA  C    GGCACLAQQNGDA  AAN KSGDSYCEHC  GSADSSS P+FSCSSSSLWLDSTRLRE GKL RILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSA--C----GGCACLAQQNGDAHVAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGN+AGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP
        LASRCGIKSKRLGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKA+EKYYSAMG+ VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW  T       TS+       
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS

Query:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
                            TF+ +++P          ++     GLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
Subjt:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST

Query:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQC
        SIIMHCYAQEREVFRSKYLNVLDWVFGVPPP CETPRCKNG +C
Subjt:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQC

A0A1S3BAT4 uncharacterized protein LOC1034880603.1e-25385.82Show/hide
Query:  MSPSA--C----GGCACLAQQNGDAHVAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA  C    GGCACLAQQNGDA  AAN KSGDSYCEHC  GSADSSS P+FSCSSSSLWLDSTRLRE GKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSA--C----GGCACLAQQNGDAHVAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP
        LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSS+KFKAIEKYYSAMGA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW  T       TS+       
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS

Query:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
                            TF+ +++P          ++     GLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
Subjt:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST

Query:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ
        SIIMHCYAQEREVFRSKYLNVLDWVFGVPPP CETPRCKNGK+
Subjt:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ

A0A5A7VHD2 Uncharacterized protein3.1e-25385.82Show/hide
Query:  MSPSA--C----GGCACLAQQNGDAHVAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA  C    GGCACLAQQNGDA  AAN KSGDSYCEHC  GSADSSS P+FSCSSSSLWLDSTRLRE GKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSA--C----GGCACLAQQNGDAHVAANSKSGDSYCEHC--GSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP
        LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSS+KFKAIEKYYSAMGA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS
        QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAW  T       TS+       
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS

Query:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
                            TF+ +++P          ++     GLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
Subjt:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST

Query:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ
        SIIMHCYAQEREVFRSKYLNVLDWVFGVPPP CETPRCKNGK+
Subjt:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ

A0A6J1BRD3 uncharacterized protein LOC111004892 isoform X14.3e-23980.66Show/hide
Query:  MSPSAC------GGCACLAQQNGDAHVAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA       GGCAC+A+QNGD   A N KSG+SYC+H  CGSADSSSLPAFSCSSSSLW DS RLRESGKLWRILVASAKGFTIGAGLKGGLSLFS+
Subjt:  MSPSAC------GGCACLAQQNGDAHVAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASL KKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHK+LAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP
        LASRCGIKSK+LGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKF+ IEKYY AMGAD KLD 
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS
        QMKTPCTIIHGNQSCGGHF+SFLIQGYKRALPVYLPVYL+PALIVHR+ L+NRP EILARGLLGTARSSLFLS YC+SAW  T       TS+       
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS

Query:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
                            TF+ +++P          ++     GLALAIEKKSRRIEISLYCLARGIESFF C TD GYLP SLNFKRADVIVFS+ST
Subjt:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST

Query:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ
        +IIMHCYAQEREVFRSKYLNVLDWVFGVPPP CETPRC N KQ
Subjt:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ

A0A6J1HFI3 uncharacterized protein LOC111462580 isoform X14.6e-24181.4Show/hide
Query:  MSPSAC------GGCACLAQQNGDAHVAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV
        MSPSA       GGCACLA++NGD     N KS DSYC+H  CGSAD SSLP FSCSSSSLW DSTRL ESGKLWRILVASAKGFTIGAGLKGGLSLFSV
Subjt:  MSPSAC------GGCACLAQQNGDAHVAANSKSGDSYCEH--CGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSV

Query:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
        LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AGALAGPSMLLTGLNTQHKTLAIYIFMRAAV
Subjt:  LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAV

Query:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP
        LASRCGI+SKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+ GMP SNKF AIEKYY   GA VKLDP
Subjt:  LASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDP

Query:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS
         MKTPCTIIHGNQSCGGHFLSF+I+GYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLS YCASAW  T   A             
Subjt:  QMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRS

Query:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST
                            TFQ +++P          ++     GLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVIVFS+ST
Subjt:  SYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSIST

Query:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ
        SIIMHCYAQER VFRSKYLNVLDWVFGVPPP CETPRCKNGKQ
Subjt:  SIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQ

SwissProt top hitse value%identityAlignment
Q95QD1 Transmembrane protein 135 homolog9.3e-0533.33Show/hide
Query:  SNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFG
        S GLA         + I++YCL + IE+ +  + D GYLP    FK  +VI+++I+T  ++     E    R  YLN L  + G
Subjt:  SNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFG

Arabidopsis top hitse value%identityAlignment
AT1G34630.1 BEST Arabidopsis thaliana protein match is: Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein (TAIR:AT5G51150.1)4.2e-16257.49Show/hide
Query:  MSPS--ACGGCACLAQQNGDAHVAANSKSGDSYCEHCGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKR
        MSPS  +  G  C +  + D       KS +  C+ C ++DS S              S  + +S KL RI+VAS KGFTIG GLKGGL++FS++A   R
Subjt:  MSPS--ACGGCACLAQQNGDAHVAANSKSGDSYCEHCGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKR

Query:  RKALASLGKK-GVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRC
        R+  +   +K G  +N +AI+M +KETLRYGLFLGTFAGTFVS+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVLASRC
Subjt:  RKALASLGKK-GVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRC

Query:  GIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDPQMKTP
        GIKSKR G ICKPLTW +GD+FLMCLSSSQILSAY+LKQ+SLP S++SFLN  GGKD  IL+G+K   +  P +N  +AIEKYY ++G D+KLDP MK P
Subjt:  GIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDPQMKTP

Query:  CTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRSSYQFR
        CTIIHGN+SC  H ++F +Q YKRALPVY+PVYLIPALIVHR+ L+ + Y IL +GLLGTARSSLFL+ YC+SAW  T              FR      
Subjt:  CTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRSSYQFR

Query:  KKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMH
                       TF+  ++P          +I     GLALAIEKKSRRIEISLYCLAR IESFF+CMT+ GY+ P  + +RADV+VFS+ST+IIMH
Subjt:  KKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMH

Query:  CYAQEREVFRSKYLNVLDWVFGVPP----PTCET
        CYAQER+VFRSKYLNVLDWVFGVPP    P+CET
Subjt:  CYAQEREVFRSKYLNVLDWVFGVPP----PTCET

AT1G34630.2 FUNCTIONS IN: molecular_function unknown1.2e-12160.05Show/hide
Query:  MSPS--ACGGCACLAQQNGDAHVAANSKSGDSYCEHCGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKR
        MSPS  +  G  C +  + D       KS +  C+ C ++DS S              S  + +S KL RI+VAS KGFTIG GLKGGL++FS++A   R
Subjt:  MSPS--ACGGCACLAQQNGDAHVAANSKSGDSYCEHCGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKR

Query:  RKALASLGKK-GVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRC
        R+  +   +K G  +N +AI+M +KETLRYGLFLGTFAGTFVS+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVLASRC
Subjt:  RKALASLGKK-GVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRC

Query:  GIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDPQMKTP
        GIKSKR G ICKPLTW +GD+FLMCLSSSQILSAY+LKQ+SLP S++SFLN  GGKD  IL+G+K   +  P +N  +AIEKYY ++G D+KLDP MK P
Subjt:  GIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDPQMKTP

Query:  CTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLT
        CTIIHGN+SC  H ++F +Q YKRALPVY+PVYLIPALIVHR+ L+ + Y IL +GLLGTARSSLFL+ YC+SAW  T
Subjt:  CTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLT

AT5G51150.1 Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein2.7e-2323.28Show/hide
Query:  KGFTIGAGLKGGLSLFSVLAGLKRRKALAS-LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLT
        + F +  G++ G+ +      L R ++ +S L  K +++ +D I    +E  R GL  G F G++ ++   +      ++     ++LAG++AG S+L  
Subjt:  KGFTIGAGLKGGLSLFSVLAGLKRRKALAS-LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLT

Query:  GLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNK
          + Q +TLA+Y+  R    A     KSK   H+     W +GD  L  L+ +Q++ +++++ ++LP S+R F+   G     + + ++    G P    
Subjt:  GLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNK

Query:  FKAIEKYYSAM--GADVKLDPQMK-TPCTIIHGN-QSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCA
          ++  Y S+    +DVK++      PC  IH N  SC     + +   +K+  P+Y  +  +P +++H +  M  PY      +  + RS+ FLSA+  
Subjt:  FKAIEKYYSAM--GADVKLDPQMK-TPCTIIHGN-QSCGGHFLSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCA

Query:  --SAWPLTSQKAFQSTSKLQFTFRSSYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSC
           A+    +K      KL + F                                             +  L++ +EKK RR E++LY L R  +S +  
Subjt:  --SAWPLTSQKAFQSTSKLQFTFRSSYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLALAIEKKSRRIEISLYCLARGIESFFSC

Query:  MTDLGYLPPSLNFKRADVIVF
        + +   LP   + K A+V +F
Subjt:  MTDLGYLPPSLNFKRADVIVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCGTCTGCTTGCGGCGGATGTGCATGCCTTGCTCAACAGAATGGGGATGCCCATGTAGCTGCTAACTCTAAATCCGGTGATTCCTATTGCGAGCATTGTGGTTC
AGCGGATTCATCTTCCTTGCCCGCGTTTTCCTGCTCTTCCTCTTCTCTATGGCTTGATTCGACGCGGTTGAGAGAGTCCGGGAAGCTATGGCGGATCCTTGTTGCTTCTG
CTAAAGGCTTCACAATTGGAGCTGGTCTCAAAGGTGGCCTCTCCCTCTTCTCAGTCCTCGCTGGACTGAAGCGGAGAAAGGCTTTGGCCTCCCTCGGGAAGAAAGGAGTG
ATTACGAATCGAGATGCGATTTCCATGGCTTTGAAGGAGACTTTGAGATACGGTCTCTTTCTTGGAACCTTTGCTGGTACATTCGTTTCAATCGACGAGATAATTGGCAA
TCTAGCAGGCCATCGTAGGACCGCAAGATGGAGGGCACTATTAGCGGGAGCATTAGCTGGGCCGTCGATGCTTCTGACTGGGTTAAATACGCAACATAAGACCTTGGCTA
TCTACATTTTTATGCGTGCTGCGGTCTTGGCTTCGCGCTGTGGGATTAAAAGCAAGCGGCTCGGGCATATTTGTAAGCCTCTCACGTGGTCATATGGTGACATCTTCCTT
ATGTGTCTCTCCTCTTCGCAGATCTTGTCTGCTTATGTGTTAAAGCAAGATAGCTTACCACCATCATTTAGGTCCTTTCTCAATACACATGGTGGAAAGGATACCGTAAT
CTTGGAAGGTTTAAAGAGTTTCGTATCAGGCATGCCTTCTTCTAATAAATTTAAAGCAATAGAGAAGTACTACAGTGCCATGGGTGCGGACGTCAAATTAGATCCGCAAA
TGAAGACTCCATGCACGATTATACATGGAAATCAATCATGTGGTGGCCATTTTCTTTCCTTTCTCATTCAAGGATATAAAAGAGCGTTGCCAGTTTACCTCCCCGTATAT
CTTATTCCAGCCCTAATAGTTCATCGTGAAGGTCTCATGAATAGGCCATACGAAATCTTAGCAAGGGGGCTTCTTGGAACTGCTAGATCAAGTCTGTTCCTTTCTGCATA
TTGTGCATCGGCTTGGCCTTTGACCTCTCAAAAAGCTTTTCAATCCACTTCAAAGCTTCAATTCACCTTTAGGAGTTCTTATCAGTTCAGAAAAAAAATATTAGCCTCTG
GCTCTAATAGATCAAGTTTTCCACCTACCTTTCAGGATATGGACATGCCTGACCGCAAGGACTTTCAAGAAAATAAACATTCCATTGGTTGCTCTAGCAACGGTTTGGCA
CTGGCCATTGAGAAGAAAAGCAGAAGGATAGAAATCTCACTCTATTGCCTCGCGAGAGGTATCGAGAGCTTCTTCAGCTGCATGACGGATTTGGGATATTTGCCGCCATC
GTTGAATTTCAAAAGAGCGGATGTGATAGTTTTCAGCATATCAACTTCCATTATAATGCATTGCTATGCTCAGGAAAGGGAGGTATTTCGATCGAAGTATTTGAATGTTC
TCGATTGGGTTTTCGGTGTGCCTCCTCCCACTTGTGAAACACCACGCTGCAAGAATGGCAAGCAGTGTTGA
mRNA sequenceShow/hide mRNA sequence
AGGAACTCAAAAATTATCCCAAAGCTCCTCTCTTTTATCTCTCAATTCCTTTCTGCTAAGTGATTCCATTTGGGATCTCTCGCTCACTTCCTCTTCCACTCAACTCACGT
TCGTAAGGGGGGAGCCCTAGCCTCCGCTTCCGGCACCATCATGTCGCCGTCTGCTTGCGGCGGATGTGCATGCCTTGCTCAACAGAATGGGGATGCCCATGTAGCTGCTA
ACTCTAAATCCGGTGATTCCTATTGCGAGCATTGTGGTTCAGCGGATTCATCTTCCTTGCCCGCGTTTTCCTGCTCTTCCTCTTCTCTATGGCTTGATTCGACGCGGTTG
AGAGAGTCCGGGAAGCTATGGCGGATCCTTGTTGCTTCTGCTAAAGGCTTCACAATTGGAGCTGGTCTCAAAGGTGGCCTCTCCCTCTTCTCAGTCCTCGCTGGACTGAA
GCGGAGAAAGGCTTTGGCCTCCCTCGGGAAGAAAGGAGTGATTACGAATCGAGATGCGATTTCCATGGCTTTGAAGGAGACTTTGAGATACGGTCTCTTTCTTGGAACCT
TTGCTGGTACATTCGTTTCAATCGACGAGATAATTGGCAATCTAGCAGGCCATCGTAGGACCGCAAGATGGAGGGCACTATTAGCGGGAGCATTAGCTGGGCCGTCGATG
CTTCTGACTGGGTTAAATACGCAACATAAGACCTTGGCTATCTACATTTTTATGCGTGCTGCGGTCTTGGCTTCGCGCTGTGGGATTAAAAGCAAGCGGCTCGGGCATAT
TTGTAAGCCTCTCACGTGGTCATATGGTGACATCTTCCTTATGTGTCTCTCCTCTTCGCAGATCTTGTCTGCTTATGTGTTAAAGCAAGATAGCTTACCACCATCATTTA
GGTCCTTTCTCAATACACATGGTGGAAAGGATACCGTAATCTTGGAAGGTTTAAAGAGTTTCGTATCAGGCATGCCTTCTTCTAATAAATTTAAAGCAATAGAGAAGTAC
TACAGTGCCATGGGTGCGGACGTCAAATTAGATCCGCAAATGAAGACTCCATGCACGATTATACATGGAAATCAATCATGTGGTGGCCATTTTCTTTCCTTTCTCATTCA
AGGATATAAAAGAGCGTTGCCAGTTTACCTCCCCGTATATCTTATTCCAGCCCTAATAGTTCATCGTGAAGGTCTCATGAATAGGCCATACGAAATCTTAGCAAGGGGGC
TTCTTGGAACTGCTAGATCAAGTCTGTTCCTTTCTGCATATTGTGCATCGGCTTGGCCTTTGACCTCTCAAAAAGCTTTTCAATCCACTTCAAAGCTTCAATTCACCTTT
AGGAGTTCTTATCAGTTCAGAAAAAAAATATTAGCCTCTGGCTCTAATAGATCAAGTTTTCCACCTACCTTTCAGGATATGGACATGCCTGACCGCAAGGACTTTCAAGA
AAATAAACATTCCATTGGTTGCTCTAGCAACGGTTTGGCACTGGCCATTGAGAAGAAAAGCAGAAGGATAGAAATCTCACTCTATTGCCTCGCGAGAGGTATCGAGAGCT
TCTTCAGCTGCATGACGGATTTGGGATATTTGCCGCCATCGTTGAATTTCAAAAGAGCGGATGTGATAGTTTTCAGCATATCAACTTCCATTATAATGCATTGCTATGCT
CAGGAAAGGGAGGTATTTCGATCGAAGTATTTGAATGTTCTCGATTGGGTTTTCGGTGTGCCTCCTCCCACTTGTGAAACACCACGCTGCAAGAATGGCAAGCAGTGTTG
A
Protein sequenceShow/hide protein sequence
MSPSACGGCACLAQQNGDAHVAANSKSGDSYCEHCGSADSSSLPAFSCSSSSLWLDSTRLRESGKLWRILVASAKGFTIGAGLKGGLSLFSVLAGLKRRKALASLGKKGV
ITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGALAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFL
MCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFVSGMPSSNKFKAIEKYYSAMGADVKLDPQMKTPCTIIHGNQSCGGHFLSFLIQGYKRALPVYLPVY
LIPALIVHREGLMNRPYEILARGLLGTARSSLFLSAYCASAWPLTSQKAFQSTSKLQFTFRSSYQFRKKILASGSNRSSFPPTFQDMDMPDRKDFQENKHSIGCSSNGLA
LAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPPSLNFKRADVIVFSISTSIIMHCYAQEREVFRSKYLNVLDWVFGVPPPTCETPRCKNGKQC