; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004859 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004859
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTMEM135_C_rich domain-containing protein
Genome locationchr6:7851052..7856109
RNA-Seq ExpressionLag0004859
SyntenyLag0004859
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR026749 - Transmembrane protein 135


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008444833.1 PREDICTED: uncharacterized protein LOC103488060 [Cucumis melo]5.3e-26592.35Show/hide
Query:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK
        MSPSA G FDA+  GGCACLA+QNGDAE   ANCKSGDSYCEHC      GSADSSS P+FSC SSSSLWLDSTRL E G++WRILVASAKGFTIGAGLK
Subjt:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK

Query:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI
        GGLSLFS+LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAG LAGPSMLLTGLNTQHKTLAI
Subjt:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI

Query:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM
        YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+SGMPSS+KF+A+EKYY AM
Subjt:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM

Query:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK
        GA VKLDP+MKTPCTIIHGNQSCGGHF SFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLS YC+SAWMWTCL +RTFKK
Subjt:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK

Query:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
        INIPLVA+ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVIVFSIST+IIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
Subjt:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP

Query:  CETPRCKNGK
        CETPRCKNGK
Subjt:  CETPRCKNGK

XP_011649669.1 uncharacterized protein LOC101202879 isoform X1 [Cucumis sativus]3.2e-26291.75Show/hide
Query:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK
        MSPSA G FDAD  GGCACLA+QNGDAE   ANCKSGDSYCEHC      GSADSSS P+FSC SSSSLWLDSTRL E G++ RILVASAKGFTIGAGLK
Subjt:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK

Query:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI
        GGLSLFS+LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGN+AGHRRTARWRALLAG LAGPSMLLTGLNTQHKTLAI
Subjt:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI

Query:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM
        YIFMRAAVLASRCGIKSKRLGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+SGMPSSNKF+A+EKYY AM
Subjt:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM

Query:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK
        G+ VKLDP+MKTPCTIIHGNQSCGGHF SFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLS YC+SAWMWTCL +RTFKK
Subjt:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK

Query:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
        INIPLVA+ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLP SLNFKRADVIVFSIST+IIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
Subjt:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP

Query:  CETPRCKNG
        CETPRCKNG
Subjt:  CETPRCKNG

XP_022131834.1 uncharacterized protein LOC111004892 isoform X1 [Momordica charantia]3.0e-26090.2Show/hide
Query:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK
        MSPSAV GFDAD  GGCAC+A+QNGD E    NCKSG+SYC+HCGC    GSADSSS PAFSC SSSSLW DS RL ESG++WRILVASAKGFTIGAGLK
Subjt:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK

Query:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI
        GGLSLFSILAGLKRRKALASL KKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAG LAGPSMLLTGLNTQHK+LAI
Subjt:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI

Query:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM
        YIFMRAAVLASRCGIKSK+LGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+SGMPSSNKF  +EKYYRAM
Subjt:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM

Query:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK
        GAD KLD +MKTPCTIIHGNQSCGGHF SFLIQGYKRALPVYLPVYL+PALIVHR+ L+NRP EILARGLLGTARSSLFLS+YCSSAWMWTCL +RTFKK
Subjt:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK

Query:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
        INIPLVA+ATFLTGLALAIEKKSRRIEISLYCLARGIESFF C TD GYLPQSLNFKRADVIVFS+STAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
Subjt:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP

Query:  CETPRCKNGK
        CETPRC N K
Subjt:  CETPRCKNGK

XP_023545566.1 uncharacterized protein LOC111804956 isoform X1 [Cucurbita pepo subsp. pepo]2.7e-26190.2Show/hide
Query:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK
        MSPSAVG F AD  GGCACLAK+NGD      NCKSGDSYC+HCGC    GSAD SS PAFSC SSSSLW DS RL ESG++WRILVASAKGFTIGAGLK
Subjt:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK

Query:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI
        GGLSLFS+LAGLKRRKALASLGKKGVITNRDA+SMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AG LAGPSMLLTGLNTQHKTLAI
Subjt:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI

Query:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM
        YIFMRAAVLASRCGI+SKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+ GMPS NKF A+EKYY+  
Subjt:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM

Query:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK
        GA VKLDP MKTPCTIIHGNQSCGGHF SF+IQGYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLSVYC+SAW+WTCL ARTF+K
Subjt:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK

Query:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
        IN+PLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVIVFS+ST+IIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
Subjt:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP

Query:  CETPRCKNGK
        CETPRCKNGK
Subjt:  CETPRCKNGK

XP_038886157.1 uncharacterized protein LOC120076412 isoform X1 [Benincasa hispida]1.2e-26190.98Show/hide
Query:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK
        MSPSAVG FDAD  GGC C A QNGDA+   AN KSGDSYCEHCGC    GSADSSS P+FSC SSSSLWLDSTRL ESG++WRILVASAKGFTIGAGLK
Subjt:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK

Query:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI
        GGLSLFS+LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEI+GNL GHRRTA WRALLAG LAGPSMLLTGLNTQHKTLAI
Subjt:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI

Query:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM
        YIFMRAAVLASRCGIKSKR GHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+SGMPSSNKF+A+EKYY AM
Subjt:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM

Query:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK
        GA+V+L+ +MKTPC IIHGNQSCGGHF SFLI+GYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLS YC+SAWMWTCL AR+FKK
Subjt:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK

Query:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
        INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFS MTDLGYLP SLNFKRADVIVFSIST+IIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
Subjt:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP

Query:  CETPRCKNGK
        CETPRCKNGK
Subjt:  CETPRCKNGK

TrEMBL top hitse value%identityAlignment
A0A0A0LRQ1 Uncharacterized protein1.6e-26291.75Show/hide
Query:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK
        MSPSA G FDAD  GGCACLA+QNGDAE   ANCKSGDSYCEHC      GSADSSS P+FSC SSSSLWLDSTRL E G++ RILVASAKGFTIGAGLK
Subjt:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK

Query:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI
        GGLSLFS+LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGN+AGHRRTARWRALLAG LAGPSMLLTGLNTQHKTLAI
Subjt:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI

Query:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM
        YIFMRAAVLASRCGIKSKRLGHICKPLTWS GDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+SGMPSSNKF+A+EKYY AM
Subjt:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM

Query:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK
        G+ VKLDP+MKTPCTIIHGNQSCGGHF SFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLS YC+SAWMWTCL +RTFKK
Subjt:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK

Query:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
        INIPLVA+ATFLTGLALAIEKKSRRIEISLYCL+RGIESFFSCMTDLGYLP SLNFKRADVIVFSIST+IIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
Subjt:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP

Query:  CETPRCKNG
        CETPRCKNG
Subjt:  CETPRCKNG

A0A1S3BAT4 uncharacterized protein LOC1034880602.6e-26592.35Show/hide
Query:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK
        MSPSA G FDA+  GGCACLA+QNGDAE   ANCKSGDSYCEHC      GSADSSS P+FSC SSSSLWLDSTRL E G++WRILVASAKGFTIGAGLK
Subjt:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK

Query:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI
        GGLSLFS+LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAG LAGPSMLLTGLNTQHKTLAI
Subjt:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI

Query:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM
        YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+SGMPSS+KF+A+EKYY AM
Subjt:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM

Query:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK
        GA VKLDP+MKTPCTIIHGNQSCGGHF SFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLS YC+SAWMWTCL +RTFKK
Subjt:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK

Query:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
        INIPLVA+ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVIVFSIST+IIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
Subjt:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP

Query:  CETPRCKNGK
        CETPRCKNGK
Subjt:  CETPRCKNGK

A0A5A7VHD2 Uncharacterized protein2.6e-26592.35Show/hide
Query:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK
        MSPSA G FDA+  GGCACLA+QNGDAE   ANCKSGDSYCEHC      GSADSSS P+FSC SSSSLWLDSTRL E G++WRILVASAKGFTIGAGLK
Subjt:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK

Query:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI
        GGLSLFS+LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAG LAGPSMLLTGLNTQHKTLAI
Subjt:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI

Query:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM
        YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+SGMPSS+KF+A+EKYY AM
Subjt:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM

Query:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK
        GA VKLDP+MKTPCTIIHGNQSCGGHF SFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLS YC+SAWMWTCL +RTFKK
Subjt:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK

Query:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
        INIPLVA+ATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLP SLNFKRADVIVFSIST+IIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
Subjt:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP

Query:  CETPRCKNGK
        CETPRCKNGK
Subjt:  CETPRCKNGK

A0A6J1BRD3 uncharacterized protein LOC111004892 isoform X11.5e-26090.2Show/hide
Query:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK
        MSPSAV GFDAD  GGCAC+A+QNGD E    NCKSG+SYC+HCGC    GSADSSS PAFSC SSSSLW DS RL ESG++WRILVASAKGFTIGAGLK
Subjt:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK

Query:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI
        GGLSLFSILAGLKRRKALASL KKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAG LAGPSMLLTGLNTQHK+LAI
Subjt:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI

Query:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM
        YIFMRAAVLASRCGIKSK+LGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+SGMPSSNKF  +EKYYRAM
Subjt:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM

Query:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK
        GAD KLD +MKTPCTIIHGNQSCGGHF SFLIQGYKRALPVYLPVYL+PALIVHR+ L+NRP EILARGLLGTARSSLFLS+YCSSAWMWTCL +RTFKK
Subjt:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK

Query:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
        INIPLVA+ATFLTGLALAIEKKSRRIEISLYCLARGIESFF C TD GYLPQSLNFKRADVIVFS+STAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
Subjt:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP

Query:  CETPRCKNGK
        CETPRC N K
Subjt:  CETPRCKNGK

A0A6J1HFI3 uncharacterized protein LOC111462580 isoform X13.3e-26090.2Show/hide
Query:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK
        MSPSAVG F AD  GGCACLAK+NGD      NCKS DSYC+HCGC    GSAD SS P FSC SSSSLW DSTRLSESG++WRILVASAKGFTIGAGLK
Subjt:  MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLK

Query:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI
        GGLSLFS+LAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRAL+AG LAGPSMLLTGLNTQHKTLAI
Subjt:  GGLSLFSILAGLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAI

Query:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM
        YIFMRAAVLASRCGI+SKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSF+ GMP SNKF A+EKYY+  
Subjt:  YIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAM

Query:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK
        GA VKLDP MKTPCTIIHGNQSCGGHF SF+I+GYKRALPVYLPVYLIPALIVHR+GLMNRPYEILARGLLGTARSSLFLSVYC+SAW+WTCL ARTF+K
Subjt:  GADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKK

Query:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP
        IN+PLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFS+ST+IIMHCYAQER VFRSKYLNVLDWVFGVPPPP
Subjt:  INIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP

Query:  CETPRCKNGK
        CETPRCKNGK
Subjt:  CETPRCKNGK

SwissProt top hitse value%identityAlignment
Q95QD1 Transmembrane protein 135 homolog3.1e-0532.26Show/hide
Query:  IPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFG
        +P V    F  GLA         + I++YCL + IE+ +  + D GYLP+   FK  +VI+++I+T  ++     E    R  YLN L  + G
Subjt:  IPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFG

Arabidopsis top hitse value%identityAlignment
AT1G34630.1 BEST Arabidopsis thaliana protein match is: Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein (TAIR:AT5G51150.1)3.8e-17666.1Show/hide
Query:  SYCEHCGCGRGSGSADSSSPPAFSCS--SSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKK-GVITNRDAISM
        S C  C      G+ D      F+C    +S     S  +++S ++ RI+VAS KGFTIG GLKGGL++FSI+A   RR+  +   +K G  +N +AI+M
Subjt:  SYCEHCGCGRGSGSADSSSPPAFSCS--SSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKK-GVITNRDAISM

Query:  ALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIF
         +KETLRYGLFLGTFAGTFVS+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVLASRCGIKSKR G ICKPLTW +GD+F
Subjt:  ALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIF

Query:  LMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAMGADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGY
        LMCLSSSQILSAY+LKQ+SLP S++SFLN  GGKD  IL+G+K   +  P +N  RA+EKYY+++G D+KLDP MK PCTIIHGN+SC  H  +F +Q Y
Subjt:  LMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAMGADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGY

Query:  KRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKKINIPLVAVATFLTGLALAIEKKSRRIEISLYCLAR
        KRALPVY+PVYLIPALIVHR+ L+ + Y IL +GLLGTARSSLFL+ YCSSAW WTCL+ RTF+  NIPLVA+ATF TGLALAIEKKSRRIEISLYCLAR
Subjt:  KRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKKINIPLVAVATFLTGLALAIEKKSRRIEISLYCLAR

Query:  GIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP----CET
         IESFF+CMT+ GY+    + +RADV+VFS+STAIIMHCYAQER+VFRSKYLNVLDWVFGVPPPP    CET
Subjt:  GIESFFSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPP----CET

AT1G34630.2 FUNCTIONS IN: molecular_function unknown2.0e-12962.43Show/hide
Query:  SYCEHCGCGRGSGSADSSSPPAFSCS--SSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKK-GVITNRDAISM
        S C  C      G+ D      F+C    +S     S  +++S ++ RI+VAS KGFTIG GLKGGL++FSI+A   RR+  +   +K G  +N +AI+M
Subjt:  SYCEHCGCGRGSGSADSSSPPAFSCS--SSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLKGGLSLFSILAGLKRRKALASLGKK-GVITNRDAISM

Query:  ALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIF
         +KETLRYGLFLGTFAGTFVS+DE I  LAG +RTA+WRAL AG +AGPSMLLTG NTQH +LA+YI MRAAVLASRCGIKSKR G ICKPLTW +GD+F
Subjt:  ALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIF

Query:  LMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAMGADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGY
        LMCLSSSQILSAY+LKQ+SLP S++SFLN  GGKD  IL+G+K   +  P +N  RA+EKYY+++G D+KLDP MK PCTIIHGN+SC  H  +F +Q Y
Subjt:  LMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAMGADVKLDPRMKTPCTIIHGNQSCGGHFFSFLIQGY

Query:  KRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKKINIPLVAVATFLT
        KRALPVY+PVYLIPALIVHR+ L+ + Y IL +GLLGTARSSLFL+ YCSSAW WTCL+ RTF+  NIPLVA+AT  T
Subjt:  KRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKKINIPLVAVATFLT

AT5G51150.1 Mitochondrial import inner membrane translocase subunit Tim17/Tim22/Tim23 family protein2.9e-2724.67Show/hide
Query:  KGFTIGAGLKGGLSLFSILAGLKRRKALAS-LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLT
        + F +  G++ G+ +      L R ++ +S L  K +++ +D I    +E  R GL  G F G++ ++   +      ++     ++LAG +AG S+L  
Subjt:  KGFTIGAGLKGGLSLFSILAGLKRRKALAS-LGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLT

Query:  GLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNK
          + Q +TLA+Y+  R    A     KSK   H+     W +GD  L  L+ +Q++ +++++ ++LP S+R F+   G     + + ++    G P    
Subjt:  GLNTQHKTLAIYIFMRAAVLASRCGIKSKRLGHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNK

Query:  FRALEKYYRAMGADVKLDPRMK-TPCTIIHGN-QSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSA
          +     +   +DVK++      PC  IH N  SC     + +   +K+  P+Y  +  +P +++H +  M  PY      +  + RS+ FLS +    
Subjt:  FRALEKYYRAMGADVKLDPRMK-TPCTIIHGN-QSCGGHFFSFLIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSA

Query:  WMWTCLIARTFKKINIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVF
          + C   +   K +  +   A     L++ +EKK RR E++LY L R  +S +  + +   LP   + K A+V +F
Subjt:  WMWTCLIARTFKKINIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESFFSCMTDLGYLPQSLNFKRADVIVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCCTCTGCCGTCGGAGGCTTCGACGCCGACGCCGGCGGCGGATGTGCATGCCTTGCTAAACAGAATGGGGATGCCGAACCTGATGATGCCAACTGTAAATCCGG
TGATTCCTATTGCGAGCATTGTGGCTGTGGCCGTGGCAGTGGTTCGGCGGATTCCTCTTCTCCGCCGGCGTTTTCGTGCTCTTCCTCCTCTTCTCTATGGCTTGATTCGA
CGCGGTTGAGCGAGTCGGGGGAGGTTTGGCGGATCCTTGTTGCTTCAGCTAAAGGCTTCACAATTGGAGCTGGTCTCAAAGGTGGACTCTCCCTCTTTTCGATCCTTGCT
GGATTGAAGCGGAGAAAGGCTTTGGCCTCCCTCGGGAAGAAAGGAGTGATTACGAATCGAGATGCAATTTCCATGGCTTTGAAGGAAACTTTGAGATATGGTCTCTTTCT
TGGAACCTTTGCTGGTACATTCGTTTCAATCGATGAGATAATAGGCAATCTGGCAGGCCACCGTAGGACTGCAAGATGGAGGGCTCTATTAGCGGGAGGATTAGCTGGGC
CGTCAATGCTTCTGACTGGGTTAAATACGCAACATAAGACCTTGGCTATCTATATTTTTATGCGTGCTGCGGTCTTGGCATCGCGCTGTGGGATTAAAAGCAAGCGGCTC
GGGCATATTTGTAAGCCTCTCACGTGGTCATATGGTGACATCTTCCTTATGTGTCTCTCCTCTTCGCAAATCTTGTCTGCTTATGTGTTAAAGCAAGACAGCTTACCGCC
GTCGTTTAGGTCCTTTCTCAATACACATGGTGGAAAGGATACTGTAATCTTGGAAGGTTTAAAGAGTTTTTTATCAGGCATGCCTTCTTCTAATAAATTTAGAGCACTAG
AAAAGTACTACAGGGCAATGGGTGCAGATGTCAAATTAGATCCGCGAATGAAGACTCCATGCACGATTATACATGGAAACCAATCATGTGGTGGCCATTTTTTTTCCTTT
CTCATTCAAGGATATAAAAGAGCATTGCCAGTTTATCTCCCTGTTTATCTGATCCCAGCCTTGATAGTTCATCGTGAAGGTCTCATGAATAGGCCTTACGAAATTTTAGC
AAGGGGGCTTCTTGGAACTGCTAGATCAAGTCTGTTCCTTTCTGTATATTGTTCATCTGCTTGGATGTGGACATGCCTGATTGCAAGGACTTTCAAGAAAATAAACATTC
CCTTGGTTGCTGTGGCAACGTTCTTAACCGGTTTGGCTCTAGCCATTGAGAAGAAGAGCAGGAGGATTGAAATCTCACTCTATTGCCTTGCAAGAGGCATCGAGAGCTTC
TTCAGCTGCATGACGGATTTGGGATACTTGCCACAATCGTTGAATTTTAAGAGAGCAGATGTGATAGTTTTCAGCATATCAACTGCCATTATAATGCACTGCTATGCACA
GGAAAGAGAGGTATTCCGATCCAAGTACTTGAACGTCCTTGATTGGGTTTTTGGTGTGCCTCCTCCCCCCTGTGAAACACCACGCTGCAAAAATGGCAAGGATCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCCTCTGCCGTCGGAGGCTTCGACGCCGACGCCGGCGGCGGATGTGCATGCCTTGCTAAACAGAATGGGGATGCCGAACCTGATGATGCCAACTGTAAATCCGG
TGATTCCTATTGCGAGCATTGTGGCTGTGGCCGTGGCAGTGGTTCGGCGGATTCCTCTTCTCCGCCGGCGTTTTCGTGCTCTTCCTCCTCTTCTCTATGGCTTGATTCGA
CGCGGTTGAGCGAGTCGGGGGAGGTTTGGCGGATCCTTGTTGCTTCAGCTAAAGGCTTCACAATTGGAGCTGGTCTCAAAGGTGGACTCTCCCTCTTTTCGATCCTTGCT
GGATTGAAGCGGAGAAAGGCTTTGGCCTCCCTCGGGAAGAAAGGAGTGATTACGAATCGAGATGCAATTTCCATGGCTTTGAAGGAAACTTTGAGATATGGTCTCTTTCT
TGGAACCTTTGCTGGTACATTCGTTTCAATCGATGAGATAATAGGCAATCTGGCAGGCCACCGTAGGACTGCAAGATGGAGGGCTCTATTAGCGGGAGGATTAGCTGGGC
CGTCAATGCTTCTGACTGGGTTAAATACGCAACATAAGACCTTGGCTATCTATATTTTTATGCGTGCTGCGGTCTTGGCATCGCGCTGTGGGATTAAAAGCAAGCGGCTC
GGGCATATTTGTAAGCCTCTCACGTGGTCATATGGTGACATCTTCCTTATGTGTCTCTCCTCTTCGCAAATCTTGTCTGCTTATGTGTTAAAGCAAGACAGCTTACCGCC
GTCGTTTAGGTCCTTTCTCAATACACATGGTGGAAAGGATACTGTAATCTTGGAAGGTTTAAAGAGTTTTTTATCAGGCATGCCTTCTTCTAATAAATTTAGAGCACTAG
AAAAGTACTACAGGGCAATGGGTGCAGATGTCAAATTAGATCCGCGAATGAAGACTCCATGCACGATTATACATGGAAACCAATCATGTGGTGGCCATTTTTTTTCCTTT
CTCATTCAAGGATATAAAAGAGCATTGCCAGTTTATCTCCCTGTTTATCTGATCCCAGCCTTGATAGTTCATCGTGAAGGTCTCATGAATAGGCCTTACGAAATTTTAGC
AAGGGGGCTTCTTGGAACTGCTAGATCAAGTCTGTTCCTTTCTGTATATTGTTCATCTGCTTGGATGTGGACATGCCTGATTGCAAGGACTTTCAAGAAAATAAACATTC
CCTTGGTTGCTGTGGCAACGTTCTTAACCGGTTTGGCTCTAGCCATTGAGAAGAAGAGCAGGAGGATTGAAATCTCACTCTATTGCCTTGCAAGAGGCATCGAGAGCTTC
TTCAGCTGCATGACGGATTTGGGATACTTGCCACAATCGTTGAATTTTAAGAGAGCAGATGTGATAGTTTTCAGCATATCAACTGCCATTATAATGCACTGCTATGCACA
GGAAAGAGAGGTATTCCGATCCAAGTACTTGAACGTCCTTGATTGGGTTTTTGGTGTGCCTCCTCCCCCCTGTGAAACACCACGCTGCAAAAATGGCAAGGATCTGTGA
Protein sequenceShow/hide protein sequence
MSPSAVGGFDADAGGGCACLAKQNGDAEPDDANCKSGDSYCEHCGCGRGSGSADSSSPPAFSCSSSSSLWLDSTRLSESGEVWRILVASAKGFTIGAGLKGGLSLFSILA
GLKRRKALASLGKKGVITNRDAISMALKETLRYGLFLGTFAGTFVSIDEIIGNLAGHRRTARWRALLAGGLAGPSMLLTGLNTQHKTLAIYIFMRAAVLASRCGIKSKRL
GHICKPLTWSYGDIFLMCLSSSQILSAYVLKQDSLPPSFRSFLNTHGGKDTVILEGLKSFLSGMPSSNKFRALEKYYRAMGADVKLDPRMKTPCTIIHGNQSCGGHFFSF
LIQGYKRALPVYLPVYLIPALIVHREGLMNRPYEILARGLLGTARSSLFLSVYCSSAWMWTCLIARTFKKINIPLVAVATFLTGLALAIEKKSRRIEISLYCLARGIESF
FSCMTDLGYLPQSLNFKRADVIVFSISTAIIMHCYAQEREVFRSKYLNVLDWVFGVPPPPCETPRCKNGKDL