; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021909 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021909
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTransmembrane protein
Genome locationscaffold110:36734..45369
RNA-Seq ExpressionMS021909
SyntenyMS021909
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144497.1 uncharacterized protein LOC111014173 isoform X1 [Momordica charantia]0.0e+0098.55Show/hide
Query:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
        MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVS+PGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
Subjt:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI

Query:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
        WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFA LLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
Subjt:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD

Query:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
        LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARC FSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
Subjt:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT

Query:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV
        PVTFRGK ANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEIS+LQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLAN+GGLYCISV
Subjt:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV

Query:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH
        AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGY+NDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH
Subjt:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH

Query:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
        LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSL TGPC GDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
Subjt:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI

Query:  LREKLLTTESEVRALATKSSR
        LREKLLTTESEVRALATKSSR
Subjt:  LREKLLTTESEVRALATKSSR

XP_022144498.1 uncharacterized protein LOC111014173 isoform X2 [Momordica charantia]0.0e+0097.75Show/hide
Query:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
        MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVS+PGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
Subjt:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI

Query:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
        WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFA LLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
Subjt:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD

Query:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
        LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARC FSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
Subjt:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT

Query:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV
        PVTFRGK ANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEIS+LQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLAN+GGLYCISV
Subjt:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV

Query:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH
        AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGY+NDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRE   
Subjt:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH

Query:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
          TANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSL TGPC GDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
Subjt:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI

Query:  LREKLLTTESEVRALATKSSR
        LREKLLTTESEVRALATKSSR
Subjt:  LREKLLTTESEVRALATKSSR

XP_022144499.1 uncharacterized protein LOC111014173 isoform X3 [Momordica charantia]0.0e+0097.1Show/hide
Query:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
        MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVS+PGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
Subjt:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI

Query:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
        WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFA LLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
Subjt:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD

Query:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
        LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARC FSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
Subjt:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT

Query:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV
        PVTFRGK ANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEIS+LQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLAN+GGLYCISV
Subjt:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV

Query:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH
        AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGY+NDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRE   
Subjt:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH

Query:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
              DTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSL TGPC GDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
Subjt:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI

Query:  LREKLLTTESEVRALATKSSR
        LREKLLTTESEVRALATKSSR
Subjt:  LREKLLTTESEVRALATKSSR

XP_022144500.1 uncharacterized protein LOC111014173 isoform X4 [Momordica charantia]0.0e+0094.69Show/hide
Query:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
        MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSA                         ATLVMLLSWLLFCLFLRFMKLGDGRSI
Subjt:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI

Query:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
        WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFA LLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
Subjt:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD

Query:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
        LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARC FSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
Subjt:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT

Query:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV
        PVTFRGK ANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEIS+LQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLAN+GGLYCISV
Subjt:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV

Query:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH
        AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGY+NDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH
Subjt:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH

Query:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
        LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSL TGPC GDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
Subjt:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI

Query:  LREKLLTTESEVRALATKSSR
        LREKLLTTESEVRALATKSSR
Subjt:  LREKLLTTESEVRALATKSSR

XP_022144501.1 uncharacterized protein LOC111014173 isoform X5 [Momordica charantia]8.4e-28198.2Show/hide
Query:  LDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRDLGSIVFGNPGFLKQKVMPLS
        +DDQKVVIKRKTELGGTFSIASWILFIGLFA LLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRDLGSIVFGNPGFLKQKVMPLS
Subjt:  LDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRDLGSIVFGNPGFLKQKVMPLS

Query:  NFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDTPVTFRGKTANIVQFNLFPRI
        NFANYSCHNKSQGPTISFRCARC FSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDTPVTFRGK ANIVQFNLFPRI
Subjt:  NFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDTPVTFRGKTANIVQFNLFPRI

Query:  YGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISVAIFSYLLVQFEYRIKKLRNE
        YGNQQDSELIQPLFHEFLPGSSFQEIS+LQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLAN+GGLYCISVAIFSYLLVQFEYRIKKLRNE
Subjt:  YGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISVAIFSYLLVQFEYRIKKLRNE

Query:  DRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNHLQTANQDTKLSKARASDQEM
        DRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGY+NDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNHLQTANQDTKLSKARASDQEM
Subjt:  DRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNHLQTANQDTKLSKARASDQEM

Query:  RAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLILREKLLTTESEVRALATKSS
        RAIATKQELSPKHHVLGFIDGGKQSL TGPC GDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLILREKLLTTESEVRALATKSS
Subjt:  RAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLILREKLLTTESEVRALATKSS

Query:  R
        R
Subjt:  R

TrEMBL top hitse value%identityAlignment
A0A6J1CRS9 uncharacterized protein LOC111014173 isoform X40.0e+0094.69Show/hide
Query:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
        MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSA                         ATLVMLLSWLLFCLFLRFMKLGDGRSI
Subjt:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI

Query:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
        WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFA LLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
Subjt:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD

Query:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
        LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARC FSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
Subjt:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT

Query:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV
        PVTFRGK ANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEIS+LQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLAN+GGLYCISV
Subjt:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV

Query:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH
        AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGY+NDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH
Subjt:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH

Query:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
        LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSL TGPC GDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
Subjt:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI

Query:  LREKLLTTESEVRALATKSSR
        LREKLLTTESEVRALATKSSR
Subjt:  LREKLLTTESEVRALATKSSR

A0A6J1CSG8 uncharacterized protein LOC111014173 isoform X30.0e+0097.1Show/hide
Query:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
        MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVS+PGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
Subjt:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI

Query:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
        WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFA LLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
Subjt:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD

Query:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
        LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARC FSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
Subjt:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT

Query:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV
        PVTFRGK ANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEIS+LQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLAN+GGLYCISV
Subjt:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV

Query:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH
        AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGY+NDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRE   
Subjt:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH

Query:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
              DTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSL TGPC GDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
Subjt:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI

Query:  LREKLLTTESEVRALATKSSR
        LREKLLTTESEVRALATKSSR
Subjt:  LREKLLTTESEVRALATKSSR

A0A6J1CTF3 uncharacterized protein LOC111014173 isoform X20.0e+0097.75Show/hide
Query:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
        MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVS+PGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
Subjt:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI

Query:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
        WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFA LLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
Subjt:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD

Query:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
        LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARC FSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
Subjt:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT

Query:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV
        PVTFRGK ANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEIS+LQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLAN+GGLYCISV
Subjt:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV

Query:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH
        AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGY+NDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRE   
Subjt:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH

Query:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
          TANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSL TGPC GDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
Subjt:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI

Query:  LREKLLTTESEVRALATKSSR
        LREKLLTTESEVRALATKSSR
Subjt:  LREKLLTTESEVRALATKSSR

A0A6J1CTK9 uncharacterized protein LOC111014173 isoform X54.1e-28198.2Show/hide
Query:  LDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRDLGSIVFGNPGFLKQKVMPLS
        +DDQKVVIKRKTELGGTFSIASWILFIGLFA LLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRDLGSIVFGNPGFLKQKVMPLS
Subjt:  LDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRDLGSIVFGNPGFLKQKVMPLS

Query:  NFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDTPVTFRGKTANIVQFNLFPRI
        NFANYSCHNKSQGPTISFRCARC FSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDTPVTFRGK ANIVQFNLFPRI
Subjt:  NFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDTPVTFRGKTANIVQFNLFPRI

Query:  YGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISVAIFSYLLVQFEYRIKKLRNE
        YGNQQDSELIQPLFHEFLPGSSFQEIS+LQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLAN+GGLYCISVAIFSYLLVQFEYRIKKLRNE
Subjt:  YGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISVAIFSYLLVQFEYRIKKLRNE

Query:  DRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNHLQTANQDTKLSKARASDQEM
        DRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGY+NDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNHLQTANQDTKLSKARASDQEM
Subjt:  DRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNHLQTANQDTKLSKARASDQEM

Query:  RAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLILREKLLTTESEVRALATKSS
        RAIATKQELSPKHHVLGFIDGGKQSL TGPC GDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLILREKLLTTESEVRALATKSS
Subjt:  RAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLILREKLLTTESEVRALATKSS

Query:  R
        R
Subjt:  R

A0A6J1CTV4 uncharacterized protein LOC111014173 isoform X10.0e+0098.55Show/hide
Query:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
        MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVS+PGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI
Subjt:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSI

Query:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
        WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFA LLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD
Subjt:  WFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRD

Query:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
        LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARC FSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT
Subjt:  LGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT

Query:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV
        PVTFRGK ANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEIS+LQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLAN+GGLYCISV
Subjt:  PVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISV

Query:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH
        AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGY+NDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH
Subjt:  AIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNH

Query:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
        LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSL TGPC GDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI
Subjt:  LQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI

Query:  LREKLLTTESEVRALATKSSR
        LREKLLTTESEVRALATKSSR
Subjt:  LREKLLTTESEVRALATKSSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G16520.1 unknown protein1.8e-18052.97Show/hide
Query:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGR-VESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRS
        M+CP NS  YN T CAC  G L + ++ SC +F  PS I   + V  S +SF  T+F+FD +RK TQSQ +FL+ATLVMLLSWL+FC FLRF KLGDGR+
Subjt:  MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGR-VESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRS

Query:  IWFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIR
        +WF +RWW+TRLD+ F+T HWLDDQ++V KRKTELGGTFS+ASWI+FIGLFA LLYQII+KR+IEVHN++A  + D++SF  D+EFNIT VS MSC N+R
Subjt:  IWFRMRWWVTRLDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIR

Query:  DLGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFND
         +G++V GNPGF + KV  LS+  +Y+C N + GPT++F+C +C  + D IYISW FVDLP+SPA+AVGFQFN +S N     H SF+SGTL+NGS  ++
Subjt:  DLGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFND

Query:  TPVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCIS
        +PVTFRG   NI++FNLFPRIY +  D +LIQPLFHEF+PGS +++ ++LQ+S+  S DG+LN T++IN LS+YIVEI+ +NILGPVSFLA++GGLYCIS
Subjt:  TPVTFRGKTANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCIS

Query:  VAIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETN
        + IF YLLVQ EYRIKKLRNED V R IRNRRKA +HW+KLR+YV YTW C  L              D  +++++     G  RP     T S + E  
Subjt:  VAIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETN

Query:  HLQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDV-IPPPPTIEFKD---GSDIDMSDILKNIKSLY
            AN+   L   +    +  ++      S      G     K+S+T               HSEDV IPPPP +EF D   GS++D  DI    + LY
Subjt:  HLQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGPCEGDSSRLGDFFHSEDV-IPPPPTIEFKD---GSDIDMSDILKNIKSLY

Query:  KYNLILREKLLTTESEVRALATK
         YN++LREKLL T+S +  LA K
Subjt:  KYNLILREKLLTTESEVRALATK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATGCCCGAGCAACAGTTTCCGCTACAATGGCACCCTCTGCGCATGCCCACCGGGCCATCTTTTCGATCTGACCACCAACAGCTGTGGCCTCTTCAGCAGCCCTTC
GGCCATTGTCATGGGCCGCGTCGAGAGCTCTGCCGTTAGTTTCCCTGGGACCATGTTCTCTTTTGATTCCCTCAGGAAGTTAACGCAGTCTCAGGTTGTGTTTCTTCAGG
CTACTCTTGTCATGCTGCTTTCCTGGCTGCTCTTCTGCTTGTTTTTGAGATTCATGAAGCTTGGGGATGGCAGAAGCATCTGGTTCAGGATGAGATGGTGGGTTACCAGA
CTGGACCTCTGCTTTGCCACAACTCATTGGCTGGATGACCAGAAAGTAGTCATAAAACGGAAAACTGAGCTTGGTGGAACATTCTCAATAGCAAGTTGGATACTTTTCAT
TGGCTTGTTTGCTGTGTTGCTTTACCAAATCATATCCAAGAGAAGCATTGAAGTGCATAATATCAAAGCAGCAAATGCACAGGACATGGTTTCTTTCGTGTGTGATATGG
AATTTAATATAACCACAGTCTCTACTATGAGCTGTGAAAATATTCGTGATCTTGGTTCTATTGTGTTTGGAAATCCTGGTTTTCTGAAACAGAAAGTAATGCCTCTGTCA
AATTTTGCAAACTACTCCTGTCATAACAAGAGTCAAGGGCCAACCATCAGTTTTCGGTGTGCAAGATGCAGTTTCAGTCAGGACAATATTTATATCTCATGGCAGTTTGT
TGATCTTCCAAATAGTCCTGCTAGTGCTGTTGGATTTCAGTTTAACCTCTCTTCAATGAATCATGCCAAAAATAATCATGCAAGTTTTATTAGTGGAACATTAAAAAATG
GGAGCAATTTTAATGATACACCAGTTACGTTCAGGGGGAAGACTGCCAATATAGTGCAATTCAATCTATTTCCAAGAATATACGGTAACCAACAGGATTCTGAGCTCATA
CAGCCTTTATTTCATGAGTTTCTCCCTGGTTCATCCTTTCAAGAGATAAGCAAGCTCCAATCATCCCTTGAGAATTCCGAAGATGGACTACTCAACATCACCATGTATAT
CAATCTGCTCTCTTCCTACATTGTTGAGATAGAGAAGCAAAATATTTTGGGCCCTGTTAGCTTTCTGGCCAATATTGGTGGCCTATATTGCATTAGTGTTGCCATTTTTT
CTTACCTCCTGGTGCAGTTTGAATACAGGATTAAAAAGCTCCGTAACGAAGATAGAGTTATGCGTAATATTAGAAATCGAAGAAAAGCACAAGAACACTGGAATAAGCTG
AGGAAATATGTAATGTATACATGGGGCTGCATAACACTCGATGGTTATCACAATGATCTGTCAACAACACCAAGTTGCGCTGACTGCATGGTTCAATCAAGTCGTAAGCG
TGCATCGTCAGGCAAGCAAAGACCAAAGAGGGGATATACTACTTTCAGTTTTAACAGAGAAACTAATCACCTGCAGACTGCCAATCAGGATACGAAATTATCTAAGGCAA
GAGCTAGTGACCAAGAAATGAGAGCGATAGCAACCAAACAAGAACTATCTCCAAAACACCATGTACTCGGTTTTATCGATGGGGGGAAGCAAAGCTTAACGACGGGTCCA
TGTGAGGGAGATTCCTCACGACTTGGAGACTTTTTTCATTCTGAAGATGTTATTCCCCCACCACCCACAATAGAGTTTAAGGACGGTTCTGATATTGACATGTCTGATAT
CTTGAAGAATATCAAAAGTTTGTACAAATATAATTTAATTCTTAGAGAAAAGCTATTGACTACTGAATCAGAGGTTCGTGCTTTAGCAACCAAGTCCTCTCGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATGCCCGAGCAACAGTTTCCGCTACAATGGCACCCTCTGCGCATGCCCACCGGGCCATCTTTTCGATCTGACCACCAACAGCTGTGGCCTCTTCAGCAGCCCTTC
GGCCATTGTCATGGGCCGCGTCGAGAGCTCTGCCGTTAGTTTCCCTGGGACCATGTTCTCTTTTGATTCCCTCAGGAAGTTAACGCAGTCTCAGGTTGTGTTTCTTCAGG
CTACTCTTGTCATGCTGCTTTCCTGGCTGCTCTTCTGCTTGTTTTTGAGATTCATGAAGCTTGGGGATGGCAGAAGCATCTGGTTCAGGATGAGATGGTGGGTTACCAGA
CTGGACCTCTGCTTTGCCACAACTCATTGGCTGGATGACCAGAAAGTAGTCATAAAACGGAAAACTGAGCTTGGTGGAACATTCTCAATAGCAAGTTGGATACTTTTCAT
TGGCTTGTTTGCTGTGTTGCTTTACCAAATCATATCCAAGAGAAGCATTGAAGTGCATAATATCAAAGCAGCAAATGCACAGGACATGGTTTCTTTCGTGTGTGATATGG
AATTTAATATAACCACAGTCTCTACTATGAGCTGTGAAAATATTCGTGATCTTGGTTCTATTGTGTTTGGAAATCCTGGTTTTCTGAAACAGAAAGTAATGCCTCTGTCA
AATTTTGCAAACTACTCCTGTCATAACAAGAGTCAAGGGCCAACCATCAGTTTTCGGTGTGCAAGATGCAGTTTCAGTCAGGACAATATTTATATCTCATGGCAGTTTGT
TGATCTTCCAAATAGTCCTGCTAGTGCTGTTGGATTTCAGTTTAACCTCTCTTCAATGAATCATGCCAAAAATAATCATGCAAGTTTTATTAGTGGAACATTAAAAAATG
GGAGCAATTTTAATGATACACCAGTTACGTTCAGGGGGAAGACTGCCAATATAGTGCAATTCAATCTATTTCCAAGAATATACGGTAACCAACAGGATTCTGAGCTCATA
CAGCCTTTATTTCATGAGTTTCTCCCTGGTTCATCCTTTCAAGAGATAAGCAAGCTCCAATCATCCCTTGAGAATTCCGAAGATGGACTACTCAACATCACCATGTATAT
CAATCTGCTCTCTTCCTACATTGTTGAGATAGAGAAGCAAAATATTTTGGGCCCTGTTAGCTTTCTGGCCAATATTGGTGGCCTATATTGCATTAGTGTTGCCATTTTTT
CTTACCTCCTGGTGCAGTTTGAATACAGGATTAAAAAGCTCCGTAACGAAGATAGAGTTATGCGTAATATTAGAAATCGAAGAAAAGCACAAGAACACTGGAATAAGCTG
AGGAAATATGTAATGTATACATGGGGCTGCATAACACTCGATGGTTATCACAATGATCTGTCAACAACACCAAGTTGCGCTGACTGCATGGTTCAATCAAGTCGTAAGCG
TGCATCGTCAGGCAAGCAAAGACCAAAGAGGGGATATACTACTTTCAGTTTTAACAGAGAAACTAATCACCTGCAGACTGCCAATCAGGATACGAAATTATCTAAGGCAA
GAGCTAGTGACCAAGAAATGAGAGCGATAGCAACCAAACAAGAACTATCTCCAAAACACCATGTACTCGGTTTTATCGATGGGGGGAAGCAAAGCTTAACGACGGGTCCA
TGTGAGGGAGATTCCTCACGACTTGGAGACTTTTTTCATTCTGAAGATGTTATTCCCCCACCACCCACAATAGAGTTTAAGGACGGTTCTGATATTGACATGTCTGATAT
CTTGAAGAATATCAAAAGTTTGTACAAATATAATTTAATTCTTAGAGAAAAGCTATTGACTACTGAATCAGAGGTTCGTGCTTTAGCAACCAAGTCCTCTCGA
Protein sequenceShow/hide protein sequence
MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSFPGTMFSFDSLRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSIWFRMRWWVTR
LDLCFATTHWLDDQKVVIKRKTELGGTFSIASWILFIGLFAVLLYQIISKRSIEVHNIKAANAQDMVSFVCDMEFNITTVSTMSCENIRDLGSIVFGNPGFLKQKVMPLS
NFANYSCHNKSQGPTISFRCARCSFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDTPVTFRGKTANIVQFNLFPRIYGNQQDSELI
QPLFHEFLPGSSFQEISKLQSSLENSEDGLLNITMYINLLSSYIVEIEKQNILGPVSFLANIGGLYCISVAIFSYLLVQFEYRIKKLRNEDRVMRNIRNRRKAQEHWNKL
RKYVMYTWGCITLDGYHNDLSTTPSCADCMVQSSRKRASSGKQRPKRGYTTFSFNRETNHLQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQSLTTGP
CEGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLILREKLLTTESEVRALATKSSR