; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009409 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009409
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
Genome locationChr06:5597098..5598651
RNA-Seq ExpressionHG10009409
SyntenyHG10009409
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055126.1 uncharacterized protein E6C27_scaffold231G00550 [Cucumis melo var. makuwa]1.9e-28694.97Show/hide
Query:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS
        MASTNSPPNID+S LTEDLATKALNKRYECLVTVRTKAI+GKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRT SEHLKRGTCPNLSS
Subjt:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS

Query:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE
        ISRSNASAASPLPISSIPSPTLHNHKKRSSQMN+PILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNS+GM  KMGFNQHQLVLSGGKDDLGALEMLE
Subjt:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
        NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWF ESCGSVSLSC +HPKFKALL+QLGLPSLP+TDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW

Query:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
        KNKNCC EESVVKF+VNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIAD+YKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
Subjt:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN

Query:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI
        KELPLFRAVTENCLKVANFVNT + +RNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVY+MLDDMLNC HVLQMVVLDESYK+ACMEDSLATEVSSLI
Subjt:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI

Query:  QNERFWDEMEAVHSFVR
        QNERFWDE+EAVHSFV+
Subjt:  QNERFWDEMEAVHSFVR

XP_038907184.1 uncharacterized protein LOC120092979 isoform X1 [Benincasa hispida]5.0e-28795.94Show/hide
Query:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS
        MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAI+GKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRT SEHLKRGTCPNLSS
Subjt:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS

Query:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE
        ISRSNASAASPLPISSIPSPTLHNHKKRSSQMN+ ILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGM  KMGFNQHQ VLSGGKDDLGALEMLE
Subjt:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
        NSVKKLKSPHASP PRLSKEQIDSAIELLTDWF ESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW

Query:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
        KNKNCCGEESVVKF+VNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIAD+YKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
Subjt:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN

Query:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI
        KELPLFRAVTENCLKVANFVNT + +RNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDML+C HVLQMVVLDES+KLACMEDSLATEVSSLI
Subjt:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI

Query:  QNERFWDEMEAVHSFVR
        QNERFWDEMEAVHS V+
Subjt:  QNERFWDEMEAVHSFVR

XP_038907185.1 uncharacterized protein LOC120092979 isoform X2 [Benincasa hispida]5.0e-28795.94Show/hide
Query:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS
        MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAI+GKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRT SEHLKRGTCPNLSS
Subjt:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS

Query:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE
        ISRSNASAASPLPISSIPSPTLHNHKKRSSQMN+ ILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGM  KMGFNQHQ VLSGGKDDLGALEMLE
Subjt:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
        NSVKKLKSPHASP PRLSKEQIDSAIELLTDWF ESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW

Query:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
        KNKNCCGEESVVKF+VNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIAD+YKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
Subjt:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN

Query:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI
        KELPLFRAVTENCLKVANFVNT + +RNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDML+C HVLQMVVLDES+KLACMEDSLATEVSSLI
Subjt:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI

Query:  QNERFWDEMEAVHSFVR
        QNERFWDEMEAVHS V+
Subjt:  QNERFWDEMEAVHSFVR

XP_038907186.1 uncharacterized protein LOC120092979 isoform X3 [Benincasa hispida]5.0e-28795.94Show/hide
Query:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS
        MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAI+GKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRT SEHLKRGTCPNLSS
Subjt:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS

Query:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE
        ISRSNASAASPLPISSIPSPTLHNHKKRSSQMN+ ILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGM  KMGFNQHQ VLSGGKDDLGALEMLE
Subjt:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
        NSVKKLKSPHASP PRLSKEQIDSAIELLTDWF ESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW

Query:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
        KNKNCCGEESVVKF+VNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIAD+YKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
Subjt:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN

Query:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI
        KELPLFRAVTENCLKVANFVNT + +RNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDML+C HVLQMVVLDES+KLACMEDSLATEVSSLI
Subjt:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI

Query:  QNERFWDEMEAVHSFVR
        QNERFWDEMEAVHS V+
Subjt:  QNERFWDEMEAVHSFVR

XP_038907187.1 uncharacterized protein LOC120092979 isoform X4 [Benincasa hispida]5.0e-28795.94Show/hide
Query:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS
        MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAI+GKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRT SEHLKRGTCPNLSS
Subjt:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS

Query:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE
        ISRSNASAASPLPISSIPSPTLHNHKKRSSQMN+ ILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGM  KMGFNQHQ VLSGGKDDLGALEMLE
Subjt:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
        NSVKKLKSPHASP PRLSKEQIDSAIELLTDWF ESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW

Query:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
        KNKNCCGEESVVKF+VNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIAD+YKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
Subjt:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN

Query:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI
        KELPLFRAVTENCLKVANFVNT + +RNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDML+C HVLQMVVLDES+KLACMEDSLATEVSSLI
Subjt:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI

Query:  QNERFWDEMEAVHSFVR
        QNERFWDEMEAVHS V+
Subjt:  QNERFWDEMEAVHSFVR

TrEMBL top hitse value%identityAlignment
A0A1S3CT68 uncharacterized protein LOC103504669 isoform X49.2e-28794.97Show/hide
Query:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS
        MASTNSPPNID+S LTEDLATKALNKRYECLVTVRTKAI+GKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRT SEHLKRGTCPNLSS
Subjt:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS

Query:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE
        ISRSNASAASPLPISSIPSPTLHNHKKRSSQMN+PILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNS+GM  KMGFNQHQLVLSGGKDDLGALEMLE
Subjt:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
        NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWF ESCGSVSLSC +HPKFKALL+QLGLPSLP+TDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW

Query:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
        KNKNCC EESVVKF+VNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIAD+YKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
Subjt:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN

Query:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI
        KELPLFRAVTENCLKVANFVNT + +RNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVY+MLDDMLNC HVLQMVVLDESYK+ACMEDSLATEVSSLI
Subjt:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI

Query:  QNERFWDEMEAVHSFVR
        QNERFWDE+EAVHSFV+
Subjt:  QNERFWDEMEAVHSFVR

A0A1S3CT77 uncharacterized protein LOC103504669 isoform X39.2e-28794.97Show/hide
Query:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS
        MASTNSPPNID+S LTEDLATKALNKRYECLVTVRTKAI+GKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRT SEHLKRGTCPNLSS
Subjt:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS

Query:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE
        ISRSNASAASPLPISSIPSPTLHNHKKRSSQMN+PILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNS+GM  KMGFNQHQLVLSGGKDDLGALEMLE
Subjt:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
        NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWF ESCGSVSLSC +HPKFKALL+QLGLPSLP+TDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW

Query:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
        KNKNCC EESVVKF+VNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIAD+YKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
Subjt:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN

Query:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI
        KELPLFRAVTENCLKVANFVNT + +RNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVY+MLDDMLNC HVLQMVVLDESYK+ACMEDSLATEVSSLI
Subjt:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI

Query:  QNERFWDEMEAVHSFVR
        QNERFWDE+EAVHSFV+
Subjt:  QNERFWDEMEAVHSFVR

A0A1S3CTD6 uncharacterized protein LOC103504669 isoform X29.2e-28794.97Show/hide
Query:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS
        MASTNSPPNID+S LTEDLATKALNKRYECLVTVRTKAI+GKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRT SEHLKRGTCPNLSS
Subjt:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS

Query:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE
        ISRSNASAASPLPISSIPSPTLHNHKKRSSQMN+PILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNS+GM  KMGFNQHQLVLSGGKDDLGALEMLE
Subjt:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
        NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWF ESCGSVSLSC +HPKFKALL+QLGLPSLP+TDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW

Query:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
        KNKNCC EESVVKF+VNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIAD+YKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
Subjt:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN

Query:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI
        KELPLFRAVTENCLKVANFVNT + +RNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVY+MLDDMLNC HVLQMVVLDESYK+ACMEDSLATEVSSLI
Subjt:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI

Query:  QNERFWDEMEAVHSFVR
        QNERFWDE+EAVHSFV+
Subjt:  QNERFWDEMEAVHSFVR

A0A5A7ULG2 Uncharacterized protein9.2e-28794.97Show/hide
Query:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS
        MASTNSPPNID+S LTEDLATKALNKRYECLVTVRTKAI+GKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRT SEHLKRGTCPNLSS
Subjt:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS

Query:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE
        ISRSNASAASPLPISSIPSPTLHNHKKRSSQMN+PILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNS+GM  KMGFNQHQLVLSGGKDDLGALEMLE
Subjt:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
        NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWF ESCGSVSLSC +HPKFKALL+QLGLPSLP+TDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW

Query:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
        KNKNCC EESVVKF+VNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIAD+YKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
Subjt:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN

Query:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI
        KELPLFRAVTENCLKVANFVNT + +RNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVY+MLDDMLNC HVLQMVVLDESYK+ACMEDSLATEVSSLI
Subjt:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI

Query:  QNERFWDEMEAVHSFVR
        QNERFWDE+EAVHSFV+
Subjt:  QNERFWDEMEAVHSFVR

A0A5D3DB26 Uncharacterized protein9.2e-28794.97Show/hide
Query:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS
        MASTNSPPNID+S LTEDLATKALNKRYECLVTVRTKAI+GKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRT SEHLKRGTCPNLSS
Subjt:  MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSS

Query:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE
        ISRSNASAASPLPISSIPSPTLHNHKKRSSQMN+PILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNS+GM  KMGFNQHQLVLSGGKDDLGALEMLE
Subjt:  ISRSNASAASPLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
        NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWF ESCGSVSLSC +HPKFKALL+QLGLPSLP+TDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGW

Query:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
        KNKNCC EESVVKF+VNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIAD+YKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN
Subjt:  KNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFN

Query:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI
        KELPLFRAVTENCLKVANFVNT + +RNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVY+MLDDMLNC HVLQMVVLDESYK+ACMEDSLATEVSSLI
Subjt:  KELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLI

Query:  QNERFWDEMEAVHSFVR
        QNERFWDE+EAVHSFV+
Subjt:  QNERFWDEMEAVHSFVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12380.1 unknown protein8.3e-14751.12Show/hide
Query:  SPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSSISRSN
        +PP +D    T++L  KALNKRYE L+TVRTKA++GKGAWYW HLEP+L+RN    LPKAVKL+CSLCD+VFSASNPSRT SEHLKRGTCPN +S+  + 
Subjt:  SPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSSISRSN

Query:  ASAASPLPISSIPSPTLHNHKKRS---------SQMNSPILTASYQVHSLAMIEPTRSYAPLI--SSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLG
         S  +P P SS  SP  H+ K+ S         S++N P +  SY V  + +++P+R     +  S+PP P               QH L+LSGGKDDLG
Subjt:  ASAASPLPISSIPSPTLHNHKKRS---------SQMNSPILTASYQVHSLAMIEPTRSYAPLI--SSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLG

Query:  ALEMLENSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQ
         L MLE+SVKKLKSP  S    L++ QI+SA++ L+DW  ESCGSVSLS LEHPKF+A LTQ+GLP + + D    RLD K EEA+A++E+RIRDA FFQ
Subjt:  ALEMLENSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQ

Query:  IASDGWKNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFIS
        I+SDGWK       ES+V  IVNLPNGT+++++A+   G V S YAEEV+L+TV  ICG+  QRCVGI++DK+K KALRNLE ++ WMVNLSCQ QG  S
Subjt:  IASDGWKNKNCCGEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFIS

Query:  LIKDFNKELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVP--------SPNCDTSKN-------FSPVYAMLDDMLNCAHVLQMVVL
        LIKDF KELPLF++V++NC+++A F+N  A IRN   KY++QE     +L +P          +C +S +       + P++ +L+D+L+ A  +Q+VV 
Subjt:  LIKDFNKELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVP--------SPNCDTSKN-------FSPVYAMLDDMLNCAHVLQMVVL

Query:  DESYKLACMEDSLATEVSSLIQNERFWDEMEAVHSFVR
        D++ K+  MED +A EV  ++ +E FW+E+EAVH+ ++
Subjt:  DESYKLACMEDSLATEVSSLIQNERFWDEMEAVHSFVR

AT1G62870.1 unknown protein5.7e-14051.27Show/hide
Query:  EDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSSISRSNASAASPLPISS
        E+LATKAL KRYE L+ VRTKA++GKGAWYW+HLEP+L+ N     PKAVKL+CSLCD+VFSASNPSRT SEHLKRGTCPN +S+ +   S  SP P   
Subjt:  EDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSSISRSNASAASPLPISS

Query:  IPSPTLHNHKKRSSQMNSPI----------LTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLENSVKKL
         P P   +H+KR+S     +             SY V  L++++P+R           PV Q            Q  L+LSGGKDDLG L MLE+SVKKL
Subjt:  IPSPTLHNHKKRSSQMNSPI----------LTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLENSVKKL

Query:  KSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGWKNKNCC
        KSP  S    L+K QIDSA++ L+DW  ESCGSVSLS LEHPK +A LTQ+GLP + R D +  RLD K+E+++A++E+RI DA FFQIASDGWK  +  
Subjt:  KSPHASPGPRLSKEQIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGWKNKNCC

Query:  GEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFNKELPLF
          E++V  IVNLPNGT+++++A+F  G V S YAEEV+ +TV  ICG+  QRCVGI++D++ +KALRNLE ++ WMVNLSCQ QGF SLI+DF KELPLF
Subjt:  GEESVVKFIVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFNKELPLF

Query:  RAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLIQNERFW
        ++V+++C ++ NFVN+ A IRN + KY++QE     +LH+P      S  F P+Y +L+D+L+ A  +Q+V+ D+  K   MED +A EV  ++ +  FW
Subjt:  RAVTENCLKVANFVNTNAHIRNCINKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLIQNERFW

Query:  DEMEAVHSFVR
        +E+EAV+  ++
Subjt:  DEMEAVHSFVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCACGAACTCACCGCCCAACATTGATGCTTCGGCGTTGACGGAGGATTTAGCGACCAAGGCTTTGAATAAACGGTATGAATGCCTTGTAACTGTTCGAACAAA
GGCTATTAGGGGGAAAGGGGCTTGGTATTGGGCTCATTTGGAGCCTGTTCTTATACGAAATCCTACTAATAGTCTTCCCAAAGCGGTGAAACTCAAGTGTTCTTTGTGTG
ATTCTGTTTTTTCGGCTTCGAATCCTTCGCGAACTGATTCTGAGCATTTGAAACGAGGCACTTGTCCTAATTTGAGTTCCATTTCTCGGTCTAATGCTTCGGCAGCGTCG
CCGTTGCCGATATCGTCCATTCCTTCTCCGACATTGCACAACCACAAGAAGCGAAGCTCTCAAATGAATTCTCCAATTCTCACTGCTTCTTATCAAGTTCATTCTCTTGC
TATGATTGAGCCGACGCGTTCCTATGCTCCGCTAATTTCCTCGCCGCCGACACCAGTGGCTCAAAATTCGCTTGGGATGGCGGGTAAGATGGGGTTTAATCAGCATCAGT
TGGTGTTATCAGGTGGGAAAGATGATTTGGGTGCGCTAGAAATGCTGGAAAACAGTGTCAAGAAACTGAAGAGTCCACATGCCTCACCTGGACCAAGGCTAAGTAAGGAA
CAAATCGATTCTGCAATTGAATTACTGACTGACTGGTTTAATGAGTCATGTGGGTCAGTTTCATTATCGTGCCTTGAGCACCCCAAGTTTAAGGCCTTGCTTACTCAGTT
GGGTTTGCCTTCATTACCTCGAACCGACATTTTAGGTGCTCGGCTTGACTCCAAGTTTGAGGAGGCCAAAGCTGATTCAGAAGCCAGGATTAGAGATGCAGCGTTTTTTC
AAATTGCTTCAGATGGGTGGAAGAATAAGAATTGCTGTGGCGAAGAGAGTGTAGTTAAATTTATTGTCAATCTTCCAAATGGTACTACTGTATTTCAAAAAGCACTATTT
ACAGGGGGATTGGTGTCATCCAAGTATGCAGAAGAGGTTATTTTGGATACGGTCAACGAGATTTGTGGGAGTGGTCTGCAGAGATGTGTGGGGATAATTGCAGATAAGTA
TAAGGCCAAGGCGTTGAGGAATTTGGAGATAAAGAATCATTGGATGGTAAATCTCTCTTGCCAGCTTCAGGGTTTTATTAGTTTGATAAAGGATTTTAACAAAGAGCTTC
CACTTTTCAGGGCAGTCACTGAAAATTGCTTGAAGGTTGCAAACTTTGTAAACACCAACGCTCATATTAGAAATTGTATAAACAAGTACAAGGTGCAGGAGCTAGAGGGC
CATTGGTTGCTTCATGTTCCTTCTCCAAACTGTGACACATCCAAAAACTTTTCACCTGTTTATGCTATGCTTGATGATATGCTTAACTGCGCTCATGTGCTTCAAATGGT
TGTGTTAGACGAGTCATATAAGCTGGCATGCATGGAGGATTCACTTGCGACTGAGGTTTCTAGTCTGATACAAAATGAACGATTTTGGGATGAAATGGAAGCAGTTCATT
CATTTGTGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCACGAACTCACCGCCCAACATTGATGCTTCGGCGTTGACGGAGGATTTAGCGACCAAGGCTTTGAATAAACGGTATGAATGCCTTGTAACTGTTCGAACAAA
GGCTATTAGGGGGAAAGGGGCTTGGTATTGGGCTCATTTGGAGCCTGTTCTTATACGAAATCCTACTAATAGTCTTCCCAAAGCGGTGAAACTCAAGTGTTCTTTGTGTG
ATTCTGTTTTTTCGGCTTCGAATCCTTCGCGAACTGATTCTGAGCATTTGAAACGAGGCACTTGTCCTAATTTGAGTTCCATTTCTCGGTCTAATGCTTCGGCAGCGTCG
CCGTTGCCGATATCGTCCATTCCTTCTCCGACATTGCACAACCACAAGAAGCGAAGCTCTCAAATGAATTCTCCAATTCTCACTGCTTCTTATCAAGTTCATTCTCTTGC
TATGATTGAGCCGACGCGTTCCTATGCTCCGCTAATTTCCTCGCCGCCGACACCAGTGGCTCAAAATTCGCTTGGGATGGCGGGTAAGATGGGGTTTAATCAGCATCAGT
TGGTGTTATCAGGTGGGAAAGATGATTTGGGTGCGCTAGAAATGCTGGAAAACAGTGTCAAGAAACTGAAGAGTCCACATGCCTCACCTGGACCAAGGCTAAGTAAGGAA
CAAATCGATTCTGCAATTGAATTACTGACTGACTGGTTTAATGAGTCATGTGGGTCAGTTTCATTATCGTGCCTTGAGCACCCCAAGTTTAAGGCCTTGCTTACTCAGTT
GGGTTTGCCTTCATTACCTCGAACCGACATTTTAGGTGCTCGGCTTGACTCCAAGTTTGAGGAGGCCAAAGCTGATTCAGAAGCCAGGATTAGAGATGCAGCGTTTTTTC
AAATTGCTTCAGATGGGTGGAAGAATAAGAATTGCTGTGGCGAAGAGAGTGTAGTTAAATTTATTGTCAATCTTCCAAATGGTACTACTGTATTTCAAAAAGCACTATTT
ACAGGGGGATTGGTGTCATCCAAGTATGCAGAAGAGGTTATTTTGGATACGGTCAACGAGATTTGTGGGAGTGGTCTGCAGAGATGTGTGGGGATAATTGCAGATAAGTA
TAAGGCCAAGGCGTTGAGGAATTTGGAGATAAAGAATCATTGGATGGTAAATCTCTCTTGCCAGCTTCAGGGTTTTATTAGTTTGATAAAGGATTTTAACAAAGAGCTTC
CACTTTTCAGGGCAGTCACTGAAAATTGCTTGAAGGTTGCAAACTTTGTAAACACCAACGCTCATATTAGAAATTGTATAAACAAGTACAAGGTGCAGGAGCTAGAGGGC
CATTGGTTGCTTCATGTTCCTTCTCCAAACTGTGACACATCCAAAAACTTTTCACCTGTTTATGCTATGCTTGATGATATGCTTAACTGCGCTCATGTGCTTCAAATGGT
TGTGTTAGACGAGTCATATAAGCTGGCATGCATGGAGGATTCACTTGCGACTGAGGTTTCTAGTCTGATACAAAATGAACGATTTTGGGATGAAATGGAAGCAGTTCATT
CATTTGTGAGATGA
Protein sequenceShow/hide protein sequence
MASTNSPPNIDASALTEDLATKALNKRYECLVTVRTKAIRGKGAWYWAHLEPVLIRNPTNSLPKAVKLKCSLCDSVFSASNPSRTDSEHLKRGTCPNLSSISRSNASAAS
PLPISSIPSPTLHNHKKRSSQMNSPILTASYQVHSLAMIEPTRSYAPLISSPPTPVAQNSLGMAGKMGFNQHQLVLSGGKDDLGALEMLENSVKKLKSPHASPGPRLSKE
QIDSAIELLTDWFNESCGSVSLSCLEHPKFKALLTQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDAAFFQIASDGWKNKNCCGEESVVKFIVNLPNGTTVFQKALF
TGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADKYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFNKELPLFRAVTENCLKVANFVNTNAHIRNCINKYKVQELEG
HWLLHVPSPNCDTSKNFSPVYAMLDDMLNCAHVLQMVVLDESYKLACMEDSLATEVSSLIQNERFWDEMEAVHSFVR