; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0003152 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0003152
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptioninter-alpha-trypsin inhibitor heavy chain-related
Genome locationchr4:48497423..48503913
RNA-Seq ExpressionLag0003152
SyntenyLag0003152
Gene Ontology termsGO:0006165 - nucleoside diphosphate phosphorylation (biological process)
GO:0006183 - GTP biosynthetic process (biological process)
GO:0006228 - UTP biosynthetic process (biological process)
GO:0006241 - CTP biosynthetic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004550 - nucleoside diphosphate kinase activity (molecular function)
InterPro domainsIPR002035 - von Willebrand factor, type A
IPR036465 - von Willebrand factor A-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059388.1 von Willebrand factor A domain-containing protein 5A isoform X2 [Cucumis melo var. makuwa]0.0e+0091.9Show/hide
Query:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI
        MA EFSN VEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLP APMVYAVIPEPTIVDNPDVPSYQPYVHGRC+PPALIPLHMNGV+MEI+CCFDTAFI
Subjt:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI

Query:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL
        GV+GTWRVHCVMAGK CECLIAVPMGEQGSLLGVEVDV G+ HRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKV+GGCTLSVRINWSQRIPY+
Subjt:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL

Query:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD
        DDLFCLSVPF+FPAYLVPPGKK KNSQKILLHIN+GISSEVVCK+TSHPMKILRREVGN+SFSNEAEVSAWSNMDFDLSYSISP+DLFGGVLLQ+PSLHD
Subjt:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD

Query:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG
        FDQREMFCLYIFPGQN NR+VFRKEVVFIIDISGSMK GPLES+KRAVLASLSKL+PED FNIIGFNGDTKLFSLSMEQAT EAITRAT+WI+ANLVANG
Subjt:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG

Query:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA
        GTNILLP+EQAIKMLAETGNSIPLIFL+TDGSVDNEREICN+VKASLKS NTISPRLCTFG+GTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ LF 
Subjt:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA

Query:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD
        KASS+FLANITVDAFKHLDS ELFPTQIPDLACGSPLIISGRYNG FPESF VSGTSADMSNSTIHL+PQRAKELLLDRVLARRQIDIMTSHAWLLESKD
Subjt:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD

Query:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT
        L++K+AK+SKQ+GFPSEYTRLIL+L  EGKKAPPSII QEMRKRFD++KSNKVEW GQKIILLGNQGVGFGNL ATAEN+QPGKEIK TQATDLLVKAAT
Subjt:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT

Query:  NCCA
        NCC+
Subjt:  NCCA

KAG6575435.1 putative nucleoside diphosphate kinase 5, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0091.89Show/hide
Query:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI
        MA EFSNCVEYGLHLSKRIYYGKGSAPAAL RQMSRVSEDYLPTAPMVYAVIP+PTIVDNPDVPSYQPYVHGRC+PP LIPLHMNGVAMEIDCCFDTAFI
Subjt:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI

Query:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL
        GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDV G+S+RTELVSMEDAEAI+KLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL
Subjt:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL

Query:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD
        DDLFCL+VPFNFPAYLVPPGKKVKN QKILLHIN+GISSE+VCK+TSHPMKIL REVG+LSFSNE EVSAWSN+DFDLSYSISPSDLFGGVLLQSPSLHD
Subjt:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD

Query:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG
        FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLES+KRAVLAS+SKL+ EDTFNII FNGDTKLFS SMEQATNEAITRATEWIDANLVANG
Subjt:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG

Query:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA
        GTNILLP+EQAIKMLAETGNSIPLIFL+TDGSV+NEREICNLVKASLKS  TISPRLCTFG+GT+CNHYFLQMLSEIGRG+YDAAYDVDLIDSRFQ+LF 
Subjt:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA

Query:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD
        KASSLFLANITVD FKHL SLELFPTQIPDLACGSPLI+SGRYNGSFPESF VSG  ADMS+S IHLK QRAKELLL+RVLARRQIDIMTSHAWLLESKD
Subjt:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD

Query:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT
        LE+K+A MSKQTGFPSEYTRLILMLT EGKKAPPSII+QEMRKR DM KSNKVEW GQ+I+LLGNQGVGFGNL ATAEN+QPGKEIK TQATDLLVKAAT
Subjt:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT

Query:  NCC
        NCC
Subjt:  NCC

TYK03939.1 von Willebrand factor A domain-containing protein 5A isoform X2 [Cucumis melo var. makuwa]0.0e+0091.9Show/hide
Query:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI
        MA EFSN VEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLP APMVYAVIPEPTIVDNPDVPSYQPYVHGRC+PPALIPLHMNGV+MEI+CCFDTAFI
Subjt:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI

Query:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL
        GV+GTWRVHCVMAGK CECLIAVPMGEQGSLLGVEVDV G+ HRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKV+GGCTLSVRINWSQRIPY+
Subjt:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL

Query:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD
        DDLFCLSVPF+FPAYLVPPGKK KNSQKILLHIN+GISSEVVCK+TSHPMKILRREVGN+SFSNEAEVSAWSNMDFDLSYSISP+DLFGGVLLQ+PSLHD
Subjt:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD

Query:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG
        FDQREMFCLYIFPGQN NR+VFRKEVVFIIDISGSMK GPLES+KRAVLASLSKL+PED FNIIGFNGDTKLFSLSMEQAT EAITRAT+WI+ANLVANG
Subjt:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG

Query:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA
        GTNILLP+EQAIKMLAETGNSIPLIFL+TDGSVDNEREICN+VKASLKS NTISPRLCTFG+GTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ LF 
Subjt:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA

Query:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD
        KASS+FLANITVDAFKHLDS ELFPTQIPDLACGSPLIISGRYNG FPESF VSGTSADMSNSTIHL+PQRAKELLLDRVLARRQIDIMTSHAWLLESKD
Subjt:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD

Query:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT
        L++K+AK+SKQ+GFPSEYTRLIL+L  EGKKAPPSII QEMRKRFD++KSNKVEW GQKIILLGNQGVGFGNL ATAEN+QPGKEIK TQATDLLVKAAT
Subjt:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT

Query:  NCCA
        NCC+
Subjt:  NCCA

XP_008462286.1 PREDICTED: von Willebrand factor A domain-containing protein 5A isoform X2 [Cucumis melo]0.0e+0091.76Show/hide
Query:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI
        MA EFSN VEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLP APMVYAVIPEPTIVDNPDVPSYQPYVHGRC+PPALIPLHMNGV+MEI+CCFDTAFI
Subjt:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI

Query:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL
        GV+GTWRVHCVMAGK CECLIAVPMGEQGSLLGVEVDV G+ HRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKV+GGCTLSVRINWSQRIPY+
Subjt:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL

Query:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD
        DDLFCLSVPF+FPAYLVPPGKK KNSQKILLHIN+GISSEVVCK+TSHPMKILRREVGN+SFSNEAEVSAWSNMDFDLSYSISP+DLFGGVLLQ+PSLHD
Subjt:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD

Query:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG
        FDQR+MFCLYIFPGQN NR+VFRKEVVFIIDISGSMK GPLES+KRAVLASLSKL+PED FNIIGFNGDTKLFSLSMEQAT EAITRAT+WI+ANLVANG
Subjt:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG

Query:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA
        GTNILLP+EQAIKMLAETGNSIPLIFL+TDGSVDNEREICNLVKASLKS NTISPRLCTFG+GTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ LF 
Subjt:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA

Query:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD
        KASS+FLANITVDA KHLDS ELFPTQIPDLACGSPLIISGRYNG FPESF VSGTSADMSNSTIHL+PQRAKELLLDRVLARRQIDIMTSHAWLLESKD
Subjt:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD

Query:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT
        L++K+AK+SKQ+GFPSEYTRLIL+L  EGKKAPPSII QEMRKRFD++KSNKVEW GQKIILLGNQGVGFGNL ATAEN+QPGKEIK TQATDLLVKAAT
Subjt:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT

Query:  NCCA
        NCC+
Subjt:  NCCA

XP_038898481.1 von Willebrand factor A domain-containing protein 5A isoform X1 [Benincasa hispida]0.0e+0093.46Show/hide
Query:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI
        MA EFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRC+PPALIPLHMNGV+MEIDCCFDTAFI
Subjt:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI

Query:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL
        GVSGTWRVHCVMAGK CECLIAVPMGEQGSLLGVEVDV G+SHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGG TLSVRINWSQRIPY 
Subjt:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL

Query:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD
        DDLFCLSVPF+FPAYLVPPGKKVKNSQKILLHIN+GISSEVVCK+TSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD
Subjt:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD

Query:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG
        FDQREMFCLYIFPGQN NR+VFRK VVFIIDISGSMK GP+ES+KRAVLASLSKL+PED FNIIGFNGDTKLFSLSMEQAT EAITRATEWIDANLV NG
Subjt:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG

Query:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA
        GTNILLP+EQAIKMLAETGNSIPLIFL+TDGSV NEREICNLVKASLK  NTISPRLCTFG+GTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ LF 
Subjt:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA

Query:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD
        KASSLFLANITVDAFKHLDS ELFPTQIPDLA GSPLIISGRYNGSFP+SF VSGTSADMSNSTIHLK QRAKELLLDRVLARRQID++TSHAWLLESKD
Subjt:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD

Query:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT
        L++KVAKMSKQTGFPSEYTRLILML  EGKKAP SII QEMRKRFD+SKSNKVEW GQKIILLGNQGVGFGNLAATAEN+QPGKEIKP+QATDLLVKAAT
Subjt:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT

Query:  NCC
        NCC
Subjt:  NCC

TrEMBL top hitse value%identityAlignment
A0A1S3CGJ5 von Willebrand factor A domain-containing protein 5A isoform X10.0e+0091.37Show/hide
Query:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI
        MA EFSN VEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLP APMVYAVIPEPTIVDNPDVPSYQPYVHGRC+PPALIPLHMNGV+MEI+CCFDTAFI
Subjt:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI

Query:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL
        GV+GTWRVHCVMAGK CECLIAVPMGEQGSLLGVEVDV G+ HRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKV+GGCTLSVRINWSQRIPY+
Subjt:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL

Query:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD
        DDLFCLSVPF+FPAYLVPPGKK KNSQKILLHIN+GISSEVVCK+TSHPMKILRREVGN+SFSNEAEVSAWSNMDFDLSYSISP+DLFGGVLLQ+PSLHD
Subjt:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD

Query:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG
        FDQR+MFCLYIFPGQN NR+VFRKEVVFIIDISGSMK GPLES+KRAVLASLSKL+PED FNIIGFNGDTKLFSLSMEQAT EAITRAT+WI+ANLVANG
Subjt:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG

Query:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGV---GTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQL
        GTNILLP+EQAIKMLAETGNSIPLIFL+TDGSVDNEREICNLVKASLKS NTISPRLCTFG+   GTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ 
Subjt:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGV---GTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQL

Query:  LFAKASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLE
        LF KASS+FLANITVDA KHLDS ELFPTQIPDLACGSPLIISGRYNG FPESF VSGTSADMSNSTIHL+PQRAKELLLDRVLARRQIDIMTSHAWLLE
Subjt:  LFAKASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLE

Query:  SKDLENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVK
        SKDL++K+AK+SKQ+GFPSEYTRLIL+L  EGKKAPPSII QEMRKRFD++KSNKVEW GQKIILLGNQGVGFGNL ATAEN+QPGKEIK TQATDLLVK
Subjt:  SKDLENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVK

Query:  AATNCCA
        AATNCC+
Subjt:  AATNCCA

A0A1S3CH51 von Willebrand factor A domain-containing protein 5A isoform X20.0e+0091.76Show/hide
Query:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI
        MA EFSN VEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLP APMVYAVIPEPTIVDNPDVPSYQPYVHGRC+PPALIPLHMNGV+MEI+CCFDTAFI
Subjt:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI

Query:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL
        GV+GTWRVHCVMAGK CECLIAVPMGEQGSLLGVEVDV G+ HRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKV+GGCTLSVRINWSQRIPY+
Subjt:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL

Query:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD
        DDLFCLSVPF+FPAYLVPPGKK KNSQKILLHIN+GISSEVVCK+TSHPMKILRREVGN+SFSNEAEVSAWSNMDFDLSYSISP+DLFGGVLLQ+PSLHD
Subjt:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD

Query:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG
        FDQR+MFCLYIFPGQN NR+VFRKEVVFIIDISGSMK GPLES+KRAVLASLSKL+PED FNIIGFNGDTKLFSLSMEQAT EAITRAT+WI+ANLVANG
Subjt:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG

Query:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA
        GTNILLP+EQAIKMLAETGNSIPLIFL+TDGSVDNEREICNLVKASLKS NTISPRLCTFG+GTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ LF 
Subjt:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA

Query:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD
        KASS+FLANITVDA KHLDS ELFPTQIPDLACGSPLIISGRYNG FPESF VSGTSADMSNSTIHL+PQRAKELLLDRVLARRQIDIMTSHAWLLESKD
Subjt:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD

Query:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT
        L++K+AK+SKQ+GFPSEYTRLIL+L  EGKKAPPSII QEMRKRFD++KSNKVEW GQKIILLGNQGVGFGNL ATAEN+QPGKEIK TQATDLLVKAAT
Subjt:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT

Query:  NCCA
        NCC+
Subjt:  NCCA

A0A5A7UW49 von Willebrand factor A domain-containing protein 5A isoform X20.0e+0091.9Show/hide
Query:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI
        MA EFSN VEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLP APMVYAVIPEPTIVDNPDVPSYQPYVHGRC+PPALIPLHMNGV+MEI+CCFDTAFI
Subjt:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI

Query:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL
        GV+GTWRVHCVMAGK CECLIAVPMGEQGSLLGVEVDV G+ HRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKV+GGCTLSVRINWSQRIPY+
Subjt:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL

Query:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD
        DDLFCLSVPF+FPAYLVPPGKK KNSQKILLHIN+GISSEVVCK+TSHPMKILRREVGN+SFSNEAEVSAWSNMDFDLSYSISP+DLFGGVLLQ+PSLHD
Subjt:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD

Query:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG
        FDQREMFCLYIFPGQN NR+VFRKEVVFIIDISGSMK GPLES+KRAVLASLSKL+PED FNIIGFNGDTKLFSLSMEQAT EAITRAT+WI+ANLVANG
Subjt:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG

Query:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA
        GTNILLP+EQAIKMLAETGNSIPLIFL+TDGSVDNEREICN+VKASLKS NTISPRLCTFG+GTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ LF 
Subjt:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA

Query:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD
        KASS+FLANITVDAFKHLDS ELFPTQIPDLACGSPLIISGRYNG FPESF VSGTSADMSNSTIHL+PQRAKELLLDRVLARRQIDIMTSHAWLLESKD
Subjt:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD

Query:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT
        L++K+AK+SKQ+GFPSEYTRLIL+L  EGKKAPPSII QEMRKRFD++KSNKVEW GQKIILLGNQGVGFGNL ATAEN+QPGKEIK TQATDLLVKAAT
Subjt:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT

Query:  NCCA
        NCC+
Subjt:  NCCA

A0A5D3C0Q7 Nucleoside-diphosphate kinase0.0e+0091.9Show/hide
Query:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI
        MA EFSN VEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLP APMVYAVIPEPTIVDNPDVPSYQPYVHGRC+PPALIPLHMNGV+MEI+CCFDTAFI
Subjt:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI

Query:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL
        GV+GTWRVHCVMAGK CECLIAVPMGEQGSLLGVEVDV G+ HRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKV+GGCTLSVRINWSQRIPY+
Subjt:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL

Query:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD
        DDLFCLSVPF+FPAYLVPPGKK KNSQKILLHIN+GISSEVVCK+TSHPMKILRREVGN+SFSNEAEVSAWSNMDFDLSYSISP+DLFGGVLLQ+PSLHD
Subjt:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD

Query:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG
        FDQREMFCLYIFPGQN NR+VFRKEVVFIIDISGSMK GPLES+KRAVLASLSKL+PED FNIIGFNGDTKLFSLSMEQAT EAITRAT+WI+ANLVANG
Subjt:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG

Query:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA
        GTNILLP+EQAIKMLAETGNSIPLIFL+TDGSVDNEREICN+VKASLKS NTISPRLCTFG+GTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ LF 
Subjt:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA

Query:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD
        KASS+FLANITVDAFKHLDS ELFPTQIPDLACGSPLIISGRYNG FPESF VSGTSADMSNSTIHL+PQRAKELLLDRVLARRQIDIMTSHAWLLESKD
Subjt:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD

Query:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT
        L++K+AK+SKQ+GFPSEYTRLIL+L  EGKKAPPSII QEMRKRFD++KSNKVEW GQKIILLGNQGVGFGNL ATAEN+QPGKEIK TQATDLLVKAAT
Subjt:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT

Query:  NCCA
        NCC+
Subjt:  NCCA

A0A6J1GQ62 uncharacterized protein LOC111456095 isoform X10.0e+0091.47Show/hide
Query:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI
        MA EFSNCVEYGLHLSKRIYYGKGSAPAAL RQMSRVSEDYLPTAPMVYAVIP+PTIVDNPDVPSYQPYVHGRC+PP LIPLHMNGVAMEIDCCFDTAFI
Subjt:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI

Query:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL
        GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDV+G+S+RTELVSMEDAEAI+KLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL
Subjt:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYL

Query:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD
        DDLFCL+VPFNFPAYLVPPGKKVKN QKILLHIN+GISSE+VCK+TSHPMKIL REVG+LSFSNE EVSAWSN+DF+LSYSISPSDLFGGVLLQSPSLHD
Subjt:  DDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHD

Query:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG
        FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLES+KRAVLAS+SKL+ EDTFNII FNGDTKLFS SMEQATNEAITRATEWIDANLVANG
Subjt:  FDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANG

Query:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA
        GTNILLP+EQAIKMLAETGNSIPLIFL+TDGSV+NEREICNLVKASLKS   I+PRLCTFG+GT+CNHYFLQMLSEIGRG+YDAAYDVDLIDSRFQ+LF 
Subjt:  GTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFA

Query:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD
        KASSLFLANITVDAFKHL SLELFPTQIPDLACGSPLI+SGRYNG FPESF VSG  ADMS+S IHLK QRAKELLL+RVLARRQIDIMTSHAWLLESKD
Subjt:  KASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKD

Query:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT
        LE+K+A MSKQTGFPSEYTRLILMLT EGKKAPPSII+QEMRKR DM KSNKVEW GQ+I+LLGNQGVGFGNL ATAEN QPGKEIK TQATDLLVKAAT
Subjt:  LENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAAT

Query:  NCC
        NCC
Subjt:  NCC

SwissProt top hitse value%identityAlignment
A6X935 Inter alpha-trypsin inhibitor, heavy chain 45.6e-1623.91Show/hide
Query:  NEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHDFDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNI
        +++E     N DF + Y ++ SD  G + ++          E + ++ F    +N     K V+F+ID SGSM G  ++ ++ A++  L  LSP+D FN+
Subjt:  NEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHDFDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNI

Query:  IGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLAET-------GNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPR
        I F+G+   +  S+ QAT E + +A  +  + + A+GGTNI   +  A+++L  +         S+ LI L+TDG          +++ +++        
Subjt:  IGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLAET-------GNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPR

Query:  LCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFAKASSLFLANITVDAFKHLDSL--ELFPTQIPDLACGSPLIISGRYNGSFPESFI--
        L   G G   N+ FL+ ++    G+    Y+      + Q  + + ++  L+++   AF++      E+   +      GS ++++G+     P+  +  
Subjt:  LCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFAKASSLFLANITVDAFKHLDSL--ELFPTQIPDLACGSPLIISGRYNGSFPESFI--

Query:  VSGTSADMSNSTIHLKPQRAKE
        VSG    M N T   +   A++
Subjt:  VSGTSADMSNSTIHLKPQRAKE

O02668 Inter-alpha-trypsin inhibitor heavy chain H25.6e-1625.61Show/hide
Query:  KEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLAETG----
        K ++F+ID+SGSM G  ++ +  A+   L  L  ED F+++ FN + + +   +  AT   +  A  +I+  +  +GGTNI   + +AI +L E      
Subjt:  KEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLAETG----

Query:  ---NSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFAKASSLFLANITVDAFK
           NS+ LI LV+DG         + ++ ++K     +  L + G+G   ++ FL+ LS   RG+    Y      S+ +  + + S+  L N+  + + 
Subjt:  ---NSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFAKASSLFLANITVDAFK

Query:  HLDSLELFPTQIPDLACGSPLIISGRYNGSFPESF--IVSGTSADM
             ++     P+   GS ++++G++N    E    I++ TSA++
Subjt:  HLDSLELFPTQIPDLACGSPLIISGRYNGSFPESF--IVSGTSADM

P97279 Inter-alpha-trypsin inhibitor heavy chain H21.6e-1526.16Show/hide
Query:  QNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLA
        +N     K ++F+ID+SGSM G  ++ +  A+   L  L  ED F+++ FN + + +   +  AT   IT A  +I+  +  +GGTNI   + +AI +L 
Subjt:  QNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLA

Query:  ETGN-------SIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFAKASSLFLAN
        E  N       S+ LI LV+DG         + ++ ++K     +  L + G+G   ++ FL+ LS   RGI    Y      S+ +  + + S+  L N
Subjt:  ETGN-------SIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFAKASSLFLAN

Query:  ITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGS
        +  + +      ++      +   GS ++++G+Y+ S
Subjt:  ITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGS

Q54DU5 von Willebrand factor A domain-containing protein DDB_G02920285.9e-1828Show/hide
Query:  RKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGD-TKLFSLSMEQATNEAITRATEW---IDANLVANGGTNILLPMEQAIKMLAET
        + E +F++D SGSM G P+E SK A+   +  L+    FNI+ F  +  KLF  S +   +E + +A+E+   IDANL   GGT +L P+   +    E+
Subjt:  RKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGD-TKLFSLSMEQATNEAITRATEW---IDANLVANGGTNILLPMEQAIKMLAET

Query:  GNSIP-LIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFAKASSLFLANITVDAFKH
            P  +F++TDG + N  ++ + V    K  NT   R+ T+G+G++ +   +  +S+  +G Y+   D   ++ +   L + A    L+NI VD +  
Subjt:  GNSIP-LIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFAKASSLFLANITVDAFKH

Query:  LDSLELFPTQIPDLACGS-----------------PLIISGRYNGSFPE--SFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLES
        L ++   P QI  L                      +I+SG  NG   E  SF V    +  + ST H+    A + + D   + R+           E 
Subjt:  LDSLELFPTQIPDLACGS-----------------PLIISGRYNGSFPE--SFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLES

Query:  KDLENKVAKMSKQTGFPSEYTRLIL
        KD ++K+ K+ K+ G  S++T  I+
Subjt:  KDLENKVAKMSKQTGFPSEYTRLIL

Q5TIE3 von Willebrand factor A domain-containing protein 5B13.3e-1626.75Show/hide
Query:  DFDQREMFCLYIFPGQNQNRKVFRK---EVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANL
        D     +  L   P     +   RK   E +F+ID S SM G  +   K A+L +L  L P   FNIIGF    K    S +  + +++  A + I    
Subjt:  DFDQREMFCLYIFPGQNQNRKVFRK---EVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANL

Query:  VANGGTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ
           GGTNIL P++  I+     G+   L+F++TDG+V+N  ++  LV+      +  S R  +FG+G    H  ++ L+ +  G  +   + + +  +  
Subjt:  VANGGTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ

Query:  LLFAKASSLFLANITVD-AFKHLDSLELFPTQIPDLACGSPLI
            KA +  L+++TV+  F     + + P     L  G  L+
Subjt:  LLFAKASSLFLANITVD-AFKHLDSLELFPTQIPDLACGSPLI

Arabidopsis top hitse value%identityAlignment
AT1G19110.1 inter-alpha-trypsin inhibitor heavy chain-related2.2e-20149.44Show/hide
Query:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAA----LARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFD
        MA +F+  V+ GL L+KRIY+GK  A AA         S  ++ YLPTAPMVYAVIP+P IVDNPD+PSYQP+VHGRC PPALIPL MN + +++DC  D
Subjt:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAA----LARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFD

Query:  TAFIGVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQR
        TA + V+G+WRVHCVM  K C+C IA+PMGEQGS+LGVEV++   S+ T+L++ ED    EK A  E G FLK   I+TL IP+VDGG  LS+++ WSQ+
Subjt:  TAFIGVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQR

Query:  IPYLDDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSP
        + Y    F L +PFNFP Y+ P  KK+   +KI L +NAG  +EV+CK  SH +K   R  G L F+ EA+V  WSN DF  SY+ S S++ GG+ LQS 
Subjt:  IPYLDDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSP

Query:  SLHDFDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANL
         +HD DQR++F  Y+FPG+ Q  K F++EVVF++DIS SM G PLE  K A+  +LSKL P D+FNII F+ DT LFS SME  T++A+ R  EW++ N 
Subjt:  SLHDFDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANL

Query:  VANGGTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ
        V   GTN+L P+E+A++ML+ T  SIP+IF VTDGSV++ER IC+++K  L S  ++ PR+ TFG+G FCNHYFLQML+ I  G +++ Y+ D I+ R  
Subjt:  VANGGTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ

Query:  LLFAKASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLL
         LF KA S  L NI ++  + LD +E++P+ IPDL   SPL+I GRY G FPE+ I  G   D+S+ +  L  Q AK++ LD+V A+  ID++T+ AW  
Subjt:  LLFAKASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLL

Query:  ESKDLENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQK------IILLGNQGVGFGNLAATAENMQPG-KEIKPT
        E K L+ K+AK+S QTG  SEYTR+I +   E  K  PS          +     K   NG+K       I L + G+GFG+  AT EN+ PG  E K  
Subjt:  ESKDLENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQK------IILLGNQGVGFGNLAATAENMQPG-KEIKPT

Query:  QATDLLVKAATNCC
         A +  VKAA++CC
Subjt:  QATDLLVKAATNCC

AT1G72500.1 LOCATED IN: plasma membrane6.0e-19950Show/hide
Query:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI
        M+ EF+  VE GL L++RIYYGKG AP  +    S   E++LPTA   YA I +P  VDNPDVPSYQPYVH RC P AL+PL M G+ M IDC  DTAF+
Subjt:  MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFI

Query:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGS--SHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIP
         V+G WRVHCV   K  +C + VPMGE+GS LG E+DV  +  S++T+LV+ ++    + + K +D +F K   IYT KIP V GG   SV + WSQ++ 
Subjt:  GVSGTWRVHCVMAGKGCECLIAVPMGEQGSLLGVEVDVAGS--SHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIP

Query:  YLDDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGIS-SEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPS
        Y D  F L+VPF FP+Y+ P GK++   +KI+L++N+ +S  E+   +TSHP+KI+ R  G LS   EAEV +WS +DF +S+++S  DL G VL++SPS
Subjt:  YLDDLFCLSVPFNFPAYLVPPGKKVKNSQKILLHINAGIS-SEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPS

Query:  LHDFDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLV
          D D R +FCLY+FPG  ++ K+F++ VVF+IDIS SMK  PLE  K+A+L  L+KL  ED FNII FN +   FS SME AT+E I+  TEW+D+NL+
Subjt:  LHDFDQREMFCLYIFPGQNQNRKVFRKEVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLV

Query:  ANGGTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASL-KSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ
        ANGGTN+LLP++QA+K+L  +   +PL++LVTDGSV+NEREIC+ +K S  ++  +ISPR+ TFG+G+FCNHYFLQML+ IG G YD   + D  + +  
Subjt:  ANGGTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREICNLVKASL-KSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQ

Query:  LLFAKASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLL
         LF  ASS  +AN T DA K L S+ELFP Q+PD+  G PLI+SGRY G FP+   + GT ADMS  TI L  Q+AK++ LD+VLARRQI+ +T+ AW  
Subjt:  LLFAKASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPESFIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLL

Query:  ESKDLENKVAKMSKQTGFPSEYTRLILMLT-DEGKK--APPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPG-KEIKPTQAT
        + K+L+ KV ++S QTGFPSEYT+++L +  DE +K  A P  I + +R             N  +  LLG QG GFGN+AAT +N+ P  +E K  + T
Subjt:  ESKDLENKVAKMSKQTGFPSEYTRLILMLT-DEGKK--APPSIILQEMRKRFDMSKSNKVEWNGQKIILLGNQGVGFGNLAATAENMQPG-KEIKPTQAT

Query:  DLLVKAATN-----CC
        +LL++AA+      CC
Subjt:  DLLVKAATN-----CC

AT3G54780.1 Zinc finger (C3HC4-type RING finger) family protein1.1e-0634.23Show/hide
Query:  EVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDT-KLFSLS-MEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLAE--TGN
        ++V ++DISGSM G  L   KRA+   +  L   D  ++I F+    +LF L+ M  A  +   +A      +LVANGGTNI+  + +  K++ +    N
Subjt:  EVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDT-KLFSLS-MEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLAE--TGN

Query:  SIPLIFLVTDG
        S+  I L++DG
Subjt:  SIPLIFLVTDG

AT3G54780.2 Zinc finger (C3HC4-type RING finger) family protein1.1e-0634.23Show/hide
Query:  EVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDT-KLFSLS-MEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLAE--TGN
        ++V ++DISGSM G  L   KRA+   +  L   D  ++I F+    +LF L+ M  A  +   +A      +LVANGGTNI+  + +  K++ +    N
Subjt:  EVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDT-KLFSLS-MEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLAE--TGN

Query:  SIPLIFLVTDG
        S+  I L++DG
Subjt:  SIPLIFLVTDG

AT3G54780.3 Zinc finger (C3HC4-type RING finger) family protein1.1e-0634.23Show/hide
Query:  EVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDT-KLFSLS-MEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLAE--TGN
        ++V ++DISGSM G  L   KRA+   +  L   D  ++I F+    +LF L+ M  A  +   +A      +LVANGGTNI+  + +  K++ +    N
Subjt:  EVVFIIDISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDT-KLFSLS-MEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLAE--TGN

Query:  SIPLIFLVTDG
        S+  I L++DG
Subjt:  SIPLIFLVTDG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCGAGTTTTCCAACTGCGTTGAGTACGGTCTCCATTTGTCGAAGCGAATTTACTATGGAAAAGGATCGGCGCCGGCGGCTCTGGCGAGGCAGATGTCGAGGGT
GTCGGAGGACTACCTTCCGACGGCTCCAATGGTGTACGCCGTCATACCGGAGCCGACAATTGTGGACAATCCGGATGTTCCGAGCTACCAGCCTTACGTGCACGGGCGTT
GCCTACCGCCGGCGTTGATTCCGCTGCATATGAATGGAGTTGCCATGGAAATCGATTGCTGTTTCGATACTGCGTTCATTGGCGTTAGCGGAACGTGGCGCGTGCATTGC
GTGATGGCCGGGAAAGGATGTGAGTGCCTTATCGCGGTGCCGATGGGAGAGCAGGGTTCACTTCTAGGTGTTGAAGTTGATGTTGCTGGATCGTCACATCGCACTGAATT
AGTTTCCATGGAAGACGCGGAAGCTATAGAGAAACTGGCAAAATCTGAAGATGGAAAGTTTCTGAAAGGGCGCCGGATATACACTTTAAAGATCCCCAAGGTCGATGGTG
GCTGTACTCTTTCAGTTCGGATCAATTGGTCCCAGAGAATACCATACCTTGATGATCTCTTCTGTCTCAGTGTACCTTTCAATTTTCCAGCATATCTAGTTCCTCCTGGG
AAAAAGGTCAAGAATAGTCAAAAGATCTTGCTGCATATAAATGCGGGTATTTCGTCAGAAGTTGTATGTAAATATACTAGCCATCCGATGAAGATACTGAGACGCGAAGT
TGGCAACTTAAGTTTCTCGAACGAAGCAGAGGTCTCAGCATGGTCAAATATGGATTTTGATCTTTCGTACTCGATTTCTCCCAGTGACTTGTTTGGTGGTGTGCTGCTGC
AATCGCCGTCTCTTCATGATTTTGATCAAAGAGAGATGTTTTGTCTGTACATTTTTCCCGGCCAGAACCAGAACCGGAAGGTTTTCAGAAAGGAAGTAGTATTTATTATT
GACATAAGTGGAAGCATGAAGGGTGGTCCTCTCGAGAGTTCAAAGCGTGCAGTCCTTGCTTCACTTTCAAAGTTGAGCCCTGAAGATACATTTAACATAATAGGGTTCAA
TGGAGACACTAAATTATTTTCTTTATCAATGGAGCAAGCAACCAACGAAGCTATTACAAGGGCAACAGAGTGGATTGACGCTAATCTTGTAGCTAATGGTGGTACTAACA
TCCTGCTTCCCATGGAACAGGCTATAAAGATGTTGGCTGAAACGGGCAATTCAATTCCCCTCATTTTTCTCGTTACCGATGGTTCTGTTGATAATGAGAGAGAAATATGC
AATCTTGTGAAAGCTTCTTTGAAGAGTGAAAATACAATCTCTCCTCGTTTGTGTACTTTTGGTGTTGGTACATTTTGTAACCATTACTTCCTACAAATGCTATCAGAAAT
TGGGAGGGGGATTTATGATGCTGCTTATGATGTAGATTTGATTGATTCTCGGTTCCAGTTATTATTCGCAAAAGCCTCATCGCTCTTTCTCGCCAACATCACCGTGGATG
CTTTCAAGCATCTTGATTCCCTTGAGTTGTTTCCAACTCAAATTCCAGACCTTGCATGTGGAAGCCCATTGATAATATCAGGCCGATACAATGGAAGCTTTCCGGAATCT
TTTATAGTTAGTGGCACCTCAGCCGATATGAGCAATTCCACAATCCATCTCAAACCTCAAAGAGCGAAAGAACTTCTCCTTGACAGGGTGCTTGCGAGAAGACAAATAGA
TATAATGACTTCACATGCATGGTTACTAGAGAGCAAGGACTTAGAAAATAAGGTTGCAAAAATGAGCAAACAAACTGGGTTTCCGTCCGAGTATACACGTTTAATTTTAA
TGCTGACCGACGAAGGGAAGAAAGCTCCCCCATCCATTATCCTACAAGAGATGCGCAAGAGATTTGACATGTCGAAGTCGAATAAGGTAGAATGGAACGGGCAAAAGATC
ATCTTATTAGGAAACCAAGGCGTTGGATTTGGGAACTTAGCCGCAACCGCAGAGAACATGCAGCCAGGCAAGGAAATAAAGCCAACTCAAGCTACAGATCTCCTGGTGAA
AGCCGCTACAAATTGCTGTGCGTTGTGGCCTTTACACAACTCTCTGCCGCTCTTGCCTGCTGTGAGATCTTTAATTGTTGCTTTGAATTATGTGAATGTGAATGTTTCTG
ATGTTAATAATATTATACCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCCGAGTTTTCCAACTGCGTTGAGTACGGTCTCCATTTGTCGAAGCGAATTTACTATGGAAAAGGATCGGCGCCGGCGGCTCTGGCGAGGCAGATGTCGAGGGT
GTCGGAGGACTACCTTCCGACGGCTCCAATGGTGTACGCCGTCATACCGGAGCCGACAATTGTGGACAATCCGGATGTTCCGAGCTACCAGCCTTACGTGCACGGGCGTT
GCCTACCGCCGGCGTTGATTCCGCTGCATATGAATGGAGTTGCCATGGAAATCGATTGCTGTTTCGATACTGCGTTCATTGGCGTTAGCGGAACGTGGCGCGTGCATTGC
GTGATGGCCGGGAAAGGATGTGAGTGCCTTATCGCGGTGCCGATGGGAGAGCAGGGTTCACTTCTAGGTGTTGAAGTTGATGTTGCTGGATCGTCACATCGCACTGAATT
AGTTTCCATGGAAGACGCGGAAGCTATAGAGAAACTGGCAAAATCTGAAGATGGAAAGTTTCTGAAAGGGCGCCGGATATACACTTTAAAGATCCCCAAGGTCGATGGTG
GCTGTACTCTTTCAGTTCGGATCAATTGGTCCCAGAGAATACCATACCTTGATGATCTCTTCTGTCTCAGTGTACCTTTCAATTTTCCAGCATATCTAGTTCCTCCTGGG
AAAAAGGTCAAGAATAGTCAAAAGATCTTGCTGCATATAAATGCGGGTATTTCGTCAGAAGTTGTATGTAAATATACTAGCCATCCGATGAAGATACTGAGACGCGAAGT
TGGCAACTTAAGTTTCTCGAACGAAGCAGAGGTCTCAGCATGGTCAAATATGGATTTTGATCTTTCGTACTCGATTTCTCCCAGTGACTTGTTTGGTGGTGTGCTGCTGC
AATCGCCGTCTCTTCATGATTTTGATCAAAGAGAGATGTTTTGTCTGTACATTTTTCCCGGCCAGAACCAGAACCGGAAGGTTTTCAGAAAGGAAGTAGTATTTATTATT
GACATAAGTGGAAGCATGAAGGGTGGTCCTCTCGAGAGTTCAAAGCGTGCAGTCCTTGCTTCACTTTCAAAGTTGAGCCCTGAAGATACATTTAACATAATAGGGTTCAA
TGGAGACACTAAATTATTTTCTTTATCAATGGAGCAAGCAACCAACGAAGCTATTACAAGGGCAACAGAGTGGATTGACGCTAATCTTGTAGCTAATGGTGGTACTAACA
TCCTGCTTCCCATGGAACAGGCTATAAAGATGTTGGCTGAAACGGGCAATTCAATTCCCCTCATTTTTCTCGTTACCGATGGTTCTGTTGATAATGAGAGAGAAATATGC
AATCTTGTGAAAGCTTCTTTGAAGAGTGAAAATACAATCTCTCCTCGTTTGTGTACTTTTGGTGTTGGTACATTTTGTAACCATTACTTCCTACAAATGCTATCAGAAAT
TGGGAGGGGGATTTATGATGCTGCTTATGATGTAGATTTGATTGATTCTCGGTTCCAGTTATTATTCGCAAAAGCCTCATCGCTCTTTCTCGCCAACATCACCGTGGATG
CTTTCAAGCATCTTGATTCCCTTGAGTTGTTTCCAACTCAAATTCCAGACCTTGCATGTGGAAGCCCATTGATAATATCAGGCCGATACAATGGAAGCTTTCCGGAATCT
TTTATAGTTAGTGGCACCTCAGCCGATATGAGCAATTCCACAATCCATCTCAAACCTCAAAGAGCGAAAGAACTTCTCCTTGACAGGGTGCTTGCGAGAAGACAAATAGA
TATAATGACTTCACATGCATGGTTACTAGAGAGCAAGGACTTAGAAAATAAGGTTGCAAAAATGAGCAAACAAACTGGGTTTCCGTCCGAGTATACACGTTTAATTTTAA
TGCTGACCGACGAAGGGAAGAAAGCTCCCCCATCCATTATCCTACAAGAGATGCGCAAGAGATTTGACATGTCGAAGTCGAATAAGGTAGAATGGAACGGGCAAAAGATC
ATCTTATTAGGAAACCAAGGCGTTGGATTTGGGAACTTAGCCGCAACCGCAGAGAACATGCAGCCAGGCAAGGAAATAAAGCCAACTCAAGCTACAGATCTCCTGGTGAA
AGCCGCTACAAATTGCTGTGCGTTGTGGCCTTTACACAACTCTCTGCCGCTCTTGCCTGCTGTGAGATCTTTAATTGTTGCTTTGAATTATGTGAATGTGAATGTTTCTG
ATGTTAATAATATTATACCATAA
Protein sequenceShow/hide protein sequence
MAAEFSNCVEYGLHLSKRIYYGKGSAPAALARQMSRVSEDYLPTAPMVYAVIPEPTIVDNPDVPSYQPYVHGRCLPPALIPLHMNGVAMEIDCCFDTAFIGVSGTWRVHC
VMAGKGCECLIAVPMGEQGSLLGVEVDVAGSSHRTELVSMEDAEAIEKLAKSEDGKFLKGRRIYTLKIPKVDGGCTLSVRINWSQRIPYLDDLFCLSVPFNFPAYLVPPG
KKVKNSQKILLHINAGISSEVVCKYTSHPMKILRREVGNLSFSNEAEVSAWSNMDFDLSYSISPSDLFGGVLLQSPSLHDFDQREMFCLYIFPGQNQNRKVFRKEVVFII
DISGSMKGGPLESSKRAVLASLSKLSPEDTFNIIGFNGDTKLFSLSMEQATNEAITRATEWIDANLVANGGTNILLPMEQAIKMLAETGNSIPLIFLVTDGSVDNEREIC
NLVKASLKSENTISPRLCTFGVGTFCNHYFLQMLSEIGRGIYDAAYDVDLIDSRFQLLFAKASSLFLANITVDAFKHLDSLELFPTQIPDLACGSPLIISGRYNGSFPES
FIVSGTSADMSNSTIHLKPQRAKELLLDRVLARRQIDIMTSHAWLLESKDLENKVAKMSKQTGFPSEYTRLILMLTDEGKKAPPSIILQEMRKRFDMSKSNKVEWNGQKI
ILLGNQGVGFGNLAATAENMQPGKEIKPTQATDLLVKAATNCCALWPLHNSLPLLPAVRSLIVALNYVNVNVSDVNNIIP