; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017710 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017710
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF1308 domain-containing protein
Genome locationchr5:7444261..7455043
RNA-Seq ExpressionLag0017710
SyntenyLag0017710
Gene Ontology termsNA
InterPro domainsIPR010733 - Domain of unknown function DUF1308


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147991.1 uncharacterized protein LOC101214095 isoform X1 [Cucumis sativus]1.5e-22487.28Show/hide
Query:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIP-SSCSKAVYV
        M EP+ VELAKQRC+AI+D I+ LPSST I+VS TQTLHKLALRELNFLSRCSSSSS PLSLNIGHLEA VHILQ PSVTGISRVCKPIP SS S+AVYV
Subjt:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIP-SSCSKAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWV
        DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLK+RL EV+DAARSL ALEPCSIILFFSHGLDQFILERLRDEF+ATE++F+FSDFDF FSEIDGDW+
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWV

Query:  NVLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIAL
        NVLPRSY+EACVLEIKVNDRNCGV   N NSKV S+GVDEPEIL+   + D GD FCS+VMAMKPNPM G EDM SAN E LLGGDSDLINFDTTALIAL
Subjt:  NVLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIAL

Query:  VSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRM
        VSGISNGC AKLL+ PE+ELRQKYKSNYDFVIGQAMSEI+KPILVELSSLLSGKRGIICQS HSEFKEL+ MCGGPNEKSRANHLLKHIMVV DM SKRM
Subjt:  VSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

XP_022923545.1 uncharacterized protein LOC111431203 isoform X1 [Cucurbita moschata]1.0e-22586.84Show/hide
Query:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VELAKQRCRA++D IEALPSST I +SS++TLHKLALRELNFLSRCSSSSS PLSLNIGHLEA VHILQ PSV GISRVCKPIPS CSKAVYVD
Subjt:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW++GHRSKGLK+R+ EVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NF+FSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNCG+   NC SK+ STGV+EPEILDKYV+RDLG PFCS+V AMKPNPM+G ED+ S +LEHLL GD+DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV

Query:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKH-IMVVPDMASKRM
        SGISNGCVAKLLATPE EL+QKYKSNYDFVI Q MSEIQKPILVELSS LSGKRGIICQSVHSEFKELV MCGGP EKSR+N+LLKH IMVVPDMASKRM
Subjt:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKH-IMVVPDMASKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        TCLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALT
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

XP_022923546.1 uncharacterized protein LOC111431203 isoform X2 [Cucurbita moschata]4.2e-22787.03Show/hide
Query:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VELAKQRCRA++D IEALPSST I +SS++TLHKLALRELNFLSRCSSSSS PLSLNIGHLEA VHILQ PSV GISRVCKPIPS CSKAVYVD
Subjt:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW++GHRSKGLK+R+ EVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NF+FSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNCG+   NC SK+ STGV+EPEILDKYV+RDLG PFCS+V AMKPNPM+G ED+ S +LEHLL GD+DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV

Query:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRMT
        SGISNGCVAKLLATPE EL+QKYKSNYDFVI Q MSEIQKPILVELSS LSGKRGIICQSVHSEFKELV MCGGP EKSR+N+LLKHIMVVPDMASKRMT
Subjt:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRMT

Query:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        CLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALT
Subjt:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

XP_023553240.1 uncharacterized protein LOC111810716 [Cucurbita pepo subsp. pepo]1.4e-22787.69Show/hide
Query:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VEL KQRCRA++D IEALPSST I+VSS++TLHKLALRELNFLSRCSSSSS PLSLNIGHLEA VHILQ PSV GISRVCKPIPS CSKAVYVD
Subjt:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW++GHRSKGLK+RL EVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NFSFSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNCG+   NC SK+ STGVDEPEILDKYV+RDLG PFCS+V AMKPNPM+G ED+ S +LEHLL GD+DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV

Query:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRMT
        SGISNGCVAKLLATPE EL+QKYKSNYDFVI Q MSEIQKPILVELSS LSGKRGIICQSVHSEFKELV MCGGP EKSRAN+LLKHIMVVPDMASKRMT
Subjt:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRMT

Query:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        CLPTTRKLALKNK+VFGTGDYWNA TLTANMSFVRAVSQTGMSL T EHRPRALT
Subjt:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

XP_038906087.1 UPF0415 protein C7orf25 homolog [Benincasa hispida]1.3e-23188.79Show/hide
Query:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EP+ +ELAKQRCRA+ID IE LPSST I VSS++TLHKLALRELNFLSRCSSSSS PLSLNIGHLEA VHILQ PSVTGISRVCKPIPSSCSK VYVD
Subjt:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN
        IICTL++NPVWVIVSDRKPRYISWYKGHRSKGLK+RL EV+DAARSLQALEPCSIILFFSHGLDQFILE+LRDEF+A E+NF+FSDFDFGFSEIDGDWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV
        VLPRSY+EA VLEIKVNDR CGV   N NS   STGVD+PEILD YV+RD+ DPFCS+VMAMKPNPM+G EDM SA+LEH LGGD+DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV

Query:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRMT
        SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEI+KPILVELSSLL+GKRGIICQSVHSEFKELV MCGGPNEKSRANHLLKHI+VVPDMASKRMT
Subjt:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRMT

Query:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
Subjt:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

TrEMBL top hitse value%identityAlignment
A0A0A0L776 DUF1308 domain-containing protein7.2e-22587.28Show/hide
Query:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIP-SSCSKAVYV
        M EP+ VELAKQRC+AI+D I+ LPSST I+VS TQTLHKLALRELNFLSRCSSSSS PLSLNIGHLEA VHILQ PSVTGISRVCKPIP SS S+AVYV
Subjt:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIP-SSCSKAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWV
        DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLK+RL EV+DAARSL ALEPCSIILFFSHGLDQFILERLRDEF+ATE++F+FSDFDF FSEIDGDW+
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWV

Query:  NVLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIAL
        NVLPRSY+EACVLEIKVNDRNCGV   N NSKV S+GVDEPEIL+   + D GD FCS+VMAMKPNPM G EDM SAN E LLGGDSDLINFDTTALIAL
Subjt:  NVLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIAL

Query:  VSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRM
        VSGISNGC AKLL+ PE+ELRQKYKSNYDFVIGQAMSEI+KPILVELSSLLSGKRGIICQS HSEFKEL+ MCGGPNEKSRANHLLKHIMVV DM SKRM
Subjt:  VSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

A0A5D3D7K2 UPF0415 protein C7orf25-like protein2.1e-22487.5Show/hide
Query:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIP-SSCSKAVYV
        M EP+ VELAKQRC+AI+D IE LPSST I+VS TQTL KLALRELNFLSRCS SSS PLSLNIGHLEA VHILQ PSVTGISRVCKPIP SS SKAVYV
Subjt:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIP-SSCSKAVYV

Query:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWV
        DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLK+RL EV+DAA SLQALEPCSIILFFSHGLDQFILERLRDEF+ATE++F+FSDFDFGFSEIDGDW+
Subjt:  DIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWV

Query:  NVLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIAL
        NVL RSYKEACVLEIKV+DRNCG    N NSKV S+GVDEP+IL+   + DLGD FCS+VMAMKPNPM G EDM SANLE LLGGDSDLINFDTTALIAL
Subjt:  NVLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIAL

Query:  VSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRM
        VSGISNGC AKLLATPE+EL+QKYKSNYDFVIGQAMSEI+KPILVEL SLLSGKRGIICQSVHSEFKEL+ MCGGPNEKSRANHLLKHIMVV DM SKRM
Subjt:  VSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

A0A6J1E731 uncharacterized protein LOC111431203 isoform X22.0e-22787.03Show/hide
Query:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VELAKQRCRA++D IEALPSST I +SS++TLHKLALRELNFLSRCSSSSS PLSLNIGHLEA VHILQ PSV GISRVCKPIPS CSKAVYVD
Subjt:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW++GHRSKGLK+R+ EVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NF+FSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNCG+   NC SK+ STGV+EPEILDKYV+RDLG PFCS+V AMKPNPM+G ED+ S +LEHLL GD+DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV

Query:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRMT
        SGISNGCVAKLLATPE EL+QKYKSNYDFVI Q MSEIQKPILVELSS LSGKRGIICQSVHSEFKELV MCGGP EKSR+N+LLKHIMVVPDMASKRMT
Subjt:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRMT

Query:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        CLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALT
Subjt:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

A0A6J1EC49 uncharacterized protein LOC111431203 isoform X15.0e-22686.84Show/hide
Query:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VELAKQRCRA++D IEALPSST I +SS++TLHKLALRELNFLSRCSSSSS PLSLNIGHLEA VHILQ PSV GISRVCKPIPS CSKAVYVD
Subjt:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW++GHRSKGLK+R+ EVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NF+FSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNCG+   NC SK+ STGV+EPEILDKYV+RDLG PFCS+V AMKPNPM+G ED+ S +LEHLL GD+DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV

Query:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKH-IMVVPDMASKRM
        SGISNGCVAKLLATPE EL+QKYKSNYDFVI Q MSEIQKPILVELSS LSGKRGIICQSVHSEFKELV MCGGP EKSR+N+LLKH IMVVPDMASKRM
Subjt:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKH-IMVVPDMASKRM

Query:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        TCLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALT
Subjt:  TCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

A0A6J1HML8 uncharacterized protein LOC111465028 isoform X22.8e-22487.03Show/hide
Query:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD
        M EPD VELAKQRCRA++D IEALP+ST I VSS++TLHKLALRELNFLSRCSSSSS PLSLNIGHLEA VHILQ PSV GISRVCKPIPSSC KAVYVD
Subjt:  MEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVD

Query:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN
        IICTLNRNPVW+IVSDRKPRYISW++GHRSKGLK+RL EVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATE+NF+FSD DF FSEID DWVN
Subjt:  IICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVN

Query:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV
        VLPR YKEACVLEIKVNDRNCG+   N NSK+ STGVDE EILDKYV+RDLG PFCS+V AMKPNPM+G ED+ S +LEHLL  D+DLINFDTTALIALV
Subjt:  VLPRSYKEACVLEIKVNDRNCGV---NCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV

Query:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRMT
        SGISNGCVAKLLATPE EL+QKYKSNYDFVI Q MSEIQKPILVELSS LSGKRGIICQSVHSEFKELV MCGGP EKSRAN+LLKHIMVVPDMASKRM 
Subjt:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRMT

Query:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
        CLPTTRKLALKNK+VFGTGDYWNAPTLTANMSFVRAVSQTGMSL T EHRPRALT
Subjt:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT

SwissProt top hitse value%identityAlignment
Q08AW5 UPF0415 protein C7orf25 homolog2.5e-2035.33Show/hide
Query:  INFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHI
        +N D T LI  VS +S+G    L        ++K  +       QA  E Q+ +L  L S +  K    C+S   +F+ ++   GGP EK RA  L+K I
Subjt:  INFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHI

Query:  MVVPDMASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTDKVKEDMRKVP--YASA
         VVPD  S+R + L  + K+  ++  +FGTG+   A T+TAN  FVRA +  G+    F H+PRALT+  +     +P  YAS+
Subjt:  MVVPDMASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTDKVKEDMRKVP--YASA

Q5BKL1 UPF0415 protein C7orf25 homolog7.2e-2034.27Show/hide
Query:  INFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHI
        +N D T LI  VS +S+G         E   ++K  +       QA  E Q+ +L  L+S +  K    C+    +F+ ++   GGP EK RA  L+K I
Subjt:  INFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHI

Query:  MVVPDMASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTDKVKEDMRKVP
         VVPD  S+R   L ++ K+  ++  +FGTG+   A T+TAN  FVRA +  G+    F H+PRALT+  +     +P
Subjt:  MVVPDMASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTDKVKEDMRKVP

Q803H0 UPF0415 protein C7orf25 homolog5.0e-2136.31Show/hide
Query:  INFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHI
        +N D T LI  VS +S+G                +      +  QA  E Q+ +L  L   + GK    CQS   +F+ ++   GGP EKSRA  LL  +
Subjt:  INFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHI

Query:  MVVPDMASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTD
         VVPD  S+R   L  + K+  ++ ++FGTGD   A T+TAN  FVRA +  G+    F H+PRALT+
Subjt:  MVVPDMASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTD

Q91WD4 UPF0415 protein C7orf25 homolog2.7e-1934.32Show/hide
Query:  INFDTTALIALVSGIS-NGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKH
        +N D T LI  VS +S  GC               +      +  QA  E ++ +L +L + +  K    C+S   +F+ ++   GGP E+ RA+ L+K 
Subjt:  INFDTTALIALVSGIS-NGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKH

Query:  IMVVPDMASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTD
        I VVPD  S+R   L  + K+  ++  +FGTGD   A T+TAN  FVRA +  G+    F H+PRALT+
Subjt:  IMVVPDMASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTD

Q9BPX7 UPF0415 protein C7orf253.6e-1934.32Show/hide
Query:  INFDTTALIALVSGIS-NGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKH
        +N D T LI  VS +S  GC               +      +  QA  E ++ +L +L + +  K    C+S   +F+ ++   GGP E+ RA  L+K 
Subjt:  INFDTTALIALVSGIS-NGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKH

Query:  IMVVPDMASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTD
        I VVPD  S+R   L  + K+  ++  +FGTGD   A T+TAN  FVRA +  G+    F H+PRALT+
Subjt:  IMVVPDMASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTD

Arabidopsis top hitse value%identityAlignment
AT1G73380.1 unknown protein8.2e-12054.07Show/hide
Query:  VELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSS-SSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVDIICTL
        +E+AKQRC ++I  IE LP ST I  S  +TL KLA  EL+FLS  SS  S  PLS+NIGH+E+ V ILQ PS+TG+SRVCKPIP      V+VD++CTL
Subjt:  VELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCSSS-SSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVDIICTL

Query:  NRNPVWVIVSDRKPRYISWY-KGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSF-SDFDFGFS---EIDGDWVN
         + PVW+IVSDR PRYISW    H SKGL++R+ +++ AA S   L+P S+ILFF++GL   + E+L+DEF A  ++F F SD D   S   + D +WVN
Subjt:  NRNPVWVIVSDRKPRYISWY-KGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGLDQFILERLRDEFRATEYNFSF-SDFDFGFS---EIDGDWVN

Query:  VL-PRSYKEACVLEIKVNDRNCGVNCNSKVGSTGVDEPEILDKYVKRDLG--DPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV
        V+  RSYKEA  +EIK+ D+     C+S        E E+L +    +L   D F +++ +M+                 LLG D  LINFDTTAL+ALV
Subjt:  VL-PRSYKEACVLEIKVNDRNCGVNCNSKVGSTGVDEPEILDKYVKRDLG--DPFCSIVMAMKPNPMMGTEDMGSANLEHLLGGDSDLINFDTTALIALV

Query:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRMT
        SGISNGC  +L+  PE EL +K+K N  FVI QA SEI+KP LV++ ++LSGKRGI+C+SV SEFKELV+M  GPNEK RA  LLK +MVV D  S+R+ 
Subjt:  SGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLKHIMVVPDMASKRMT

Query:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT
         LPTTRKLA+KNK VFGTGD W APTLTANM+FVRAV+Q+GMSL T +H PRALT
Subjt:  CLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTCCCGCCGCCACCGGAAGCCCCTCTGGTGGTATTCATTAACCTACGCAGCGGCTGCCGCCATGGCCAGAAGCTCAAGCCCCTCTGTCGACCAAAAGACCGATGG
AAATGACCGGAAAAGTAGAGAAGAAATGGCAGCTGATAGAAATGGCATTTTACCCATAAAAATGGAAGAACCAGATGGAGTTGAATTGGCAAAGCAAAGATGCAGAGCGA
TTATCGACAGAATCGAAGCGCTGCCTTCTTCCACCAAAATCGCCGTTTCAAGTACCCAAACTCTTCACAAATTGGCTCTTCGCGAGCTCAATTTCCTCTCTCGCTGCTCC
TCTTCGTCCTCCATCCCGCTCAGCTTGAACATTGGGCACCTCGAGGCCACTGTTCACATTCTTCAACAGCCTTCCGTCACTGGAATTTCACGTGTCTGTAAGCCGATTCC
ATCTTCCTGTTCCAAAGCTGTTTATGTTGATATAATCTGCACTTTGAATAGGAATCCAGTGTGGGTTATTGTATCAGATAGAAAACCTAGGTATATTTCTTGGTATAAGG
GCCATAGAAGTAAGGGCTTGAAAGCTCGACTTGTGGAAGTGGTCGATGCCGCTCGCTCTTTGCAGGCCTTAGAACCTTGCTCGATCATATTGTTCTTTTCGCATGGACTT
GATCAGTTTATTCTGGAAAGACTTCGCGATGAATTTAGGGCTACTGAGTATAATTTCAGTTTCTCGGACTTTGATTTTGGTTTCTCTGAGATTGATGGAGATTGGGTTAA
TGTGCTTCCGAGAAGCTATAAAGAAGCCTGTGTTCTTGAAATTAAAGTTAATGATAGGAATTGTGGGGTTAATTGCAACAGTAAAGTAGGTTCTACTGGTGTGGATGAGC
CAGAGATTTTGGACAAGTATGTCAAGAGAGATCTGGGGGATCCTTTCTGCTCTATTGTTATGGCAATGAAACCTAATCCTATGATGGGTACGGAAGACATGGGATCTGCA
AATCTGGAACATTTATTGGGTGGTGATAGTGATCTAATAAATTTTGATACCACGGCGTTGATTGCATTAGTATCTGGCATTAGTAATGGTTGTGTTGCTAAATTATTGGC
TACCCCAGAGAGTGAATTGAGACAGAAGTACAAGAGTAACTATGATTTTGTTATTGGTCAGGCAATGTCAGAAATTCAGAAGCCTATACTTGTAGAGCTGAGTTCTCTTT
TATCTGGAAAAAGAGGCATAATATGCCAAAGTGTTCACTCTGAGTTCAAGGAACTAGTTGCAATGTGTGGAGGGCCTAATGAGAAGTCCAGAGCAAACCACTTACTAAAA
CACATTATGGTTGTACCGGACATGGCATCGAAACGTATGACGTGTCTCCCTACTACAAGAAAGTTGGCTTTGAAGAACAAGGTTGTGTTTGGCACTGGTGACTATTGGAA
TGCCCCAACCTTGACTGCTAACATGTCATTTGTCCGTGCAGTGTCCCAGACTGGGATGTCCCTTTTTACCTTTGAGCATAGGCCACGAGCTCTAACTGATAAAGTGAAGG
AAGACATGAGAAAAGTTCCATATGCTTCAGCAGTTGGGAGCTTGACGTATGTCATGGTAGTGCAGTGTCTTGGTAGGGGAATCCAAAATTTGGAGCTTTGGGGATCAACC
GTGAAACTGAGTGTCGCACAACAACGTTTGGGTGTTTTTTCGGCCAAGATTCAAGCACGAACAACAGCGTTTTTGGGTGAGATCTACTTTGGGAGCGATAGGATTGAATT
CTCGATTCATACTAAGGAGTTAAAAAGCTCAAAAAAACACCCACGGCTGAAACGTTGTTGTGCGATGCTGGAACTCAATAAAAACCATCGCTACACGTCGGAAATGTCAC
CACACCGCCGCACAAACGTCGTCGCCGTCGTCGGAGTACGCCGGAGACGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGTCCCGCCGCCACCGGAAGCCCCTCTGGTGGTATTCATTAACCTACGCAGCGGCTGCCGCCATGGCCAGAAGCTCAAGCCCCTCTGTCGACCAAAAGACCGATGG
AAATGACCGGAAAAGTAGAGAAGAAATGGCAGCTGATAGAAATGGCATTTTACCCATAAAAATGGAAGAACCAGATGGAGTTGAATTGGCAAAGCAAAGATGCAGAGCGA
TTATCGACAGAATCGAAGCGCTGCCTTCTTCCACCAAAATCGCCGTTTCAAGTACCCAAACTCTTCACAAATTGGCTCTTCGCGAGCTCAATTTCCTCTCTCGCTGCTCC
TCTTCGTCCTCCATCCCGCTCAGCTTGAACATTGGGCACCTCGAGGCCACTGTTCACATTCTTCAACAGCCTTCCGTCACTGGAATTTCACGTGTCTGTAAGCCGATTCC
ATCTTCCTGTTCCAAAGCTGTTTATGTTGATATAATCTGCACTTTGAATAGGAATCCAGTGTGGGTTATTGTATCAGATAGAAAACCTAGGTATATTTCTTGGTATAAGG
GCCATAGAAGTAAGGGCTTGAAAGCTCGACTTGTGGAAGTGGTCGATGCCGCTCGCTCTTTGCAGGCCTTAGAACCTTGCTCGATCATATTGTTCTTTTCGCATGGACTT
GATCAGTTTATTCTGGAAAGACTTCGCGATGAATTTAGGGCTACTGAGTATAATTTCAGTTTCTCGGACTTTGATTTTGGTTTCTCTGAGATTGATGGAGATTGGGTTAA
TGTGCTTCCGAGAAGCTATAAAGAAGCCTGTGTTCTTGAAATTAAAGTTAATGATAGGAATTGTGGGGTTAATTGCAACAGTAAAGTAGGTTCTACTGGTGTGGATGAGC
CAGAGATTTTGGACAAGTATGTCAAGAGAGATCTGGGGGATCCTTTCTGCTCTATTGTTATGGCAATGAAACCTAATCCTATGATGGGTACGGAAGACATGGGATCTGCA
AATCTGGAACATTTATTGGGTGGTGATAGTGATCTAATAAATTTTGATACCACGGCGTTGATTGCATTAGTATCTGGCATTAGTAATGGTTGTGTTGCTAAATTATTGGC
TACCCCAGAGAGTGAATTGAGACAGAAGTACAAGAGTAACTATGATTTTGTTATTGGTCAGGCAATGTCAGAAATTCAGAAGCCTATACTTGTAGAGCTGAGTTCTCTTT
TATCTGGAAAAAGAGGCATAATATGCCAAAGTGTTCACTCTGAGTTCAAGGAACTAGTTGCAATGTGTGGAGGGCCTAATGAGAAGTCCAGAGCAAACCACTTACTAAAA
CACATTATGGTTGTACCGGACATGGCATCGAAACGTATGACGTGTCTCCCTACTACAAGAAAGTTGGCTTTGAAGAACAAGGTTGTGTTTGGCACTGGTGACTATTGGAA
TGCCCCAACCTTGACTGCTAACATGTCATTTGTCCGTGCAGTGTCCCAGACTGGGATGTCCCTTTTTACCTTTGAGCATAGGCCACGAGCTCTAACTGATAAAGTGAAGG
AAGACATGAGAAAAGTTCCATATGCTTCAGCAGTTGGGAGCTTGACGTATGTCATGGTAGTGCAGTGTCTTGGTAGGGGAATCCAAAATTTGGAGCTTTGGGGATCAACC
GTGAAACTGAGTGTCGCACAACAACGTTTGGGTGTTTTTTCGGCCAAGATTCAAGCACGAACAACAGCGTTTTTGGGTGAGATCTACTTTGGGAGCGATAGGATTGAATT
CTCGATTCATACTAAGGAGTTAAAAAGCTCAAAAAAACACCCACGGCTGAAACGTTGTTGTGCGATGCTGGAACTCAATAAAAACCATCGCTACACGTCGGAAATGTCAC
CACACCGCCGCACAAACGTCGTCGCCGTCGTCGGAGTACGCCGGAGACGCTAG
Protein sequenceShow/hide protein sequence
MWSRRHRKPLWWYSLTYAAAAAMARSSSPSVDQKTDGNDRKSREEMAADRNGILPIKMEEPDGVELAKQRCRAIIDRIEALPSSTKIAVSSTQTLHKLALRELNFLSRCS
SSSSIPLSLNIGHLEATVHILQQPSVTGISRVCKPIPSSCSKAVYVDIICTLNRNPVWVIVSDRKPRYISWYKGHRSKGLKARLVEVVDAARSLQALEPCSIILFFSHGL
DQFILERLRDEFRATEYNFSFSDFDFGFSEIDGDWVNVLPRSYKEACVLEIKVNDRNCGVNCNSKVGSTGVDEPEILDKYVKRDLGDPFCSIVMAMKPNPMMGTEDMGSA
NLEHLLGGDSDLINFDTTALIALVSGISNGCVAKLLATPESELRQKYKSNYDFVIGQAMSEIQKPILVELSSLLSGKRGIICQSVHSEFKELVAMCGGPNEKSRANHLLK
HIMVVPDMASKRMTCLPTTRKLALKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTDKVKEDMRKVPYASAVGSLTYVMVVQCLGRGIQNLELWGST
VKLSVAQQRLGVFSAKIQARTTAFLGEIYFGSDRIEFSIHTKELKSSKKHPRLKRCCAMLELNKNHRYTSEMSPHRRTNVVAVVGVRRRR