; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040997 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040997
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionNeuronal PAS domain-containing protein 4
Genome locationchr13:10701635..10703316
RNA-Seq ExpressionLag0040997
SyntenyLag0040997
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599962.1 hypothetical protein SDJN03_05195, partial [Cucurbita argyrosperma subsp. sororia]8.5e-26891.15Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV
        MASCSFPDVYSWIQ LPPLSQWKT+ ISTSIC+S+S SSSLKIVAAK+LHSPTIT S+IAD+SFPISLWTSKPLKTST SSNL DEE+MS+LL+NCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV

Query:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK
        LYYGSNQR NSSH  LKLDIT +S++IFNLAFLTLIFLICIYEAP DLRSNCLMTLKHHLANSTSRQ SKVLMKLLGSNLEEQWMRS+NLAITNW+LELK
Subjt:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK

Query:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER
        ANGRTLKTPSPLYSYSFST+GLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNY+LVVR+KWID+RVHVDNIRCDI+RLVNETLLSER
Subjt:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSS+NPKIEVG+E+TFEAGFEP+TPYPGLK+SVGET MVS+KPWKFEQ+VHGNAA LNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
        VASTKPSKL LINPKAWFRDRYSSA+RPFNKQGG+IFAGDEYGE+VWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
Subjt:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP

KAG6600900.1 hypothetical protein SDJN03_06133, partial [Cucurbita argyrosperma subsp. sororia]5.7e-26490.54Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV
        MASCSFPDVYSWIQNLPPLSQWKTT ISTSICSSSST+SSLKIVAAK+LHSPTITFSV AD+SF ISLWTS+PLKTSTK+SNLL++ESMS+LL+NCV DV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV

Query:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK
        LYYGSN + NSSHN LKLDIT + K+IFN  FLTLIFLICIYEAPIDLRSNCL TLKHHLAN TSRQTSKVLMKLLGSNLE+QWMRSINLAITNWILELK
Subjt:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK

Query:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER
        A GRTLKTPSPL+SYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNY+ V REKWIDLRVHVDNIRCDIIRLV+ETLLSER
Subjt:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSS+NPK+E+G+EKTFEAGFE ATP+PGLK++VGETV+VS+KPWKFEQ+VHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
        VASTKPSKL LINPKAWFRDRYSSA RPFNKQGGV+FAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTET+RLEFKEILHLSIP
Subjt:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP

XP_022941840.1 uncharacterized protein LOC111447081 [Cucurbita moschata]2.2e-26891.35Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV
        MASCSFPDVYSWIQ LPPLSQWKT+ ISTSIC+S+S SSSLKIVAAK+LHSPTIT S+IAD+SFPISLWTSKPLKTST SSNL DEE+MS+LL+NCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV

Query:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK
        LYYGSNQR NSSH  LKLDIT +S++IFNLAFLTLIFLICIYEAP DLRSNCLMTLKHHLANSTSRQ SKVLMKLLGSNLEEQWMRS+NLAITNW+LELK
Subjt:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK

Query:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER
        ANGRTLKTPSPLYSYSFST+GLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNY+LVVR+KWID+RVHVDNIRCDI+RLVNETLLSER
Subjt:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSS+NPKIE+G+E+TFEAGFEP+TPYPGLK+SVGET MVS+KPWKFEQ+VHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
        VASTKPSKL LINPKAWFRDRYSSA+RPFNKQGGVIFAGDEYGE+VWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
Subjt:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP

XP_022995522.1 uncharacterized protein LOC111491028 [Cucurbita maxima]5.9e-26991.75Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV
        MASCSFPDVYSWIQ LPPLSQWKT+ ISTSIC+S+S SSSLKIVAAK+LHSPTIT S+IAD+SFPISLWTSKPLKTST SSNL DEE+MS+LL+NCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV

Query:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK
        LYYGSNQR NSSH  LKLDIT +SKDIFNLAFLTLIFLICIYEAP DLRSNCLMTLKHHLANSTSRQ SKVLMKLLGSNLEEQWMRS+NLAITNW+LELK
Subjt:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK

Query:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER
        ANGRTLKTPSPLYSYSFST+GLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNY+LVVR+KWID+RVHVDNIRCDIIRLVNETLLSER
Subjt:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSS+NP+IEVG+E+TFEAGFEP+TPYPGLK+SVGET MVS+KPWKFEQ+VHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
        VASTKPSKLALINPKAWFRDRYSSA+RPFNKQGGVIFAGDEYGE+VWWKIDGKARGKTMEWEI+GWIWLTYWPNKHKTFYTETKRLEFKEI+HLSIP
Subjt:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP

XP_023544514.1 uncharacterized protein LOC111804061 [Cucurbita pepo subsp. pepo]5.9e-26991.75Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV
        MASCSFPDVYSWIQ LPPLSQWKTT ISTSIC+S+S SSSLKIVAAK+LHSPTIT S+IAD+SFPISLWTSKPLKTST SSNL DEE+MS+LL+NCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV

Query:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK
        LYYGSNQR NSSH  LKLDIT +S++IFNLAFLTLIFLICIYEAP DLRSNCLMTLKHHLANSTSRQ SKVLMKLLGSNLE+QWMRS+NLAITNW+LELK
Subjt:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK

Query:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER
        ANGRTLKTPSPLYSYSFS++GLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNY+LVVR+KWID+RVHVDNIRCDIIRLVNETLLSER
Subjt:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSS+NPKIEVG+E+TFEAGFEP+TPYPGLK+SVGET MVS+KPWKFEQ+VHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
        VASTKPSKLALINPKAWFRDRYSSA+RPFNKQGGVIFAGDEYGE+VWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
Subjt:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP

TrEMBL top hitse value%identityAlignment
A0A5D3DQB5 Neuronal PAS domain-containing protein 47.3e-25788.73Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV
        MASCSFPDVYSWIQNLPPLSQWKTT ISTSICSSS+T+SSL IVAAK++HSPTITFSV AD +FPISLWTSKPL   T SS+LL++E MSSLL+NCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV

Query:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK
        LYYGSNQR NSSHNF KL+IT NSK+IFNLAFLTLIFLICIYEAPIDLRSNCLM LK+HLANSTSRQ SKVLMKLLGS LE+QW+RS+NLA+TNWI ELK
Subjt:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK

Query:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER
        A+GRTLKTPSP++SYSFSTYGLWKVQLYCPIIAMDNIENSSNPS+DERLQFSLNYHQLEGVLQFNYK VVREKWID+RVHVDNIRCDI+RLVNETLLSER
Subjt:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTPT HTNIMSVSVSKSSSNP IE+G+EKTFEAGFE A PYPGLK++VGETV VS+KPWKFEQ VHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
        VASTKPSKLALINPKAWFRDRYSSA RPFN+QGGVIFAGDEYG+SVWWKIDG ARGKTMEWEIRGWIW+TYWPNKHKTFYTETKRLEFKEIL LSIP
Subjt:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP

A0A6J1FUY0 uncharacterized protein LOC1114470811.1e-26891.35Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV
        MASCSFPDVYSWIQ LPPLSQWKT+ ISTSIC+S+S SSSLKIVAAK+LHSPTIT S+IAD+SFPISLWTSKPLKTST SSNL DEE+MS+LL+NCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV

Query:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK
        LYYGSNQR NSSH  LKLDIT +S++IFNLAFLTLIFLICIYEAP DLRSNCLMTLKHHLANSTSRQ SKVLMKLLGSNLEEQWMRS+NLAITNW+LELK
Subjt:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK

Query:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER
        ANGRTLKTPSPLYSYSFST+GLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNY+LVVR+KWID+RVHVDNIRCDI+RLVNETLLSER
Subjt:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSS+NPKIE+G+E+TFEAGFEP+TPYPGLK+SVGET MVS+KPWKFEQ+VHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
        VASTKPSKL LINPKAWFRDRYSSA+RPFNKQGGVIFAGDEYGE+VWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
Subjt:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP

A0A6J1GXT2 uncharacterized protein LOC1114581751.2e-26289.94Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV
        MASCSFPDVYSWIQNLPPLSQWKTT ISTSICSSSST+SSLKIVAAK+LHSPTITFSV AD+SF ISLWTS+PLKTSTK+SNLL++ESMS+LL+NCV DV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV

Query:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK
        LYYGSN + NSSHN LKLDIT + K+IFN  FLTLIFLIC YEAPIDLRSNCL+TLKHHLAN TSRQTSKVLMKLLGSNLE+QWMRSINLAITNWILELK
Subjt:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK

Query:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER
        A GRTLKTPSPL+SYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNY+ V REKWIDLRVHVDNIRCDIIRLV+ETLLSER
Subjt:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTPT HTNIMSVSVSKSS+NPK+E+G+EKTFEAGFE ATP+PGLK++VGETV+VS+KPWKFEQ+VHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
        VASTKPSKL LINPKAWFRDRYSSA RPFNKQGGV+FAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTET+RLEFKEIL+LSIP
Subjt:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP

A0A6J1JLL6 uncharacterized protein LOC1114869874.4e-26290.14Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV
        MASCSFPDVY WIQNLPPLSQWKTT ISTSICSSSST+SSLKIVAAK+LHSPTITFSV AD+SF ISLWTS+PLKTSTK+SNLL++ESMS+LL+NCV DV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV

Query:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK
        LYYGSN + NSSHN LKLDIT + K+IFN  FLTLIFLICIYEAPIDLRSNCL TLKHHLAN TSRQTSKVLMKLLGSNLE+QWMRSINLAITNWILELK
Subjt:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK

Query:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER
        A GRTLKTPSPL+SYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNY+ V REKWIDLRVHVDNIRCDIIRLV+ETLLSER
Subjt:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTPT HTNIMSVSVSKSSSNPK+++G+EKTFEAGFE ATP+PGLK++VGETV+VS+KPWKFEQ+VHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
        VASTKPSKL LINPKAWFRDRYSSA RPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTET+RLEFKEIL+LSIP
Subjt:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP

A0A6J1JZ56 uncharacterized protein LOC1114910282.8e-26991.75Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV
        MASCSFPDVYSWIQ LPPLSQWKT+ ISTSIC+S+S SSSLKIVAAK+LHSPTIT S+IAD+SFPISLWTSKPLKTST SSNL DEE+MS+LL+NCVHDV
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDV

Query:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK
        LYYGSNQR NSSH  LKLDIT +SKDIFNLAFLTLIFLICIYEAP DLRSNCLMTLKHHLANSTSRQ SKVLMKLLGSNLEEQWMRS+NLAITNW+LELK
Subjt:  LYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELK

Query:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER
        ANGRTLKTPSPLYSYSFST+GLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNY+LVVR+KWID+RVHVDNIRCDIIRLVNETLLSER
Subjt:  ANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSER

Query:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE
        GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSS+NP+IEVG+E+TFEAGFEP+TPYPGLK+SVGET MVS+KPWKFEQ+VHGNAATLNWYLHDSSDGKE
Subjt:  GVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKE

Query:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
        VASTKPSKLALINPKAWFRDRYSSA+RPFNKQGGVIFAGDEYGE+VWWKIDGKARGKTMEWEI+GWIWLTYWPNKHKTFYTETKRLEFKEI+HLSIP
Subjt:  VASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15020.1 unknown protein2.0e-4125.78Show/hide
Query:  DVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSL----HSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDVLYY
        D +SWI  LP   ++  +        +   + S+K+ A ++L     S ++TF+V+A+     +L  S  +  S  +  L  E+    L++  + +++  
Subjt:  DVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSL----HSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDVLYY

Query:  GSNQRNNSSHNFLKLDI---------------TFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVL---MKLLGSNLEEQWM
             + +   F +L+I               +F+S  +FNL  LT +F +C+++AP ++ S     L     N+ + Q + VL   +  LG + E   +
Subjt:  GSNQRNNSSHNFLKLDI---------------TFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVL---MKLLGSNLEEQWM

Query:  RSINLAITNWI---------LELKANGRTLKTPSPL-YSYSFSTYGLWKVQLYCPIIAMDNIENSSN---------PSTDER---LQFSLNYHQLEGVLQ
        R+ + A++ W+         L LK    +L     L +SY+   +GLW ++ Y PI++M+   NSSN         P  + +   L+++L++ Q E ++Q
Subjt:  RSINLAITNWI---------LELKANGRTLKTPSPL-YSYSFSTYGLWKVQLYCPIIAMDNIENSSN---------PSTDER---LQFSLNYHQLEGVLQ

Query:  FNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSERGVG---------GSEKHFPSRISLQLTP-TSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEP
        F Y +   E +I +   VDNIR  + +L       + GVG           E++FPSR+ + L P    +++  +S+ +S+ N + ++   +  +  F  
Subjt:  FNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSERGVG---------GSEKHFPSRISLQLTP-TSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEP

Query:  ATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKEVASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGK
            P +K          +K W+ EQ   GNAA  +  L+D   G+EV + KP            +         F K GG++F  DEYG+ V W++  +
Subjt:  ATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKEVASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGK

Query:  ARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSI
          G  ++W + G IWLTYWPNK  T + ET+ +E+ + + L +
Subjt:  ARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSI

AT2G40390.1 unknown protein1.5e-16658.12Show/hide
Query:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPT-ITFSVIADYSFPISLWTSKPLKT-STKSSNLLDEESMSSLLVNCVH
        MASC  PD ++W+Q LPPLS WK   +S  ICS +S+  SL     ++  SP   TFS++A++  PI+L+ SK  +T ST S+  L+E  +S+LL+  V 
Subjt:  MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPT-ITFSVIADYSFPISLWTSKPLKT-STKSSNLLDEESMSSLLVNCVH

Query:  DVLYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILE
         VL Y + +R   S     L  T N KD+FNLAF T +FLICIYEAP  LR+ CL T+K  L    SRQ SK+LM  LGSNLEEQWMRS+NLAITNWI+E
Subjt:  DVLYYGSNQRNNSSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILE

Query:  LKANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLS
        +KA  + LK+PSPL+SY+FST GLWKV +YCP++AM+ +E+ ++   DERL FSLNYHQLEGV+Q N+++ VREKW ++ V++DN+RCDIIRLVNE LLS
Subjt:  LKANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLS

Query:  ERGVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDG
        ERG+G  EKHFPSRISLQLTPT+ +NI+ VSV KSS NP  E   EK  EA  +P   + GLKVS  ET   SMKPWKFE++VHG +A L W+LHD  DG
Subjt:  ERGVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDG

Query:  KEVASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP
        +EV+S+KPSK++++NP+AWF++RYSSA+RPF KQGGV+FAGD YG+SV WK+D  A GK ME+E++G +WLTYWPNKH TFY++T++LEFKE+L+L++P
Subjt:  KEVASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP

AT5G64190.1 unknown protein2.1e-14754.71Show/hide
Query:  FPDVYSWIQNLPPLSQWKTTFISTSICSSSS--TSSSLKIVAAKSLHSPTITFSVI--ADYSFPISLWTSK-PLKTSTKSSNLLDEESMSSLLVNCVHDV
        FPDV++WIQN+P +++W+TT +   IC S+S   +S+L + A KS     +TFS+I  ++   P+ LWT+K  L  +  S N  DE ++ SLL N V  +
Subjt:  FPDVYSWIQNLPPLSQWKTTFISTSICSSSS--TSSSLKIVAAKSLHSPTITFSVI--ADYSFPISLWTSK-PLKTSTKSSNLLDEESMSSLLVNCVHDV

Query:  LYYGSNQRNNSSHNFLKLDIT--FNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILE
        L Y SN  N S+      D +     KDI N   LTL F++C+YEAP+ LR NCL TLK+HL    +R+ +  LMKLLGSNLEEQWMR++NLA TNWI+E
Subjt:  LYYGSNQRNNSSHNFLKLDIT--FNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILE

Query:  LKANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLS
         + +  T  T +PL+SY+ S YGLWKVQLYCP+ AM+ +E SSNP+ D RL FSL ++QLEGV+QFN+K+VVR+ WID+ V +DNIR D+I+LVNE L+S
Subjt:  LKANGRTLKTPSPLYSYSFSTYGLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLS

Query:  ERGVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSS-D
         RG G  EKHFPSRISLQLTPT  T+ +SVSVSKSS+NP  E   E++ E  F+P     GL+V+  E   ++M PWK EQ V G  A LNW L+DSS  
Subjt:  ERGVGGSEKHFPSRISLQLTPTSHTNIMSVSVSKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSS-D

Query:  GKEVASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSI
        G+EV STKPS+ ++++P++WF+DRY+ AYR F ++GGVIFAGDEYGESV WKI   A G TMEWEI+G+IWLTYWPNK+KTFY ET+RLEF ++L+L+I
Subjt:  GKEVASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTGTAGCTTTCCTGATGTTTATTCCTGGATACAGAACCTACCACCACTTTCTCAATGGAAAACTACTTTCATTTCTACATCCATTTGCTCTTCAAGCTCAAC
CAGCTCCTCTCTCAAAATTGTTGCTGCAAAAAGCCTTCATTCCCCAACCATTACTTTCTCTGTTATTGCAGATTACAGCTTTCCTATCTCCCTTTGGACATCAAAACCCT
TGAAAACCAGCACAAAATCTTCAAATTTATTAGATGAGGAAAGCATGTCCAGTCTCTTGGTTAATTGTGTTCATGATGTTCTCTATTACGGCTCAAACCAAAGAAATAAT
TCTAGCCATAACTTTCTCAAACTCGACATCACTTTCAACTCGAAGGATATCTTCAATCTCGCATTTCTTACCCTCATATTCCTAATCTGCATCTATGAAGCTCCGATTGA
TCTCCGTTCGAATTGTCTTATGACTCTCAAGCATCATTTGGCAAATAGTACGTCAAGGCAGACGTCGAAGGTGCTAATGAAACTGTTGGGGTCTAATCTAGAAGAGCAAT
GGATGAGGTCGATAAACCTTGCAATCACCAACTGGATATTGGAGCTCAAGGCCAACGGCCGTACTTTGAAAACACCCTCACCTTTGTACTCGTATTCATTTTCAACGTAT
GGGCTGTGGAAAGTTCAACTCTATTGCCCTATCATTGCGATGGATAATATTGAGAACTCCAGCAATCCTTCAACCGATGAAAGATTGCAATTCTCTTTAAATTATCACCA
ACTTGAAGGGGTTCTGCAGTTCAATTACAAGCTTGTGGTTCGAGAAAAGTGGATTGATCTGAGGGTGCACGTTGATAACATAAGGTGTGACATCATCCGGCTTGTGAACG
AGACTCTCTTATCCGAACGAGGAGTTGGTGGATCAGAAAAGCATTTTCCATCACGAATTTCACTGCAACTCACTCCAACTTCCCACACAAACATAATGAGTGTCTCAGTA
AGCAAATCCTCAAGTAACCCCAAAATCGAGGTTGGATCTGAAAAAACCTTTGAAGCTGGATTTGAACCTGCGACACCTTACCCAGGCCTCAAAGTATCAGTTGGGGAGAC
TGTAATGGTGAGCATGAAGCCATGGAAATTTGAGCAGTACGTCCATGGCAATGCTGCAACTCTTAATTGGTACCTTCATGACAGTTCGGATGGGAAAGAGGTGGCCTCCA
CCAAGCCATCAAAACTTGCACTAATAAACCCTAAAGCTTGGTTTCGGGACCGTTACTCGAGTGCTTACAGACCTTTTAACAAACAGGGAGGGGTCATATTTGCAGGAGAC
GAGTATGGAGAGAGTGTGTGGTGGAAGATTGATGGAAAGGCAAGAGGGAAAACTATGGAGTGGGAAATTAGAGGCTGGATCTGGTTAACTTATTGGCCAAACAAACACAA
GACATTTTACACCGAAACCAAGAGGCTGGAATTCAAAGAGATTCTCCATCTTTCAATTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTGTAGCTTTCCTGATGTTTATTCCTGGATACAGAACCTACCACCACTTTCTCAATGGAAAACTACTTTCATTTCTACATCCATTTGCTCTTCAAGCTCAAC
CAGCTCCTCTCTCAAAATTGTTGCTGCAAAAAGCCTTCATTCCCCAACCATTACTTTCTCTGTTATTGCAGATTACAGCTTTCCTATCTCCCTTTGGACATCAAAACCCT
TGAAAACCAGCACAAAATCTTCAAATTTATTAGATGAGGAAAGCATGTCCAGTCTCTTGGTTAATTGTGTTCATGATGTTCTCTATTACGGCTCAAACCAAAGAAATAAT
TCTAGCCATAACTTTCTCAAACTCGACATCACTTTCAACTCGAAGGATATCTTCAATCTCGCATTTCTTACCCTCATATTCCTAATCTGCATCTATGAAGCTCCGATTGA
TCTCCGTTCGAATTGTCTTATGACTCTCAAGCATCATTTGGCAAATAGTACGTCAAGGCAGACGTCGAAGGTGCTAATGAAACTGTTGGGGTCTAATCTAGAAGAGCAAT
GGATGAGGTCGATAAACCTTGCAATCACCAACTGGATATTGGAGCTCAAGGCCAACGGCCGTACTTTGAAAACACCCTCACCTTTGTACTCGTATTCATTTTCAACGTAT
GGGCTGTGGAAAGTTCAACTCTATTGCCCTATCATTGCGATGGATAATATTGAGAACTCCAGCAATCCTTCAACCGATGAAAGATTGCAATTCTCTTTAAATTATCACCA
ACTTGAAGGGGTTCTGCAGTTCAATTACAAGCTTGTGGTTCGAGAAAAGTGGATTGATCTGAGGGTGCACGTTGATAACATAAGGTGTGACATCATCCGGCTTGTGAACG
AGACTCTCTTATCCGAACGAGGAGTTGGTGGATCAGAAAAGCATTTTCCATCACGAATTTCACTGCAACTCACTCCAACTTCCCACACAAACATAATGAGTGTCTCAGTA
AGCAAATCCTCAAGTAACCCCAAAATCGAGGTTGGATCTGAAAAAACCTTTGAAGCTGGATTTGAACCTGCGACACCTTACCCAGGCCTCAAAGTATCAGTTGGGGAGAC
TGTAATGGTGAGCATGAAGCCATGGAAATTTGAGCAGTACGTCCATGGCAATGCTGCAACTCTTAATTGGTACCTTCATGACAGTTCGGATGGGAAAGAGGTGGCCTCCA
CCAAGCCATCAAAACTTGCACTAATAAACCCTAAAGCTTGGTTTCGGGACCGTTACTCGAGTGCTTACAGACCTTTTAACAAACAGGGAGGGGTCATATTTGCAGGAGAC
GAGTATGGAGAGAGTGTGTGGTGGAAGATTGATGGAAAGGCAAGAGGGAAAACTATGGAGTGGGAAATTAGAGGCTGGATCTGGTTAACTTATTGGCCAAACAAACACAA
GACATTTTACACCGAAACCAAGAGGCTGGAATTCAAAGAGATTCTCCATCTTTCAATTCCTTAG
Protein sequenceShow/hide protein sequence
MASCSFPDVYSWIQNLPPLSQWKTTFISTSICSSSSTSSSLKIVAAKSLHSPTITFSVIADYSFPISLWTSKPLKTSTKSSNLLDEESMSSLLVNCVHDVLYYGSNQRNN
SSHNFLKLDITFNSKDIFNLAFLTLIFLICIYEAPIDLRSNCLMTLKHHLANSTSRQTSKVLMKLLGSNLEEQWMRSINLAITNWILELKANGRTLKTPSPLYSYSFSTY
GLWKVQLYCPIIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKLVVREKWIDLRVHVDNIRCDIIRLVNETLLSERGVGGSEKHFPSRISLQLTPTSHTNIMSVSV
SKSSSNPKIEVGSEKTFEAGFEPATPYPGLKVSVGETVMVSMKPWKFEQYVHGNAATLNWYLHDSSDGKEVASTKPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGD
EYGESVWWKIDGKARGKTMEWEIRGWIWLTYWPNKHKTFYTETKRLEFKEILHLSIP