; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012065 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012065
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionZinc finger family protein, putative isoform 1
Genome locationchr1:37011478..37015725
RNA-Seq ExpressionLag0012065
SyntenyLag0012065
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004175 - endopeptidase activity (molecular function)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN54878.2 hypothetical protein Csa_012907 [Cucumis sativus]1.0e-22582.39Show/hide
Query:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI
        MGKNDGEQP PSA+ SRPSG  ADGRCCCGC+SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDL LNPSYRGHDIVATF+VER++SLLEDN 
Subjt:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI

Query:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF
        +QLRTDI+EEF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEI ST LSLIRSI  S+VTNQ FL ITKSTFGEA+SFEVLKFPGGITIIPPQSAF
Subjt:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGST+T PTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPE-SVASPKKSHEAKPPGCQYRYKR
        GKV                            PTPQPHN H PPTHHHHHHHTPLTPAISPAPAT+KGA +YGSPAPE + ASPK+S+ AKPPGCQYRYKR
Subjt:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPE-SVASPKKSHEAKPPGCQYRYKR

Query:  KSGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWG
        KSGRKEGKQSHLTPLASPNISP HSAASPSPQHQ+ PPA PVSP PALTPLPNVIYAHVQPPSKSD NHP        A PS +PSPSGAD CHMIT+WG
Subjt:  KSGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWG

Query:  FALFLILAFHM
        F LFLILA HM
Subjt:  FALFLILAFHM

XP_004144318.1 uncharacterized protein LOC101216010 isoform X1 [Cucumis sativus]1.0e-22582.39Show/hide
Query:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI
        MGKNDGEQP PSA+ SRPSG  ADGRCCCGC+SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDL LNPSYRGHDIVATF+VER++SLLEDN 
Subjt:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI

Query:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF
        +QLRTDI+EEF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEI ST LSLIRSI  S+VTNQ FL ITKSTFGEA+SFEVLKFPGGITIIPPQSAF
Subjt:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGST+T PTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPE-SVASPKKSHEAKPPGCQYRYKR
        GKV                            PTPQPHN H PPTHHHHHHHTPLTPAISPAPAT+KGA +YGSPAPE + ASPK+S+ AKPPGCQYRYKR
Subjt:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPE-SVASPKKSHEAKPPGCQYRYKR

Query:  KSGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWG
        KSGRKEGKQSHLTPLASPNISP HSAASPSPQHQ+ PPA PVSP PALTPLPNVIYAHVQPPSKSD NHP        A PS +PSPSGAD CHMIT+WG
Subjt:  KSGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWG

Query:  FALFLILAFHM
        F LFLILA HM
Subjt:  FALFLILAFHM

XP_008455751.1 PREDICTED: uncharacterized protein LOC103495852 [Cucumis melo]4.5e-22181.02Show/hide
Query:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI
        MGKNDGEQP PSA+ SRPSG  ADGRCC GC+SIRRLIGFRCIFILLLSVALFVSAV WLPPF+HYADQKDLGLNPSYRGHDIVATF+VER++SLLEDN 
Subjt:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI

Query:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF
        +QLRTDI+EEF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEI ST LSLIRSI  S+VTNQ FL ITKSTFGEA+SFEVLKFPGGITIIPPQSAF
Subjt:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGST+TAPTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPE-SVASPKKSHEAKPPGCQYRYKR
        GKV                            PTPQPHN+H PPTHHHHHHHTPL  AISPAPAT+KGA +YGSPAPE S ASP++S+ A+PPGCQYRYKR
Subjt:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPE-SVASPKKSHEAKPPGCQYRYKR

Query:  KSGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWG
        KSGRKEGKQSHLTPLASPNISP HSAASPSPQHQ+ PPA PVSP PALTPLPNVIYAHVQPPSKSD N P        A PS +PSPSGAD CHMIT+WG
Subjt:  KSGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWG

Query:  FALFLILAFHM
        F LFLILA HM
Subjt:  FALFLILAFHM

XP_022925202.1 uncharacterized protein LOC111432513 isoform X3 [Cucurbita moschata]1.8e-21479.8Show/hide
Query:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI
        MGKNDGE PPPSAVGS PS     GRCC GC+SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRGHDIVATF VER +SLL+DNI
Subjt:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI

Query:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF
        E+LRTDI+EEF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EIPST LSLIRS  AS+VTNQSFLRITKS FGEAFSFEVLKFPGGITIIPPQSAF
Subjt:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGST+TAPTIVQSSVLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRK
        GKV                            PTPQPHN+H PP+HHHHHHH PLTP ISPAPA + GA +YG  AP+S ASPK+S+EAKPPGCQ  YKRK
Subjt:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWGF
        SGRKEGKQ HL+PLASP+ISPVHSAASPS QH        VSPT A TPLP+VIYAHVQPPSKSD NHPEKSTT+PS  PSPSPSPS A H  MITRWGF
Subjt:  SGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWGF

Query:  ALFLILAFHM
         L LI+AF+M
Subjt:  ALFLILAFHM

XP_038882638.1 uncharacterized protein LOC120073837 [Benincasa hispida]1.7e-22082.46Show/hide
Query:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI
        MGKNDGEQP PSA+ SRPSGQ ADGRCCCGC+SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATF+VER +SLLEDNI
Subjt:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI

Query:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF
        EQLRTDI+EEF IPSIKVDILSLESL GSNRTKVVF +DPD D+SEI ST LSLIRS   S+VTNQ FLRITKS FGEAFSFEVLKFPGGITIIPPQSAF
Subjt:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILY+KLWNAEGST+TAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPE-SVASPKKSHEAKPPGCQYRYKR
        GKV                            P PQPHN   PPT HHHHHHT LTPAISPAPAT+KGA +YGSPAPE S ASPK+S+ AKPPGCQY  KR
Subjt:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPE-SVASPKKSHEAKPPGCQYRYKR

Query:  KSGRKEGKQSHLTPLASPNISPVHSAASPS--PQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITR
        KSGRKEGKQSHLTPLASPN+SP HSAASPS  PQH+V PPA P+ P PALTPLPNVIYAHVQPPSKS+ NHPEKSTTNPS   +PSPSPSGAD C MIT+
Subjt:  KSGRKEGKQSHLTPLASPNISPVHSAASPS--PQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITR

Query:  WGFALFLILAFHM
        WGF LFLILA HM
Subjt:  WGFALFLILAFHM

TrEMBL top hitse value%identityAlignment
A0A0A0KYS3 Uncharacterized protein5.0e-22682.39Show/hide
Query:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI
        MGKNDGEQP PSA+ SRPSG  ADGRCCCGC+SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDL LNPSYRGHDIVATF+VER++SLLEDN 
Subjt:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI

Query:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF
        +QLRTDI+EEF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEI ST LSLIRSI  S+VTNQ FL ITKSTFGEA+SFEVLKFPGGITIIPPQSAF
Subjt:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGST+T PTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPE-SVASPKKSHEAKPPGCQYRYKR
        GKV                            PTPQPHN H PPTHHHHHHHTPLTPAISPAPAT+KGA +YGSPAPE + ASPK+S+ AKPPGCQYRYKR
Subjt:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPE-SVASPKKSHEAKPPGCQYRYKR

Query:  KSGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWG
        KSGRKEGKQSHLTPLASPNISP HSAASPSPQHQ+ PPA PVSP PALTPLPNVIYAHVQPPSKSD NHP        A PS +PSPSGAD CHMIT+WG
Subjt:  KSGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWG

Query:  FALFLILAFHM
        F LFLILA HM
Subjt:  FALFLILAFHM

A0A1S3C173 uncharacterized protein LOC1034958522.2e-22181.02Show/hide
Query:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI
        MGKNDGEQP PSA+ SRPSG  ADGRCC GC+SIRRLIGFRCIFILLLSVALFVSAV WLPPF+HYADQKDLGLNPSYRGHDIVATF+VER++SLLEDN 
Subjt:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI

Query:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF
        +QLRTDI+EEF IPSIKV+ILSLE LSGSNRTKVVF +DPD DDSEI ST LSLIRSI  S+VTNQ FL ITKSTFGEA+SFEVLKFPGGITIIPPQSAF
Subjt:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGST+TAPTIVQ+SVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNN EF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPE-SVASPKKSHEAKPPGCQYRYKR
        GKV                            PTPQPHN+H PPTHHHHHHHTPL  AISPAPAT+KGA +YGSPAPE S ASP++S+ A+PPGCQYRYKR
Subjt:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPE-SVASPKKSHEAKPPGCQYRYKR

Query:  KSGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWG
        KSGRKEGKQSHLTPLASPNISP HSAASPSPQHQ+ PPA PVSP PALTPLPNVIYAHVQPPSKSD N P        A PS +PSPSGAD CHMIT+WG
Subjt:  KSGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWG

Query:  FALFLILAFHM
        F LFLILA HM
Subjt:  FALFLILAFHM

A0A6J1EB56 uncharacterized protein LOC111432513 isoform X38.8e-21579.8Show/hide
Query:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI
        MGKNDGE PPPSAVGS PS     GRCC GC+SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRGHDIVATF VER +SLL+DNI
Subjt:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI

Query:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF
        E+LRTDI+EEF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EIPST LSLIRS  AS+VTNQSFLRITKS FGEAFSFEVLKFPGGITIIPPQSAF
Subjt:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGST+TAPTIVQSSVLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRK
        GKV                            PTPQPHN+H PP+HHHHHHH PLTP ISPAPA + GA +YG  AP+S ASPK+S+EAKPPGCQ  YKRK
Subjt:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWGF
        SGRKEGKQ HL+PLASP+ISPVHSAASPS QH        VSPT A TPLP+VIYAHVQPPSKSD NHPEKSTT+PS  PSPSPSPS A H  MITRWGF
Subjt:  SGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWGF

Query:  ALFLILAFHM
         L LI+AF+M
Subjt:  ALFLILAFHM

A0A6J1EEJ8 uncharacterized protein LOC111432513 isoform X43.1e-21279.41Show/hide
Query:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI
        MGKNDGE PPPSAVGS PS     GRCC GC+SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRGHDIVATF VER +SLL+DNI
Subjt:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI

Query:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF
        E+LRTDI+EEF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EIPST LSLIRS  AS+VTNQSFLRITKS FGEAFSFEVLKFPGGITIIPPQSAF
Subjt:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGST+TAPTIVQSSVLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRK
        GKV                            PTPQPHN+H PP+HHHHHHH PLTP ISPAPA + GA +YG  AP+S ASPK+S+EAKPPGCQ  YKRK
Subjt:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWGF
        SGRKEGKQ HL+PLASP+ISPVHSAASPS QH        VSPT A TPLP+VIYAHVQPPSKSD NHPEKSTT+PS    PSPSPS A H  MITRWGF
Subjt:  SGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWGF

Query:  ALFLILAFHM
         L LI+AF+M
Subjt:  ALFLILAFHM

A0A6J1EH92 uncharacterized protein LOC111432513 isoform X13.7e-21379.18Show/hide
Query:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI
        MGKNDGE PPPSAVGS PS     GRCC GC+SIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHY+DQKDLGLNPSYRGHDIVATF VER +SLL+DNI
Subjt:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNI

Query:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF
        E+LRTDI+EEF IPSIKVDILSL SLSGSNRTKVVFG+DPD DD EIPST LSLIRS  AS+VTNQSFLRITKS FGEAFSFEVLKFPGGITIIPPQSAF
Subjt:  EQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAF

Query:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF
        LLQKVQILFNFTLNFSIHQIQVHFSELTSQL+AGLRLAPYEILYIKLWNAEGST+TAPTIVQSSVLLEVGNTPSM+RLKQLAQTIS SNSSNLGLNNTEF
Subjt:  LLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEF

Query:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRK
        GKV                            PTPQPHN+H PP+HHHHHHH PLTP ISPAPA + GA +YG  AP+S ASPK+S+EAKPPGCQ  YKRK
Subjt:  GKVS------------------------FSCPTPQPHNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRK

Query:  SGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTN----PSATPSPSPSPSGADHCHMIT
        SGRKEGKQ HL+PLASP+ISPVHSAASPS QH        VSPT A TPLP+VIYAHVQPPSKSD NHPEKSTT+    PS +PSPSPSPS A H  MIT
Subjt:  SGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTN----PSATPSPSPSPSGADHCHMIT

Query:  RWGFALFLILAFHM
        RWGF L LI+AF+M
Subjt:  RWGFALFLILAFHM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)2.3e-3436.33Show/hide
Query:  SGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLG---LNPSYRGHDIVATFDVERAISLLEDNIEQLRTDIYEEF-LIP
        S   + GR C    S  RL+G RC+ +L+LS A+ +SA+FWL P    ++ K  G   LN S     + A+F +++ +S +  +  ++  DI     L  
Subjt:  SGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLG---LNPSYRGHDIVATFDVERAISLLEDNIEQLRTDIYEEF-LIP

Query:  SIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLN
        + KV +LSL     SN T V F V P   D EI   SLSL+RS F  +   +S L++T S FG+  SF+VLKFPGGIT+ P + A +     +LF+ T+ 
Subjt:  SIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLN

Query:  FSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV
         SI  +Q     L    E  L L PYE ++ +L N +GST++ P   Q  V   +      +RL    Q I  S + NLGL+   FG+V
Subjt:  FSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKV

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein1.3e-9044.02Show/hide
Query:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLS-IRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDN
        MGK + +       G   +G +      CGC   I   +GF+C+F+LLLSVALF+SA+F L PF    D++D  L+P +RGH IVA+F + R+ S L +N
Subjt:  MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLS-IRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDN

Query:  IEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSA
          QL+ DI++E    SIKV IL++E     N TKVVFG+DPD    EI   SLS I+ +F S++ NQS L++TKS FGE F FEVLKFPGGIT+IPPQSA
Subjt:  IEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSA

Query:  FLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTE
        F LQK +I+FNFTLN+SIHQIQ++F+ L SQL+ GL LAPYE LY+ L N+EGST++ PT V SSVLL VG + S  RLKQL  TI+GS S NLGLNNT 
Subjt:  FLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTE

Query:  FGKV---------------SFSCPTPQP--------------HNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKP---P
        FGKV               S   P+P P              H++H    +HHHHHH  L+P ++P  +         SPAP    S K++  A P   P
Subjt:  FGKV---------------SFSCPTPQP--------------HNYHRPPTHHHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKP---P

Query:  GCQYRYKRKSGRKEGKQSHLTPLASPNI-SPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGAD
        G +  +K K       Q   TP  +P+  +P H   SP+P    K    P+S      PLP+V++AH   P  ++P  P     N  A P P  S S  +
Subjt:  GCQYRYKRKSGRKEGKQSHLTPLASPNI-SPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGAD

Query:  HCHMITRWGFALFLILAF
            +  W   L LI+A+
Subjt:  HCHMITRWGFALFLILAF

AT3G56590.1 hydroxyproline-rich glycoprotein family protein3.4e-9445.98Show/hide
Query:  MGKNDGEQP----PPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLL
        MGKN  E+        A  +R +G      CCC C  I      RC+ IL  S A+F+SA+FWLPPFL +AD  DL L+P ++ H IVA+FDV + IS +
Subjt:  MGKNDGEQP----PPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLL

Query:  EDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPP
        EDN+ QL  DI +E   P  KV +L+LE L   NRT V+F +DP+ ++S+IP+   SLI++ F ++V  Q   R+T+S FGE F FEVLKFPGGIT+IPP
Subjt:  EDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPP

Query:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN
        Q  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GST+  PTIV SSVLL  G   S  RLKQLAQTI+ S+S NLGLN
Subjt:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN

Query:  NTEFGKV----------------SFSCPTPQPHNYHRPPTH-HHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRKSGR
        +T FGKV                S   P+PQP  +  P  H HHHHHH  L P  S +P T KG     +P   S   P+       P C Y  +R  G 
Subjt:  NTEFGKV----------------SFSCPTPQPHNYHRPPTH-HHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRKSGR

Query:  KEGKQSHLTPLASPNISPVH-SAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPS
                 P  +P+ S  H  A +P+P    +  A PVS     +PLP+V++AH+ PPSKS P        +PS  P+P  S S
Subjt:  KEGKQSHLTPLASPNISPVH-SAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPS

AT3G56590.2 hydroxyproline-rich glycoprotein family protein6.9e-9546.2Show/hide
Query:  MGKNDGEQP----PPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLL
        MGKN  E+        A  +R +G      CCC C  I      RC+ IL  S A+F+SA+FWLPPFL +AD  DL L+P ++ H IVA+FDV + IS +
Subjt:  MGKNDGEQP----PPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLL

Query:  EDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPP
        EDN+ QL  DI +E   P  KV +L+LE L   NRT V+F +DP+ ++S+IP+   SLI++ F ++V  Q   R+T+S FGE F FEVLKFPGGIT+IPP
Subjt:  EDNIEQLRTDIYEEFLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPP

Query:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN
        Q  F LQK Q+LFNFTLNFSI+QIQ +F EL SQL+ G+ LA YE LYI L N+ GST+  PTIV SSVLL  G   S  RLKQLAQTI+ S+S NLGLN
Subjt:  QSAFLLQKVQILFNFTLNFSIHQIQVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLN

Query:  NTEFGKV----------------SFSCPTPQPHNYHRPPTH-HHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRKSGR
        +T FGKV                S   P+PQP  +  P  H HHHHHH  L P  S +P T KG     +P   S   P+       P C Y  +R  G 
Subjt:  NTEFGKV----------------SFSCPTPQPHNYHRPPTH-HHHHHHTPLTPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRKSGR

Query:  KEGKQSHLTPLASPNISPVH-SAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGA
                 P  +P+ S  H  A +P+P    +  A PVS     +PLP+V++AH+ PPSKS P    +S      +PSP+P+PS A
Subjt:  KEGKQSHLTPLASPNISPVH-SAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKSDPNHPEKSTTNPSATPSPSPSPSGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAAACGACGGAGAACAGCCACCGCCGTCCGCCGTCGGCTCGAGGCCGTCAGGCCAGGCTGCCGATGGCCGATGCTGTTGTGGGTGTCTTTCGATTCGAAGGCT
CATTGGCTTCAGATGCATCTTCATTCTGCTATTGTCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGCTGCCCCCTTTTCTCCATTATGCAGATCAAAAGGATCTGGGTC
TTAATCCCTCGTATCGAGGTCATGATATAGTAGCAACATTCGATGTTGAGAGAGCGATTTCTTTGCTGGAAGACAATATCGAGCAACTCCGGACCGACATTTATGAAGAG
TTCCTTATACCTTCTATCAAAGTGGATATACTATCTCTAGAATCGTTATCAGGATCCAACCGAACAAAAGTTGTGTTCGGTGTCGATCCAGATGCTGATGATTCAGAAAT
CCCATCAACTTCTCTAAGTTTAATAAGGTCGATCTTTGCAAGTATAGTAACAAATCAGTCATTCCTCCGTATTACTAAATCCACGTTTGGGGAGGCCTTTTCGTTTGAAG
TACTGAAATTCCCCGGAGGAATAACGATAATCCCGCCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACATTAAACTTCTCCATTCATCAGATT
CAAGTACATTTCAGTGAACTGACCAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATATTAAACTATGGAATGCGGAAGGTTCGACCATGACTGC
CCCTACGATTGTCCAGTCATCTGTTCTTCTGGAAGTCGGAAATACTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAGACAATCTCAGGTTCTAATTCTAGCAACCTCG
GCCTGAATAATACTGAGTTTGGAAAAGTCTCCTTCTCCTGCCCGACACCGCAGCCCCATAACTACCATCGCCCCCCAACTCACCACCATCACCACCATCACACCCCTCTA
ACACCTGCAATTTCACCTGCCCCTGCTACCAAGAAGGGTGCACTGAAATATGGTTCGCCTGCCCCCGAAAGTGTGGCATCACCTAAGAAAAGTCATGAAGCAAAGCCGCC
CGGTTGTCAATATAGATACAAGAGGAAGTCTGGTAGGAAAGAGGGAAAGCAATCTCATTTAACCCCGCTTGCTTCACCCAATATATCTCCTGTTCATTCTGCTGCATCAC
CATCGCCACAACATCAAGTTAAACCACCAGCAACACCCGTCTCTCCAACTCCGGCATTAACTCCGTTGCCAAACGTCATTTATGCTCATGTTCAACCACCTTCGAAAAGC
GACCCCAATCACCCCGAAAAATCCACGACGAATCCATCAGCCACGCCGTCGCCATCTCCATCTCCATCTGGCGCAGATCATTGCCATATGATCACTCGATGGGGATTCGC
ACTGTTTCTAATTCTCGCATTCCACATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAAACGACGGAGAACAGCCACCGCCGTCCGCCGTCGGCTCGAGGCCGTCAGGCCAGGCTGCCGATGGCCGATGCTGTTGTGGGTGTCTTTCGATTCGAAGGCT
CATTGGCTTCAGATGCATCTTCATTCTGCTATTGTCCGTTGCCTTGTTCGTTTCTGCTGTTTTTTGGCTGCCCCCTTTTCTCCATTATGCAGATCAAAAGGATCTGGGTC
TTAATCCCTCGTATCGAGGTCATGATATAGTAGCAACATTCGATGTTGAGAGAGCGATTTCTTTGCTGGAAGACAATATCGAGCAACTCCGGACCGACATTTATGAAGAG
TTCCTTATACCTTCTATCAAAGTGGATATACTATCTCTAGAATCGTTATCAGGATCCAACCGAACAAAAGTTGTGTTCGGTGTCGATCCAGATGCTGATGATTCAGAAAT
CCCATCAACTTCTCTAAGTTTAATAAGGTCGATCTTTGCAAGTATAGTAACAAATCAGTCATTCCTCCGTATTACTAAATCCACGTTTGGGGAGGCCTTTTCGTTTGAAG
TACTGAAATTCCCCGGAGGAATAACGATAATCCCGCCACAGAGTGCATTTCTTTTGCAGAAAGTGCAAATTCTTTTCAACTTTACATTAAACTTCTCCATTCATCAGATT
CAAGTACATTTCAGTGAACTGACCAGCCAACTGGAGGCGGGATTACGACTAGCTCCATATGAGATTTTATATATTAAACTATGGAATGCGGAAGGTTCGACCATGACTGC
CCCTACGATTGTCCAGTCATCTGTTCTTCTGGAAGTCGGAAATACTCCATCAATGCGACGGCTGAAGCAGCTAGCTCAGACAATCTCAGGTTCTAATTCTAGCAACCTCG
GCCTGAATAATACTGAGTTTGGAAAAGTCTCCTTCTCCTGCCCGACACCGCAGCCCCATAACTACCATCGCCCCCCAACTCACCACCATCACCACCATCACACCCCTCTA
ACACCTGCAATTTCACCTGCCCCTGCTACCAAGAAGGGTGCACTGAAATATGGTTCGCCTGCCCCCGAAAGTGTGGCATCACCTAAGAAAAGTCATGAAGCAAAGCCGCC
CGGTTGTCAATATAGATACAAGAGGAAGTCTGGTAGGAAAGAGGGAAAGCAATCTCATTTAACCCCGCTTGCTTCACCCAATATATCTCCTGTTCATTCTGCTGCATCAC
CATCGCCACAACATCAAGTTAAACCACCAGCAACACCCGTCTCTCCAACTCCGGCATTAACTCCGTTGCCAAACGTCATTTATGCTCATGTTCAACCACCTTCGAAAAGC
GACCCCAATCACCCCGAAAAATCCACGACGAATCCATCAGCCACGCCGTCGCCATCTCCATCTCCATCTGGCGCAGATCATTGCCATATGATCACTCGATGGGGATTCGC
ACTGTTTCTAATTCTCGCATTCCACATGTAA
Protein sequenceShow/hide protein sequence
MGKNDGEQPPPSAVGSRPSGQAADGRCCCGCLSIRRLIGFRCIFILLLSVALFVSAVFWLPPFLHYADQKDLGLNPSYRGHDIVATFDVERAISLLEDNIEQLRTDIYEE
FLIPSIKVDILSLESLSGSNRTKVVFGVDPDADDSEIPSTSLSLIRSIFASIVTNQSFLRITKSTFGEAFSFEVLKFPGGITIIPPQSAFLLQKVQILFNFTLNFSIHQI
QVHFSELTSQLEAGLRLAPYEILYIKLWNAEGSTMTAPTIVQSSVLLEVGNTPSMRRLKQLAQTISGSNSSNLGLNNTEFGKVSFSCPTPQPHNYHRPPTHHHHHHHTPL
TPAISPAPATKKGALKYGSPAPESVASPKKSHEAKPPGCQYRYKRKSGRKEGKQSHLTPLASPNISPVHSAASPSPQHQVKPPATPVSPTPALTPLPNVIYAHVQPPSKS
DPNHPEKSTTNPSATPSPSPSPSGADHCHMITRWGFALFLILAFHM