; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007648 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007648
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationchr9:2365558..2367423
RNA-Seq ExpressionLag0007648
SyntenyLag0007648
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]5.1e-5141.22Show/hide
Query:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ
        +D NNQIYPV + +   E+   W +F   +   +  +  LV +SDRH +ICK +  VFP A HC+C+ HL+ N   KFK +    L+ KA KAFRE+ F 
Subjt:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ

Query:  KYWGKIPI--SAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY
        + W ++      + YLE +  ERW+R FQ  +RY+QMTTNIAE                      G LQ  +YE RT A +  +++S Y EE++ EA   
Subjt:  KYWGKIPI--SAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY

Query:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLC
        ARRH+V  ID++ F+V DG+L G V++ ++ CTCREFD F++PCSHAIA    R+++   LC
Subjt:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLC

XP_022147768.1 uncharacterized protein LOC111016623 [Momordica charantia]6.9e-5642.97Show/hide
Query:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ
        +D NNQIYPV +GI   E+D  WT+F   +  AIG++  LV VSDRH++I   V  VFPNA H  CM H++ N   KFK+     LYL+A +AF+++ F+
Subjt:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ

Query:  KYWGKIP--ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY
         YW ++      Q YL+EV  ++WSR +Q  +RYNQMTTNIAE                        LQ  +YERRT+A    + +++Y E+I+ ++ + 
Subjt:  KYWGKIP--ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY

Query:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCS
        ARRH +RPID YEF+V DG    RVN++++ C+C++FD +++PCSHAIA  + +NV+   LCS
Subjt:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCS

XP_022154803.1 uncharacterized protein LOC111021969 [Momordica charantia]6.0e-5241.06Show/hide
Query:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ
        +D NNQIYP+ +G+   E+DE WT+F   +   IG +  LV VSDRH++I   V T+F +AAH  CM H+      KF++     ++ KA KAF+ ++F+
Subjt:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ

Query:  KYWGKIP--ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY
         YWG++        YLE++ L++W+R +Q GMRYNQMT+N+AE                        LQ  +Y+RRT   +    +++Y E I+ E  + 
Subjt:  KYWGKIP--ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY

Query:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCS
        AR H VRPIDR+EF+V DG    RVNI+++ CTC++F  +E+PCSHAIA  + RN+S   LCS
Subjt:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCS

XP_022157237.1 protein FAR-RED ELONGATED HYPOCOTYL 3-like [Momordica charantia]4.6e-5241.44Show/hide
Query:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ
        ID NNQIY + +G+     D+ WT+F   +   IG +  LV +SDRH+SI   V TVFP+AAH  CM HL      KF++     ++ KA KAF+ ++F+
Subjt:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ

Query:  KYWGKIP--ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY
         YWG++      Q YLE++  ++W+R +Q GMRYNQMT+N+AE                        LQ  +YERRT A +    +++Y E I+ +  + 
Subjt:  KYWGKIP--ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY

Query:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCS
        AR H VRPIDR+EF+V DG     VN++++ CTC++FD FE+ CSHAIA  ++RN+S   LCS
Subjt:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCS

XP_022158655.1 uncharacterized protein LOC111025117 [Momordica charantia]7.3e-5042.45Show/hide
Query:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ
        +D NNQIYPV + I   E+   W +F   +   +  +  LV +SDRH SICK +  VFP A HC+C+ HL+ N   KFK      L+ KA KAF E  F 
Subjt:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ

Query:  KYWGKIPI--SAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAE----CGWLQANYYERRTNAVNWDASISKYGEEIVTEAEKYARRHVVRPIDRYEFQVD
        + W ++      + YLE +  ERW+R FQ  +RY+QMTTNIAE       +   +YER+T A +  +++S Y EE++ EA   +RRH+V  ID++ F+V 
Subjt:  KYWGKIPI--SAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAE----CGWLQANYYERRTNAVNWDASISKYGEEIVTEAEKYARRHVVRPIDRYEFQVD

Query:  DGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLC
        DG+L G V++ ++ CTCREFD F++ CSHAIA    R+++   LC
Subjt:  DGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLC

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like2.5e-5141.22Show/hide
Query:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ
        +D NNQIYPV + +   E+   W +F   +   +  +  LV +SDRH +ICK +  VFP A HC+C+ HL+ N   KFK +    L+ KA KAFRE+ F 
Subjt:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ

Query:  KYWGKIPI--SAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY
        + W ++      + YLE +  ERW+R FQ  +RY+QMTTNIAE                      G LQ  +YE RT A +  +++S Y EE++ EA   
Subjt:  KYWGKIPI--SAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY

Query:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLC
        ARRH+V  ID++ F+V DG+L G V++ ++ CTCREFD F++PCSHAIA    R+++   LC
Subjt:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLC

A0A6J1D278 uncharacterized protein LOC1110166233.3e-5642.97Show/hide
Query:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ
        +D NNQIYPV +GI   E+D  WT+F   +  AIG++  LV VSDRH++I   V  VFPNA H  CM H++ N   KFK+     LYL+A +AF+++ F+
Subjt:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ

Query:  KYWGKIP--ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY
         YW ++      Q YL+EV  ++WSR +Q  +RYNQMTTNIAE                        LQ  +YERRT+A    + +++Y E+I+ ++ + 
Subjt:  KYWGKIP--ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY

Query:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCS
        ARRH +RPID YEF+V DG    RVN++++ C+C++FD +++PCSHAIA  + +NV+   LCS
Subjt:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCS

A0A6J1DLB0 uncharacterized protein LOC1110219692.9e-5241.06Show/hide
Query:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ
        +D NNQIYP+ +G+   E+DE WT+F   +   IG +  LV VSDRH++I   V T+F +AAH  CM H+      KF++     ++ KA KAF+ ++F+
Subjt:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ

Query:  KYWGKIP--ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY
         YWG++        YLE++ L++W+R +Q GMRYNQMT+N+AE                        LQ  +Y+RRT   +    +++Y E I+ E  + 
Subjt:  KYWGKIP--ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY

Query:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCS
        AR H VRPIDR+EF+V DG    RVNI+++ CTC++F  +E+PCSHAIA  + RN+S   LCS
Subjt:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCS

A0A6J1DU12 protein FAR-RED ELONGATED HYPOCOTYL 3-like2.2e-5241.44Show/hide
Query:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ
        ID NNQIY + +G+     D+ WT+F   +   IG +  LV +SDRH+SI   V TVFP+AAH  CM HL      KF++     ++ KA KAF+ ++F+
Subjt:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ

Query:  KYWGKIP--ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY
         YWG++      Q YLE++  ++W+R +Q GMRYNQMT+N+AE                        LQ  +YERRT A +    +++Y E I+ +  + 
Subjt:  KYWGKIP--ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAEC---------------------GWLQANYYERRTNAVNWDASISKYGEEIVTEAEKY

Query:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCS
        AR H VRPIDR+EF+V DG     VN++++ CTC++FD FE+ CSHAIA  ++RN+S   LCS
Subjt:  ARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCS

A0A6J1DWF8 uncharacterized protein LOC1110251173.6e-5042.45Show/hide
Query:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ
        +D NNQIYPV + I   E+   W +F   +   +  +  LV +SDRH SICK +  VFP A HC+C+ HL+ N   KFK      L+ KA KAF E  F 
Subjt:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQ

Query:  KYWGKIPI--SAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAE----CGWLQANYYERRTNAVNWDASISKYGEEIVTEAEKYARRHVVRPIDRYEFQVD
        + W ++      + YLE +  ERW+R FQ  +RY+QMTTNIAE       +   +YER+T A +  +++S Y EE++ EA   +RRH+V  ID++ F+V 
Subjt:  KYWGKIPI--SAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAE----CGWLQANYYERRTNAVNWDASISKYGEEIVTEAEKYARRHVVRPIDRYEFQVD

Query:  DGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLC
        DG+L G V++ ++ CTCREFD F++ CSHAIA    R+++   LC
Subjt:  DGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLC

SwissProt top hitse value%identityAlignment
P19775 Transposase for insertion sequence element IS256 in transposon Tn40013.7e-0427.59Show/hide
Query:  RVKKSNKKIVKVWCVVEELDIDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKD
        +V++ N+ + K   +   +  D + +I  +G+ I SGE++E WT FF ++    G     +++SD HK +   +R  F N +   C  H   N       
Subjt:  RVKKSNKKIVKVWCVVEELDIDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKD

Query:  ANFIPLYLKATKAFRE
          F  +  K +K+FRE
Subjt:  ANFIPLYLKATKAFRE

P59787 Transposase for insertion sequence element IS256 in transposon Tn40013.7e-0427.59Show/hide
Query:  RVKKSNKKIVKVWCVVEELDIDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKD
        +V++ N+ + K   +   +  D + +I  +G+ I SGE++E WT FF ++    G     +++SD HK +   +R  F N +   C  H   N       
Subjt:  RVKKSNKKIVKVWCVVEELDIDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKD

Query:  ANFIPLYLKATKAFRE
          F  +  K +K+FRE
Subjt:  ANFIPLYLKATKAFRE

Arabidopsis top hitse value%identityAlignment
AT1G64255.1 MuDR family transposase4.6e-1021.79Show/hide
Query:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVV-----RTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFR
        +D  N+ +P+ + +    + ++W +F   +   + Q   L ++S  H  I  VV     +   P A H + + H        F          +A    +
Subjt:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVV-----RTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFR

Query:  ETEFQKYWGKIP---ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAECGWLQANYYERRTNAV-------------NWDASIS----------KYGEE
        + EF  Y   I      A+ +L++    RW+     G RY  M  N  +  +   N +E+  + V              +D S S           Y E 
Subjt:  ETEFQKYWGKIP---ISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAECGWLQANYYERRTNAV-------------NWDASIS----------KYGEE

Query:  IVTEAEKY-----ARRHVVRPIDRYEFQVDDGHLGGR--VNIHTRMCTCREFDCFELPCSHAIATC---IYRNVSYLDLC
        ++ + E++        ++V P+D   FQV      G   V +    CTC +F  ++ PC HA+A C    +  + Y+D C
Subjt:  IVTEAEKY-----ARRHVVRPIDRYEFQVDDGHLGGR--VNIHTRMCTCREFDCFELPCSHAIATC---IYRNVSYLDLC

AT1G64260.1 MuDR family transposase1.3e-1221.79Show/hide
Query:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRT-----VFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFR
        +D  N+ +P+ + +    + + W +FF  +   + Q   L ++S   + I  VV         P A H +C+ HL       F+D N   L  +A    +
Subjt:  IDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRT-----VFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFR

Query:  ETEFQKYWGKIP---ISAQTYLEEVRLERWSRVFQCGMRYNQMTTN---------------IAECGWLQANYYERRTNAVNWDASISK----------YG
        + EF  Y   I      A  +L+++   +W+     G+RY  +  +               +A  G +   + E R+   ++D S+S           Y 
Subjt:  ETEFQKYWGKIP---ISAQTYLEEVRLERWSRVFQCGMRYNQMTTN---------------IAECGWLQANYYERRTNAVNWDASISK----------YG

Query:  EEIVTEAEKY---ARRHVVRPIDRYEFQVDDGHLGGR--VNIHTRMCTCREFDCFELPCSHAIATCIYRNVS---YLDLC
        E  + + E++   +  +V+  ++R  F+V +        V ++   CTCR+F  ++ PC HA+A      ++   Y+D C
Subjt:  EEIVTEAEKY---ARRHVVRPIDRYEFQVDDGHLGGR--VNIHTRMCTCREFDCFELPCSHAIATCIYRNVS---YLDLC

AT4G15090.1 FRS (FAR1 Related Sequences) transcription factor family6.5e-0422.15Show/hide
Query:  LDIDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKF-----KDANFIPLYLKAT-K
        + ++ ++Q   +G  + + E+ E + +  +    A+G   P VI++D+ K +   V  + PN  HC+ + H+ E     F     +  NF+  + K   +
Subjt:  LDIDDNNQIYPVGYGIGSGETDELWTYFFRHMGCAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKF-----KDANFIPLYLKAT-K

Query:  AFRETEFQKYWGKIPISAQTYLEEVRL------ERW-----SRVFQCGMRYNQMTTNI
        ++ + EF   W K+        +E  L      ++W     S VF  GM  +Q + ++
Subjt:  AFRETEFQKYWGKIPISAQTYLEEVRL------ERW-----SRVFQCGMRYNQMTTNI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTTACACCCATTTATAACGAGCCAACATCACTGAGTCCATTATATGACAACGAGGGGGCATTTAACCTTGAGGATGATGTTGACTACGTTACCCTGCAAAATGA
TTTCCATGATTGGGGAGAGTATGAAGACGATGTTAACCCGGACAATGATTTCCATGATTGGGGAGAATACGAAGAAGAGGAGTCGGACACATATATAGATGGAGCACATG
AGAGTGGGGAGAAAGATTTGTATGAAGTTGATGTAACTGCCCAGGAGATGGAGGAAAACATCGGGGTTCAACGAGTGGTACCTGAAACTACACAAAACCTGGATCCTCTT
TCTATGCCGGTTAGTATGGCATCAGACCCTTCTACAACGACTCGTGCATCTAATTCAACAATCATGCCAGGTCAATACTCTAATTACAAGGATATAGAAGTGGGGGATAT
ATTCTTATCTAAGAAGGACTTGCAGATGAGACTGTCTGTTTTAGGGATGAGACAGAACTTTGAATATAGGGTTAAGAAATCTAACAAGAAAATAGTCAAAGTTTGGTGTG
TTGTCGAGGAATTAGACATCGATGACAACAACCAAATATACCCCGTAGGGTATGGCATTGGAAGTGGAGAAACAGATGAATTGTGGACCTACTTTTTCCGGCATATGGGG
TGTGCTATTGGTCAACTTTATCCTCTAGTGATTGTATCTGATAGACACAAGAGCATATGCAAAGTAGTACGAACTGTGTTTCCTAATGCAGCTCACTGCTACTGTATGAA
ACATCTAGAAGAGAACCCGAAAACAAAGTTCAAGGATGCCAACTTTATTCCACTATACCTTAAAGCAACAAAGGCATTTCGCGAAACCGAGTTTCAGAAATATTGGGGCA
AAATTCCAATAAGTGCCCAAACATACTTAGAAGAAGTCAGACTTGAACGGTGGTCTCGGGTTTTTCAGTGTGGAATGAGATACAATCAGATGACCACAAACATTGCGGAA
TGTGGGTGGCTGCAAGCCAATTACTACGAACGTCGTACCAATGCAGTGAATTGGGATGCATCGATATCAAAATATGGTGAGGAAATTGTTACAGAAGCAGAGAAGTATGC
AAGAAGGCACGTGGTTAGACCAATTGACCGGTACGAATTCCAAGTCGACGATGGACACTTGGGTGGTCGTGTCAACATCCACACGAGGATGTGTACTTGTCGTGAATTCG
ATTGTTTCGAACTTCCGTGCTCCCATGCAATCGCAACCTGCATCTATCGTAATGTATCCTACTTGGATCTCTGTTCTCCATACTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCATTTACACCCATTTATAACGAGCCAACATCACTGAGTCCATTATATGACAACGAGGGGGCATTTAACCTTGAGGATGATGTTGACTACGTTACCCTGCAAAATGA
TTTCCATGATTGGGGAGAGTATGAAGACGATGTTAACCCGGACAATGATTTCCATGATTGGGGAGAATACGAAGAAGAGGAGTCGGACACATATATAGATGGAGCACATG
AGAGTGGGGAGAAAGATTTGTATGAAGTTGATGTAACTGCCCAGGAGATGGAGGAAAACATCGGGGTTCAACGAGTGGTACCTGAAACTACACAAAACCTGGATCCTCTT
TCTATGCCGGTTAGTATGGCATCAGACCCTTCTACAACGACTCGTGCATCTAATTCAACAATCATGCCAGGTCAATACTCTAATTACAAGGATATAGAAGTGGGGGATAT
ATTCTTATCTAAGAAGGACTTGCAGATGAGACTGTCTGTTTTAGGGATGAGACAGAACTTTGAATATAGGGTTAAGAAATCTAACAAGAAAATAGTCAAAGTTTGGTGTG
TTGTCGAGGAATTAGACATCGATGACAACAACCAAATATACCCCGTAGGGTATGGCATTGGAAGTGGAGAAACAGATGAATTGTGGACCTACTTTTTCCGGCATATGGGG
TGTGCTATTGGTCAACTTTATCCTCTAGTGATTGTATCTGATAGACACAAGAGCATATGCAAAGTAGTACGAACTGTGTTTCCTAATGCAGCTCACTGCTACTGTATGAA
ACATCTAGAAGAGAACCCGAAAACAAAGTTCAAGGATGCCAACTTTATTCCACTATACCTTAAAGCAACAAAGGCATTTCGCGAAACCGAGTTTCAGAAATATTGGGGCA
AAATTCCAATAAGTGCCCAAACATACTTAGAAGAAGTCAGACTTGAACGGTGGTCTCGGGTTTTTCAGTGTGGAATGAGATACAATCAGATGACCACAAACATTGCGGAA
TGTGGGTGGCTGCAAGCCAATTACTACGAACGTCGTACCAATGCAGTGAATTGGGATGCATCGATATCAAAATATGGTGAGGAAATTGTTACAGAAGCAGAGAAGTATGC
AAGAAGGCACGTGGTTAGACCAATTGACCGGTACGAATTCCAAGTCGACGATGGACACTTGGGTGGTCGTGTCAACATCCACACGAGGATGTGTACTTGTCGTGAATTCG
ATTGTTTCGAACTTCCGTGCTCCCATGCAATCGCAACCTGCATCTATCGTAATGTATCCTACTTGGATCTCTGTTCTCCATACTAA
Protein sequenceShow/hide protein sequence
MPFTPIYNEPTSLSPLYDNEGAFNLEDDVDYVTLQNDFHDWGEYEDDVNPDNDFHDWGEYEEEESDTYIDGAHESGEKDLYEVDVTAQEMEENIGVQRVVPETTQNLDPL
SMPVSMASDPSTTTRASNSTIMPGQYSNYKDIEVGDIFLSKKDLQMRLSVLGMRQNFEYRVKKSNKKIVKVWCVVEELDIDDNNQIYPVGYGIGSGETDELWTYFFRHMG
CAIGQLYPLVIVSDRHKSICKVVRTVFPNAAHCYCMKHLEENPKTKFKDANFIPLYLKATKAFRETEFQKYWGKIPISAQTYLEEVRLERWSRVFQCGMRYNQMTTNIAE
CGWLQANYYERRTNAVNWDASISKYGEEIVTEAEKYARRHVVRPIDRYEFQVDDGHLGGRVNIHTRMCTCREFDCFELPCSHAIATCIYRNVSYLDLCSPY