; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G001450 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G001450
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUPF0160 protein-like
Genome locationchr09:1474523..1481693
RNA-Seq ExpressionLsi09G001450
SyntenyLsi09G001450
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR003226 - Metal-dependent protein hydrolase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042656.1 UPF0160 protein-like [Cucumis melo var. makuwa]4.0e-20394.51Show/hide
Query:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD
        GLGFN  QFL FP FFFLRTFMA+SPLASLSPASPSDSI VKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVL GLDAVLDVGGVYDPS D
Subjt:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD

Query:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
        RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLA+YKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
Subjt:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI

Query:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ
        +PDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMG LA RH+IDPSGEIM++TTFCPWKLHLFELE ELKIENSIKYVLY+DDRSKHWRVQ
Subjt:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ

Query:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
        AVAVSPDRFESR+PLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTY+GALTMAK+ALKL
Subjt:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL

XP_004143846.2 UPF0160 protein [Cucumis sativus]5.8e-20294.54Show/hide
Query:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPA--SPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPS
        GLGFN  QFLSFPNFFFLRTFMA+SPLASLSPA  SPSDSI +KRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVL GLDAVLDVGGVYDPS
Subjt:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPA--SPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPS

Query:  RDRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLD
         DRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLD
Subjt:  RDRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLD

Query:  WIEPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWR
        WI+PDQS ENENKAFEKAMALAG+EFLDSVRFHAKSWLPARSIVMG LAARH IDPSGEIM++TTFCPWKLHLFELE ELKIENSIKYVLY+DDRSKHWR
Subjt:  WIEPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWR

Query:  VQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
        VQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTY+GALTMAK+ALKL
Subjt:  VQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL

XP_008437439.1 PREDICTED: UPF0160 protein-like [Cucumis melo]1.5e-20294.23Show/hide
Query:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD
        GLGFN  QFL FP FFFLRTFMA+ PLASLSPASPSDSI VKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVL GLDAVLDVGGVYDPS D
Subjt:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD

Query:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
        RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLA+YKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
Subjt:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI

Query:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ
        +PDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMG LA RH+IDPSGEIM++TTFCPWKLHLFELE ELKIENSIKYVLY+DDRSKHWRVQ
Subjt:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ

Query:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
        AVAVSPDRFESR+PLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTY+GALTMAK+ALKL
Subjt:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL

XP_022131236.1 UPF0160 protein [Momordica charantia]1.4e-19590.93Show/hide
Query:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD
        GLGFN KQ   FP FFFLR FMA+SP+AS+S  S  D ISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVL GLDAVLDVGGVYDPS D
Subjt:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD

Query:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
        RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYD D+PPKYVNNTHLSSRVG+LNLDW 
Subjt:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI

Query:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ
        +PDQS ENENKAFEKAM LAG EFLDSVRFHAKSWLPARSIVMGCLAAR+EIDPSGEIM+LTTFCPWKLHLFELE+E+K +N IKYVLY+DDRSKHWRVQ
Subjt:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ

Query:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
        AVAVSPDRFESR+PLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
Subjt:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL

XP_038907236.1 MYG1 protein [Benincasa hispida]2.3e-20695.88Show/hide
Query:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD
        GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASP+DSI VKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVL GLDAVLDVGGVYDPS D
Subjt:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD

Query:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
        RYDHHQKGFEEVFGHGF+TKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTD+PPKYVNNTHLSSRVGRLNLDWI
Subjt:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI

Query:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ
        +PDQS ENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIV+GCLA RH+IDPSGEIM+L TFCPWKLHLFELEQELKIENSIKYVLY+DDRSKHWRVQ
Subjt:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ

Query:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
        AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAK ALKL
Subjt:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL

TrEMBL top hitse value%identityAlignment
A0A0A0KMA5 Protein MYG12.8e-20294.54Show/hide
Query:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPA--SPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPS
        GLGFN  QFLSFPNFFFLRTFMA+SPLASLSPA  SPSDSI +KRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVL GLDAVLDVGGVYDPS
Subjt:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPA--SPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPS

Query:  RDRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLD
         DRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLD
Subjt:  RDRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLD

Query:  WIEPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWR
        WI+PDQS ENENKAFEKAMALAG+EFLDSVRFHAKSWLPARSIVMG LAARH IDPSGEIM++TTFCPWKLHLFELE ELKIENSIKYVLY+DDRSKHWR
Subjt:  WIEPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWR

Query:  VQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
        VQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTY+GALTMAK+ALKL
Subjt:  VQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL

A0A1S3ATP0 UPF0160 protein-like7.4e-20394.23Show/hide
Query:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD
        GLGFN  QFL FP FFFLRTFMA+ PLASLSPASPSDSI VKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVL GLDAVLDVGGVYDPS D
Subjt:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD

Query:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
        RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLA+YKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
Subjt:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI

Query:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ
        +PDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMG LA RH+IDPSGEIM++TTFCPWKLHLFELE ELKIENSIKYVLY+DDRSKHWRVQ
Subjt:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ

Query:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
        AVAVSPDRFESR+PLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTY+GALTMAK+ALKL
Subjt:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL

A0A5A7TL15 UPF0160 protein-like2.0e-20394.51Show/hide
Query:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD
        GLGFN  QFL FP FFFLRTFMA+SPLASLSPASPSDSI VKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVL GLDAVLDVGGVYDPS D
Subjt:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD

Query:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
        RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLA+YKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
Subjt:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI

Query:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ
        +PDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMG LA RH+IDPSGEIM++TTFCPWKLHLFELE ELKIENSIKYVLY+DDRSKHWRVQ
Subjt:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ

Query:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
        AVAVSPDRFESR+PLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTY+GALTMAK+ALKL
Subjt:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL

A0A6J1BNZ6 UPF0160 protein6.7e-19690.93Show/hide
Query:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD
        GLGFN KQ   FP FFFLR FMA+SP+AS+S  S  D ISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVL GLDAVLDVGGVYDPS D
Subjt:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD

Query:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
        RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYD D+PPKYVNNTHLSSRVG+LNLDW 
Subjt:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI

Query:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ
        +PDQS ENENKAFEKAM LAG EFLDSVRFHAKSWLPARSIVMGCLAAR+EIDPSGEIM+LTTFCPWKLHLFELE+E+K +N IKYVLY+DDRSKHWRVQ
Subjt:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ

Query:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
        AVAVSPDRFESR+PLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
Subjt:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL

A0A6J1H185 UPF0160 protein1.5e-19590.38Show/hide
Query:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD
        GLGFNHKQF SFP FFFLR FMATSP+AS S  SP  ++SVKRVGTHHGSFHCDEALGCFMIRLT KFSNAQIVRTRDPQVL+GLDAVLDVGGVYDPS D
Subjt:  GLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRD

Query:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
        RYDHHQKGFEEVFGHGF+TKLSSAGLVYKHFGKEIIAKELQVDEGHPDV RLFLAVYKSFME IDA+DNGINQYDTD+PPKYVNNTHLSSRVGRLNLDWI
Subjt:  RYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI

Query:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ
        +PDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVM CL ARH+IDPSGEIM+LTTFCPWKLHLFELE ELK +N IKYVLY+DDRSK WRVQ
Subjt:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQ

Query:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
        AVA++PDRFESR+PLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGAL MAK ALKL
Subjt:  AVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL

SwissProt top hitse value%identityAlignment
Q55G91 MYG1 protein6.5e-8750.48Show/hide
Query:  THHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEG
        TH GSFH DEAL C++++L   + +++I+R+RD  V++     +DVG VY+  + R+DHHQ GF E F      KLSSAGL+YKH+GK+II + L  ++ 
Subjt:  THHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEG

Query:  HPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIEPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGC
          ++  L+  +Y S ++ +D VDNG+ +Y +D  P+Y + + +S+RVG LN  W EP Q  E  NK FEKAM L G  FLD + ++ KSWLP RSIV   
Subjt:  HPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIEPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGC

Query:  LAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFI
        L  R +   SGEI+IL  FCPWK HLF LEQE  I+  IK+VL+E D S  WRV AV ++   F  R PLP +WRG RDEELS+ SGI GCVF H +GFI
Subjt:  LAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFI

Query:  GGNQTYEGALTMA
        GGN+T EGAL MA
Subjt:  GGNQTYEGALTMA

Q58DG1 MYG1 exonuclease3.1e-8948.21Show/hide
Query:  FLRTFMATSPL------ASLSPASPSDS-----ISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHH
        FLR  +   PL       SL P +PS       ++  R+GTH+G+FHCDEAL C ++RL  ++  A+IVRTRDP+ L   D V+DVGG YDP R RYDHH
Subjt:  FLRTFMATSPL------ASLSPASPSDS-----ISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHH

Query:  QKGFEEVF-----GHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI
        Q+ F E       G  + TKLSSAGL+Y HFG +++A+ L   E    V  L+  +Y++F+E +DAVDNGI+Q++ +  P+Y+  T LS+RV RLN  W 
Subjt:  QKGFEEVF-----GHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWI

Query:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTT-FCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRV
        +P+Q +E     F++AM L   EFL  + F+  SWLPAR++V   LA R ++DPSGEI+ L    CPWK HL++LE  L    +I +V+Y  D++  WRV
Subjt:  EPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTT-FCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRV

Query:  QAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHAL
        Q V   P  F+SR PL   WRGLRDE L + SGIPGC+FVH SGFIGG++T EGAL+MA+  L
Subjt:  QAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHAL

Q641W2 MYG1 exonuclease1.2e-8850Show/hide
Query:  LSPASPSDSI-SVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHHQKGFEEVF-----GHGFSTKLSS
        L P  P +++ +  R+GTH+G+FHCDEAL C ++RL  ++ NA+IVRTRDP+ L   D V+DVGG Y+P R RYDHHQ+ F E       G  + TKLSS
Subjt:  LSPASPSDSI-SVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHHQKGFEEVF-----GHGFSTKLSS

Query:  AGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIEPDQSSENENKAFEKAMALAGSE
        AGLVY HFG +++A+ L   E    V  ++  +Y++F+E +DAVDNGI+Q+  +  P+Y   T LS+RV RLN  W +PDQ +E     F +AM L   E
Subjt:  AGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIEPDQSSENENKAFEKAMALAGSE

Query:  FLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTT-FCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGL
        FL  + F+  SWLPAR++V   LA R ++D SGEI+ L    CPWK HL+ LE EL    +I +V+Y  D++  WRVQ V   P  F+SR PLP  WRGL
Subjt:  FLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTT-FCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGL

Query:  RDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHAL
        RDE L + SGIPGC+FVH SGFIGG+ T EGAL MA+  L
Subjt:  RDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHAL

Q9HB07 MYG1 exonuclease2.6e-8849.85Show/hide
Query:  SPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHHQKGFEEVF-----GHGFSTKLSSAGLVY
        S S  ++  R+GTH+G+FHCDEAL C ++RL  ++ +A+IVRTRDP+ L   D V+DVGG YDP R RYDHHQ+ F E       G  + TKLSSAGL+Y
Subjt:  SPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHHQKGFEEVF-----GHGFSTKLSSAGLVY

Query:  KHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIEPDQSSENENKAFEKAMALAGSEFLDSV
         HFG +++A+ L   E    V  L+  +Y++F+E +DAVDNGI+Q+  +  P+Y   T LS+RV RLN  W  PDQ +E     F++AM L   EFL  +
Subjt:  KHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIEPDQSSENENKAFEKAMALAGSEFLDSV

Query:  RFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTT-FCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGLRDEEL
         F+  SWLPAR++V   LA R ++DPSGEI+ L    CPWK HL+ LE  L    +I +V+Y  D++  WR+Q V   P  F+SR PLP  WRGLRDE L
Subjt:  RFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTT-FCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGLRDEEL

Query:  SKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHAL
         + SGIPGC+FVH SGF GG+ T EGAL+MA+  L
Subjt:  SKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHAL

Q9JK81 MYG1 exonuclease5.8e-8850.31Show/hide
Query:  RVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKEIIA
        R+GTH+G+FHCDEAL C ++RL  +++NA+IVRTRDP+ L   D V+DVGG Y+P   RYDHHQ+ F E       G  + TKLSSAGLVY HFG++++A
Subjt:  RVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHHQKGFEEVF-----GHGFSTKLSSAGLVYKHFGKEIIA

Query:  KELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIEPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLP
        + L   E    V  ++  +Y++F+E +DAVDNGI+Q+  +  P+Y   T LS+RV RLN  W +P+Q +E     F +AM L   EFL  + F+  SWLP
Subjt:  KELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIEPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLP

Query:  ARSIVMGCLAARHEIDPSGEIMILTT-FCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGC
        AR++V   LA R ++D SGEI+ L    CPWK HL+ LE EL  + +I +V+Y  D++  WRVQ V   P  F+SR PLP  WRGLRD+ L + SGIPGC
Subjt:  ARSIVMGCLAARHEIDPSGEIMILTT-FCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGC

Query:  VFVHMSGFIGGNQTYEGALTMAKHAL
        +FVH SGFIGG+ T EGAL MA+  L
Subjt:  VFVHMSGFIGGNQTYEGALTMAKHAL

Arabidopsis top hitse value%identityAlignment
AT3G49320.1 Metal-dependent protein hydrolase3.1e-14573.23Show/hide
Query:  SISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIA
        S S KRVGTH+G+FHCDEAL CF++R +++FS+AQIVRTRD QVL+ LDA LDVGGVYDP  +RYDHHQKGF EVFG GF+TKLSSAGLVYKH+G EII+
Subjt:  SISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIA

Query:  KELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIEPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLP
        KELQ+++ HPDV RLFLAVYK+F+EA+DA+DNGI+QYDTD+PP+YVNNT L  R+GRLNLDWIEPDQSS  E++AF +AM LAGSEFL+ V FHAKSWLP
Subjt:  KELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIEPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLP

Query:  ARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCV
        ARSIVM CLA R++ID SGEIM L+  CPWKLH+FELE+E+KI+  IKYVLY+DDRS++WR+QAV+VSP+RFESR+ LP  WRGL  E+LS+ES IP CV
Subjt:  ARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGLRDEELSKESGIPGCV

Query:  FVHMSGFIGGNQTYEGALTMAKHAL
        FVHMSGFIG NQTYEGAL MA+ +L
Subjt:  FVHMSGFIGGNQTYEGALTMAKHAL

AT5G41960.1 unknown protein2.0e-3043.3Show/hide
Query:  MITLASAYLSSSPSNLSSLKNLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQSRIGNGICRAELGNDAPFAIAIGACILSSLVFPAADGASDDESD---AV
        M +L+   +    S+ S     RL   SS+   S S L    P   P     +I   ICRAE   DAP   AIGACILSS VFP A   +D+E +   + 
Subjt:  MITLASAYLSSSPSNLSSLKNLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQSRIGNGICRAELGNDAPFAIAIGACILSSLVFPAADGASDDESD---AV

Query:  IDSTDTRLAVMSIISFIPYFNWLFNPHSAIVKVEELLFYEVDL----YFGVGVGGIVVAWRSNLSLSPEESWLPIVSILLCIIHIQLEVSITNGDIQPLQ
        I STD RLA M IISFIPYFNWL             +F  +D     Y    +  +V    SNLS+SPEESWLPI SI+L IIH+QLE SI NGD++ L 
Subjt:  IDSTDTRLAVMSIISFIPYFNWLFNPHSAIVKVEELLFYEVDL----YFGVGVGGIVVAWRSNLSLSPEESWLPIVSILLCIIHIQLEVSITNGDIQPLQ

Query:  LFGKASKQISSTKKG---RDHFKG
         F   S    S+KK    + HFKG
Subjt:  LFGKASKQISSTKKG---RDHFKG

AT5G41970.1 Metal-dependent protein hydrolase1.8e-16179.24Show/hide
Query:  ATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHHQKGFEEVFGHGFSTKLS
        ATSP       SPS+ ISVK+VGTH+GSFHCDEALGCFMIRL DKFS A IVR+RDP++L  LDAVLDVGGVYDP  DRYDHHQKGFEEVFGHGF+TKLS
Subjt:  ATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGLDAVLDVGGVYDPSRDRYDHHQKGFEEVFGHGFSTKLS

Query:  SAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIEPDQSSENENKAFEKAMALAGS
        SAGLVYKHFGKEIIAKEL V++ HPDV RLFLAVYKSFMEAIDAVDNGIN+YDTD+PP+YVNNTHLS RVGRLNLDWI+PDQS E EN+AF++AMALAG 
Subjt:  SAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRLNLDWIEPDQSSENENKAFEKAMALAGS

Query:  EFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGL
        EFL+SV+FHA+SWLPARSIVM CL  R + DPSGEIMIL  FCPWKLHLFELEQE+KIE  IKYV+Y+D+R+K WRVQAVAV+PDRFE+R+PLP +WRGL
Subjt:  EFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVSPDRFESRRPLPAQWRGL

Query:  RDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL
        RDEELSK + IPGCVFVHMSGFIGGNQ+Y+GAL+MA+ AL L
Subjt:  RDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACTCTCGCTTCTGCTTATTTATCATCTTCCCCTTCCAATCTCTCTTCTCTCAAGAATCTTCGTCTCTTCAAACCCTCTTCCACATTCTCACCATCACTCTCTAA
TCTCAAACCCTTAAATCCTTTCCTCAAACCACCTTCCAATCAGAGCAGGATCGGGAATGGGATTTGTAGGGCCGAATTGGGTAACGACGCACCCTTCGCTATTGCGATCG
GTGCTTGTATTCTCAGTTCTCTTGTTTTTCCGGCTGCCGACGGTGCTTCCGATGATGAGAGCGATGCCGTCATTGATTCCACCGATACCAGGCTCGCCGTTATGAGCATC
ATTAGCTTTATCCCCTACTTCAATTGGCTGTTCAATCCTCACTCTGCAATTGTCAAAGTCGAAGAACTTTTGTTTTATGAAGTTGATTTGTATTTTGGGGTGGGGGTGGG
TGGCATTGTGGTTGCTTGGAGGTCAAACTTATCGTTGTCGCCCGAAGAGAGCTGGCTTCCTATTGTTAGTATACTTCTCTGCATTATTCACATTCAGCTTGAAGTGAGCA
TTACAAATGGAGATATTCAACCTCTCCAACTATTTGGGAAAGCTTCAAAGCAAATTTCTTCAACCAAGAAAGGGAGAGACCATTTCAAGGGGTCCCAAGGACCATACAAA
GAGGACAGGGAGCTACCGTCGTCGGAGGAACAATTTCGAGATAAGATCAGAAGATGGGGAGATTCTAAAGAGACAGGTTTAGGGTTTAACCACAAGCAATTCCTCTCCTT
CCCTAACTTTTTCTTTCTACGCACTTTCATGGCTACTTCTCCCCTCGCTTCCCTTTCCCCTGCTTCTCCTTCTGATTCCATTTCCGTTAAGCGAGTCGGGACTCACCATG
GGAGCTTCCATTGCGATGAAGCGCTTGGTTGCTTCATGATTCGCTTGACGGATAAGTTCTCTAATGCTCAAATTGTTCGAACCCGTGATCCCCAGGTACTAGATGGTCTT
GATGCAGTCCTTGATGTTGGGGGCGTATATGATCCAAGTCGTGATCGATATGATCATCATCAAAAGGGGTTTGAAGAGGTTTTTGGCCATGGTTTCTCCACTAAGCTTAG
CAGCGCTGGTCTTGTTTATAAGCATTTTGGGAAGGAGATTATTGCGAAGGAACTTCAAGTTGATGAAGGGCATCCAGATGTGCACAGGCTGTTTTTGGCTGTTTACAAAA
GTTTCATGGAGGCAATTGATGCTGTAGATAACGGCATTAATCAGTATGATACGGATAAGCCACCAAAATATGTGAATAACACACACCTATCTTCGAGGGTGGGGAGATTG
AATCTGGACTGGATAGAACCTGATCAATCATCCGAGAATGAGAATAAGGCCTTCGAGAAAGCAATGGCCTTGGCTGGCAGCGAGTTCTTAGATAGTGTTAGATTTCATGC
AAAGTCATGGCTACCAGCAAGGTCAATTGTGATGGGATGTCTTGCAGCAAGACATGAGATTGACCCTAGTGGAGAAATAATGATTTTGACAACATTTTGCCCTTGGAAAC
TTCATCTATTTGAGCTCGAACAGGAGTTGAAGATTGAAAATTCGATCAAATATGTGCTCTATGAAGATGATAGAAGCAAACATTGGCGAGTGCAGGCAGTGGCAGTATCT
CCGGACAGATTTGAGAGTCGTAGGCCTCTGCCTGCCCAATGGCGAGGTTTAAGGGATGAGGAACTCTCAAAAGAGTCTGGGATACCTGGTTGTGTGTTTGTTCATATGAG
TGGCTTTATTGGTGGAAATCAAACTTACGAAGGGGCTCTTACTATGGCGAAACATGCATTGAAGCTGTAG
mRNA sequenceShow/hide mRNA sequence
AGGAATTTAATTTAGATAAAAAAAGAAGAAAAGCCAGACAGCAATGCCGAAAGAGGATATTGGCACAGTTGCCAAGAACAGTTCCAATGGCGATCTTCGTTTTCCAGTGA
TGAATTTTCGTTCAAACAATTTCTGAATTCTCTCTTTCTTCCCATTCTCTTCTCACTGATTCAATTAGCAATTTGAATTGAACGCCAATGATCACTCTCGCTTCTGCTTA
TTTATCATCTTCCCCTTCCAATCTCTCTTCTCTCAAGAATCTTCGTCTCTTCAAACCCTCTTCCACATTCTCACCATCACTCTCTAATCTCAAACCCTTAAATCCTTTCC
TCAAACCACCTTCCAATCAGAGCAGGATCGGGAATGGGATTTGTAGGGCCGAATTGGGTAACGACGCACCCTTCGCTATTGCGATCGGTGCTTGTATTCTCAGTTCTCTT
GTTTTTCCGGCTGCCGACGGTGCTTCCGATGATGAGAGCGATGCCGTCATTGATTCCACCGATACCAGGCTCGCCGTTATGAGCATCATTAGCTTTATCCCCTACTTCAA
TTGGCTGTTCAATCCTCACTCTGCAATTGTCAAAGTCGAAGAACTTTTGTTTTATGAAGTTGATTTGTATTTTGGGGTGGGGGTGGGTGGCATTGTGGTTGCTTGGAGGT
CAAACTTATCGTTGTCGCCCGAAGAGAGCTGGCTTCCTATTGTTAGTATACTTCTCTGCATTATTCACATTCAGCTTGAAGTGAGCATTACAAATGGAGATATTCAACCT
CTCCAACTATTTGGGAAAGCTTCAAAGCAAATTTCTTCAACCAAGAAAGGGAGAGACCATTTCAAGGGGTCCCAAGGACCATACAAAGAGGACAGGGAGCTACCGTCGTC
GGAGGAACAATTTCGAGATAAGATCAGAAGATGGGGAGATTCTAAAGAGACAGGTTTAGGGTTTAACCACAAGCAATTCCTCTCCTTCCCTAACTTTTTCTTTCTACGCA
CTTTCATGGCTACTTCTCCCCTCGCTTCCCTTTCCCCTGCTTCTCCTTCTGATTCCATTTCCGTTAAGCGAGTCGGGACTCACCATGGGAGCTTCCATTGCGATGAAGCG
CTTGGTTGCTTCATGATTCGCTTGACGGATAAGTTCTCTAATGCTCAAATTGTTCGAACCCGTGATCCCCAGGTACTAGATGGTCTTGATGCAGTCCTTGATGTTGGGGG
CGTATATGATCCAAGTCGTGATCGATATGATCATCATCAAAAGGGGTTTGAAGAGGTTTTTGGCCATGGTTTCTCCACTAAGCTTAGCAGCGCTGGTCTTGTTTATAAGC
ATTTTGGGAAGGAGATTATTGCGAAGGAACTTCAAGTTGATGAAGGGCATCCAGATGTGCACAGGCTGTTTTTGGCTGTTTACAAAAGTTTCATGGAGGCAATTGATGCT
GTAGATAACGGCATTAATCAGTATGATACGGATAAGCCACCAAAATATGTGAATAACACACACCTATCTTCGAGGGTGGGGAGATTGAATCTGGACTGGATAGAACCTGA
TCAATCATCCGAGAATGAGAATAAGGCCTTCGAGAAAGCAATGGCCTTGGCTGGCAGCGAGTTCTTAGATAGTGTTAGATTTCATGCAAAGTCATGGCTACCAGCAAGGT
CAATTGTGATGGGATGTCTTGCAGCAAGACATGAGATTGACCCTAGTGGAGAAATAATGATTTTGACAACATTTTGCCCTTGGAAACTTCATCTATTTGAGCTCGAACAG
GAGTTGAAGATTGAAAATTCGATCAAATATGTGCTCTATGAAGATGATAGAAGCAAACATTGGCGAGTGCAGGCAGTGGCAGTATCTCCGGACAGATTTGAGAGTCGTAG
GCCTCTGCCTGCCCAATGGCGAGGTTTAAGGGATGAGGAACTCTCAAAAGAGTCTGGGATACCTGGTTGTGTGTTTGTTCATATGAGTGGCTTTATTGGTGGAAATCAAA
CTTACGAAGGGGCTCTTACTATGGCGAAACATGCATTGAAGCTGTAGAAACCGAGATCTATTAATATATTTATGTAGCTCGTAGCATCGGTTCGGAGAATACTGTGTTTA
TTCGAGTTCAAGGCTTACTGTTTTTTATGACTTTGTGAGTCCTCCCTCATTCTTCCAATTTGAAAACATTTGCTGTAGAAACAATGTTATTCTTTGGGAAGGGAAGTCAT
TAGATATCTTGGATTTTGATTTTGTTATTTGGAAATGGAAAAATCTGTAATACCATACAGAACAAGATAATCAACTTTTCACAGGAAGATCTTGCTTTATCAATGTGTCT
AA
Protein sequenceShow/hide protein sequence
MITLASAYLSSSPSNLSSLKNLRLFKPSSTFSPSLSNLKPLNPFLKPPSNQSRIGNGICRAELGNDAPFAIAIGACILSSLVFPAADGASDDESDAVIDSTDTRLAVMSI
ISFIPYFNWLFNPHSAIVKVEELLFYEVDLYFGVGVGGIVVAWRSNLSLSPEESWLPIVSILLCIIHIQLEVSITNGDIQPLQLFGKASKQISSTKKGRDHFKGSQGPYK
EDRELPSSEEQFRDKIRRWGDSKETGLGFNHKQFLSFPNFFFLRTFMATSPLASLSPASPSDSISVKRVGTHHGSFHCDEALGCFMIRLTDKFSNAQIVRTRDPQVLDGL
DAVLDVGGVYDPSRDRYDHHQKGFEEVFGHGFSTKLSSAGLVYKHFGKEIIAKELQVDEGHPDVHRLFLAVYKSFMEAIDAVDNGINQYDTDKPPKYVNNTHLSSRVGRL
NLDWIEPDQSSENENKAFEKAMALAGSEFLDSVRFHAKSWLPARSIVMGCLAARHEIDPSGEIMILTTFCPWKLHLFELEQELKIENSIKYVLYEDDRSKHWRVQAVAVS
PDRFESRRPLPAQWRGLRDEELSKESGIPGCVFVHMSGFIGGNQTYEGALTMAKHALKL