; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0435 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0435
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionglucose-induced degradation protein 4 homolog
Genome locationMC01:10992342..10996625
RNA-Seq ExpressionMC01g0435
SyntenyMC01g0435
Gene Ontology termsGO:0006623 - protein targeting to vacuole (biological process)
GO:0007039 - protein catabolic process in the vacuole (biological process)
GO:0043161 - proteasome-mediated ubiquitin-dependent protein catabolic process (biological process)
GO:0045721 - negative regulation of gluconeogenesis (biological process)
GO:0005773 - vacuole (cellular component)
GO:0019898 - extrinsic component of membrane (cellular component)
GO:0034657 - GID complex (cellular component)
InterPro domainsIPR018618 - Vacuolar import/degradation protein Vid24


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142059.1 glucose-induced degradation protein 4 homolog [Cucumis sativus]3.02e-15094.84Show/hide
Query:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
        MPVRVESS PS IS AD +QT+PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
Subjt:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT

Query:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
        GKWQAAPE+DIRHWTKFPSF+PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSI+GFYYDPNSSPFQKLELKSTNE
Subjt:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE

Query:  GRSGFSFSSYELR
        GRSGFSFSSYEL+
Subjt:  GRSGFSFSSYELR

XP_008448124.1 PREDICTED: glucose-induced degradation protein 4 homolog isoform X1 [Cucumis melo]1.50e-15094.84Show/hide
Query:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
        MPVRVESS PS IS AD +QT+PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
Subjt:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT

Query:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
        GKWQAAPE+DIRHWTKFPSF+PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSI+GFYYDPNSSPFQKLELKSTNE
Subjt:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE

Query:  GRSGFSFSSYELR
        GRSGFSFSSYEL+
Subjt:  GRSGFSFSSYELR

XP_022143888.1 glucose-induced degradation protein 4 homolog isoform X2 [Momordica charantia]2.92e-157100Show/hide
Query:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
        MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
Subjt:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT

Query:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
        GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
Subjt:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE

Query:  GRSGFSFSSYELR
        GRSGFSFSSYELR
Subjt:  GRSGFSFSSYELR

XP_023517177.1 glucose-induced degradation protein 4 homolog [Cucurbita pepo subsp. pepo]1.28e-15195.77Show/hide
Query:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
        MPVRVESSAPS IS AD+RQ++PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDL+HGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
Subjt:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT

Query:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
        GKWQAAPEDDIRHWTKFPSF PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
Subjt:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE

Query:  GRSGFSFSSYELR
        GRSGFSFSSYEL+
Subjt:  GRSGFSFSSYELR

XP_038880933.1 glucose-induced degradation protein 4 homolog [Benincasa hispida]1.50e-15095.31Show/hide
Query:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
        MPVRVESS PS IS AD RQT+PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
Subjt:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT

Query:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
        GKWQAAPEDDIRHWTKFPSF+PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSI+GFYYDPNSSPFQKLELKSTNE
Subjt:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE

Query:  GRSGFSFSSYELR
        GRSGFSFSSY L+
Subjt:  GRSGFSFSSYELR

TrEMBL top hitse value%identityAlignment
A0A0A0KXJ4 Uncharacterized protein1.46e-15094.84Show/hide
Query:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
        MPVRVESS PS IS AD +QT+PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
Subjt:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT

Query:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
        GKWQAAPE+DIRHWTKFPSF+PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSI+GFYYDPNSSPFQKLELKSTNE
Subjt:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE

Query:  GRSGFSFSSYELR
        GRSGFSFSSYEL+
Subjt:  GRSGFSFSSYELR

A0A1S3BJV3 glucose-induced degradation protein 4 homolog isoform X17.25e-15194.84Show/hide
Query:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
        MPVRVESS PS IS AD +QT+PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
Subjt:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT

Query:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
        GKWQAAPE+DIRHWTKFPSF+PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSI+GFYYDPNSSPFQKLELKSTNE
Subjt:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE

Query:  GRSGFSFSSYELR
        GRSGFSFSSYEL+
Subjt:  GRSGFSFSSYELR

A0A5A7SQ56 Glucose-induced degradation protein 4-like protein isoform X17.25e-15194.84Show/hide
Query:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
        MPVRVESS PS IS AD +QT+PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
Subjt:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT

Query:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
        GKWQAAPE+DIRHWTKFPSF+PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSI+GFYYDPNSSPFQKLELKSTNE
Subjt:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE

Query:  GRSGFSFSSYELR
        GRSGFSFSSYEL+
Subjt:  GRSGFSFSSYELR

A0A6J1CRU3 glucose-induced degradation protein 4 homolog isoform X21.41e-157100Show/hide
Query:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
        MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
Subjt:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT

Query:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
        GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
Subjt:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE

Query:  GRSGFSFSSYELR
        GRSGFSFSSYELR
Subjt:  GRSGFSFSSYELR

A0A6J1HUE2 glucose-induced degradation protein 4 homolog2.95e-15094.84Show/hide
Query:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
        MPVRVESSAP+ IS AD+ Q++PQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDL+HGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT
Subjt:  MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT

Query:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
        GKWQAAPEDDIRHWTKFPSF PLM+QVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE
Subjt:  GKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNE

Query:  GRSGFSFSSYELR
        GRSGFSFSSYEL+
Subjt:  GRSGFSFSSYELR

SwissProt top hitse value%identityAlignment
P53242 Uncharacterized protein YGR066C2.4e-1229.95Show/hide
Query:  LSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLE--------HGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFF------------TGKWQAAPE
        L  G  F G+Q    + K   + V V+I  ++L           ++ GT    N+      VVT +EG ++   NYN F               + A  E
Subjt:  LSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLE--------HGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFF------------TGKWQAAPE

Query:  DDIRHWTKFPSF-SPLMSQVEVDGG----KSLDLSNYPCIFMRWKEQYFV------NVGTDC-----GLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQK
         D  HW +F  F S   S  E + G    +S +  N   I+++WKE++ +      N+  D      G +  GFYYVC     GS+ G+YY P    FQK
Subjt:  DDIRHWTKFPSF-SPLMSQVEVDGG----KSLDLSNYPCIFMRWKEQYFV------NVGTDC-----GLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQK

Query:  LELKSTN
        LEL  TN
Subjt:  LELKSTN

Q10079 Uncharacterized protein C3H1.144.8e-1328.8Show/hide
Query:  QACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT--GKWQAAPEDDIRHWTKFPSFS
        + C+ L  G  F G Q   ++   E   V+V I  ++L    LCG +        +T + T++E EI+ G  + F T   +W A+ E D RHW +  +  
Subjt:  QACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFT--GKWQAAPEDDIRHWTKFPSFS

Query:  PLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTD---------CGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELK
             + +   +  D  +   ++MRWKE   ++   D          G++  GFYY+ FS S G I G+YY  +S P + L L+
Subjt:  PLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTD---------CGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELK

Q8IVV7 Glucose-induced degradation protein 4 homolog1.9e-2535.21Show/hide
Query:  MPVRVESSAPSPISSADARQTSPQ----------ACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEI
        MPVR E   P+  S+A A    P           A +LL  G  F G Q    N  D    V V +Q +D  + YLCG ++   +      + TF+EGEI
Subjt:  MPVRVESSAPSPISSADARQTSPQ----------ACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEI

Query:  VDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFV---NVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNS
        +  K + F T KW A  + D +HW KF +F         D     +L N   +FMRWKEQ+ V    +    G + AGFYY+CF  S  SI G+YY  +S
Subjt:  VDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFV---NVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNS

Query:  SPFQKLELKSTNE
          +Q L L    E
Subjt:  SPFQKLELKSTNE

Q9CPY6 Glucose-induced degradation protein 4 homolog5.5e-2534.74Show/hide
Query:  MPVRVESSAPSPISSADARQTSPQ----------ACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEI
        MPVR E   P+  S+  A    P           A +LL  G  F G Q    N  D    V V +Q +D  + YLCG ++   +      + TF+EGEI
Subjt:  MPVRVESSAPSPISSADARQTSPQ----------ACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEI

Query:  VDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFV---NVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNS
        +  K + F T KW A  + D +HW KF +F         D     +L N   +FMRWKEQ+ V    +    G + AGFYY+CF  S  SI G+YY  +S
Subjt:  VDGKNYNFFTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFV---NVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNS

Query:  SPFQKLELKSTNE
          +Q L L    E
Subjt:  SPFQKLELKSTNE

Arabidopsis top hitse value%identityAlignment
AT2G37680.1 CONTAINS InterPro DOMAIN/s: Vacuolar import/degradation protein Vid24 (InterPro:IPR018618); Has 318 Blast hits to 317 proteins in 131 species: Archae - 0; Bacteria - 0; Metazoa - 80; Fungi - 184; Plants - 51; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink).2.0e-10280.93Show/hide
Query:  MPVR-VESSAPSPISSADARQTSP-QACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNF
        MPVR VES+ P+ +S  D    SP    +LL  GQAFSGTQNVSN QK+EAWRVNV+IQG+DLEHGYLCGTMEALNVPMADTPV+TFWEGEIVDGKNY F
Subjt:  MPVR-VESSAPSPISSADARQTSP-QACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNF

Query:  FTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKST
        +TGKW+A  EDD+RHW+KFPSFSPL  QVE DGG+ LDL+NYP IFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELK+ 
Subjt:  FTGKWQAAPEDDIRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKST

Query:  NEGRSGFSFSSYELR
        NEGRSGFSFSSYEL+
Subjt:  NEGRSGFSFSSYELR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGTCCGAGTAGAGAGCTCAGCGCCTTCTCCGATTTCAAGTGCTGATGCTAGGCAGACCTCTCCTCAAGCTTGTACGCTGTTGAGTGTGGGGCAGGCGTTTTCCGG
TACTCAGAACGTGTCTAATAATCAAAAGGATGAGGCATGGAGGGTGAATGTACGAATACAGGGGTTGGACCTTGAACATGGGTATCTCTGTGGCACGATGGAGGCTCTTA
ATGTTCCCATGGCGGATACACCGGTAGTAACCTTTTGGGAAGGAGAGATTGTGGATGGCAAGAATTATAATTTCTTCACTGGAAAATGGCAAGCAGCACCAGAAGATGAT
ATAAGGCACTGGACCAAATTTCCGTCATTTTCGCCCCTGATGAGCCAGGTGGAAGTTGATGGTGGAAAATCTTTGGATCTTAGTAATTATCCATGCATATTTATGAGATG
GAAAGAGCAATATTTCGTGAATGTTGGAACCGATTGTGGGTTAACCATAGCTGGCTTCTATTATGTTTGCTTCTCTTGTAGTGATGGTTCCATCAGCGGCTTCTACTATG
ACCCTAATAGTAGCCCATTTCAGAAGCTTGAGCTCAAATCCACAAATGAGGGAAGATCGGGTTTCAGCTTCTCATCGTACGAGTTGCGATGA
mRNA sequenceShow/hide mRNA sequence
CTGGGAAGAAAATATATTGAAGAATAAATAATAATAATAATAATAAATTAAATTAAATTTGAAAAAGAGTAAAAATCTAGTCCCAGAAACTTATGGGGTAAAAATGTCAA
TTTTCCAAAATCTTCTAAATGAAATGAAACTGTTATCATATTTCTAATAACAAAAATCGGAGGTTACTTTTCGACTTTAGAGTCTCATTGGTTTGGCGTTTGGTCCCTCC
GGCCACCCGAAAAGCTAGTTCCTTGCTGCTTAGTTTTACATTTACCATTAAACTTCTCGCAAATTGCGATCGCCGGAAATTTAGCGGGAAAGAGATGCCGGTCCGAGTAG
AGAGCTCAGCGCCTTCTCCGATTTCAAGTGCTGATGCTAGGCAGACCTCTCCTCAAGCTTGTACGCTGTTGAGTGTGGGGCAGGCGTTTTCCGGTACTCAGAACGTGTCT
AATAATCAAAAGGATGAGGCATGGAGGGTGAATGTACGAATACAGGGGTTGGACCTTGAACATGGGTATCTCTGTGGCACGATGGAGGCTCTTAATGTTCCCATGGCGGA
TACACCGGTAGTAACCTTTTGGGAAGGAGAGATTGTGGATGGCAAGAATTATAATTTCTTCACTGGAAAATGGCAAGCAGCACCAGAAGATGATATAAGGCACTGGACCA
AATTTCCGTCATTTTCGCCCCTGATGAGCCAGGTGGAAGTTGATGGTGGAAAATCTTTGGATCTTAGTAATTATCCATGCATATTTATGAGATGGAAAGAGCAATATTTC
GTGAATGTTGGAACCGATTGTGGGTTAACCATAGCTGGCTTCTATTATGTTTGCTTCTCTTGTAGTGATGGTTCCATCAGCGGCTTCTACTATGACCCTAATAGTAGCCC
ATTTCAGAAGCTTGAGCTCAAATCCACAAATGAGGGAAGATCGGGTTTCAGCTTCTCATCGTACGAGTTGCGATGACCTAAATGGAAAATTCGGTATTCCATTAAGGTTG
GTTTCACTTTTAGCTATATGTGAATTTCCATGGTTACCTCTCAAAGTTAAGGTGCGATATCGAGATTTTGCTTCAACACTTGGAGATCTACAGAGAAATTTAATGGGAAA
GGCCCCCGTGTACCTAGTTTTGTTTATGAAAATTCTGGTTCCCCAAAGTTACAGGATTGAAGACGCAAGCCACAGCAGTTGTATCTAAAGTTGATCTTAACCCTTTCCCA
TTTTGTAATTTACTGGAACTCAAGTTGTTTCCCACTGCATTTAGAACTTTGTTCCCTGTGCCTCTCTGAATGTATGTATTGGGATTGGGATTTCCTTTTTGTTGCTACCA
TATTCAGTAAGAACTTTTGAAGATTGGATCTGTGGGAACAATAATGACAAATTGCAAAAACGAACAACCATTTATGAATAACTGTTGGGTCACACTGCACCTACTCTCAA
A
Protein sequenceShow/hide protein sequence
MPVRVESSAPSPISSADARQTSPQACTLLSVGQAFSGTQNVSNNQKDEAWRVNVRIQGLDLEHGYLCGTMEALNVPMADTPVVTFWEGEIVDGKNYNFFTGKWQAAPEDD
IRHWTKFPSFSPLMSQVEVDGGKSLDLSNYPCIFMRWKEQYFVNVGTDCGLTIAGFYYVCFSCSDGSISGFYYDPNSSPFQKLELKSTNEGRSGFSFSSYELR