; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g1285 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g1285
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPlant protein of unknown function (DUF863)
Genome locationMC03:18700480..18701184
RNA-Seq ExpressionMC03g1285
SyntenyMC03g1285
Gene Ontology termsNA
InterPro domainsIPR008581 - Protein of unknown function DUF863, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137621.1 uncharacterized protein LOC111009019 [Momordica charantia]3.77e-152100Show/hide
Query:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN
        EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN
Subjt:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN

Query:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIM
        HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIM
Subjt:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIM

Query:  DQVSSDNELENKERNVIVWENITRRRRGQRYPACN
        DQVSSDNELENKERNVIVWENITRRRRGQRYPACN
Subjt:  DQVSSDNELENKERNVIVWENITRRRRGQRYPACN

XP_023519938.1 uncharacterized protein LOC111783256 isoform X1 [Cucurbita pepo subsp. pepo]6.65e-10074.47Show/hide
Query:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN
        EDL  I AEALVSISSSVAQN +K T   QS Q   ESLCW AEIVSSM  +PE AE+A+K KD  DS ELL N +DDFE+MTLKLKET  E CSLT SN
Subjt:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN

Query:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI
        HQEEA KNVSSPSCQLGKGR RRG+ KNFQTEILPSL TLSRYEVTEDIQTIGGLMEV SS SI G  KT S    +WTRGKRR C+SS K TE V+ SI
Subjt:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI

Query:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPAC
        MDQVSSDNE+ENKER V+VW NITRRRRG+RYPAC
Subjt:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPAC

XP_023519940.1 uncharacterized protein LOC111783256 isoform X2 [Cucurbita pepo subsp. pepo]6.49e-10074.47Show/hide
Query:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN
        EDL  I AEALVSISSSVAQN +K T   QS Q   ESLCW AEIVSSM  +PE AE+A+K KD  DS ELL N +DDFE+MTLKLKET  E CSLT SN
Subjt:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN

Query:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI
        HQEEA KNVSSPSCQLGKGR RRG+ KNFQTEILPSL TLSRYEVTEDIQTIGGLMEV SS SI G  KT S    +WTRGKRR C+SS K TE V+ SI
Subjt:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI

Query:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPAC
        MDQVSSDNE+ENKER V+VW NITRRRRG+RYPAC
Subjt:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPAC

XP_038894350.1 uncharacterized protein LOC120082969 isoform X1 [Benincasa hispida]1.39e-10377.12Show/hide
Query:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN
        EDL +I AE LVSISS VAQN  K T    S Q   ESLCWFAEIVSSM  DPE  E+ALK KD  DS ELL + +DDFE+MTLKLKET+ EGCSLT SN
Subjt:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN

Query:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI
        HQEEAVKNV   SCQLGKGRVRRG+  NFQTEILPSLATLSRYEVTEDIQTIGGLMEVASS SI GVVKT+S  R +WTRGKRR C+SS K TETVI SI
Subjt:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI

Query:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPACN
        MDQVSS NELENKER VIVW NITRRRRGQRYPACN
Subjt:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPACN

XP_038894352.1 uncharacterized protein LOC120082969 isoform X2 [Benincasa hispida]1.13e-10477.12Show/hide
Query:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN
        EDL +I AE LVSISS VAQN  K T    S Q   ESLCWFAEIVSSM  DPE  E+ALK KD  DS ELL + +DDFE+MTLKLKET+ EGCSLT SN
Subjt:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN

Query:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI
        HQEEAVKNV   SCQLGKGRVRRG+  NFQTEILPSLATLSRYEVTEDIQTIGGLMEVASS SI GVVKT+S  R +WTRGKRR C+SS K TETVI SI
Subjt:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI

Query:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPACN
        MDQVSS NELENKER VIVW NITRRRRGQRYPACN
Subjt:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPACN

TrEMBL top hitse value%identityAlignment
A0A1S3BE34 uncharacterized protein LOC103488795 isoform X11.75e-9674.58Show/hide
Query:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN
        EDL TI AEALVSISSSVAQN  K T S  S Q   ESLCW AEIVSSM  DP+ AE+ALK KD  DS ELL +++D+FE+MTLKLKE + +GCSLT SN
Subjt:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN

Query:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI
        HQEEAVKNVSSPSCQ GK R RRG+ KNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASS SI  V K +S  R + TRGKRR C+SS K TETVIRS 
Subjt:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI

Query:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPACN
        MD+VSSDNE +NKER VIVW NITRRRRGQRYPA N
Subjt:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPACN

A0A1S3BEN4 uncharacterized protein LOC103488795 isoform X21.59e-9774.58Show/hide
Query:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN
        EDL TI AEALVSISSSVAQN  K T S  S Q   ESLCW AEIVSSM  DP+ AE+ALK KD  DS ELL +++D+FE+MTLKLKE + +GCSLT SN
Subjt:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN

Query:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI
        HQEEAVKNVSSPSCQ GK R RRG+ KNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASS SI  V K +S  R + TRGKRR C+SS K TETVIRS 
Subjt:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI

Query:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPACN
        MD+VSSDNE +NKER VIVW NITRRRRGQRYPA N
Subjt:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPACN

A0A5D3BIN1 Uncharacterized protein1.75e-9674.58Show/hide
Query:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN
        EDL TI AEALVSISSSVAQN  K T S  S Q   ESLCW AEIVSSM  DP+ AE+ALK KD  DS ELL +++D+FE+MTLKLKE + +GCSLT SN
Subjt:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN

Query:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI
        HQEEAVKNVSSPSCQ GK R RRG+ KNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASS SI  V K +S  R + TRGKRR C+SS K TETVIRS 
Subjt:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI

Query:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPACN
        MD+VSSDNE +NKER VIVW NITRRRRGQRYPA N
Subjt:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPACN

A0A6J1C758 uncharacterized protein LOC1110090191.82e-152100Show/hide
Query:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN
        EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN
Subjt:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN

Query:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIM
        HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIM
Subjt:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIM

Query:  DQVSSDNELENKERNVIVWENITRRRRGQRYPACN
        DQVSSDNELENKERNVIVWENITRRRRGQRYPACN
Subjt:  DQVSSDNELENKERNVIVWENITRRRRGQRYPACN

A0A6J1KHA2 uncharacterized protein LOC1114957416.25e-9974.89Show/hide
Query:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN
        EDL  I A+ALVSISSSVAQN +K T S QS Q   ESLCW AEIVSSM  DPE AE+ALK KD  DS ELL N +DDFE+MTLKLKET  E CSLT SN
Subjt:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSN

Query:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI
        HQEEA KNVSSPSCQLGKGR RRG+ KNFQTEILPSL TLSRYEVTEDIQTIGGLME+ SS SI GV KT S    +WTRGKRR C+SS K TE VI SI
Subjt:  HQEEAVKNVSSPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSS-KPTETVIRSI

Query:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPAC
        MDQVSS NE+ENKER V VW NITRRRRG+RY AC
Subjt:  MDQVSSDNELENKERNVIVWENITRRRRGQRYPAC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12120.1 Plant protein of unknown function (DUF863)2.5e-0529.14Show/hide
Query:  SGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSK-------DVGDSGELLPN-YIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKG
        S +  Q   ESL   +EI     D      +   S        D  + G+  P    D +E  TL + ET  E     VS+   + + N++  + +    
Subjt:  SGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSK-------DVGDSGELLPN-YIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKG

Query:  RVRRGRG-KNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVK
        ++RRGR  KNFQ EILPSL +LSR+E+ ED+  +  ++     + + G  K
Subjt:  RVRRGRG-KNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVK

AT1G13940.1 Plant protein of unknown function (DUF863)6.2e-0929.03Show/hide
Query:  IVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKL---KETKVEGCSLTVSNHQ
        + AE +V+I S+      +  AS + ++     L WFAE V++  ++ +  ++   S++   S E     ID FE MTL+L    E +     L   N +
Subjt:  IVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKL---KETKVEGCSLTVSNHQ

Query:  -EEAVKNVSSPSCQLGKGRVRRGR-GKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVAS-SQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVI---
         EE        S +  +G  R+G+  ++FQ +ILP L +LS++EVTEDIQ   G M     S + TG+ +  +GSR    R +R     ++P    +   
Subjt:  -EEAVKNVSSPSCQLGKGRVRRGR-GKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVAS-SQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVI---

Query:  ----RSIMDQVSS-----DNELENKERNVIVWENITRRRRGQRYPACN
             S+   VS+     + E+  ++R+   W  +TRR R QR P+ +
Subjt:  ----RSIMDQVSS-----DNELENKERNVIVWENITRRRRGQRYPACN

AT1G26620.1 Plant protein of unknown function (DUF863)1.5e-1534.02Show/hide
Query:  DLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADD--------PETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEG
        +LI + AEA+V+IS +  Q      AS  +  A    L WFAEI++S  D+        PE  +     +D   SGE     ID FE MTL ++ETK E 
Subjt:  DLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADD--------PETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEG

Query:  CSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGK-NFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCN-SSKP
                  E +K   +   +  +G+ RRGR K +FQ + LP L++LSR+EVTEDIQ  GGLM+       +G+    +  R      KR   N +  P
Subjt:  CSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGK-NFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCN-SSKP

Query:  TETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYP
            +   M++  S   LE+ +  +  W   TRR R QR P
Subjt:  TETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYP

AT1G62530.1 Plant protein of unknown function (DUF863)5.5e-0536.17Show/hide
Query:  DDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRG-KNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVK
        D FE+ TL+++ET  E      S    +A+ + S    + G  ++RRGR  KNFQ EILP L +LSR+E+ EDI  +  +      + + G  K
Subjt:  DDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVSSPSCQLGKGRVRRGRG-KNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVK

AT1G69360.1 Plant protein of unknown function (DUF863)4.8e-1734.02Show/hide
Query:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPET-AEIALKSKDV-GDSGELLPNYIDDFEIMTLKLKETKVEGC---S
        ++LI   AEA+V+IS S         AS  +     E L WF   ++S  +D E+  +  L+++D  G   E      D FE MTL L +TK E      
Subjt:  EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPET-AEIALKSKDV-GDSGELLPNYIDDFEIMTLKLKETKVEGC---S

Query:  LTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGK-NFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNS--SKPT
        L     + +   ++   S +  +G+ RRGR K +FQ +ILP LA+LSR EVTED+Q  GGLM+       TG    S  +R S  RG++R  ++    P 
Subjt:  LTVSNHQEEAVKNVSSPSCQLGKGRVRRGRGK-NFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNS--SKPT

Query:  ETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPA
         + +   M+  SS   +  ++R++  W N TRR R  R PA
Subjt:  ETVIRSIMDQVSSDNELENKERNVIVWENITRRRRGQRYPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GAGGATCTCATTACGATTGTTGCAGAAGCTTTGGTTTCTATTTCATCATCTGTGGCTCAAAATCGTCAAAAAACCACTGCCAGCGGTCAGTCTGCTCAAGCTCCATGTGA
ATCCTTATGCTGGTTTGCTGAAATAGTTTCTTCAATGGCAGATGATCCAGAGACGGCTGAAATAGCCTTGAAAAGTAAGGATGTTGGTGACTCAGGGGAGCTTTTGCCAA
ATTACATAGATGACTTTGAGATCATGACATTGAAGTTGAAAGAAACAAAAGTGGAAGGGTGTTCCCTGACTGTAAGTAACCACCAAGAGGAAGCAGTAAAAAATGTTTCT
TCGCCTTCATGTCAACTAGGGAAGGGCCGTGTGAGGCGGGGTCGGGGCAAGAATTTTCAGACAGAAATTCTTCCAAGTCTTGCAACTTTGTCAAGGTATGAGGTCACTGA
AGATATTCAGACAATTGGAGGCCTGATGGAAGTTGCAAGTTCTCAATCGATAACTGGTGTAGTGAAAACTTCAAGTGGAAGTAGAGCGTCATGGACAAGAGGGAAGAGAC
GGTCCTGTAATTCATCAAAACCTACAGAGACTGTGATAAGGTCAATCATGGATCAAGTAAGCAGTGATAATGAACTGGAGAATAAAGAAAGGAATGTTATAGTTTGGGAA
AATATTACACGGCGACGGAGGGGACAGAGATATCCAGCTTGTAAC
mRNA sequenceShow/hide mRNA sequence
GAGGATCTCATTACGATTGTTGCAGAAGCTTTGGTTTCTATTTCATCATCTGTGGCTCAAAATCGTCAAAAAACCACTGCCAGCGGTCAGTCTGCTCAAGCTCCATGTGA
ATCCTTATGCTGGTTTGCTGAAATAGTTTCTTCAATGGCAGATGATCCAGAGACGGCTGAAATAGCCTTGAAAAGTAAGGATGTTGGTGACTCAGGGGAGCTTTTGCCAA
ATTACATAGATGACTTTGAGATCATGACATTGAAGTTGAAAGAAACAAAAGTGGAAGGGTGTTCCCTGACTGTAAGTAACCACCAAGAGGAAGCAGTAAAAAATGTTTCT
TCGCCTTCATGTCAACTAGGGAAGGGCCGTGTGAGGCGGGGTCGGGGCAAGAATTTTCAGACAGAAATTCTTCCAAGTCTTGCAACTTTGTCAAGGTATGAGGTCACTGA
AGATATTCAGACAATTGGAGGCCTGATGGAAGTTGCAAGTTCTCAATCGATAACTGGTGTAGTGAAAACTTCAAGTGGAAGTAGAGCGTCATGGACAAGAGGGAAGAGAC
GGTCCTGTAATTCATCAAAACCTACAGAGACTGTGATAAGGTCAATCATGGATCAAGTAAGCAGTGATAATGAACTGGAGAATAAAGAAAGGAATGTTATAGTTTGGGAA
AATATTACACGGCGACGGAGGGGACAGAGATATCCAGCTTGTAAC
Protein sequenceShow/hide protein sequence
EDLITIVAEALVSISSSVAQNRQKTTASGQSAQAPCESLCWFAEIVSSMADDPETAEIALKSKDVGDSGELLPNYIDDFEIMTLKLKETKVEGCSLTVSNHQEEAVKNVS
SPSCQLGKGRVRRGRGKNFQTEILPSLATLSRYEVTEDIQTIGGLMEVASSQSITGVVKTSSGSRASWTRGKRRSCNSSKPTETVIRSIMDQVSSDNELENKERNVIVWE
NITRRRRGQRYPACN