; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g28700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g28700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr6:21629471..21637985
RNA-Seq ExpressionMoc06g28700
SyntenyMoc06g28700
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.3e-9487.73Show/hide
Query:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVTTRFGNLVPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNPAVRPI
        MCARKGA GIVKGPTSIK WVRKWFYASGEWLAKDES      V  R    VPELTQASFDTLKYYKEHF RGRKVGTLV DKLLLES LLDYNPAVRPI
Subjt:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVTTRFGNLVPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNPAVRPI

Query:  ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAVAEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVREEVPLKRRRKKKKT
        ESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSS P TPAV  PASEDP PVIELESS GPSREKRPR QTEAVDVS LGEEVREEVPLKRRRKKKKT
Subjt:  ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAVAEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVREEVPLKRRRKKKKT

Query:  TSPLEVGARGALPASFADRV
        TSPLEVGARG LPASFADRV
Subjt:  TSPLEVGARGALPASFADRV

XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.7e-9750.8Show/hide
Query:  VPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAV--------
        +PEL QA+FDTLK+YK++F RGRK+GTLV DKLLLES LLDYNP VRPIE+SRPNSELAMVCGF S+VKRKSKGRAHAL+  QSS P TPAV        
Subjt:  VPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAV--------

Query:  AEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVREEVPLKRRRKKKKTTSPLEVGARGALPASFADRVSRISAASLDRCLRRASKFVSDPG
        A P+S  PTPVIEL+S+G  SREKR R ++EA+DVS L  EVR                                                         
Subjt:  AEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVREEVPLKRRRKKKKTTSPLEVGARGALPASFADRVSRISAASLDRCLRRASKFVSDPG

Query:  SVLQRTIDYAAEAFVASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLE
                                                                               E KAELLK+E++R KA LRAAHAITKGLE
Subjt:  SVLQRTIDYAAEAFVASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLE

Query:  KEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPS
        KEKFQLLKEKDDMLQALE K+  +    AEL+  KERL+NGALLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DLG LKKRYAE+WASGP+
Subjt:  KEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPS

Query:  GTPGPQALVDKYVRDLDSDYSDFEEDQVQRLDHSKLFTS
        GT GP +LVDKYVRDLDSDYSD +ED+V   + +++ T+
Subjt:  GTPGPQALVDKYVRDLDSDYSDFEEDQVQRLDHSKLFTS

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.3e-11589.76Show/hide
Query:  DRVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEVETK
        D+VSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSAL VKAELDG+E LAAREKE+FSAALE ASSTMKD+LLKAHSEVE LKAEVE++
Subjt:  DRVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEVETK

Query:  AELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA
        AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELET KERLSNG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA
Subjt:  AELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA

Query:  SDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDFEEDQV
        SDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EEDQV
Subjt:  SDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDFEEDQV

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.2e-14077.46Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEWERADNPPEGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEE ERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEWERADNPPEGWVTLYFKMFEYG

Query:  HKLPLHPFVQEFLFRTG-----------------------------------------------IAKKPGRFYMCARKGAGGIVKGPTSIKRWVRKWFYA
         +LPLHPFVQEFLFRTG                                               IAKKPGRFYMCARKGAGGIVKGPTSIK WVRKWFYA
Subjt:  HKLPLHPFVQEFLFRTG-----------------------------------------------IAKKPGRFYMCARKGAGGIVKGPTSIKRWVRKWFYA

Query:  SGEWLAKDESGRSFFDVTTRFGNL-----VPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDV TRFGNL     VPELTQASFDTLKYYKE F RGRKVGTLV D+LLLES LLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVTTRFGNL-----VPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSVPTTPAVAEPASEDPTPVIELESSGGPSREKRPRGQTEAVD
        SKGRAHALEAAQSS P TPAV  PASEDP  VIELESSGGPSREKRPR QTEAVD
Subjt:  SKGRAHALEAAQSSVPTTPAVAEPASEDPTPVIELESSGGPSREKRPRGQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.5e-17367.77Show/hide
Query:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVTTRFGNLV-----PELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNP
        MCARKG GGIVKGPTSIK WV KWF+ASGEWLAKDESGR+FFDV TRFGNLV     PEL QA+FDTLK+YK+HF R RK+ TLV DKLLLES LLDYNP
Subjt:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVTTRFGNLV-----PELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAV--------AEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    + P TP V        + P+S  PTPVIEL+ SGG S EKR R ++EA+DVS L  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAV--------AEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGALPASFA------------------------------DRVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GARG LP S A                              D+VSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGALPASFA------------------------------DRVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDG+EALAA+E+E   AALEAA +T+K +LLKA  EV+IL+AEV+ K +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR
        Q LE K+  +   T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR
Subjt:  QALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR

Query:  DLDSDYSDFEED
        +LDSDYSD EE+
Subjt:  DLDSDYSDFEED

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092986.5e-9587.73Show/hide
Query:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVTTRFGNLVPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNPAVRPI
        MCARKGA GIVKGPTSIK WVRKWFYASGEWLAKDES      V  R    VPELTQASFDTLKYYKEHF RGRKVGTLV DKLLLES LLDYNPAVRPI
Subjt:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVTTRFGNLVPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNPAVRPI

Query:  ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAVAEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVREEVPLKRRRKKKKT
        ESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSS P TPAV  PASEDP PVIELESS GPSREKRPR QTEAVDVS LGEEVREEVPLKRRRKKKKT
Subjt:  ESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAVAEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVREEVPLKRRRKKKKT

Query:  TSPLEVGARGALPASFADRV
        TSPLEVGARG LPASFADRV
Subjt:  TSPLEVGARGALPASFADRV

A0A6J1CLV1 uncharacterized protein LOC1110124678.2e-9850.8Show/hide
Query:  VPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAV--------
        +PEL QA+FDTLK+YK++F RGRK+GTLV DKLLLES LLDYNP VRPIE+SRPNSELAMVCGF S+VKRKSKGRAHAL+  QSS P TPAV        
Subjt:  VPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAV--------

Query:  AEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVREEVPLKRRRKKKKTTSPLEVGARGALPASFADRVSRISAASLDRCLRRASKFVSDPG
        A P+S  PTPVIEL+S+G  SREKR R ++EA+DVS L  EVR                                                         
Subjt:  AEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVREEVPLKRRRKKKKTTSPLEVGARGALPASFADRVSRISAASLDRCLRRASKFVSDPG

Query:  SVLQRTIDYAAEAFVASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLE
                                                                               E KAELLK+E++R KA LRAAHAITKGLE
Subjt:  SVLQRTIDYAAEAFVASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLE

Query:  KEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPS
        KEKFQLLKEKDDMLQALE K+  +    AEL+  KERL+NGALLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DLG LKKRYAE+WASGP+
Subjt:  KEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPS

Query:  GTPGPQALVDKYVRDLDSDYSDFEEDQVQRLDHSKLFTS
        GT GP +LVDKYVRDLDSDYSD +ED+V   + +++ T+
Subjt:  GTPGPQALVDKYVRDLDSDYSDFEEDQVQRLDHSKLFTS

A0A6J1D971 uncharacterized protein LOC1110185381.1e-11589.76Show/hide
Query:  DRVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEVETK
        D+VSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSAL VKAELDG+E LAAREKE+FSAALE ASSTMKD+LLKAHSEVE LKAEVE++
Subjt:  DRVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEVETK

Query:  AELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA
        AELLKKEEDRR+AQLRAAHAIT+GLE+EKFQLLKEKDDMLQALEAK++EL+HATAELET KERLSNG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA
Subjt:  AELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA

Query:  SDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDFEEDQV
        SDMPDLQIDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD EEDQV
Subjt:  SDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDFEEDQV

A0A6J1DXS5 uncharacterized protein LOC1110255026.0e-14177.46Show/hide
Query:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEWERADNPPEGWVTLYFKMFEYG
        MSSS SS+L  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEE ERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEWERADNPPEGWVTLYFKMFEYG

Query:  HKLPLHPFVQEFLFRTG-----------------------------------------------IAKKPGRFYMCARKGAGGIVKGPTSIKRWVRKWFYA
         +LPLHPFVQEFLFRTG                                               IAKKPGRFYMCARKGAGGIVKGPTSIK WVRKWFYA
Subjt:  HKLPLHPFVQEFLFRTG-----------------------------------------------IAKKPGRFYMCARKGAGGIVKGPTSIKRWVRKWFYA

Query:  SGEWLAKDESGRSFFDVTTRFGNL-----VPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDESGRSFFDV TRFGNL     VPELTQASFDTLKYYKE F RGRKVGTLV D+LLLES LLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESGRSFFDVTTRFGNL-----VPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSVPTTPAVAEPASEDPTPVIELESSGGPSREKRPRGQTEAVD
        SKGRAHALEAAQSS P TPAV  PASEDP  VIELESSGGPSREKRPR QTEAVD
Subjt:  SKGRAHALEAAQSSVPTTPAVAEPASEDPTPVIELESSGGPSREKRPRGQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256651.2e-17367.77Show/hide
Query:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVTTRFGNLV-----PELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNP
        MCARKG GGIVKGPTSIK WV KWF+ASGEWLAKDESGR+FFDV TRFGNLV     PEL QA+FDTLK+YK+HF R RK+ TLV DKLLLES LLDYNP
Subjt:  MCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVTTRFGNLV-----PELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAV--------AEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    + P TP V        + P+S  PTPVIEL+ SGG S EKR R ++EA+DVS L  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAV--------AEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGALPASFA------------------------------DRVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GARG LP S A                              D+VSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGALPASFA------------------------------DRVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDG+EALAA+E+E   AALEAA +T+K +LLKA  EV+IL+AEV+ K +LLKKE ++ KA LRAAHAITKGLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDML

Query:  QALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR
        Q LE K+  +   T EL+ +KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR
Subjt:  QALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVR

Query:  DLDSDYSDFEED
        +LDSDYSD EE+
Subjt:  DLDSDYSDFEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGTGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCACAAGCTTCCCCTTCACCCTTTTGTCCAA
GAATTTCTCTTCCGAACTGGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAAAGATGGGT
GAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGATGTTACCACTAGGTTTGGGAACCTAGTCCCCGAGCTTACACAAG
CCTCTTTCGACACGCTGAAATATTACAAGGAGCATTTTTCGAGGGGTAGGAAGGTCGGAACCTTGGTGATCGACAAGCTGCTGCTTGAGTCCGAGCTGCTAGATTACAAC
CCTGCAGTTCGTCCCATTGAATCTTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGA
GGCCGCCCAGAGTTCGGTACCTACCACTCCTGCTGTGGCAGAGCCAGCCTCGGAAGATCCAACCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGC
GCCCCAGAGGTCAGACAGAGGCGGTGGACGTCTCCTCCTTGGGCGAGGAGGTGAGGGAGGAGGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTG
GAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGA
CCCGGGGTCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGACCGTGAAGGCCGAGCTGGATGGGAAGGAAGCTCTGG
CAGCGAGGGAGAAAGAGAAGTTCTCTGCTGCCTTGGAGGCTGCTTCTTCCACCATGAAGGATAAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTG
GAGACCAAGGCCGAGCTGCTGAAAAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAA
GGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGG
AGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAG
ATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATCTGGACTC
TGACTACTCCGACTTCGAAGAGGATCAGGTTCAAAGGCTTGATCATTCTAAACTTTTCACATCGCCCCCTTGCCTTGAAGGGGTTAGGCATTTCAATAGAGGCAGGGAAA
AGCGGCGTCGGTACAACGCTCCACCTCGGACCACGAACCGAGCTGCTTGCCTTGCCAACTTTCTGCGCTCCTTGGGGTCTTGTGGTGAATTGCCCCTAATGAAGTCCACA
ATCGGGTCCATCCATGAGGGCTCTGGAGCGTCGATCTCCATCAGATCTGGCTCCGAGATCGAGGGATTATCTAAGATCTCGACGGGGACCAACCTGGCCAGGTCGGTCTC
AAGGGTGTTCTGGGAAGGTGTCGGGCAATTGGCCGAAGCTCGTTCCATACTATCCGAGCTACGTCGTATCCGAGCTCAGCAATTTCGAGCTCAACAGTATAGGTTCAAGC
GAGTTGCTAGCGAGCAAGCTCAAGAGGGAAAGTCTCTAGGAGTTGATATAGAGCTAGTTGGAGAAAGAAAACCTGCAAAAACAGGTAGAGCACTCCGACAATCAAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATCCTCTTTTAGCAGCGACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGTGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCACAAGCTTCCCCTTCACCCTTTTGTCCAA
GAATTTCTCTTCCGAACTGGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAAAGATGGGT
GAGGAAGTGGTTCTACGCTTCCGGGGAATGGCTTGCAAAGGACGAGTCGGGTCGTTCCTTCTTTGATGTTACCACTAGGTTTGGGAACCTAGTCCCCGAGCTTACACAAG
CCTCTTTCGACACGCTGAAATATTACAAGGAGCATTTTTCGAGGGGTAGGAAGGTCGGAACCTTGGTGATCGACAAGCTGCTGCTTGAGTCCGAGCTGCTAGATTACAAC
CCTGCAGTTCGTCCCATTGAATCTTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGGATTTGCAAGCAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGA
GGCCGCCCAGAGTTCGGTACCTACCACTCCTGCTGTGGCAGAGCCAGCCTCGGAAGATCCAACCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGC
GCCCCAGAGGTCAGACAGAGGCGGTGGACGTCTCCTCCTTGGGCGAGGAGGTGAGGGAGGAGGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTG
GAGGTCGGAGCTCGTGGGGCCCTGCCTGCGAGCTTCGCAGATCGGGTGTCCCGCATCTCGGCTGCAAGTTTGGACCGCTGCCTCAGAAGAGCGTCCAAATTTGTAAGTGA
CCCGGGGTCCGTCCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGCTTCCATTCAATCGGCTCTGACCGTGAAGGCCGAGCTGGATGGGAAGGAAGCTCTGG
CAGCGAGGGAGAAAGAGAAGTTCTCTGCTGCCTTGGAGGCTGCTTCTTCCACCATGAAGGATAAGCTGCTGAAAGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTG
GAGACCAAGGCCGAGCTGCTGAAAAAAGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAAGGGCTTGGAGAAGGAGAAGTTCCAACTCCTCAA
GGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGTGAAGGAGCGTCTCAGCAATGGAGCCCTATTGG
AGGAATCGTTCAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAG
ATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCTAGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATCTGGACTC
TGACTACTCCGACTTCGAAGAGGATCAGGTTCAAAGGCTTGATCATTCTAAACTTTTCACATCGCCCCCTTGCCTTGAAGGGGTTAGGCATTTCAATAGAGGCAGGGAAA
AGCGGCGTCGGTACAACGCTCCACCTCGGACCACGAACCGAGCTGCTTGCCTTGCCAACTTTCTGCGCTCCTTGGGGTCTTGTGGTGAATTGCCCCTAATGAAGTCCACA
ATCGGGTCCATCCATGAGGGCTCTGGAGCGTCGATCTCCATCAGATCTGGCTCCGAGATCGAGGGATTATCTAAGATCTCGACGGGGACCAACCTGGCCAGGTCGGTCTC
AAGGGTGTTCTGGGAAGGTGTCGGGCAATTGGCCGAAGCTCGTTCCATACTATCCGAGCTACGTCGTATCCGAGCTCAGCAATTTCGAGCTCAACAGTATAGGTTCAAGC
GAGTTGCTAGCGAGCAAGCTCAAGAGGGAAAGTCTCTAGGAGTTGATATAGAGCTAGTTGGAGAAAGAAAACCTGCAAAAACAGGTAGAGCACTCCGACAATCAAGTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSDLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEWERADNPPEGWVTLYFKMFEYGHKLPLHPFVQ
EFLFRTGIAKKPGRFYMCARKGAGGIVKGPTSIKRWVRKWFYASGEWLAKDESGRSFFDVTTRFGNLVPELTQASFDTLKYYKEHFSRGRKVGTLVIDKLLLESELLDYN
PAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSVPTTPAVAEPASEDPTPVIELESSGGPSREKRPRGQTEAVDVSSLGEEVREEVPLKRRRKKKKTTSPL
EVGARGALPASFADRVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALTVKAELDGKEALAAREKEKFSAALEAASSTMKDKLLKAHSEVEILKAEV
ETKAELLKKEEDRRKAQLRAAHAITKGLEKEKFQLLKEKDDMLQALEAKEEELKHATAELETVKERLSNGALLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQ
IDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDFEEDQVQRLDHSKLFTSPPCLEGVRHFNRGREKRRRYNAPPRTTNRAACLANFLRSLGSCGELPLMKST
IGSIHEGSGASISIRSGSEIEGLSKISTGTNLARSVSRVFWEGVGQLAEARSILSELRRIRAQQFRAQQYRFKRVASEQAQEGKSLGVDIELVGERKPAKTGRALRQSS