; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g25670 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g25670
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionChloroplast, nucleus, chloroplast envelope, putative
Genome locationchr3:18467750..18470396
RNA-Seq ExpressionMoc03g25670
SyntenyMoc03g25670
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN65868.1 hypothetical protein Csa_023343 [Cucumis sativus]2.6e-10272.34Show/hide
Query:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF
        MSSMQITATQNSIC+++SIC+ SKSIYPSF A +SR A+VNLSANASYFKQGLPVLKY+HRR GL +QHTPIVSL+GSKGK S DGGSPWK  DKVVE+F
Subjt:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF

Query:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSG---------DSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFG
         KGRSVEDVLRQQIEKKEFYDGGDGGKRPP GGGGSG         DSSSGSED SL GI+DE LQVILAT+G +F+YIYI+SGEEL+RLAKDYIK++FG
Subjt:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSG---------DSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFG

Query:  GSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSD
        GSKSVRLKRAMY WG+FYQ L +KK+YD+YWLEKAI++TPTWWD+PDK       YM  + +NQ      +D  E D  +SD
Subjt:  GSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSD

XP_008444591.1 PREDICTED: uncharacterized protein LOC103487859 [Cucumis melo]1.0e-10672.03Show/hide
Query:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF
        MSSMQITATQNSIC+++SIC+ SKSIYPSF A +S  A+VNLSANASYFKQGLP+LKYKHRR GL HQHTPIVSLFGSKGK S DGGSPWK FDKVVE+F
Subjt:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF

Query:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSG-----------DSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFV
        KKG SVEDVLR+QIEKKEFYDGGDGG+RPPSGGGG G           DSSSG++D SL   +DETLQV+LAT+GFIF+Y Y+++GEE+TRL KDYIK+ 
Subjt:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSG-----------DSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFV

Query:  FGGSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDE
        FGGSKSVRL+RAMY+WGRFYQ+LT KK+YDE+WLEKAIINTPTWWDHPD YR A M Y +++ + ++ AS  +DD E D    DDE
Subjt:  FGGSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDE

XP_022140099.1 uncharacterized protein LOC111010834 [Momordica charantia]8.3e-149100Show/hide
Query:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF
        MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF
Subjt:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF

Query:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRLKR
        KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRLKR
Subjt:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRLKR

Query:  AMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDET
        AMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDET
Subjt:  AMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDET

XP_022994855.1 uncharacterized protein LOC111490456 [Cucurbita maxima]2.1e-9966.78Show/hide
Query:  SSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENFK
        S MQITATQNS+C ++S+C+ SKS YPSF A+++RSA VN SAN SY K+GLPVLKY HRR GL H++TPI SLFGSKGK++GDGGSPWK FDKVVENFK
Subjt:  SSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENFK

Query:  KGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGS--GDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRLK
        KGRSVED+LRQQIE K+FYDGGDGG+ PP GGGGS  GDSSS SED ++ GI++ET+ V+LATIG + +YIYII G+EL  LAKDYIK++FG  +S RLK
Subjt:  KGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGS--GDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRLK

Query:  RAMYKWGRFYQKLTEKKQY-DEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQ---------HSASNVNDDAEMDVSNSDDE
         AMY WG+FY++ T KKQ  DEYWLEKAI+NTPTWWDHPDKYR A+M+Y+ESQ + +          S+S+  DD E + SNSDDE
Subjt:  RAMYKWGRFYQKLTEKKQY-DEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQ---------HSASNVNDDAEMDVSNSDDE

XP_038895689.1 uncharacterized protein LOC120083861 [Benincasa hispida]4.3e-12181.95Show/hide
Query:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF
        MSSMQITATQNSICS++SIC+ SKSIYPSF A++SRS LVNLSAN S FKQGLPVLKYKHRR GL HQHTPIVSLFGSKGK++GDGGSPWK FD+VVENF
Subjt:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF

Query:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGG--GSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRL
        KKGRSVEDVLRQQIEKKEFYDGG+GGKRPPSGGG  GSGDSSSGSEDDSL GI+DETLQV+LAT+GFIFLYIYII+GEEL RLAKDYIK++FGGSKSVRL
Subjt:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGG--GSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRL

Query:  KRAMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDE
        +R+MY+WGRFYQKLTEKKQYDEYWLEKAI+NTPTWWDHPD YRR VM ++ESQ++ ++ AS  +D  E+D  NSDDE
Subjt:  KRAMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDE

TrEMBL top hitse value%identityAlignment
A0A0A0LVP5 Uncharacterized protein1.3e-10272.34Show/hide
Query:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF
        MSSMQITATQNSIC+++SIC+ SKSIYPSF A +SR A+VNLSANASYFKQGLPVLKY+HRR GL +QHTPIVSL+GSKGK S DGGSPWK  DKVVE+F
Subjt:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF

Query:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSG---------DSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFG
         KGRSVEDVLRQQIEKKEFYDGGDGGKRPP GGGGSG         DSSSGSED SL GI+DE LQVILAT+G +F+YIYI+SGEEL+RLAKDYIK++FG
Subjt:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSG---------DSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFG

Query:  GSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSD
        GSKSVRLKRAMY WG+FYQ L +KK+YD+YWLEKAI++TPTWWD+PDK       YM  + +NQ      +D  E D  +SD
Subjt:  GSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSD

A0A1S3BA69 uncharacterized protein LOC1034878595.0e-10772.03Show/hide
Query:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF
        MSSMQITATQNSIC+++SIC+ SKSIYPSF A +S  A+VNLSANASYFKQGLP+LKYKHRR GL HQHTPIVSLFGSKGK S DGGSPWK FDKVVE+F
Subjt:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF

Query:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSG-----------DSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFV
        KKG SVEDVLR+QIEKKEFYDGGDGG+RPPSGGGG G           DSSSG++D SL   +DETLQV+LAT+GFIF+Y Y+++GEE+TRL KDYIK+ 
Subjt:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSG-----------DSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFV

Query:  FGGSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDE
        FGGSKSVRL+RAMY+WGRFYQ+LT KK+YDE+WLEKAIINTPTWWDHPD YR A M Y +++ + ++ AS  +DD E D    DDE
Subjt:  FGGSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDE

A0A6J1CES8 uncharacterized protein LOC1110108344.0e-149100Show/hide
Query:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF
        MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF
Subjt:  MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENF

Query:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRLKR
        KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRLKR
Subjt:  KKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRLKR

Query:  AMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDET
        AMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDET
Subjt:  AMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDET

A0A6J1GSZ4 uncharacterized protein LOC1114572077.2e-9864.53Show/hide
Query:  SSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENFK
        S MQITATQNS+C ++SIC+ SKS YPSF A+++RSA VN SANASY K+GLPVLKY HRR GL H++TPI SLFGSKGK++ DGGSPWK FDKVVENFK
Subjt:  SSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENFK

Query:  KGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGG----SGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVR
        KGRSVED+LRQQIE K+FYDGGDGG+ PP GGGG     GDSSS SED S+ GI++ET+ V+LATIG + +YIYII G+EL  LAKDYIK++FG  +S R
Subjt:  KGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGG----SGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVR

Query:  LKRAMYKWGRFYQKLTEKK-QYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVN-----------------DDAEMDVSNSDDE
        LK AMY WG+FY++ T+KK + DEYWLEKAI+NTPTWWDHPDKYR A+M+Y+ESQ + +  AS+ +                 DD E + SNSDDE
Subjt:  LKRAMYKWGRFYQKLTEKK-QYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVN-----------------DDAEMDVSNSDDE

A0A6J1K2H4 uncharacterized protein LOC1114904561.0e-9966.78Show/hide
Query:  SSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENFK
        S MQITATQNS+C ++S+C+ SKS YPSF A+++RSA VN SAN SY K+GLPVLKY HRR GL H++TPI SLFGSKGK++GDGGSPWK FDKVVENFK
Subjt:  SSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENFK

Query:  KGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGS--GDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRLK
        KGRSVED+LRQQIE K+FYDGGDGG+ PP GGGGS  GDSSS SED ++ GI++ET+ V+LATIG + +YIYII G+EL  LAKDYIK++FG  +S RLK
Subjt:  KGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGS--GDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRLK

Query:  RAMYKWGRFYQKLTEKKQY-DEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQ---------HSASNVNDDAEMDVSNSDDE
         AMY WG+FY++ T KKQ  DEYWLEKAI+NTPTWWDHPDKYR A+M+Y+ESQ + +          S+S+  DD E + SNSDDE
Subjt:  RAMYKWGRFYQKLTEKKQY-DEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQ---------HSASNVNDDAEMDVSNSDDE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G43630.1 FUNCTIONS IN: molecular_function unknown5.4e-5346.69Show/hide
Query:  SRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESG-DGGSPWKTFDKVVENFKKGRSVEDVLRQQI
        +R  CI+S  I  S R          L A A+   Q  P+L ++ R      + +  V LFG K K  G D  SPWK  +K +      +SVED+LR+QI
Subjt:  SRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESG-DGGSPWKTFDKVVENFKKGRSVEDVLRQQI

Query:  EKKEFYDGGDGGKRPPSGGGGSG--------DSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRLKRAMYKWG
        +KK+FYD   GG  PP GGG  G           SG ED  L GI DETLQV+LAT+GFIFLY YII+GEEL +LA+DYI+F+ G  K+VRL RAM  W 
Subjt:  EKKEFYDGGDGGKRPPSGGGGSG--------DSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIKFVFGGSKSVRLKRAMYKWG

Query:  RFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVND
         F +K++ ++ YDEYWLEKAIINTPTW+D P+KYRR +  Y++S  +  +  SN ++
Subjt:  RFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVND

AT3G59640.1 glycine-rich protein5.2e-3240.08Show/hide
Query:  MSSMQITATQNSICSSRSICIASKSIYPSFRATR-----SRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDG--GSPWKTF
        MSS Q    + S+  +R+      S  P   + R          +  SA++S   Q  P+  ++ R    N +  P+V L G K K +G     S W+  
Subjt:  MSSMQITATQNSICSSRSICIASKSIYPSFRATR-----SRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDG--GSPWKTF

Query:  DKVVENFKKGRSVEDVLRQQIEKKEFYD------GGDGGKRPPSGGGGSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIK
        +K +      +SVED+LR+QI+KK+         GG GG R    GG +G   S  ED  L    DETLQV+LAT+GFIFLY YII+GEEL RLA+DYI+
Subjt:  DKVVENFKKGRSVEDVLRQQIEKKEFYD------GGDGGKRPPSGGGGSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIK

Query:  FVFGGSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLE
        ++ G  KSVRL R M  W RF++K++ KK Y+EYWL+
Subjt:  FVFGGSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLE

AT3G59640.2 glycine-rich protein5.2e-3240.08Show/hide
Query:  MSSMQITATQNSICSSRSICIASKSIYPSFRATR-----SRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDG--GSPWKTF
        MSS Q    + S+  +R+      S  P   + R          +  SA++S   Q  P+  ++ R    N +  P+V L G K K +G     S W+  
Subjt:  MSSMQITATQNSICSSRSICIASKSIYPSFRATR-----SRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLFGSKGKESGDG--GSPWKTF

Query:  DKVVENFKKGRSVEDVLRQQIEKKEFYD------GGDGGKRPPSGGGGSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIK
        +K +      +SVED+LR+QI+KK+         GG GG R    GG +G   S  ED  L    DETLQV+LAT+GFIFLY YII+GEEL RLA+DYI+
Subjt:  DKVVENFKKGRSVEDVLRQQIEKKEFYD------GGDGGKRPPSGGGGSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRLAKDYIK

Query:  FVFGGSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLE
        ++ G  KSVRL R M  W RF++K++ KK Y+EYWL+
Subjt:  FVFGGSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCATAGTTCATACTATGATATTGAAGCTTGGACATCAGGGTGCTGGAGCTTGGCATGCGATTATAGAGGTGTTTATCAAAGGATGAGCAGCATGCAGATA
ACTGCTACACAGAATTCTATTTGTTCTAGTAGATCAATATGCATTGCTTCTAAGTCAATATATCCATCATTTCGAGCTACTCGCTCCCGTAGTGCTCTTGTGAAC
CTAAGTGCCAATGCATCTTATTTCAAGCAAGGTCTACCAGTATTGAAGTATAAACATCGGAGGGCTGGATTAAATCACCAGCATACCCCAATTGTTTCCTTATTT
GGTAGCAAGGGAAAGGAAAGTGGTGATGGGGGTTCTCCGTGGAAAACTTTTGACAAAGTTGTTGAAAATTTTAAGAAGGGGCGATCAGTAGAAGATGTATTGCGA
CAGCAAATCGAAAAAAAAGAGTTCTATGATGGTGGAGATGGTGGCAAAAGACCTCCAAGTGGTGGCGGTGGCAGCGGGGATAGCTCTAGCGGATCTGAGGATGAT
AGCCTTGGAGGAATTATTGATGAAACACTGCAAGTGATTTTGGCGACCATCGGCTTTATTTTCTTGTACATTTACATCATTAGCGGGGAAGAGCTGACCCGATTA
GCGAAGGATTACATAAAGTTTGTATTCGGAGGAAGCAAGAGCGTCCGATTGAAGCGAGCCATGTACAAATGGGGAAGGTTTTACCAGAAACTGACTGAGAAGAAG
CAGTATGATGAATACTGGCTGGAGAAGGCTATTATCAACACTCCAACTTGGTGGGATCATCCTGACAAGTACAGGCGTGCTGTAATGGATTATATGGAGTCCCAG
TATGAGAATCAGCATTCTGCATCAAATGTAAATGATGATGCAGAAATGGATGTCTCAAATTCTGACGATGAAACATAG
mRNA sequenceShow/hide mRNA sequence
ATGATTCATAGTTCATACTATGATATTGAAGCTTGGACATCAGGGTGCTGGAGCTTGGCATGCGATTATAGAGGTGTTTATCAAAGGATGAGCAGCATGCAGATA
ACTGCTACACAGAATTCTATTTGTTCTAGTAGATCAATATGCATTGCTTCTAAGTCAATATATCCATCATTTCGAGCTACTCGCTCCCGTAGTGCTCTTGTGAAC
CTAAGTGCCAATGCATCTTATTTCAAGCAAGGTCTACCAGTATTGAAGTATAAACATCGGAGGGCTGGATTAAATCACCAGCATACCCCAATTGTTTCCTTATTT
GGTAGCAAGGGAAAGGAAAGTGGTGATGGGGGTTCTCCGTGGAAAACTTTTGACAAAGTTGTTGAAAATTTTAAGAAGGGGCGATCAGTAGAAGATGTATTGCGA
CAGCAAATCGAAAAAAAAGAGTTCTATGATGGTGGAGATGGTGGCAAAAGACCTCCAAGTGGTGGCGGTGGCAGCGGGGATAGCTCTAGCGGATCTGAGGATGAT
AGCCTTGGAGGAATTATTGATGAAACACTGCAAGTGATTTTGGCGACCATCGGCTTTATTTTCTTGTACATTTACATCATTAGCGGGGAAGAGCTGACCCGATTA
GCGAAGGATTACATAAAGTTTGTATTCGGAGGAAGCAAGAGCGTCCGATTGAAGCGAGCCATGTACAAATGGGGAAGGTTTTACCAGAAACTGACTGAGAAGAAG
CAGTATGATGAATACTGGCTGGAGAAGGCTATTATCAACACTCCAACTTGGTGGGATCATCCTGACAAGTACAGGCGTGCTGTAATGGATTATATGGAGTCCCAG
TATGAGAATCAGCATTCTGCATCAAATGTAAATGATGATGCAGAAATGGATGTCTCAAATTCTGACGATGAAACATAG
Protein sequenceShow/hide protein sequence
MIHSSYYDIEAWTSGCWSLACDYRGVYQRMSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKHRRAGLNHQHTPIVSLF
GSKGKESGDGGSPWKTFDKVVENFKKGRSVEDVLRQQIEKKEFYDGGDGGKRPPSGGGGSGDSSSGSEDDSLGGIIDETLQVILATIGFIFLYIYIISGEELTRL
AKDYIKFVFGGSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLEKAIINTPTWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSDDET