; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025147 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025147
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionglycine-rich protein
Genome locationchr10:9079585..9084283
RNA-Seq ExpressionLag0025147
SyntenyLag0025147
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN65868.1 hypothetical protein Csa_023343 [Cucumis sativus]1.2e-10273.68Show/hide
Query:  MSSMQITATQNSICFNKSICLVSKSI----HASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF
        MSSMQITATQNSIC NKSICLVSKSI    HA+QSR  +VNLSANAS FKQGLPVLKY+HRRVGLK+QHTPIVSL+GSKGK S DGGSPWK  DKVVE+F
Subjt:  MSSMQITATQNSICFNKSICLVSKSI----HASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF

Query:  KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGG--------DSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGG
        KGRSVEDVLRQQIEKKEFYDGGDGGK+PP GGG  GG        DSSSGSED SL GIM+E LQV+LAT+G +F+YIYI+SGEEL+RLAKDYIKYLFGG
Subjt:  KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGG--------DSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGG

Query:  SKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDEEI
        SKSVRLKR+MY WG+FYQ L +KK+YD+YWLEKAIL+TPTWWD+PDKY        ++Q QK+  AS  YD  E+D   SD  EI
Subjt:  SKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDEEI

XP_008444591.1 PREDICTED: uncharacterized protein LOC103487859 [Cucumis melo]3.0e-10473.17Show/hide
Query:  MSSMQITATQNSICFNKSICLVSKSI----HASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF
        MSSMQITATQNSIC NKSICLVSKSI    HA+QS   +VNLSANAS FKQGLP+LKYKHRRVGLKHQHTPIVSLFGSKGK S DGGSPWKAFDKVVE+F
Subjt:  MSSMQITATQNSICFNKSICLVSKSI----HASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF

Query:  -KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGG----------DSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYL
         KG SVEDVLR+QIEKKEFYDGGDGG++PPSGGG GGG          DSSSG++D SLA  ++ETLQVVLAT+GFIF+Y Y+++GEE+TRL KDYIKY 
Subjt:  -KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGG----------DSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYL

Query:  FGGSKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDEE
        FGGSKSVRL+R+MY+WGRFYQ+LT KK+YDE+WLEKAI+NTPTWWDHPD YR   MAY +++ Q++  AS   DD E+D    DDEE
Subjt:  FGGSKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDEE

XP_022140099.1 uncharacterized protein LOC111010834 [Momordica charantia]1.5e-11983.7Show/hide
Query:  MSSMQITATQNSICFNKSICLVSKSIH----ASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF
        MSSMQITATQNSIC ++SIC+ SKSI+    A++SRS LVNLSANAS FKQGLPVLKYKHRR GL HQHTPIVSLFGSKGK+SGDGGSPWK FDKVVENF
Subjt:  MSSMQITATQNSICFNKSICLVSKSIH----ASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF

Query:  -KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLK
         KGRSVEDVLRQQIEKKEFYDGGDGGK+PPSGGG G GDSSSGSEDDSL GI++ETLQV+LATIGFIFLYIYIISGEELTRLAKDYIK++FGGSKSVRLK
Subjt:  -KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLK

Query:  RSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDE
        R+MY+WGRFYQKLTEKKQYDEYWLEKAI+NTPTWWDHPDKYRR VM Y+ESQY+ + SAS   DD E D S SDDE
Subjt:  RSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDE

XP_022994855.1 uncharacterized protein LOC111490456 [Cucurbita maxima]8.3e-9970.03Show/hide
Query:  SSMQITATQNSICFNKSICLVSKSIH----ASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF-
        S MQITATQNS+C NKS+CLVSKS +    ASQ+RS  VN SAN S  K+GLPVLKY HRRVGLKH++TPI SLFGSKGKD+GDGGSPWKAFDKVVENF 
Subjt:  SSMQITATQNSICFNKSICLVSKSIH----ASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF-

Query:  KGRSVEDVLRQQIEKKEFYDGGDGGKKPP-SGGGSGGGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLK
        KGRSVED+LRQQIE K+FYDGGDGG+ PP  GGGS GGDSSS SED ++ GI+EET+ VVLATIG + +YIYII G+EL  LAKDYIKYLFG  +S RLK
Subjt:  KGRSVEDVLRQQIEKKEFYDGGDGGKKPP-SGGGSGGGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLK

Query:  RSMYQWGRFYQKLTEKKQY-DEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKE---------RSASASYDDLESDESISDDEE
         +MY WG+FY++ T KKQ  DEYWLEKAILNTPTWWDHPDKYR  +M Y+ESQ Q+E          S+S+SYDD E +ES SDDE+
Subjt:  RSMYQWGRFYQKLTEKKQY-DEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKE---------RSASASYDDLESDESISDDEE

XP_038895689.1 uncharacterized protein LOC120083861 [Benincasa hispida]2.6e-12487.81Show/hide
Query:  MSSMQITATQNSICFNKSICLVSKSI----HASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF
        MSSMQITATQNSIC NKSICLVSKSI    HASQSRSVLVNLSAN S FKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKD+GDGGSPWKAFD+VVENF
Subjt:  MSSMQITATQNSICFNKSICLVSKSI----HASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF

Query:  -KGRSVEDVLRQQIEKKEFYDGGDGGKKPPS-GGGSGGGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRL
         KGRSVEDVLRQQIEKKEFYDGG+GGK+PPS GGGSG GDSSSGSEDDSLAGI++ETLQVVLAT+GFIFLYIYII+GEEL RLAKDYIKYLFGGSKSVRL
Subjt:  -KGRSVEDVLRQQIEKKEFYDGGDGGKKPPS-GGGSGGGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRL

Query:  KRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDEEI
        +RSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPD YRRTVMA+IESQ+QKE  AS  Y   E D+  SDDEEI
Subjt:  KRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDEEI

TrEMBL top hitse value%identityAlignment
A0A0A0LVP5 Uncharacterized protein6.0e-10373.68Show/hide
Query:  MSSMQITATQNSICFNKSICLVSKSI----HASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF
        MSSMQITATQNSIC NKSICLVSKSI    HA+QSR  +VNLSANAS FKQGLPVLKY+HRRVGLK+QHTPIVSL+GSKGK S DGGSPWK  DKVVE+F
Subjt:  MSSMQITATQNSICFNKSICLVSKSI----HASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF

Query:  KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGG--------DSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGG
        KGRSVEDVLRQQIEKKEFYDGGDGGK+PP GGG  GG        DSSSGSED SL GIM+E LQV+LAT+G +F+YIYI+SGEEL+RLAKDYIKYLFGG
Subjt:  KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGG--------DSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGG

Query:  SKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDEEI
        SKSVRLKR+MY WG+FYQ L +KK+YD+YWLEKAIL+TPTWWD+PDKY        ++Q QK+  AS  YD  E+D   SD  EI
Subjt:  SKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDEEI

A0A1S3BA69 uncharacterized protein LOC1034878591.4e-10473.17Show/hide
Query:  MSSMQITATQNSICFNKSICLVSKSI----HASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF
        MSSMQITATQNSIC NKSICLVSKSI    HA+QS   +VNLSANAS FKQGLP+LKYKHRRVGLKHQHTPIVSLFGSKGK S DGGSPWKAFDKVVE+F
Subjt:  MSSMQITATQNSICFNKSICLVSKSI----HASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF

Query:  -KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGG----------DSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYL
         KG SVEDVLR+QIEKKEFYDGGDGG++PPSGGG GGG          DSSSG++D SLA  ++ETLQVVLAT+GFIF+Y Y+++GEE+TRL KDYIKY 
Subjt:  -KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGG----------DSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYL

Query:  FGGSKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDEE
        FGGSKSVRL+R+MY+WGRFYQ+LT KK+YDE+WLEKAI+NTPTWWDHPD YR   MAY +++ Q++  AS   DD E+D    DDEE
Subjt:  FGGSKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDEE

A0A6J1CES8 uncharacterized protein LOC1110108347.1e-12083.7Show/hide
Query:  MSSMQITATQNSICFNKSICLVSKSIH----ASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF
        MSSMQITATQNSIC ++SIC+ SKSI+    A++SRS LVNLSANAS FKQGLPVLKYKHRR GL HQHTPIVSLFGSKGK+SGDGGSPWK FDKVVENF
Subjt:  MSSMQITATQNSICFNKSICLVSKSIH----ASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF

Query:  -KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLK
         KGRSVEDVLRQQIEKKEFYDGGDGGK+PPSGGG G GDSSSGSEDDSL GI++ETLQV+LATIGFIFLYIYIISGEELTRLAKDYIK++FGGSKSVRLK
Subjt:  -KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLK

Query:  RSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDE
        R+MY+WGRFYQKLTEKKQYDEYWLEKAI+NTPTWWDHPDKYRR VM Y+ESQY+ + SAS   DD E D S SDDE
Subjt:  RSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDE

A0A6J1GSZ4 uncharacterized protein LOC1114572077.6e-9868.01Show/hide
Query:  SSMQITATQNSICFNKSICLVSKSIH----ASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF-
        S MQITATQNS+C NKSICLVSKS +    ASQ+RS  VN SANAS  K+GLPVLKY HRRVGLKH++TPI SLFGSKGKD+ DGGSPWKAFDKVVENF 
Subjt:  SSMQITATQNSICFNKSICLVSKSIH----ASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF-

Query:  KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSG---GGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVR
        KGRSVED+LRQQIE K+FYDGGDGG+ PP GGG G   GGDSSS SED S+ GI+EET+ VVLATIG + +YIYII G+EL  LAKDYIKYLFG  +S R
Subjt:  KGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSG---GGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVR

Query:  LKRSMYQWGRFYQKLTEKK-QYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKE-----------------RSASASYDDLESDESISDDEE
        LK +MY WG+FY++ T+KK + DEYWLEKAILNTPTWWDHPDKYR  +M Y+ESQ Q E                  S+S+SYDD E +ES SDDE+
Subjt:  LKRSMYQWGRFYQKLTEKK-QYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKE-----------------RSASASYDDLESDESISDDEE

A0A6J1K2H4 uncharacterized protein LOC1114904564.0e-9970.03Show/hide
Query:  SSMQITATQNSICFNKSICLVSKSIH----ASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF-
        S MQITATQNS+C NKS+CLVSKS +    ASQ+RS  VN SAN S  K+GLPVLKY HRRVGLKH++TPI SLFGSKGKD+GDGGSPWKAFDKVVENF 
Subjt:  SSMQITATQNSICFNKSICLVSKSIH----ASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDGGSPWKAFDKVVENF-

Query:  KGRSVEDVLRQQIEKKEFYDGGDGGKKPP-SGGGSGGGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLK
        KGRSVED+LRQQIE K+FYDGGDGG+ PP  GGGS GGDSSS SED ++ GI+EET+ VVLATIG + +YIYII G+EL  LAKDYIKYLFG  +S RLK
Subjt:  KGRSVEDVLRQQIEKKEFYDGGDGGKKPP-SGGGSGGGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLK

Query:  RSMYQWGRFYQKLTEKKQY-DEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKE---------RSASASYDDLESDESISDDEE
         +MY WG+FY++ T KKQ  DEYWLEKAILNTPTWWDHPDKYR  +M Y+ESQ Q+E          S+S+SYDD E +ES SDDE+
Subjt:  RSMYQWGRFYQKLTEKKQY-DEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKE---------RSASASYDDLESDESISDDEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G43630.1 FUNCTIONS IN: molecular_function unknown3.0e-5450.85Show/hide
Query:  LSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGK-DSGDGGSPWKAFDKVVENFKGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGGDSS
        L A A+   Q  P+L ++ R    K + +  V LFG K K D  D  SPWKA +K +     +SVED+LR+QI+KK+FYD   GG  PP GGGSGGG  +
Subjt:  LSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGK-DSGDGGSPWKAFDKVVENFKGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGGDSS

Query:  -------SGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTW
               SG ED  LAGI +ETLQVVLAT+GFIFLY YII+GEEL +LA+DYI++L G  K+VRL R+M  W  F +K++ ++ YDEYWLEKAI+NTPTW
Subjt:  -------SGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTW

Query:  WDHPDKYRRTVMAYIESQYQKERSASASYDDLESDE
        +D P+KYRR + AY++S      ++  +Y +  SDE
Subjt:  WDHPDKYRRTVMAYIESQYQKERSASASYDDLESDE

AT3G59640.1 glycine-rich protein1.3e-3346.15Show/hide
Query:  VNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDG--GSPWKAFDKVVENFKGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGG
        +  SA++S   Q  P+  ++ R      +  P+V L G K K +G     S W+A +K +     +SVED+LR+QI+KK      D G  PP G G GGG
Subjt:  VNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDG--GSPWKAFDKVVENFKGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGG

Query:  DSSSGS--------EDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLE
          + G+        ED  LA   +ETLQVVLAT+GFIFLY YII+GEEL RLA+DYI+YL G  KSVRL R M  W RF++K++ KK Y+EYWL+
Subjt:  DSSSGS--------EDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLE

AT3G59640.2 glycine-rich protein1.3e-3346.15Show/hide
Query:  VNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDG--GSPWKAFDKVVENFKGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGG
        +  SA++S   Q  P+  ++ R      +  P+V L G K K +G     S W+A +K +     +SVED+LR+QI+KK      D G  PP G G GGG
Subjt:  VNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFGSKGKDSGDG--GSPWKAFDKVVENFKGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGG

Query:  DSSSGS--------EDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLE
          + G+        ED  LA   +ETLQVVLAT+GFIFLY YII+GEEL RLA+DYI+YL G  KSVRL R M  W RF++K++ KK Y+EYWL+
Subjt:  DSSSGS--------EDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIKYLFGGSKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGGGGAAGGAAGAAGATGGAGGGAGAAAAAAGCACACAGAAAACCCCCAAGCGGATTATCGTCGATTTCTCCCCCGCCACCGCTCCACCGTCGACGCCGGCGGC
AGAAGCTGCCGGCCGTTTGCTGTCTTTCGAGGCGACTCGAAGTTTGGCTCCTCTCCTAGCCCATTTGCGCGACAAAAACGGCGGAGCCGCAACTGTTTCGAGCCGAAGCC
GCTGTCTCCTTCCTCTGCCAGCCCATTCGCGCGACTGGACGTTGATATCACTTGCACCTCTCTCTCAAATCCCATCTATTTTTTCTTCGCCTCAGACTCATAAAGCCCGC
ATCTTGCTTCCAACCCGCCATGAGCCATCGAAGGATTCTACTGGGAACTCTTGGCCGGCTGCTTTAGGGTGCTGGAGCTTGGCATGTGATTACAAAGGTGTTTGTCAAAG
TATGAGCAGCATGCAGATAACTGCTACACAGAATTCTATTTGTTTCAATAAATCAATATGCCTTGTTTCTAAGTCAATACATGCTAGTCAGTCACGTAGTGTTCTTGTGA
ACCTGAGTGCCAATGCATCTCGTTTCAAGCAAGGTCTACCTGTGTTGAAGTATAAACATCGGAGGGTTGGATTGAAACATCAGCATACCCCAATTGTTTCCTTATTTGGT
AGCAAGGGAAAGGACAGTGGCGATGGGGGTTCTCCATGGAAAGCTTTCGACAAAGTTGTTGAAAATTTTAAGGGACGGTCAGTAGAAGATGTATTGCGACAGCAAATTGA
GAAAAAAGAGTTCTATGATGGTGGAGATGGTGGCAAAAAACCCCCAAGTGGTGGCGGCAGTGGTGGCGGGGATAGCTCTAGCGGATCAGAGGATGATAGCCTTGCAGGAA
TAATGGAAGAAACACTGCAAGTGGTTTTGGCGACCATTGGCTTTATTTTCTTGTACATTTACATCATTAGTGGGGAAGAACTGACCCGATTAGCGAAGGATTACATAAAG
TATCTATTTGGAGGAAGCAAGAGTGTTCGTTTGAAGCGATCGATGTACCAATGGGGAAGGTTTTACCAAAAACTCACTGAAAAGAAGCAATATGATGAATACTGGCTGGA
GAAAGCTATTCTAAACACCCCAACTTGGTGGGATCATCCTGATAAGTACAGGCGTACTGTAATGGCTTATATTGAGTCCCAGTATCAGAAAGAGCGTTCTGCATCTGCAT
CATATGATGATTTGGAAAGTGATGAGTCGATTTCTGATGATGAGGAAATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGGGGAAGGAAGAAGATGGAGGGAGAAAAAAGCACACAGAAAACCCCCAAGCGGATTATCGTCGATTTCTCCCCCGCCACCGCTCCACCGTCGACGCCGGCGGC
AGAAGCTGCCGGCCGTTTGCTGTCTTTCGAGGCGACTCGAAGTTTGGCTCCTCTCCTAGCCCATTTGCGCGACAAAAACGGCGGAGCCGCAACTGTTTCGAGCCGAAGCC
GCTGTCTCCTTCCTCTGCCAGCCCATTCGCGCGACTGGACGTTGATATCACTTGCACCTCTCTCTCAAATCCCATCTATTTTTTCTTCGCCTCAGACTCATAAAGCCCGC
ATCTTGCTTCCAACCCGCCATGAGCCATCGAAGGATTCTACTGGGAACTCTTGGCCGGCTGCTTTAGGGTGCTGGAGCTTGGCATGTGATTACAAAGGTGTTTGTCAAAG
TATGAGCAGCATGCAGATAACTGCTACACAGAATTCTATTTGTTTCAATAAATCAATATGCCTTGTTTCTAAGTCAATACATGCTAGTCAGTCACGTAGTGTTCTTGTGA
ACCTGAGTGCCAATGCATCTCGTTTCAAGCAAGGTCTACCTGTGTTGAAGTATAAACATCGGAGGGTTGGATTGAAACATCAGCATACCCCAATTGTTTCCTTATTTGGT
AGCAAGGGAAAGGACAGTGGCGATGGGGGTTCTCCATGGAAAGCTTTCGACAAAGTTGTTGAAAATTTTAAGGGACGGTCAGTAGAAGATGTATTGCGACAGCAAATTGA
GAAAAAAGAGTTCTATGATGGTGGAGATGGTGGCAAAAAACCCCCAAGTGGTGGCGGCAGTGGTGGCGGGGATAGCTCTAGCGGATCAGAGGATGATAGCCTTGCAGGAA
TAATGGAAGAAACACTGCAAGTGGTTTTGGCGACCATTGGCTTTATTTTCTTGTACATTTACATCATTAGTGGGGAAGAACTGACCCGATTAGCGAAGGATTACATAAAG
TATCTATTTGGAGGAAGCAAGAGTGTTCGTTTGAAGCGATCGATGTACCAATGGGGAAGGTTTTACCAAAAACTCACTGAAAAGAAGCAATATGATGAATACTGGCTGGA
GAAAGCTATTCTAAACACCCCAACTTGGTGGGATCATCCTGATAAGTACAGGCGTACTGTAATGGCTTATATTGAGTCCCAGTATCAGAAAGAGCGTTCTGCATCTGCAT
CATATGATGATTTGGAAAGTGATGAGTCGATTTCTGATGATGAGGAAATCTGA
Protein sequenceShow/hide protein sequence
MGGGRKKMEGEKSTQKTPKRIIVDFSPATAPPSTPAAEAAGRLLSFEATRSLAPLLAHLRDKNGGAATVSSRSRCLLPLPAHSRDWTLISLAPLSQIPSIFSSPQTHKAR
ILLPTRHEPSKDSTGNSWPAALGCWSLACDYKGVCQSMSSMQITATQNSICFNKSICLVSKSIHASQSRSVLVNLSANASRFKQGLPVLKYKHRRVGLKHQHTPIVSLFG
SKGKDSGDGGSPWKAFDKVVENFKGRSVEDVLRQQIEKKEFYDGGDGGKKPPSGGGSGGGDSSSGSEDDSLAGIMEETLQVVLATIGFIFLYIYIISGEELTRLAKDYIK
YLFGGSKSVRLKRSMYQWGRFYQKLTEKKQYDEYWLEKAILNTPTWWDHPDKYRRTVMAYIESQYQKERSASASYDDLESDESISDDEEI