; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0278 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0278
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionIndole-3-glycerol phosphate synthase
Genome locationMC10:2282584..2292196
RNA-Seq ExpressionMC10g0278
SyntenyMC10g0278
Gene Ontology termsGO:0016567 - protein ubiquitination (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR038920 - BTB/POZ domain-containing protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137277.1 uncharacterized protein LOC101222931 [Cucumis sativus]5.95e-19175.32Show/hide
Query:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPYSGSRDLGISRAL
        M +AE SP T+APLLLRNL TSLF+FADKSL+NL+KKYKLL LIH L +SSFLFFLRLLPSLFPSI +VSDD  PLK PK+ +Y     GS DLG+SRAL
Subjt:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPYSGSRDLGISRAL

Query:  TQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVRAV
        TQLLSII+H+PVSSRKYEVVRSLAEKLIDENH EGIEELREVNR VLS AFD +I  IEA M++RGF Q  +D ++GGGGG  GG  PV+F L RVVRAV
Subjt:  TQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVRAV

Query:  RSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESENQQ
        R  G+SA SRFGR +E GN  G S EKLAAE LWLAQKM SCG GN+ C R ASA QLGRL+LSAEPRLQ SL+KVA FL KQ  EM KDED EES  QQ
Subjt:  RSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESENQQ

Query:  HKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQD
          Q KL+MLISWLPLLCRGS+GTDAP+LS+GER E+EL LEEMIGTLQQDEQEQVLALWLH+FTY SSSDWPNLHASYARWY ASRKLLIHQD
Subjt:  HKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQD

XP_008462861.1 PREDICTED: uncharacterized protein LOC103501143 [Cucumis melo]1.98e-19777.72Show/hide
Query:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPY--SGSRDLGISR
        M +AE SP T+APLLLRNL TSLF+FADKSL+NLAKKYK+L +IH L +SSFLFFLRLLPSLFPSI SVSDDR PLKPPK  +Y      SGS DLG+SR
Subjt:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPY--SGSRDLGISR

Query:  ALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVR
        ALTQLLSII+HVPVSSRKYEVVRSLAEKLIDENH EGIEELREVNR VLSAAFD TI  IEA M++RGF Q  +D+D G GGGGG   GPV+F L RVVR
Subjt:  ALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVR

Query:  AVRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESEN
        AVR  G+SA SRFGR KE GN  G S EKLAAE LWLAQKMASCG  N+ C R ASAAQLGRL+LSAEPRLQ SL+KVA FL KQ  EM KDEDGEESE 
Subjt:  AVRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESEN

Query:  QQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQD
        QQ  QTKL+MLISWLPLLCRGSNGTDAP+LS+GER E+EL LEEMIGTLQQDEQEQVLALWLH+FTYSS SDWPNLHASYARWY ASRKLLI +D
Subjt:  QQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQD

XP_022942592.1 uncharacterized protein LOC111447583 [Cucurbita moschata]7.39e-18573.86Show/hide
Query:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPYSGSRDLGISRAL
        M +AE SP TMAPLLLRNL+TSLF FADK L+NL+KK+KLL++IHCL+VS F FFLR LPSLFPSI  VSDDR  LKPPK  +Y    SGS DLG+SRAL
Subjt:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPYSGSRDLGISRAL

Query:  TQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVRAV
        TQLLSII+HV VSSRKYEVVRSLAEKLIDENHREGIEEL EVNRAVLS AFD TI QIEAAM+ +GF    D+D+DG        SGPV+F L+RVVRAV
Subjt:  TQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVRAV

Query:  RSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESENQQ
         S       R G  K+G N  G SAEKLAAE LWLA KMASCGCG +AC R ASAAQLGRL+LSAEPRLQGSL++VAAF+ KQS EM KDE+ EE E+++
Subjt:  RSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESENQQ

Query:  HKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQDH
        H QTKL+MLISWLPLLCRGSNGTDAPVLS+GER EVELVL EMIGTLQ DEQEQVLA+WLHHFTYS+SSDWPNLHASYA WY ASR L+IHQ H
Subjt:  HKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQDH

XP_023546996.1 uncharacterized protein LOC111805927 [Cucurbita pepo subsp. pepo]7.94e-18573.74Show/hide
Query:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATY--AAPYSGSRDLGISR
        M +AE SP TMAPLLLRNL+TSLF FADK L+NL+KK+KLL++IHCL+VS F FFLR LPSLFPSI  VSDDR PLKPPK  +Y  +   SGS DLG+SR
Subjt:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATY--AAPYSGSRDLGISR

Query:  ALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVR
        ALTQLLSII+HV VSSRKYEVVRSLAEKLIDEN REGIEELREVNRAVLS AFD TI+QIEAAM+ +GF    D+D+DG        SGPV+F L+RVVR
Subjt:  ALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVR

Query:  AVRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESEN
        AV S       R G  K+G N  G SAEKLAAE LWLA+KMASCGCG +AC R ASAAQLGRL+LSAEPRLQGSL++VAAF+ KQS EM KDE+ EE E+
Subjt:  AVRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESEN

Query:  QQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQDH
        ++H QTKL+MLISWLPLLCRGSNGTDAPVLS+GER EVELVL EMIGTLQ DEQEQVLA+WLHHFTYS+SSDWPNLHASYA WY ASR L+IHQ H
Subjt:  QQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQDH

XP_038876704.1 uncharacterized protein LOC120069090 [Benincasa hispida]2.48e-20378.7Show/hide
Query:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPY------SGSRDL
        M +AE SP TMAPLLLRNL TSLF+FADK L+NL+KKYKLL++IH L +SSFLFFLRLLPSLFPSI  VSDDR PLKPPK  +Y +        SGS DL
Subjt:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPY------SGSRDL

Query:  GISRALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLS
        GISRALTQLLSII+HVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNR VLSAAF  TI QIEA M++RGF Q   DD+DGGGGGG  G GPV+F L 
Subjt:  GISRALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLS

Query:  RVVRAVRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGE
        +VVRAVR  G+SA SRFGR KEG N  G S EKLAAE LWLAQKMASCGC N+ C R ASAAQLGRL+LSAEPRLQ SL+KVAAFL KQ  EM KDEDGE
Subjt:  RVVRAVRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGE

Query:  ESENQQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQD
        ESE QQ  QTKL+MLISWLPLLCRGSNGTD P+LS+GER E+ELVLEEMIGTLQQD+QEQVLALWLHHFTYSSSSDWPNLHASYARWY ASRKLLIHQD
Subjt:  ESENQQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQD

TrEMBL top hitse value%identityAlignment
A0A0A0KXY7 Uncharacterized protein2.88e-19175.32Show/hide
Query:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPYSGSRDLGISRAL
        M +AE SP T+APLLLRNL TSLF+FADKSL+NL+KKYKLL LIH L +SSFLFFLRLLPSLFPSI +VSDD  PLK PK+ +Y     GS DLG+SRAL
Subjt:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPYSGSRDLGISRAL

Query:  TQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVRAV
        TQLLSII+H+PVSSRKYEVVRSLAEKLIDENH EGIEELREVNR VLS AFD +I  IEA M++RGF Q  +D ++GGGGG  GG  PV+F L RVVRAV
Subjt:  TQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVRAV

Query:  RSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESENQQ
        R  G+SA SRFGR +E GN  G S EKLAAE LWLAQKM SCG GN+ C R ASA QLGRL+LSAEPRLQ SL+KVA FL KQ  EM KDED EES  QQ
Subjt:  RSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESENQQ

Query:  HKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQD
          Q KL+MLISWLPLLCRGS+GTDAP+LS+GER E+EL LEEMIGTLQQDEQEQVLALWLH+FTY SSSDWPNLHASYARWY ASRKLLIHQD
Subjt:  HKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQD

A0A1S3CHW5 uncharacterized protein LOC1035011439.58e-19877.72Show/hide
Query:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPY--SGSRDLGISR
        M +AE SP T+APLLLRNL TSLF+FADKSL+NLAKKYK+L +IH L +SSFLFFLRLLPSLFPSI SVSDDR PLKPPK  +Y      SGS DLG+SR
Subjt:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPY--SGSRDLGISR

Query:  ALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVR
        ALTQLLSII+HVPVSSRKYEVVRSLAEKLIDENH EGIEELREVNR VLSAAFD TI  IEA M++RGF Q  +D+D G GGGGG   GPV+F L RVVR
Subjt:  ALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVR

Query:  AVRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESEN
        AVR  G+SA SRFGR KE GN  G S EKLAAE LWLAQKMASCG  N+ C R ASAAQLGRL+LSAEPRLQ SL+KVA FL KQ  EM KDEDGEESE 
Subjt:  AVRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESEN

Query:  QQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQD
        QQ  QTKL+MLISWLPLLCRGSNGTDAP+LS+GER E+EL LEEMIGTLQQDEQEQVLALWLH+FTYSS SDWPNLHASYARWY ASRKLLI +D
Subjt:  QQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQD

A0A4U5QCH6 Uncharacterized protein2.25e-14460.9Show/hide
Query:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRS--------PLKPPKNATYAAPYSGSR
        M M E SP T+APLL+RN+ T++FIFADKSL+ LA+KYKLL+ I  L V+SFLFFLRLLPSLFPS++    D +        PLKPPK+  Y  P S   
Subjt:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRS--------PLKPPKNATYAAPYSGSR

Query:  DLGISRALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFR
        D GI+RALTQLLSI+N +PVSSRKYE+VRSLAEKLID+NH+E  E LREVNR VLSAAF  T++Q+EAAM++ G           GG  GG  +GPV+ R
Subjt:  DLGISRALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFR

Query:  LSRVVRAVRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDED
        L+RV++AVR+  D + +RFGR  EG +    SAEKLAAE LWL QK+A+CGCG +A  R ASA+ +  LALSAE RLQGSL+KV+AFL KQ+ E+  +E 
Subjt:  LSRVVRAVRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDED

Query:  GEESENQQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQ-QDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLI
         EE + +Q +QT ++M++SWLPLLCR SNG+DAPVLS+ ER E+E  LEEMI  L+ ++EQE+VL+LWLHHFTYS SSDWPNLHASYARW  ASRKLLI
Subjt:  GEESENQQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQ-QDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLI

A0A6J1FQP8 uncharacterized protein LOC1114475833.58e-18573.86Show/hide
Query:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPYSGSRDLGISRAL
        M +AE SP TMAPLLLRNL+TSLF FADK L+NL+KK+KLL++IHCL+VS F FFLR LPSLFPSI  VSDDR  LKPPK  +Y    SGS DLG+SRAL
Subjt:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPYSGSRDLGISRAL

Query:  TQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVRAV
        TQLLSII+HV VSSRKYEVVRSLAEKLIDENHREGIEEL EVNRAVLS AFD TI QIEAAM+ +GF    D+D+DG        SGPV+F L+RVVRAV
Subjt:  TQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVRAV

Query:  RSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESENQQ
         S       R G  K+G N  G SAEKLAAE LWLA KMASCGCG +AC R ASAAQLGRL+LSAEPRLQGSL++VAAF+ KQS EM KDE+ EE E+++
Subjt:  RSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESENQQ

Query:  HKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQDH
        H QTKL+MLISWLPLLCRGSNGTDAPVLS+GER EVELVL EMIGTLQ DEQEQVLA+WLHHFTYS+SSDWPNLHASYA WY ASR L+IHQ H
Subjt:  HKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQDH

A0A6J1ISQ7 uncharacterized protein LOC1114780719.23e-18072.73Show/hide
Query:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATY--AAPYSGSRDLGISR
        M +AE SP TMAPLLLRNL+TSLF FADK L++L+KK+KLL++IHCL+VS FLFFLR LP  FP+I  VSDDR PLK PK  +Y  +   SGS DLGISR
Subjt:  MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATY--AAPYSGSRDLGISR

Query:  ALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVR
        ALTQLLSII+HV +SSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLS AFD TI QIEAAM+ +GFR   D+D+DG        SGPV+F L+RVVR
Subjt:  ALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVR

Query:  AVRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESEN
        AV S       R G  K+G N  G SAEKLAAE LWLA KMASCGCG +AC R ASAAQLGRL+LSAEPRLQGSL++VAAF+ KQS EM     G+ESE 
Subjt:  AVRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESEN

Query:  QQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQDH
          H QTKL+MLISWLPLLCRGSNGTDAPVLS+GER EVELVL EMIGTLQ+DEQEQVLA+WLHHFTYS+SSDWPNLHASYA WY ASR L+IHQ H
Subjt:  QQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQDH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G19920.1 unknown protein5.5e-3432.16Show/hide
Query:  NATYAAPYSGSRDLG--------------ISRALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRG
        ++T    YSGS DLG              + RAL   L+++N +PV+SRKY+    +AEK++++N + G  +L +VNRA L+++F  T  +++  +    
Subjt:  NATYAAPYSGSRDLG--------------ISRALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRG

Query:  FRQVVDDDDDGG-------GGGGGGGSGPVDFRLSRVVRAVRSFGDSARSRFGRTKE--------GG---NHGGGSAEKLAAEALWLAQKMASCGCGNDA
         R    D+  GG           GG        LS  +  VRS  D   +   +T+         GG   N    + EKLA E LW+ +K+   G   + 
Subjt:  FRQVVDDDDDGG-------GGGGGGGSGPVDFRLSRVVRAVRSFGDSARSRFGRTKE--------GG---NHGGGSAEKLAAEALWLAQKMASCGCGNDA

Query:  CTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESENQQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQ
          R + A+ L  L+L+A PR+QG ++K++A L+    E+ +D        Q   Q   R+L +WLPL     NG   PVL+  ER EVE  +++ I TL 
Subjt:  CTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESENQQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQ

Query:  QDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLI
          +QE +L  WL  F+  S+S+WPNL  +Y RW  ++R+L +
Subjt:  QDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLI

AT5G64230.1 unknown protein6.9e-7744.53Show/hide
Query:  MAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLF----PSIDSVSDDRSPLKPPKNATYAAPYSGSRDLGISR
        M + S A + P LLRN+I ++ +FAD+SLL ++   KLL+ +    V+ FLFFLR LPS+     P+   VS         K        +   + GI R
Subjt:  MAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLF----PSIDSVSDDRSPLKPPKNATYAAPYSGSRDLGISR

Query:  ALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVR
        A+ QLLS +N +PVSSRKY+VVRSLAE+LI++N  E    L ++NR VL+A+F  T++++E A+     R+ +D+              PV   L+RVVR
Subjt:  ALTQLLSIINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVR

Query:  A-VRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESE
        A VR+ GD      G  +E  +    ++EKLAAE LWLA+KMA  G  ++A  + ASA+ L  LALS EPRLQ SLI+++A L K++ +++K  + EE E
Subjt:  A-VRSFGDSARSRFGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESE

Query:  NQQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLI
          + ++ K +MLISW+PLLCR SNG D PVL   ER  +E VLE+MI  L+++EQE+VL+LWLHH+T+ +SSDWP+L+ SY RW  +SR+LL+
Subjt:  NQQHKQTKLRMLISWLPLLCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTATGGCCGAACCCTCTCCGGCCACCATGGCTCCGCTGCTTCTGCGGAACCTCATCACCTCCCTCTTCATCTTCGCCGACAAATCCCTCCTCAATCTGGCC
AAGAAATACAAGCTCCTCGATCTCATCCACTGCCTCGCCGTCTCCTCCTTCCTATTCTTCCTGCGTTTGCTTCCTTCTCTTTTCCCTTCCATCGATTCGGTTTCC
GATGATCGTTCCCCTCTAAAACCCCCAAAAAATGCGACCTACGCTGCGCCGTACAGCGGGAGCCGCGATTTAGGTATTTCTCGTGCCCTAACGCAGTTGCTCTCG
ATTATTAATCATGTTCCCGTCAGTTCTCGCAAGTACGAAGTTGTGCGATCGTTGGCGGAGAAGCTGATCGACGAAAATCACAGGGAAGGAATTGAGGAGTTGCGC
GAGGTGAACCGTGCGGTTCTCTCCGCGGCTTTCGATGGGACTATTAACCAAATCGAAGCTGCGATGGTCGACCGAGGGTTTCGTCAAGTCGTCGACGACGACGAC
GACGGCGGCGGCGGCGGCGGCGGCGGGGGGTCCGGACCGGTCGATTTCCGGTTGTCTAGGGTTGTGAGGGCGGTTCGGTCGTTTGGAGACTCGGCGCGAAGCCGG
TTCGGGAGGACGAAGGAGGGCGGGAATCACGGCGGAGGCTCGGCGGAGAAGCTGGCGGCGGAGGCGCTGTGGTTGGCTCAGAAAATGGCGAGTTGTGGTTGCGGC
AATGACGCGTGTACGCGGTTGGCCTCGGCGGCTCAGTTGGGCAGGCTCGCTCTCTCGGCGGAGCCGCGGCTGCAGGGCTCTTTGATTAAAGTCGCAGCATTCTTG
TTGAAGCAATCGAGCGAAATGGAAAAGGATGAAGATGGAGAAGAAAGTGAGAACCAGCAGCATAAGCAAACCAAGTTGAGGATGCTCATTTCATGGCTTCCATTG
CTATGCAGAGGCAGCAACGGCACCGATGCTCCGGTCCTGAGCCTCGGGGAACGGTGTGAGGTGGAGTTGGTGTTGGAGGAGATGATAGGGACACTGCAACAAGAC
GAACAAGAGCAGGTTTTGGCTCTGTGGCTCCATCATTTCACTTACTCCTCCTCCTCCGACTGGCCGAACCTCCACGCCTCGTATGCGCGGTGGTATGGCGCCTCC
CGTAAGCTCTTGATCCACCAGGATCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATCAAAATAAAAACAAATATTTTTTTTAACAACAAATGGAGGTGGAAAAATTCCAACTTACGAACTATTATAGAAAAAAACATGTATCAATTGCTGCGTA
AAACTACTTATTTGAGAACAAATTAAATTGCTGTTTAATAAAATTTCTTTATATTAATATTAAATGATGAAACTTTTAAAAAAGGCGTTACGAAAAAACAGAGGA
AGTCTCTGTTCTATTTCTATGGTTCCCCTCTCGTCTTCCGCTCCATCAGAATATATATAGAAGCAATCCCGTCCCCTCGCCGGACCTTCGCCGGTACTTCGCCGG
ACTGCCATGACTATGGCCGAACCCTCTCCGGCCACCATGGCTCCGCTGCTTCTGCGGAACCTCATCACCTCCCTCTTCATCTTCGCCGACAAATCCCTCCTCAAT
CTGGCCAAGAAATACAAGCTCCTCGATCTCATCCACTGCCTCGCCGTCTCCTCCTTCCTATTCTTCCTGCGTTTGCTTCCTTCTCTTTTCCCTTCCATCGATTCG
GTTTCCGATGATCGTTCCCCTCTAAAACCCCCAAAAAATGCGACCTACGCTGCGCCGTACAGCGGGAGCCGCGATTTAGGTATTTCTCGTGCCCTAACGCAGTTG
CTCTCGATTATTAATCATGTTCCCGTCAGTTCTCGCAAGTACGAAGTTGTGCGATCGTTGGCGGAGAAGCTGATCGACGAAAATCACAGGGAAGGAATTGAGGAG
TTGCGCGAGGTGAACCGTGCGGTTCTCTCCGCGGCTTTCGATGGGACTATTAACCAAATCGAAGCTGCGATGGTCGACCGAGGGTTTCGTCAAGTCGTCGACGAC
GACGACGACGGCGGCGGCGGCGGCGGCGGCGGGGGGTCCGGACCGGTCGATTTCCGGTTGTCTAGGGTTGTGAGGGCGGTTCGGTCGTTTGGAGACTCGGCGCGA
AGCCGGTTCGGGAGGACGAAGGAGGGCGGGAATCACGGCGGAGGCTCGGCGGAGAAGCTGGCGGCGGAGGCGCTGTGGTTGGCTCAGAAAATGGCGAGTTGTGGT
TGCGGCAATGACGCGTGTACGCGGTTGGCCTCGGCGGCTCAGTTGGGCAGGCTCGCTCTCTCGGCGGAGCCGCGGCTGCAGGGCTCTTTGATTAAAGTCGCAGCA
TTCTTGTTGAAGCAATCGAGCGAAATGGAAAAGGATGAAGATGGAGAAGAAAGTGAGAACCAGCAGCATAAGCAAACCAAGTTGAGGATGCTCATTTCATGGCTT
CCATTGCTATGCAGAGGCAGCAACGGCACCGATGCTCCGGTCCTGAGCCTCGGGGAACGGTGTGAGGTGGAGTTGGTGTTGGAGGAGATGATAGGGACACTGCAA
CAAGACGAACAAGAGCAGGTTTTGGCTCTGTGGCTCCATCATTTCACTTACTCCTCCTCCTCCGACTGGCCGAACCTCCACGCCTCGTATGCGCGGTGGTATGGC
GCCTCCCGTAAGCTCTTGATCCACCAGGATCATTAATAGTGTAACTTAAAAACAATAACCTTGTTAGAAATATGCTGTTAATTAACACCATACACTATTAGAAGT
TCAAAATATTGTAATCATTTTTCTGTTGCTATTTGAAAGCCCTATTGTATTCTACTTGTTGGTAAATGTGGGAAATGTGAAGGTTGGAAATAGTGAGTTTTATTG
TATAAAAGAATTCTCACTCAATGATTATTAGTCAAATTGATTTTAGCTGTTCTTGTGGCTTTTATATATATATATATATATATATATATATTATTTTATTAGAGA
GAATGCATTTGTTATCTTTTTTTTAAGTAGCGGTAGGAAATTCAAATTCATAATTTCTTGGTTAGAGATAACTAGAGCTAAGCTAACGTTGACAAACCACATTTC
AAAACACAACATACACGATCTAGTTTGTATATATCTTCTCTCCTCTTTTTTATTTTCTTTCTTTTGCTGCATCTGTAAAGAAGAGCACCATGACCATCTATAGTT
AGTTATCTATACACAGTCGAGAAAGCTTTGAATTTAAAAAAAAAACCTTGAATTTTGCTAATATGCAAGAAATTAAAGAAAGATATTTGATTAAACCAACCTTTA
TTTATTATACCACCACTCCTTTTTACTTAGAAAGTATCAAATTTTCATGGTGGGAATTACAAAGCACTTGAGAGAGGGAAATTAATAAAGGATAATTTCCATTAA
CTTGTTGGTCCATAAAAAGTTTAGATAACTACTTGAGCAATTTTAGCCATCCTTGGAATTGCTCCCTCACCCACAGGCTCTGCATTATAACATATCACACAGTTA
TATAATGTCAAAACTATATATTAATCACACAAGATTAAACATACCTTATATTAACCATTAATTTCTCGAGTGATTCGAAGAAGAAGAAAAAGGAACTTACCGAAA
TCGGGCACGTCTGTTGGGAGCAGCCTAACAAATGTGGGGCATGATAGTGAAAACGCGCAATTACGTGGCTGCAACGAGTAAAGATTTACGACGTTTGTCAATTGG
TCACATGCATTATATATTAATTATGAATATTTTTGATATAACCATACATTAAAATCTTTGTCAAAGCATAGCCAGATTTCCTTGACAACTCCATCTTCAGCACAT
GTAATCTTTGGCATTGCATGGAAAGCCCTTCGAAGTTCATCCAAGATGTACATAATAGGATATGGAGCAACAGAAGCTACAATTCCAGCATCATTCAAAACTTTC
TGATGGGTAACAAATAACAAAGCTTTTATAAATGTTATGCACCAAAGTTTCAACCTATCTCAAATAGAGAGGAAAAGAAAAAGGAAAAAATTACGTGTTCATATG
TACATGCTTAATTGTTAGCGCCAGCCACCGGCCACCGTTACATTTTTTATTTTTGACCAGTAGGAAAGAAAAATACTTACAAGACCATAACAAAAAAAAAAAAAA
CTGTAAAATTTTTAGTAGTGAATAGCAAGAAGGACGAATACTGATAATATTATTGAGAAAAAACATCTATTAAATAATAGAGGAGAAGGTATCTTAGGTGTTTAT
AAGTAGGCGTCTCCCTACCTACCTGTACTGAAAAAATGGCAACTATTTAATCAATTTTATATACTTCCCTTTATGCATTTGTGAGATAATAAATAATTAAATTGG
CAATATAATAAATTCTTTTTTAATTTATAAAATTTATTTTAATATTTATTCAAATTAAGTTTTTGTTTTAGTATTAGAATTTTTAAAGATTATAAATTTTGTTGT
ATCATCTTATGTTTTAAAAGAATATATTGTATTGGGTTACTTTTTGAAAATTACATTTATATTTCTTTACTATATTAATATAAAGAAAAAGAAAATTCATCAGAA
AAATAGAAATGACAATAAATAACCCTACTAAAAACTAATCAATTATATATAATTATATGTATTTATGTATGTATATGATTATTCAATATATATTATTATGTTTAT
TATATTGGATCTTGTCTTTATGCAAGTTCAATAGACCAATCAAAATAGGATCTGGATCGATCCATTTTAAAAAACAAAATTCATATTCAATCCAATTTATAGAAG
GATCGGTCTGGATGGGTGTGATGATCGAGGATGAAGCGACACACCAAATAGTCTATTGAATAGAGAGCGGTTTTATAAAAGCTTACGGTGACATCGTATTTGGTG
TAAAGGGTGGTGGCTTTGGTGAAGTACTCATATTGACCCTTGATTATCGGAGCTGCGCAGGTTCCATGCTTCTCTGCACAAACAATTGGAAAGCAAATTTTGAAA
ACCTTTTTTTCAAAATATAAACAATAATATATAAAATAAAAGAAAATACAATGATAGTAATAATAATAAATTTATTTTTATTATTAGAAGCTGTGATGGAAATAA
ATAATGATAATATTAGGTAGAAAAGAGAAAAGAGATGATTAATTGTTATAAAAAGGAAAAATATATGTCCCAAGATTTGAATCGAGTATGGAAAGGTTAGCAAAG
TAGTAATACACCCACTATACCACTAAGAGAATTTTTACATTTAAACTACTGAATATATATATATTCAGAAATCAACATCGGACCGTCCGGATATTGTCTCTTGAA
GAGACGACCATGTGAGATGGTCGTCTCTTTCTCTTCAAAAGACGACCACACTCCCGTTAAAAAAAATTGACGTGGCACCATCTAGCGCCACGTTAGATGGTGACG
ACGATCATGTAGGCAGGAAAACATGCAAATCCCATTTTTTTTAAAACATCAATAGTTTTGGAATAAGTTTCCAAAGTTGCATTTTTTACATTTTTTTTCTGATAA
ACTGCATTATTTTAAAAATAAATTTAATTAAAAACAATATTAAAAAAAATCCATAAAAAAGGAAGTGGAATTTGAAAAGACATTATAGGTAGAGATGAAAGGAGA
AAGAATAATTACCATACTGCAGCCAATGGTAAGATGCCAAGTGGTCACATTACATCATCAAATAACGCAAAAAAACACAGAAAATAAATTTGATTAGTGTCAAAT
ATAAATCTAAAATTTTGATAGATCATCAGCTATAAATAAAATTAAATAAGGAATAATAATAGGAAGAAGAATAACCTGTGCAGCCAAAAATGATGATTTTTCACC
ATTGCAGGTTGTTTCAGAATCACATCGATAACTTGGCCAAACCTCTGAAAGAATAATCCAATTCAAGGGAGAAATCTACAAGTGCAGTGTGAAGGTTAATCAATA
TCAGTGAATACTATTGGTTTAAGATGATCACTAAACATATATATAAGGGTTGTGTTTAATAAGTTCCAATGAATCCGCTACAAGAAAACGGAGTTTTAGCGACGT
TTTTCTTAACGAGAACATAAAAAAAAAAAAAAAAGACAAGTTAAAGAGAAGATTCTGATAAGAAGTTTGGTAGGAGAATGAGAGAGATACCGCTTTTGGATCGAA
TCTTTCGTCACTACAACATGAAGGCCATTTTGAGTCTTCGTCGTCGTACTCTGGCCACAGCCCATCTGTAGTGCATGCATACGCCATTGTATTTTATTATAGGTT
AAAAGGTTAAAATAGATATTTGGTCCTGTACTTTCGATTTTGGTGAAATAATAATTTAGTCTTTAAACTTTTGTTTATAACGATTGAGTTTCTTATTTTCTAATT
AGTATCAATAATTTAGTTCTTTGAGTTTTAAAACGTAATAATTTTAGTTTGTATATAGTTTGAAATTTGTAATAATTTAGTCTCTACTATGAAAAATCTCATCAA
AATTAAGTGTCTATTTTTATCATATATAACGACTTAATATTTACAATTTATAAATATATTTGATCCATCATCGGATTATCCATCTATATATATATAATAAACTTA
ACAAAAATTTTTATGATAGAGAAATTAAATCGTTACAAACTTTTTAAAGTACAAAGACTAAACTGTTACATACTAAATCTCATGAATTACATTGTTACCAATTTA
AATTTCAAGGACTAAATTGTAACGCAAACGAAATTTTAGGAACTAAATCGTTACTCCGCTGAAAGTATAGGGACCAAAAGTGTCTTTTAACCTAGGTTAAAATAC
AATTTAGGTTCTTACACTATAGCTTTGATTTATTTTTGTCCAGGTACTCTTTAAAGGGTTCATTTTGATCATGTATTTTCAGCTTTATTGGTGACTATATTTTTA
AAAAGTGACCATTTGATCCTTTAATTTTATTTTTAATCTCATTATTTCTATCCCAATTTCAACGAGAGTTTATATTAAAGTATTTTCATTGTGTGTAGATCAATT
TGAGTCAAGATCTTAAAAATATATATAGATCAAAATTGTCACTTTTCTAAAGTAGATGAACTAAAACCAAGATACGAGATGACCTACGGATAGTAAATTGATTGG
GAGCATGTGTTCTTCTGCAAACACAGGAAAAAAGGATATGGTTAAATTATATATTAACTCAAAAAAGAGAGAAGTAAATAAGTAATTCACCCATGGCTGCCAACA
TGAGCTCAACTCAGAAATAATTGACACCTATCTATGATTTAGAGGTCATAAGTTCAAATCCCCAACCTTATTTGTTATATTAAAAAAGAGAAAACTAACCCATAT
CCAGGACAGCAACCGTTGTCGGGGCAACAAGAACTTATGTTTCTGCACGAATTGCCTGGCCATTGTAGAGCCAGCACAAAGTAGTCGAACTCCGCTGCATATGCC
GCTGTCTGCAGCTGTGCAACCGCTAAAACGAACACCGCTGCCGCTGCCACCACCGCTGCTTTACCTTTCATGGACAAATAACCCATTTTTTCTTCTCTTTTCTTT
TTTCTCTGAGATCTGCCCA
Protein sequenceShow/hide protein sequence
MTMAEPSPATMAPLLLRNLITSLFIFADKSLLNLAKKYKLLDLIHCLAVSSFLFFLRLLPSLFPSIDSVSDDRSPLKPPKNATYAAPYSGSRDLGISRALTQLLS
IINHVPVSSRKYEVVRSLAEKLIDENHREGIEELREVNRAVLSAAFDGTINQIEAAMVDRGFRQVVDDDDDGGGGGGGGGSGPVDFRLSRVVRAVRSFGDSARSR
FGRTKEGGNHGGGSAEKLAAEALWLAQKMASCGCGNDACTRLASAAQLGRLALSAEPRLQGSLIKVAAFLLKQSSEMEKDEDGEESENQQHKQTKLRMLISWLPL
LCRGSNGTDAPVLSLGERCEVELVLEEMIGTLQQDEQEQVLALWLHHFTYSSSSDWPNLHASYARWYGASRKLLIHQDH