; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1438 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1438
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
Genome locationMC06:21660341..21661373
RNA-Seq ExpressionMC06g1438
SyntenyMC06g1438
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437350.1 PREDICTED: uncharacterized protein LOC103482794 [Cucumis melo]5.90e-8866.67Show/hide
Query:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNS--GSDGEMIIVASVSVK
        MA+KEQVKPLA+ A  + RSDD   FL  P KL L  +KYIKCCGCF+ALLLILAV+GIVL FTV HIK P+I+ID+LSF N    S+  +I+VASVSV+
Subjt:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNS--GSDGEMIIVASVSVK

Query:  NPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTY
        NPNVASFKYSK +TKIYY  KVIGEGETP GE KAK+T+KMNVTV I   KIDD SSL+KD NSG+ L+I+SYT+IPGRVK+LG IKKN LV+++CS+TY
Subjt:  NPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTY

Query:  NTRSQTIQGENCNQPVEVS
        N++S+TIQ ++C+Q V +S
Subjt:  NTRSQTIQGENCNQPVEVS

XP_022146047.1 uncharacterized protein LOC111015350 [Momordica charantia]2.11e-14899.54Show/hide
Query:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP
        MAEKEQVKPLAAF PGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP
Subjt:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP

Query:  NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT
        NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT
Subjt:  NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT

Query:  RSQTIQGENCNQPVEVSD
        RSQTIQGENCNQPVEVSD
Subjt:  RSQTIQGENCNQPVEVSD

XP_022995815.1 uncharacterized protein LOC111491239 [Cucurbita maxima]1.11e-8765.14Show/hide
Query:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP
        M EK+QVKPLA+  P      DDD+FL  PAKLRL  +KYI C GCFAALLLILAV+GIVL FTVLHIK P+++ID LSFSN+ S+G +IIVASV V+NP
Subjt:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP

Query:  NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT
        NVASFKYSK T  IYY G +IGEGETP GEAKAK+TM MNVTV I  E++D+  SLM+DL SG  LNI+SY +IPGRVKI+GFIKK   V++ CS TYN 
Subjt:  NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT

Query:  RSQTIQGENCNQPVEVSD
        ++QTI+ E+C++ V++SD
Subjt:  RSQTIQGENCNQPVEVSD

XP_023532981.1 uncharacterized protein LOC111794996 [Cucurbita pepo subsp. pepo]2.11e-8867.43Show/hide
Query:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP
        MAEK+QVKPLA+  P      DDD FL  PAKLRL  +KYI C GCFAALLLILAV+GIVL FTVLHIK P+++ID LSFSN+ S+G +IIVASV V+NP
Subjt:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP

Query:  NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT
        N ASFKYSK TT IYY G+VIGEGETP GEAKAK+TM MNVTV I  E++D+  SLM+DL SG  LNI+SYT+IPGRVKI+GFIKK   V+M CS TYN 
Subjt:  NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT

Query:  RSQTIQGENCNQPVEVSD
        ++QTI+ E+C+Q V++SD
Subjt:  RSQTIQGENCNQPVEVSD

XP_038875804.1 uncharacterized protein LOC120068170 [Benincasa hispida]2.13e-8866.82Show/hide
Query:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGE---MIIVASVSV
        MA+KEQVKPLA+ A  + RSDDD  FL  PAKL L  +KYIK CGCFAALL+ILAV+GIVL FTVLHI+ PNI+ID+LSF NS S      + +VASVSV
Subjt:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGE---MIIVASVSV

Query:  KNPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMT
        +NPNVASFKYSK +T+IYY G VIGEGETP GE KAK+T+KMN+TV I  EKIDD SSL+KD N G+ LNI+SYT+IPGRVKILG IKK+ LV++ CS+T
Subjt:  KNPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMT

Query:  YNTRSQTIQGENCNQPVEVS
        +N+RS+ IQG++C+Q V +S
Subjt:  YNTRSQTIQGENCNQPVEVS

TrEMBL top hitse value%identityAlignment
A0A1S3ATT7 uncharacterized protein LOC1034827942.85e-8866.67Show/hide
Query:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNS--GSDGEMIIVASVSVK
        MA+KEQVKPLA+ A  + RSDD   FL  P KL L  +KYIKCCGCF+ALLLILAV+GIVL FTV HIK P+I+ID+LSF N    S+  +I+VASVSV+
Subjt:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNS--GSDGEMIIVASVSVK

Query:  NPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTY
        NPNVASFKYSK +TKIYY  KVIGEGETP GE KAK+T+KMNVTV I   KIDD SSL+KD NSG+ L+I+SYT+IPGRVK+LG IKKN LV+++CS+TY
Subjt:  NPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTY

Query:  NTRSQTIQGENCNQPVEVS
        N++S+TIQ ++C+Q V +S
Subjt:  NTRSQTIQGENCNQPVEVS

A0A5A7THA2 Late embryogenesis abundant protein-like protein2.85e-8866.67Show/hide
Query:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNS--GSDGEMIIVASVSVK
        MA+KEQVKPLA+ A  + RSDD   FL  P KL L  +KYIKCCGCF+ALLLILAV+GIVL FTV HIK P+I+ID+LSF N    S+  +I+VASVSV+
Subjt:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNS--GSDGEMIIVASVSVK

Query:  NPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTY
        NPNVASFKYSK +TKIYY  KVIGEGETP GE KAK+T+KMNVTV I   KIDD SSL+KD NSG+ L+I+SYT+IPGRVK+LG IKKN LV+++CS+TY
Subjt:  NPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTY

Query:  NTRSQTIQGENCNQPVEVS
        N++S+TIQ ++C+Q V +S
Subjt:  NTRSQTIQGENCNQPVEVS

A0A6J1CYE1 uncharacterized protein LOC1110153501.02e-14899.54Show/hide
Query:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP
        MAEKEQVKPLAAF PGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP
Subjt:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP

Query:  NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT
        NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT
Subjt:  NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT

Query:  RSQTIQGENCNQPVEVSD
        RSQTIQGENCNQPVEVSD
Subjt:  RSQTIQGENCNQPVEVSD

A0A6J1H2C0 uncharacterized protein LOC1114597397.75e-8665.6Show/hide
Query:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP
        MAEK+QVKPLA+  P      DDD+FL  PAKLRL  +KYI C GCFAALLLILAV+GIVL FTVLHIK P+++ID LSFSN+ S+G +IIV SV V+NP
Subjt:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP

Query:  NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT
        NVASFKY K TT IYY  K+IGEGETP GEAKAK+TM MNVT+ I  E++D+  SLM+DL SG  LNI+SYT+IPGRVKI+GFI+K   V+M CS TYN 
Subjt:  NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT

Query:  RSQTIQGENCNQPVEVSD
        ++QTI+ E+C+Q V++SD
Subjt:  RSQTIQGENCNQPVEVSD

A0A6J1K4Z4 uncharacterized protein LOC1114912395.37e-8865.14Show/hide
Query:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP
        M EK+QVKPLA+  P      DDD+FL  PAKLRL  +KYI C GCFAALLLILAV+GIVL FTVLHIK P+++ID LSFSN+ S+G +IIVASV V+NP
Subjt:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNP

Query:  NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT
        NVASFKYSK T  IYY G +IGEGETP GEAKAK+TM MNVTV I  E++D+  SLM+DL SG  LNI+SY +IPGRVKI+GFIKK   V++ CS TYN 
Subjt:  NVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNT

Query:  RSQTIQGENCNQPVEVSD
        ++QTI+ E+C++ V++SD
Subjt:  RSQTIQGENCNQPVEVSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.4e-3137.72Show/hide
Query:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIE--------IDTLSFSNS----GSDGE
        MA+ E V+PL   AP       D+    S  K   ++   IKC  C  A  LIL  + + L FTV  +K+P I+        +D+++ +N     G++  
Subjt:  MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIE--------IDTLSFSNS----GSDGE

Query:  MIIVASVSVKNPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNL
        MI+   VSVKNPN ASFKYS TTT IYY G ++GE     G+A+   T +MNVTV I L++I     L ++++    +N+ SYT++ G+VKI+G +KK++
Subjt:  MIIVASVSVKNPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNL

Query:  LVRMNCSMTYNTRSQTIQGENCNQPVEV
         V+MNC+M  N   Q IQ  +C + +++
Subjt:  LVRMNCSMTYNTRSQTIQGENCNQPVEV

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.6e-1724.55Show/hide
Query:  EKEQVKPLAAF--APGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALL-LILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIV-------
        +KE+ KP  A    P  + S  + +   +    +L+  +  K C CF  LL L++A+V ++L+FT+   K P   ID+++     +    +++       
Subjt:  EKEQVKPLAAF--APGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALL-LILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIV-------

Query:  --ASVSVKNPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLV
            +S+KNPN   F Y  ++  + Y G+VIGE   PA    A++T+ +N+T+ +  +++   + L+ D+ +G  + +N++ K+ G+V +L   K  +  
Subjt:  --ASVSVKNPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLV

Query:  RMNCSMTYNTRSQTIQGENC
          +C ++ +   + +  ++C
Subjt:  RMNCSMTYNTRSQTIQGENC

AT4G23610.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.2e-1930.33Show/hide
Query:  KEQVKPLAA--FAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNS--------GSDGEMIIVA
        ++Q KPLA         + D++D++     K      K I CCG  A+L +++AV  IVLS TV H+ +PN+ +D++SF+           ++    +  
Subjt:  KEQVKPLAA--FAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNS--------GSDGEMIIVA

Query:  SVSVKNPNVASFKYSKTTTKIYYGG-KVIGEGETPAGEAKAKETMKMNVTVAIGLEK-IDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVR
         +S+ NPN A F         Y+G   V+GE    +    AK T+KMN+T  I   K +  +  LM+DLN G  +++ S  ++ GRVK +   +K + ++
Subjt:  SVSVKNPNVASFKYSKTTTKIYYGG-KVIGEGETPAGEAKAKETMKMNVTVAIGLEK-IDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVR

Query:  MNCSMTYNTRS
         +C M   T +
Subjt:  MNCSMTYNTRS

AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.2e-0924.29Show/hide
Query:  CGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTL---SFSNSGSDGEMIIVASVSVKNPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMN
        C      ++ L +  + +  TV   ++P I + ++   SFS + S          +V+NPN A+F +     +++Y G  IG    PAGE ++  T +M 
Subjt:  CGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTL---SFSNSGSDGEMIIVASVSVKNPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMN

Query:  VTVAIGLEKIDDVSS-------LMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNTRSQTIQGENC
         T ++    +   SS             SGS++ I S  ++ GRV++LG     +  + NC +  ++   +I    C
Subjt:  VTVAIGLEKIDDVSS-------LMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNTRSQTIQGENC

AT4G23930.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.9e-0421.84Show/hide
Query:  CGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTV
        C      ++ L +  + +  TV   ++P I +                    SVK P+     +S   + ++Y G  IG    PAGE ++  T +M  T 
Subjt:  CGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNPNVASFKYSKTTTKIYYGGKVIGEGETPAGEAKAKETMKMNVTV

Query:  AIGLEKIDDVSS-------LMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNTRSQTIQGENC
        ++    +   SS             SGS++ I S  ++ GRV++LG     +  + NC +  ++   +I    C
Subjt:  AIGLEKIDDVSS-------LMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNTRSQTIQGENC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGAAAGAGCAAGTCAAGCCCTTGGCAGCTTTCGCCCCCGGCCAATCCCGCAGCGACGACGATGATCGTTTCCTTCTTTCTCCGGCCAAGCTCCGCCTCCAAAC
GCACAAATACATCAAGTGCTGCGGTTGCTTCGCCGCTCTTCTCTTAATTCTCGCCGTTGTCGGCATCGTCCTCAGCTTCACTGTCCTCCACATCAAAAATCCAAATATCG
AAATTGATACCCTATCGTTCTCAAACAGTGGTTCAGACGGTGAGATGATCATTGTGGCAAGCGTCTCCGTGAAAAACCCTAATGTTGCATCCTTCAAATACTCAAAAACC
ACGACAAAAATTTATTACGGTGGCAAGGTCATCGGAGAGGGCGAGACCCCGGCCGGGGAGGCGAAGGCGAAGGAGACGATGAAGATGAATGTGACGGTGGCGATTGGGCT
TGAGAAGATCGATGACGTTTCGAGTTTGATGAAGGATTTGAACTCGGGATCATCTTTGAATATCAATAGCTACACGAAGATTCCGGGGAGGGTCAAAATACTTGGCTTCA
TCAAGAAAAACTTGCTTGTGAGGATGAACTGTTCCATGACTTACAACACTAGAAGTCAGACCATTCAAGGGGAAAATTGCAATCAACCAGTAGAAGTCTCTGATTAA
mRNA sequenceShow/hide mRNA sequence
CAACAAAATCGTGAGATTATTATGTAAGAAAGGGACAATGGATCAAAACATTTCACTAGTCAACCTTTTGCCCAATCAACACGTTTTTCAAAAAGAAAAAAAAAGGGAAG
ATGAAAACTCCTAAAGCAAAACACCAGAGACTGCAGACTGACTTGACGATGGAGTCTTTATTCCAAGATCAAAACCAACAATTAAATACCCAAAATTTCCCTTCTTTCCC
CCTCATTTCTCCATAGCCATGGCAGAGAAAGAGCAAGTCAAGCCCTTGGCAGCTTTCGCCCCCGGCCAATCCCGCAGCGACGACGATGATCGTTTCCTTCTTTCTCCGGC
CAAGCTCCGCCTCCAAACGCACAAATACATCAAGTGCTGCGGTTGCTTCGCCGCTCTTCTCTTAATTCTCGCCGTTGTCGGCATCGTCCTCAGCTTCACTGTCCTCCACA
TCAAAAATCCAAATATCGAAATTGATACCCTATCGTTCTCAAACAGTGGTTCAGACGGTGAGATGATCATTGTGGCAAGCGTCTCCGTGAAAAACCCTAATGTTGCATCC
TTCAAATACTCAAAAACCACGACAAAAATTTATTACGGTGGCAAGGTCATCGGAGAGGGCGAGACCCCGGCCGGGGAGGCGAAGGCGAAGGAGACGATGAAGATGAATGT
GACGGTGGCGATTGGGCTTGAGAAGATCGATGACGTTTCGAGTTTGATGAAGGATTTGAACTCGGGATCATCTTTGAATATCAATAGCTACACGAAGATTCCGGGGAGGG
TCAAAATACTTGGCTTCATCAAGAAAAACTTGCTTGTGAGGATGAACTGTTCCATGACTTACAACACTAGAAGTCAGACCATTCAAGGGGAAAATTGCAATCAACCAGTA
GAAGTCTCTGATTAATGAAGAGAGAAACTTATGCAATTGTATTATAGAATGATGA
Protein sequenceShow/hide protein sequence
MAEKEQVKPLAAFAPGQSRSDDDDRFLLSPAKLRLQTHKYIKCCGCFAALLLILAVVGIVLSFTVLHIKNPNIEIDTLSFSNSGSDGEMIIVASVSVKNPNVASFKYSKT
TTKIYYGGKVIGEGETPAGEAKAKETMKMNVTVAIGLEKIDDVSSLMKDLNSGSSLNINSYTKIPGRVKILGFIKKNLLVRMNCSMTYNTRSQTIQGENCNQPVEVSD