; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g23010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g23010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr5:16486700..16493152
RNA-Seq ExpressionMoc05g23010
SyntenyMoc05g23010
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]1.4e-7140.87Show/hide
Query:  MKEIVDGVPVSADPEVAVPPLNVVLLADDIDREIRAYAAPTFYNFNQVITGLKIAAPKFELNP-------------------------------------
        ++EIVDGVPV+ + EV VP LNVVLLA  IDREIRAYAAPTFYNFN VIT  +I APKFEL                                       
Subjt:  MKEIVDGVPVSADPEVAVPPLNVVLLADDIDREIRAYAAPTFYNFNQVITGLKIAAPKFELNP-------------------------------------

Query:  -----------------------------------------------------------IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISS
                                                                   IE YYNGLDDATRLV  VS N ALLAKPYAEAFNILERISS
Subjt:  -----------------------------------------------------------IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISS

Query:  NNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKR-----------------------------DHYYNNCPGNQESVYYLGNPQNSRNNSYS
        N HS SD RAI GRG+K LN+S+SYS  NSK EN+ DLV +                               H+YNNCPGN ESVY LGN  NSRNNSYS
Subjt:  NNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKR-----------------------------DHYYNNCPGNQESVYYLGNPQNSRNNSYS

Query:  KMYNPDWRSHPNFSWSGNQGGNNAGQMVAQKPSEGLFALLEKLMKQYMADNEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPA
          YNP  R+HPN                          + E++M  +      +R G                                           
Subjt:  KMYNPDWRSHPNFSWSGNQGGNNAGQMVAQKPSEGLFALLEKLMKQYMADNEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPA

Query:  EIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGSTQTKVPKKRKQARYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVN-TLVEALEQ
         ++     ++IVEQQRE +NSS+EEVNPVN  AS  GSTQ +V KKRKQ  +EDA AEY+  PPYP R QKKE N+  MKFL +LKQLHVN  LVEALE+
Subjt:  EIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGSTQTKVPKKRKQARYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVN-TLVEALEQ

Query:  IPNY
        + NY
Subjt:  IPNY

XP_022156835.1 uncharacterized protein LOC111023669 [Momordica charantia]9.8e-5457.87Show/hide
Query:  IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSNNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKR-----------------
        IE YY GLDDATRLVID STNGALL KPYAEAFNILERISSNNHSWSD RAI GRG KGLN+SESY ALNSK ENL +LVM+                  
Subjt:  IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSNNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKR-----------------

Query:  ------------DHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNA---------------------GQMVAQKPSEGLFA
                    +H+YNN P N ESVYYLGN QN+  NSYS  YNP WR+HPNFSWSGNQGGNNA                     GQ+  Q  SEG FA
Subjt:  ------------DHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNA---------------------GQMVAQKPSEGLFA

Query:  LLEKLMKQYMADNEAT
         LE LMK+ M  N+ T
Subjt:  LLEKLMKQYMADNEAT

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]1.1e-5243.96Show/hide
Query:  IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSNNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKRDHYYNNCPGNQESVYYL
        IETYY GLD+ATRLVID S NGALL KPYA+A NILERISS+NHSWSD RAI G+ SK L +SESY+ LNSK E L DL                     
Subjt:  IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSNNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKRDHYYNNCPGNQESVYYL

Query:  GNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNA---------------------GQMVAQKPSEGLFALLEKLMKQYMADNEAT-------------
            N+RN SYS  YNP  R+HPNF WSGNQGG+N                      GQMV    S+G    LE +MKQYMA+N+AT             
Subjt:  GNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNA---------------------GQMVAQKPSEGLFALLEKLMKQYMADNEAT-------------

Query:  -------------------------RDGKEQCKALTLQRGKSIT---SGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEV
                                 RD KEQC ALTL+ GK++          +K   ++ QGE Q  +DSEPAE+V P PP QI EQ +E +N+S + V
Subjt:  -------------------------RDGKEQCKALTLQRGKSIT---SGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEV

Query:  NPVNIKASNVGSTQTKVPKKRKQ
        NPV  +A   GS+Q  +P+K  +
Subjt:  NPVNIKASNVGSTQTKVPKKRKQ

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]7.8e-5140.71Show/hide
Query:  IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSNNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKR-----------------
        IETYY  L+DATRL                                 D RA+ G+ SKGL +SESY+ LNS  ENL  LVM+                  
Subjt:  IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSNNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKR-----------------

Query:  ------------DHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNA---------------------GQMVAQKPSEGLFA
                    DH+YNNCPGN ESVYYLGNPQN+RNN YS  YNP WR+HPNFSWSG+QGG+NA                     GQMVA++ SEG  A
Subjt:  ------------DHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNA---------------------GQMVAQKPSEGLFA

Query:  LLEKLMKQYMADNEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGS
         LEKLMKQYMA+N+AT     Q +A +L+  K               ++ G+                                          A+++ S
Subjt:  LLEKLMKQYMADNEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGS

Query:  TQTKVPKKRKQARYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVN-TLVEALEQIPNY
           +VP+KRKQA +E+APAEY P PPYP RLQKKE N+   KFL +LKQLHVN  LVEALEQ+PNY
Subjt:  TQTKVPKKRKQARYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVN-TLVEALEQIPNY

XP_022158740.1 uncharacterized protein LOC111025203 [Momordica charantia]1.1e-3942.52Show/hide
Query:  DHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGN---------------------NAGQMVAQKPSEGLFALLEKLMKQYMAD
        +H+YN+CP N +SVYYLGN  N+ NN YS  YN  W SHPNFSWS NQG N                     N GQ   QKP +G FA LE LMKQYM  
Subjt:  DHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGN---------------------NAGQMVAQKPSEGLFALLEKLMKQYMAD

Query:  NEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGSTQTKVPKKRKQA
        N  T                     +   + ++  +E    Q A D +     +     ++ EQ ++ ++ +S+EVNPVN KASN G++  KV +KRK+ 
Subjt:  NEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGSTQTKVPKKRKQA

Query:  RYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVN-TLVEALEQIPNY
         +EDAP E+RPTPPYP RL+KKE ++   KFL +L QLHVN  LVEA EQ+  Y
Subjt:  RYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVN-TLVEALEQIPNY

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189106.6e-7240.87Show/hide
Query:  MKEIVDGVPVSADPEVAVPPLNVVLLADDIDREIRAYAAPTFYNFNQVITGLKIAAPKFELNP-------------------------------------
        ++EIVDGVPV+ + EV VP LNVVLLA  IDREIRAYAAPTFYNFN VIT  +I APKFEL                                       
Subjt:  MKEIVDGVPVSADPEVAVPPLNVVLLADDIDREIRAYAAPTFYNFNQVITGLKIAAPKFELNP-------------------------------------

Query:  -----------------------------------------------------------IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISS
                                                                   IE YYNGLDDATRLV  VS N ALLAKPYAEAFNILERISS
Subjt:  -----------------------------------------------------------IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISS

Query:  NNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKR-----------------------------DHYYNNCPGNQESVYYLGNPQNSRNNSYS
        N HS SD RAI GRG+K LN+S+SYS  NSK EN+ DLV +                               H+YNNCPGN ESVY LGN  NSRNNSYS
Subjt:  NNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKR-----------------------------DHYYNNCPGNQESVYYLGNPQNSRNNSYS

Query:  KMYNPDWRSHPNFSWSGNQGGNNAGQMVAQKPSEGLFALLEKLMKQYMADNEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPA
          YNP  R+HPN                          + E++M  +      +R G                                           
Subjt:  KMYNPDWRSHPNFSWSGNQGGNNAGQMVAQKPSEGLFALLEKLMKQYMADNEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPA

Query:  EIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGSTQTKVPKKRKQARYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVN-TLVEALEQ
         ++     ++IVEQQRE +NSS+EEVNPVN  AS  GSTQ +V KKRKQ  +EDA AEY+  PPYP R QKKE N+  MKFL +LKQLHVN  LVEALE+
Subjt:  EIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGSTQTKVPKKRKQARYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVN-TLVEALEQ

Query:  IPNY
        + NY
Subjt:  IPNY

A0A6J1DRG1 uncharacterized protein LOC1110236694.8e-5457.87Show/hide
Query:  IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSNNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKR-----------------
        IE YY GLDDATRLVID STNGALL KPYAEAFNILERISSNNHSWSD RAI GRG KGLN+SESY ALNSK ENL +LVM+                  
Subjt:  IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSNNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKR-----------------

Query:  ------------DHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNA---------------------GQMVAQKPSEGLFA
                    +H+YNN P N ESVYYLGN QN+  NSYS  YNP WR+HPNFSWSGNQGGNNA                     GQ+  Q  SEG FA
Subjt:  ------------DHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNA---------------------GQMVAQKPSEGLFA

Query:  LLEKLMKQYMADNEAT
         LE LMK+ M  N+ T
Subjt:  LLEKLMKQYMADNEAT

A0A6J1DWK1 uncharacterized protein LOC1110250535.3e-5343.96Show/hide
Query:  IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSNNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKRDHYYNNCPGNQESVYYL
        IETYY GLD+ATRLVID S NGALL KPYA+A NILERISS+NHSWSD RAI G+ SK L +SESY+ LNSK E L DL                     
Subjt:  IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSNNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKRDHYYNNCPGNQESVYYL

Query:  GNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNA---------------------GQMVAQKPSEGLFALLEKLMKQYMADNEAT-------------
            N+RN SYS  YNP  R+HPNF WSGNQGG+N                      GQMV    S+G    LE +MKQYMA+N+AT             
Subjt:  GNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNA---------------------GQMVAQKPSEGLFALLEKLMKQYMADNEAT-------------

Query:  -------------------------RDGKEQCKALTLQRGKSIT---SGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEV
                                 RD KEQC ALTL+ GK++          +K   ++ QGE Q  +DSEPAE+V P PP QI EQ +E +N+S + V
Subjt:  -------------------------RDGKEQCKALTLQRGKSIT---SGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEV

Query:  NPVNIKASNVGSTQTKVPKKRKQ
        NPV  +A   GS+Q  +P+K  +
Subjt:  NPVNIKASNVGSTQTKVPKKRKQ

A0A6J1DWN2 uncharacterized protein LOC1110252035.1e-4042.52Show/hide
Query:  DHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGN---------------------NAGQMVAQKPSEGLFALLEKLMKQYMAD
        +H+YN+CP N +SVYYLGN  N+ NN YS  YN  W SHPNFSWS NQG N                     N GQ   QKP +G FA LE LMKQYM  
Subjt:  DHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGN---------------------NAGQMVAQKPSEGLFALLEKLMKQYMAD

Query:  NEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGSTQTKVPKKRKQA
        N  T                     +   + ++  +E    Q A D +     +     ++ EQ ++ ++ +S+EVNPVN KASN G++  KV +KRK+ 
Subjt:  NEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGSTQTKVPKKRKQA

Query:  RYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVN-TLVEALEQIPNY
         +EDAP E+RPTPPYP RL+KKE ++   KFL +L QLHVN  LVEA EQ+  Y
Subjt:  RYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVN-TLVEALEQIPNY

A0A6J1E1F3 uncharacterized protein LOC1110250653.8e-5140.71Show/hide
Query:  IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSNNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKR-----------------
        IETYY  L+DATRL                                 D RA+ G+ SKGL +SESY+ LNS  ENL  LVM+                  
Subjt:  IETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSNNHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKR-----------------

Query:  ------------DHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNA---------------------GQMVAQKPSEGLFA
                    DH+YNNCPGN ESVYYLGNPQN+RNN YS  YNP WR+HPNFSWSG+QGG+NA                     GQMVA++ SEG  A
Subjt:  ------------DHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNA---------------------GQMVAQKPSEGLFA

Query:  LLEKLMKQYMADNEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGS
         LEKLMKQYMA+N+AT     Q +A +L+  K               ++ G+                                          A+++ S
Subjt:  LLEKLMKQYMADNEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGS

Query:  TQTKVPKKRKQARYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVN-TLVEALEQIPNY
           +VP+KRKQA +E+APAEY P PPYP RLQKKE N+   KFL +LKQLHVN  LVEALEQ+PNY
Subjt:  TQTKVPKKRKQARYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVN-TLVEALEQIPNY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAAATAGTAGATGGGGTTCCTGTTTCTGCTGACCCTGAGGTAGCAGTGCCCCCTCTCAATGTTGTATTACTAGCAGATGACATCGATAGAGAAATTAGG
GCGTATGCAGCACCAACATTTTATAATTTCAACCAAGTAATCACAGGCCTGAAAATTGCAGCCCCAAAGTTTGAACTCAATCCGATCGAAACATATTACAATGGT
TTGGACGATGCGACGCGTTTAGTCATCGATGTGTCAACAAATGGGGCTTTGTTAGCTAAACCTTATGCTGAAGCATTCAACATCTTGGAAAGGATATCGTCCAAC
AACCATTCATGGTCTGACCTTAGAGCTATACATGGTAGAGGAAGCAAGGGACTTAACAAATCTGAGTCATACTCTGCTCTAAACTCGAAGACTGAGAACCTGAAA
GACTTAGTGATGAAAAGAGATCATTATTATAATAATTGTCCTGGCAATCAAGAGTCGGTGTACTACTTGGGAAATCCACAGAATAGTAGAAACAACTCATACTCC
AAGATGTATAACCCCGACTGGAGAAGTCACCCCAACTTCAGTTGGAGTGGAAATCAGGGAGGAAATAATGCTGGTCAGATGGTAGCACAAAAGCCTTCAGAAGGA
TTGTTTGCATTGTTGGAGAAGCTGATGAAGCAGTACATGGCAGATAATGAAGCCACTAGAGATGGCAAGGAGCAATGTAAGGCTCTCACACTGCAAAGAGGAAAA
AGCATTACCTCCGGCATATCCAAATGCTCCAAGGCAGTGAACGAGGTAGAACAAGGAGAATTTCAGCCAGCAAAAGATAGTGAGCCAGCAGAGATAGTTTCACCT
ACCCCACCAGTACAGATTGTTGAACAACAAAGGGAAATTGAGAATTCTTCGAGTGAGGAGGTCAACCCAGTGAATATTAAAGCATCTAATGTAGGATCAACGCAG
ACCAAAGTGCCCAAGAAAAGAAAGCAGGCAAGGTATGAGGATGCTCCAGCAGAATACAGACCGACACCACCATATCCTATGCGGTTGCAGAAGAAAGAGCATAAT
ATTCTGCTTATGAAATTCTTAGGTTTGCTGAAGCAACTGCATGTTAACACTTTGGTGGAAGCTCTAGAGCAAATACCAAATTATAATGATGCTGATAATGTAGAG
GAGGACTTTGAGGGTATCTACCTAGACAGTATGAACAGTAGTGAAAGGTCAGCTAAGCACATGCATGAGTCTTTAAACCTCGCAAATCATGAGTATAAATTGCTT
AAGTCATCTATAGAAGAGCCACTAGTCCTGGAACTCAAAGCATTGGTTCAACATCTGAAGTATGCCTACCTAGGTATTGAAGTTCTGAATGTGGAGGACGATGAG
CCATGGTTTGCTGAGTTGGCCAACTATATCGGTAGTGGGATATTGCCATCTGATATGAATAAACAACAGCTAAAGAACAAGCCAGGTGCTGACGAAATGCCATTC
AGTCATTATGGTGGGCACTTTGGAGGGCAGCGCATAACAGCAAAGGTTGAAATCAGGCTTAAAGTGGTTTACAAGAAAATGTGGGCAGCGGAAGTGATGTCGTTG
GCGAAGACAGAGAAAGAAGGCTCTAGCGAAGAGTTCCAGGAAGCCAGGCAGACGATCCCGCTGCCGGAAGCTTCAGGCGAGCGGAGCTCTGTCCAGCAAACTCCG
ACTAGTTCGGAGCACACTCCCGCTGATTCAGAACTCCTACCACTGCTTCCATCAGTCGCCAGTTCTCAAGTAAGTGCTCCAGACCCCAGTGCCCCTCTCCAACAA
ATCCAAGCCAAAAACCCTCCCTCTTCTGTTGAAAACTTCTTATCGTTCAGCCCCGCTAAATCAACCGCTCCTCCTACCATCCCCACCACTGAATCACCCTCTACT
CATGAAAATTTGCCCCTCTTCCCGCCTAAATGGCTGTCTCACAAACCCCCGCCAACCCCACTCCCATCTCCATTGTCCCCTCACCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAAATAGTAGATGGGGTTCCTGTTTCTGCTGACCCTGAGGTAGCAGTGCCCCCTCTCAATGTTGTATTACTAGCAGATGACATCGATAGAGAAATTAGG
GCGTATGCAGCACCAACATTTTATAATTTCAACCAAGTAATCACAGGCCTGAAAATTGCAGCCCCAAAGTTTGAACTCAATCCGATCGAAACATATTACAATGGT
TTGGACGATGCGACGCGTTTAGTCATCGATGTGTCAACAAATGGGGCTTTGTTAGCTAAACCTTATGCTGAAGCATTCAACATCTTGGAAAGGATATCGTCCAAC
AACCATTCATGGTCTGACCTTAGAGCTATACATGGTAGAGGAAGCAAGGGACTTAACAAATCTGAGTCATACTCTGCTCTAAACTCGAAGACTGAGAACCTGAAA
GACTTAGTGATGAAAAGAGATCATTATTATAATAATTGTCCTGGCAATCAAGAGTCGGTGTACTACTTGGGAAATCCACAGAATAGTAGAAACAACTCATACTCC
AAGATGTATAACCCCGACTGGAGAAGTCACCCCAACTTCAGTTGGAGTGGAAATCAGGGAGGAAATAATGCTGGTCAGATGGTAGCACAAAAGCCTTCAGAAGGA
TTGTTTGCATTGTTGGAGAAGCTGATGAAGCAGTACATGGCAGATAATGAAGCCACTAGAGATGGCAAGGAGCAATGTAAGGCTCTCACACTGCAAAGAGGAAAA
AGCATTACCTCCGGCATATCCAAATGCTCCAAGGCAGTGAACGAGGTAGAACAAGGAGAATTTCAGCCAGCAAAAGATAGTGAGCCAGCAGAGATAGTTTCACCT
ACCCCACCAGTACAGATTGTTGAACAACAAAGGGAAATTGAGAATTCTTCGAGTGAGGAGGTCAACCCAGTGAATATTAAAGCATCTAATGTAGGATCAACGCAG
ACCAAAGTGCCCAAGAAAAGAAAGCAGGCAAGGTATGAGGATGCTCCAGCAGAATACAGACCGACACCACCATATCCTATGCGGTTGCAGAAGAAAGAGCATAAT
ATTCTGCTTATGAAATTCTTAGGTTTGCTGAAGCAACTGCATGTTAACACTTTGGTGGAAGCTCTAGAGCAAATACCAAATTATAATGATGCTGATAATGTAGAG
GAGGACTTTGAGGGTATCTACCTAGACAGTATGAACAGTAGTGAAAGGTCAGCTAAGCACATGCATGAGTCTTTAAACCTCGCAAATCATGAGTATAAATTGCTT
AAGTCATCTATAGAAGAGCCACTAGTCCTGGAACTCAAAGCATTGGTTCAACATCTGAAGTATGCCTACCTAGGTATTGAAGTTCTGAATGTGGAGGACGATGAG
CCATGGTTTGCTGAGTTGGCCAACTATATCGGTAGTGGGATATTGCCATCTGATATGAATAAACAACAGCTAAAGAACAAGCCAGGTGCTGACGAAATGCCATTC
AGTCATTATGGTGGGCACTTTGGAGGGCAGCGCATAACAGCAAAGGTTGAAATCAGGCTTAAAGTGGTTTACAAGAAAATGTGGGCAGCGGAAGTGATGTCGTTG
GCGAAGACAGAGAAAGAAGGCTCTAGCGAAGAGTTCCAGGAAGCCAGGCAGACGATCCCGCTGCCGGAAGCTTCAGGCGAGCGGAGCTCTGTCCAGCAAACTCCG
ACTAGTTCGGAGCACACTCCCGCTGATTCAGAACTCCTACCACTGCTTCCATCAGTCGCCAGTTCTCAAGTAAGTGCTCCAGACCCCAGTGCCCCTCTCCAACAA
ATCCAAGCCAAAAACCCTCCCTCTTCTGTTGAAAACTTCTTATCGTTCAGCCCCGCTAAATCAACCGCTCCTCCTACCATCCCCACCACTGAATCACCCTCTACT
CATGAAAATTTGCCCCTCTTCCCGCCTAAATGGCTGTCTCACAAACCCCCGCCAACCCCACTCCCATCTCCATTGTCCCCTCACCCATAG
Protein sequenceShow/hide protein sequence
MKEIVDGVPVSADPEVAVPPLNVVLLADDIDREIRAYAAPTFYNFNQVITGLKIAAPKFELNPIETYYNGLDDATRLVIDVSTNGALLAKPYAEAFNILERISSN
NHSWSDLRAIHGRGSKGLNKSESYSALNSKTENLKDLVMKRDHYYNNCPGNQESVYYLGNPQNSRNNSYSKMYNPDWRSHPNFSWSGNQGGNNAGQMVAQKPSEG
LFALLEKLMKQYMADNEATRDGKEQCKALTLQRGKSITSGISKCSKAVNEVEQGEFQPAKDSEPAEIVSPTPPVQIVEQQREIENSSSEEVNPVNIKASNVGSTQ
TKVPKKRKQARYEDAPAEYRPTPPYPMRLQKKEHNILLMKFLGLLKQLHVNTLVEALEQIPNYNDADNVEEDFEGIYLDSMNSSERSAKHMHESLNLANHEYKLL
KSSIEEPLVLELKALVQHLKYAYLGIEVLNVEDDEPWFAELANYIGSGILPSDMNKQQLKNKPGADEMPFSHYGGHFGGQRITAKVEIRLKVVYKKMWAAEVMSL
AKTEKEGSSEEFQEARQTIPLPEASGERSSVQQTPTSSEHTPADSELLPLLPSVASSQVSAPDPSAPLQQIQAKNPPSSVENFLSFSPAKSTAPPTIPTTESPST
HENLPLFPPKWLSHKPPPTPLPSPLSPHP