; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g14460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g14460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:12451519..12453153
RNA-Seq ExpressionMoc09g14460
SyntenyMoc09g14460
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]6.7e-4029.44Show/hide
Query:  SSRNLPRLTGKL---QFDQLRREL---VSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT------
        SSRN     G +   +FDQLR +L   V  LK   ++      D +  E PF  D+L+API  K + PT   YDG+K+P ++VE +ES+MDF        
Subjt:  SSRNLPRLTGKL---QFDQLRREL---VSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT------

Query:  ----------------------SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK-------------------------
                              SI ++ QLR+ F+A F++    K + T+L +I+QK  E +R+Y+  F  EQ+K                         
Subjt:  ----------------------SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK-------------------------

Query:  -------------------------------------------------------------------------------------------------DSN
                                                                                                         +S 
Subjt:  -------------------------------------------------------------------------------------------------DSN

Query:  LD-LLVEPRKMQKDPEQRDK-------------------------------------------SNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQSG
        ++ LL  P K++  PE+R K                                           S  K ++R+RS+TPP+R DRP VINTI GGPSGGQSG
Subjt:  LD-LLVEPRKMQKDPEQRDK-------------------------------------------SNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQSG

Query:  HKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGW
         KRKELA+ A  E+C+++ ++ +CPI F  ADL+  H PHNDALVI+PLIDH++V R L+DGG S NIL+L TY ALGW
Subjt:  HKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGW

XP_022144033.1 uncharacterized protein LOC111013825 [Momordica charantia]8.1e-4638.26Show/hide
Query:  QFDQLRREL---VSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHTSILSWKQLRKAF-IAQFAAHK
        +FDQL+ +    V  LK   +K G T +D +  E PF  DIL+API  K + PT + YDG+K+P ++VE  E +M+F  +I   K    AF IA   +H 
Subjt:  QFDQLRREL---VSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHTSILSWKQLRKAF-IAQFAAHK

Query:  DAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK------------------DSNLDLLVE--------------------------------PRKMQKDP
        D K + T+L  I+QK    +R+Y+  F  +Q+K                  D  L + +E                                 R  +K+P
Subjt:  DAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK------------------DSNLDLLVE--------------------------------PRKMQKDP

Query:  EQ------------------------RDKSNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQSGHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ
         Q                        R  S  K ++R+RS+TPP+R+DRP VINTI GGPSGGQ G+KRKELA+EA  E+C+++ ++ +  I F D DL+
Subjt:  EQ------------------------RDKSNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQSGHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ

Query:  --HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGW
          H PHNDALVI+PLIDH++VRR L+DGGAS NIL+L TY  LGW
Subjt:  --HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGW

XP_022149836.1 uncharacterized protein LOC111018172 [Momordica charantia]1.7e-7550.66Show/hide
Query:  LRRELVSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT----------------------------
        LRRELVSELKG+LKK  R TEDDEPDEPPFVQDILDAPISQKS  PTF+KYDGTK+P++HVETYE IMDFH                             
Subjt:  LRRELVSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT----------------------------

Query:  SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK-DSNLDLLV--------------------------------------
        SI S KQLRKAFIAQFAAHKDAKHS T              DYIK FLSEQIK ++  DLL                                       
Subjt:  SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK-DSNLDLLV--------------------------------------

Query:  ---------------EPRKMQKDPE----------------------------QRDK-------SNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQS
                        P K++K+ +                            +RD+       S+RK+QKRERSKT PKRED+P +I+TI GG S GQS
Subjt:  ---------------EPRKMQKDPE----------------------------QRDK-------SNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQS

Query:  GHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGWG
        GHKRKELA EA HE+CMLQP+QQS PI FSD DLQ  HFPHNDALVI+ LIDHI +RR LIDG ASTNIL+LSTYKAL WG
Subjt:  GHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGWG

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]2.5e-3933.6Show/hide
Query:  QFDQLRR---ELVSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT---------------------
        +FD ++    E V  LK   +K     +DD+  E PF  DI++API  K + PT   YDG+K+P ++VE +E +MDF                       
Subjt:  QFDQLRR---ELVSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT---------------------

Query:  -------SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPI-----------------------------------------------------
               SI ++ QLRK FI QF+     + + T+L +I+QK  E +                                                     
Subjt:  -------SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPI-----------------------------------------------------

Query:  ---------------------------RDYIKCF-LSEQIKDSNLDLLVEPRKMQK-DPEQRDKSNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQS
                                   R Y +C+ L  QI+D     L++    +K   + R  S  K ++R+RS+TPP+REDRP VINTI GGPSGGQ 
Subjt:  ---------------------------RDYIKCF-LSEQIKDSNLDLLVEPRKMQK-DPEQRDKSNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQS

Query:  GHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKAL
         +KRKELA EA  ++ +++ ++ +C I F D DL+  H PHNDALVI+PLIDH++VRR L+DGGAS NIL+L TY AL
Subjt:  GHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKAL

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]2.6e-4434.68Show/hide
Query:  QFDQLRREL---VSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT---------------------
        +F+QL+ +    V  LK   +K     +D +  E PF  DIL+A I  K + PT   YDG+K+P ++VE +E +MDF                       
Subjt:  QFDQLRREL---VSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT---------------------

Query:  -------SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK------DSNL------------------------------
               SI ++ QLRK FI+QF +    + +TT+L +I+QK  + +++YI  F  EQ+K      DS++                              
Subjt:  -------SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK------DSNL------------------------------

Query:  --------DLL----------VEPRKMQKDPEQRDKSNR--------------------KDQKRERSKTPPKREDRPLVINTIHGGPSGGQSGHKRKELA
                +LL          ++ +K  +D  + D  ++                    K ++R+RS+TPP+ +DRP VINTI GGPSGGQSG+KRKELA
Subjt:  --------DLL----------VEPRKMQKDPEQRDKSNR--------------------KDQKRERSKTPPKREDRPLVINTIHGGPSGGQSGHKRKELA

Query:  KEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGW
        +EAS E+C+++ ++ +C + F D+DL+  H P+NDALVI+PLIDH++VRR L+DGGAS NIL+L    ALGW
Subjt:  KEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGW

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.2e-4029.44Show/hide
Query:  SSRNLPRLTGKL---QFDQLRREL---VSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT------
        SSRN     G +   +FDQLR +L   V  LK   ++      D +  E PF  D+L+API  K + PT   YDG+K+P ++VE +ES+MDF        
Subjt:  SSRNLPRLTGKL---QFDQLRREL---VSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT------

Query:  ----------------------SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK-------------------------
                              SI ++ QLR+ F+A F++    K + T+L +I+QK  E +R+Y+  F  EQ+K                         
Subjt:  ----------------------SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK-------------------------

Query:  -------------------------------------------------------------------------------------------------DSN
                                                                                                         +S 
Subjt:  -------------------------------------------------------------------------------------------------DSN

Query:  LD-LLVEPRKMQKDPEQRDK-------------------------------------------SNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQSG
        ++ LL  P K++  PE+R K                                           S  K ++R+RS+TPP+R DRP VINTI GGPSGGQSG
Subjt:  LD-LLVEPRKMQKDPEQRDK-------------------------------------------SNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQSG

Query:  HKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGW
         KRKELA+ A  E+C+++ ++ +CPI F  ADL+  H PHNDALVI+PLIDH++V R L+DGG S NIL+L TY ALGW
Subjt:  HKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGW

A0A6J1CS66 uncharacterized protein LOC1110138253.9e-4638.26Show/hide
Query:  QFDQLRREL---VSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHTSILSWKQLRKAF-IAQFAAHK
        +FDQL+ +    V  LK   +K G T +D +  E PF  DIL+API  K + PT + YDG+K+P ++VE  E +M+F  +I   K    AF IA   +H 
Subjt:  QFDQLRREL---VSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHTSILSWKQLRKAF-IAQFAAHK

Query:  DAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK------------------DSNLDLLVE--------------------------------PRKMQKDP
        D K + T+L  I+QK    +R+Y+  F  +Q+K                  D  L + +E                                 R  +K+P
Subjt:  DAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK------------------DSNLDLLVE--------------------------------PRKMQKDP

Query:  EQ------------------------RDKSNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQSGHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ
         Q                        R  S  K ++R+RS+TPP+R+DRP VINTI GGPSGGQ G+KRKELA+EA  E+C+++ ++ +  I F D DL+
Subjt:  EQ------------------------RDKSNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQSGHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ

Query:  --HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGW
          H PHNDALVI+PLIDH++VRR L+DGGAS NIL+L TY  LGW
Subjt:  --HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGW

A0A6J1D9M1 uncharacterized protein LOC1110181728.1e-7650.66Show/hide
Query:  LRRELVSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT----------------------------
        LRRELVSELKG+LKK  R TEDDEPDEPPFVQDILDAPISQKS  PTF+KYDGTK+P++HVETYE IMDFH                             
Subjt:  LRRELVSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT----------------------------

Query:  SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK-DSNLDLLV--------------------------------------
        SI S KQLRKAFIAQFAAHKDAKHS T              DYIK FLSEQIK ++  DLL                                       
Subjt:  SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK-DSNLDLLV--------------------------------------

Query:  ---------------EPRKMQKDPE----------------------------QRDK-------SNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQS
                        P K++K+ +                            +RD+       S+RK+QKRERSKT PKRED+P +I+TI GG S GQS
Subjt:  ---------------EPRKMQKDPE----------------------------QRDK-------SNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQS

Query:  GHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGWG
        GHKRKELA EA HE+CMLQP+QQS PI FSD DLQ  HFPHNDALVI+ LIDHI +RR LIDG ASTNIL+LSTYKAL WG
Subjt:  GHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGWG

A0A6J1DPC9 uncharacterized protein LOC1110222801.2e-3933.6Show/hide
Query:  QFDQLRR---ELVSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT---------------------
        +FD ++    E V  LK   +K     +DD+  E PF  DI++API  K + PT   YDG+K+P ++VE +E +MDF                       
Subjt:  QFDQLRR---ELVSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT---------------------

Query:  -------SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPI-----------------------------------------------------
               SI ++ QLRK FI QF+     + + T+L +I+QK  E +                                                     
Subjt:  -------SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPI-----------------------------------------------------

Query:  ---------------------------RDYIKCF-LSEQIKDSNLDLLVEPRKMQK-DPEQRDKSNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQS
                                   R Y +C+ L  QI+D     L++    +K   + R  S  K ++R+RS+TPP+REDRP VINTI GGPSGGQ 
Subjt:  ---------------------------RDYIKCF-LSEQIKDSNLDLLVEPRKMQK-DPEQRDKSNRKDQKRERSKTPPKREDRPLVINTIHGGPSGGQS

Query:  GHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKAL
         +KRKELA EA  ++ +++ ++ +C I F D DL+  H PHNDALVI+PLIDH++VRR L+DGGAS NIL+L TY AL
Subjt:  GHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKAL

A0A6J1DPN4 uncharacterized protein LOC1110230601.3e-4434.68Show/hide
Query:  QFDQLRREL---VSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT---------------------
        +F+QL+ +    V  LK   +K     +D +  E PF  DIL+A I  K + PT   YDG+K+P ++VE +E +MDF                       
Subjt:  QFDQLRREL---VSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDGTKNPINHVETYESIMDFHT---------------------

Query:  -------SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK------DSNL------------------------------
               SI ++ QLRK FI+QF +    + +TT+L +I+QK  + +++YI  F  EQ+K      DS++                              
Subjt:  -------SILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIK------DSNL------------------------------

Query:  --------DLL----------VEPRKMQKDPEQRDKSNR--------------------KDQKRERSKTPPKREDRPLVINTIHGGPSGGQSGHKRKELA
                +LL          ++ +K  +D  + D  ++                    K ++R+RS+TPP+ +DRP VINTI GGPSGGQSG+KRKELA
Subjt:  --------DLL----------VEPRKMQKDPEQRDKSNR--------------------KDQKRERSKTPPKREDRPLVINTIHGGPSGGQSGHKRKELA

Query:  KEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGW
        +EAS E+C+++ ++ +C + F D+DL+  H P+NDALVI+PLIDH++VRR L+DGGAS NIL+L    ALGW
Subjt:  KEASHEICMLQPEQQSCPIGFSDADLQ--HFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCAAGAAGGGAAAAAAAAGGTGGACGAGTACCTCCAGAAGAAAATAGCTGAGGAACCGATATCAAAGCTGGTCATAGGGGCAATAACCTACCAAGACCGGAAAAT
TACCAACCTGTGCTCTTCCCGCAACCTTCCAAGGTTGACAGGCAAACTACAGTTCGACCAACTAAGAAGAGAGCTGGTGAGCGAGTTGAAAGGGGATCTGAAGAAAGCAG
GGAGGACAACAGAGGACGACGAGCCTGACGAGCCGCCTTTCGTCCAGGACATCCTAGATGCCCCCATCTCGCAAAAGAGCAGACTTCCAACCTTCGACAAATACGATGGG
ACGAAGAATCCTATCAACCATGTGGAGACCTATGAGTCCATCATGGACTTTCACACCTCAATTTTGAGCTGGAAACAGCTTAGGAAGGCTTTCATAGCCCAGTTCGCAGC
CCACAAGGACGCGAAGCATTCAACCACGTATCTCTTCTCAATTCAACAGAAGCCAAAGGAACCTATCAGGGACTACATTAAGTGCTTCCTCTCGGAGCAGATCAAGGACT
CAAATCTGGATTTACTAGTAGAGCCGAGGAAGATGCAGAAAGATCCTGAGCAGCGTGATAAGTCAAACAGGAAAGATCAGAAAAGAGAAAGGTCTAAAACCCCACCCAAG
CGTGAAGACCGACCTCTAGTGATCAACACAATTCACGGGGGGCCAAGTGGTGGCCAGTCAGGTCACAAGAGGAAAGAGCTGGCCAAAGAGGCTAGCCATGAGATTTGCAT
GTTGCAACCAGAGCAGCAGTCTTGCCCAATTGGCTTCTCTGATGCCGACCTCCAGCACTTCCCTCACAATGACGCATTGGTCATTTCCCCCCTAATTGATCACATCATCG
TGCGTAGGGCCCTCATAGATGGTGGCGCCTCAACAAATATTCTTACCCTGTCCACCTACAAAGCGCTCGGATGGGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCCAAGAAGGGAAAAAAAAGGTGGACGAGTACCTCCAGAAGAAAATAGCTGAGGAACCGATATCAAAGCTGGTCATAGGGGCAATAACCTACCAAGACCGGAAAAT
TACCAACCTGTGCTCTTCCCGCAACCTTCCAAGGTTGACAGGCAAACTACAGTTCGACCAACTAAGAAGAGAGCTGGTGAGCGAGTTGAAAGGGGATCTGAAGAAAGCAG
GGAGGACAACAGAGGACGACGAGCCTGACGAGCCGCCTTTCGTCCAGGACATCCTAGATGCCCCCATCTCGCAAAAGAGCAGACTTCCAACCTTCGACAAATACGATGGG
ACGAAGAATCCTATCAACCATGTGGAGACCTATGAGTCCATCATGGACTTTCACACCTCAATTTTGAGCTGGAAACAGCTTAGGAAGGCTTTCATAGCCCAGTTCGCAGC
CCACAAGGACGCGAAGCATTCAACCACGTATCTCTTCTCAATTCAACAGAAGCCAAAGGAACCTATCAGGGACTACATTAAGTGCTTCCTCTCGGAGCAGATCAAGGACT
CAAATCTGGATTTACTAGTAGAGCCGAGGAAGATGCAGAAAGATCCTGAGCAGCGTGATAAGTCAAACAGGAAAGATCAGAAAAGAGAAAGGTCTAAAACCCCACCCAAG
CGTGAAGACCGACCTCTAGTGATCAACACAATTCACGGGGGGCCAAGTGGTGGCCAGTCAGGTCACAAGAGGAAAGAGCTGGCCAAAGAGGCTAGCCATGAGATTTGCAT
GTTGCAACCAGAGCAGCAGTCTTGCCCAATTGGCTTCTCTGATGCCGACCTCCAGCACTTCCCTCACAATGACGCATTGGTCATTTCCCCCCTAATTGATCACATCATCG
TGCGTAGGGCCCTCATAGATGGTGGCGCCTCAACAAATATTCTTACCCTGTCCACCTACAAAGCGCTCGGATGGGGCTGA
Protein sequenceShow/hide protein sequence
MTQEGKKKVDEYLQKKIAEEPISKLVIGAITYQDRKITNLCSSRNLPRLTGKLQFDQLRRELVSELKGDLKKAGRTTEDDEPDEPPFVQDILDAPISQKSRLPTFDKYDG
TKNPINHVETYESIMDFHTSILSWKQLRKAFIAQFAAHKDAKHSTTYLFSIQQKPKEPIRDYIKCFLSEQIKDSNLDLLVEPRKMQKDPEQRDKSNRKDQKRERSKTPPK
REDRPLVINTIHGGPSGGQSGHKRKELAKEASHEICMLQPEQQSCPIGFSDADLQHFPHNDALVISPLIDHIIVRRALIDGGASTNILTLSTYKALGWG