; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g15470 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g15470
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr4:11653312..11656257
RNA-Seq ExpressionMoc04g15470
SyntenyMoc04g15470
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]1.0e-4234.19Show/hide
Query:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDES
        P K K P +KPYDGS DP D+++++EGLM+  AT D +KCRAF I L   AR  YR+L  +++ ++ Q R+ F++QFS +H  K   THL T  Q E E+
Subjt:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDES

Query:  LRDFVQRFMAEKIKVTDCLDDAAR---------------------MTFTSTINN----LDLHQTIISRHAK-----ARGQSSQP----------------
        LR++V RF  E++KV  C DD+A                       TFT  +      +D H+ + ++  +     +RG+S +                 
Subjt:  LRDFVQRFMAEKIKVTDCLDDAAR---------------------MTFTSTINN----LDLHQTIISRHAK-----ARGQSSQP----------------

Query:  -----KRVRVHDRRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGHLKKYVGKKD
             +R      R R ++ +TP  +P+ EIL  IE S +  +L++  K++   ++ SK KY RFHR++GH+T  C++L+ Q+E+LI  G+ KK+VGK  
Subjt:  -----KRVRVHDRRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGHLKKYVGKKD

Query:  SCSSQGKRKFYRT
        + S++ K +  R+
Subjt:  SCSSQGKRKFYRT

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]1.3e-4233.87Show/hide
Query:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDES
        P K K P +KPYDGS DP D+++++EGLM+ +A  D +KCRAF I L   AR  YR+L  +++ ++ Q R+ F++QFS +   K  +THL T  Q E  +
Subjt:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDES

Query:  LRDFVQRFMAEKIKVTDCLDDAARMTFTS----------------------------TINNLDLHQTIISRHAKARGQSSQPKRVRVHD-----------
        LR++V RF  E++KV  C DD+A   F +                             I+  +L +T   R  +  G+    K V   D           
Subjt:  LRDFVQRFMAEKIKVTDCLDDAARMTFTS----------------------------TINNLDLHQTIISRHAKARGQSSQPKRVRVHD-----------

Query:  ------------RRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGHLKKYVGKKD
                     + R ++ +TP  +P+SEIL NIE S +  +L++  K++ +P++ SK KY RFHR++GH+T  C++L+ Q+E+LI  G+ KK+VGK  
Subjt:  ------------RRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGHLKKYVGKKD

Query:  SCSSQGKRKFYRT
        + S++ K +  R+
Subjt:  SCSSQGKRKFYRT

XP_022151609.1 uncharacterized protein LOC111019523 [Momordica charantia]1.3e-4553.17Show/hide
Query:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDES
        P KMKKPDMKPYD   +P DHI+LYEGLMEL+A GD+MKCRAF +TLK +ARS YRQLK K++  W Q RK+FI+QFS QHDRK  DTHLLT +Q E ES
Subjt:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDES

Query:  LRDFVQRFMAEKIKVTDCLDDAARMTFTSTINNLDL-----------------------------------HQTIISRHAKARGQSSQPKRVRV--HDRR
        L +FV RF+ EKIKV DC +D ARMTF S IN  +L                                   HQT  S+H +ARGQSSQ K+      DRR
Subjt:  LRDFVQRFMAEKIKVTDCLDDAARMTFTSTINNLDL-----------------------------------HQTIISRHAKARGQSSQPKRVRV--HDRR

Query:  LREFD
           FD
Subjt:  LREFD

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.3e-4237.54Show/hide
Query:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQH-DRKTPDTHLLTNYQLEDE
        P K K P MKPYDGS DP D+++++E LM+  A  D +KC AF I L   AR  YR+L  + + ++ Q RK FISQFS +H DRKTP THL T  Q E E
Subjt:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQH-DRKTPDTHLLTNYQLEDE

Query:  SLRDFVQRFMAEKIKVTDCLDDAARMTFTS----------------------------TINNLDLHQTIISR-----------------HAKARGQSSQP
        +LR++V RF  E++KV  C DD+A   F +                             I+  +L +T   R                  +K+R +    
Subjt:  SLRDFVQRFMAEKIKVTDCLDDAARMTFTS----------------------------TINNLDLHQTIISR-----------------HAKARGQSSQP

Query:  KRVRVHDRR-------LREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGHLKKYVGK
           RV  RR        R ++HYTP  +P+ EIL NIE + +  +L++  K++  P+K +  KY RFHRD+GH+T   ++L+ Q+E+LI  G+ KK+VGK
Subjt:  KRVRVHDRR-------LREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGHLKKYVGK

Query:  KDSCSSQGK--RKFYRT
          S S + K  RK  RT
Subjt:  KDSCSSQGK--RKFYRT

XP_022155997.1 uncharacterized protein LOC111022972 [Momordica charantia]1.8e-4748.03Show/hide
Query:  THLLTNYQLEDESLRDFVQRFMAEKIKVTDCLDDAARMTFTSTINNLDL-----------------------------------HQTIISRHAKARGQSS
        THL+T  Q E ESLR+FV RFM EKIKV D  DD ARMTF   +NNLDL                                   +QT+ S++ ++R   S
Subjt:  THLLTNYQLEDESLRDFVQRFMAEKIKVTDCLDDAARMTFTSTINNLDL-----------------------------------HQTIISRHAKARGQSS

Query:  Q----------------------PKRVRVHDRRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRD
                               P+ V V +RRLREFD YTPLNVP++EILANIE+ DLH MLEK GKMK S DK S++KY RFHRD+ HDT QC+  RD
Subjt:  Q----------------------PKRVRVHDRRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRD

Query:  QVENLIGRGHLKKYVGKKDSCSSQGKRKFYRTDLDDQDKFPSPRKREAAGKRPM
        Q+E+LI + HLKKYVGKKD+CS  GKRK+ R D  D+D   +P+KR+  GKRP+
Subjt:  QVENLIGRGHLKKYVGKKDSCSSQGKRKFYRTDLDDQDKFPSPRKREAAGKRPM

TrEMBL top hitse value%identityAlignment
A0A6J1CKB3 uncharacterized protein LOC1110120815.0e-4334.19Show/hide
Query:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDES
        P K K P +KPYDGS DP D+++++EGLM+  AT D +KCRAF I L   AR  YR+L  +++ ++ Q R+ F++QFS +H  K   THL T  Q E E+
Subjt:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDES

Query:  LRDFVQRFMAEKIKVTDCLDDAAR---------------------MTFTSTINN----LDLHQTIISRHAK-----ARGQSSQP----------------
        LR++V RF  E++KV  C DD+A                       TFT  +      +D H+ + ++  +     +RG+S +                 
Subjt:  LRDFVQRFMAEKIKVTDCLDDAAR---------------------MTFTSTINN----LDLHQTIISRHAK-----ARGQSSQP----------------

Query:  -----KRVRVHDRRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGHLKKYVGKKD
             +R      R R ++ +TP  +P+ EIL  IE S +  +L++  K++   ++ SK KY RFHR++GH+T  C++L+ Q+E+LI  G+ KK+VGK  
Subjt:  -----KRVRVHDRRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGHLKKYVGKKD

Query:  SCSSQGKRKFYRT
        + S++ K +  R+
Subjt:  SCSSQGKRKFYRT

A0A6J1D9W7 uncharacterized protein LOC1110187086.5e-4333.87Show/hide
Query:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDES
        P K K P +KPYDGS DP D+++++EGLM+ +A  D +KCRAF I L   AR  YR+L  +++ ++ Q R+ F++QFS +   K  +THL T  Q E  +
Subjt:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDES

Query:  LRDFVQRFMAEKIKVTDCLDDAARMTFTS----------------------------TINNLDLHQTIISRHAKARGQSSQPKRVRVHD-----------
        LR++V RF  E++KV  C DD+A   F +                             I+  +L +T   R  +  G+    K V   D           
Subjt:  LRDFVQRFMAEKIKVTDCLDDAARMTFTS----------------------------TINNLDLHQTIISRHAKARGQSSQPKRVRVHD-----------

Query:  ------------RRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGHLKKYVGKKD
                     + R ++ +TP  +P+SEIL NIE S +  +L++  K++ +P++ SK KY RFHR++GH+T  C++L+ Q+E+LI  G+ KK+VGK  
Subjt:  ------------RRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGHLKKYVGKKD

Query:  SCSSQGKRKFYRT
        + S++ K +  R+
Subjt:  SCSSQGKRKFYRT

A0A6J1DDJ7 uncharacterized protein LOC1110195236.3e-4653.17Show/hide
Query:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDES
        P KMKKPDMKPYD   +P DHI+LYEGLMEL+A GD+MKCRAF +TLK +ARS YRQLK K++  W Q RK+FI+QFS QHDRK  DTHLLT +Q E ES
Subjt:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDES

Query:  LRDFVQRFMAEKIKVTDCLDDAARMTFTSTINNLDL-----------------------------------HQTIISRHAKARGQSSQPKRVRV--HDRR
        L +FV RF+ EKIKV DC +D ARMTF S IN  +L                                   HQT  S+H +ARGQSSQ K+      DRR
Subjt:  LRDFVQRFMAEKIKVTDCLDDAARMTFTSTINNLDL-----------------------------------HQTIISRHAKARGQSSQPKRVRV--HDRR

Query:  LREFD
           FD
Subjt:  LREFD

A0A6J1DHB3 uncharacterized protein LOC1110204796.5e-4337.54Show/hide
Query:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQH-DRKTPDTHLLTNYQLEDE
        P K K P MKPYDGS DP D+++++E LM+  A  D +KC AF I L   AR  YR+L  + + ++ Q RK FISQFS +H DRKTP THL T  Q E E
Subjt:  PSKMKKPDMKPYDGSTDPSDHIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQH-DRKTPDTHLLTNYQLEDE

Query:  SLRDFVQRFMAEKIKVTDCLDDAARMTFTS----------------------------TINNLDLHQTIISR-----------------HAKARGQSSQP
        +LR++V RF  E++KV  C DD+A   F +                             I+  +L +T   R                  +K+R +    
Subjt:  SLRDFVQRFMAEKIKVTDCLDDAARMTFTS----------------------------TINNLDLHQTIISR-----------------HAKARGQSSQP

Query:  KRVRVHDRR-------LREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGHLKKYVGK
           RV  RR        R ++HYTP  +P+ EIL NIE + +  +L++  K++  P+K +  KY RFHRD+GH+T   ++L+ Q+E+LI  G+ KK+VGK
Subjt:  KRVRVHDRR-------LREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGHLKKYVGK

Query:  KDSCSSQGK--RKFYRT
          S S + K  RK  RT
Subjt:  KDSCSSQGK--RKFYRT

A0A6J1DQW1 uncharacterized protein LOC1110229728.8e-4848.03Show/hide
Query:  THLLTNYQLEDESLRDFVQRFMAEKIKVTDCLDDAARMTFTSTINNLDL-----------------------------------HQTIISRHAKARGQSS
        THL+T  Q E ESLR+FV RFM EKIKV D  DD ARMTF   +NNLDL                                   +QT+ S++ ++R   S
Subjt:  THLLTNYQLEDESLRDFVQRFMAEKIKVTDCLDDAARMTFTSTINNLDL-----------------------------------HQTIISRHAKARGQSS

Query:  Q----------------------PKRVRVHDRRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRD
                               P+ V V +RRLREFD YTPLNVP++EILANIE+ DLH MLEK GKMK S DK S++KY RFHRD+ HDT QC+  RD
Subjt:  Q----------------------PKRVRVHDRRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRD

Query:  QVENLIGRGHLKKYVGKKDSCSSQGKRKFYRTDLDDQDKFPSPRKREAAGKRPM
        Q+E+LI + HLKKYVGKKD+CS  GKRK+ R D  D+D   +P+KR+  GKRP+
Subjt:  QVENLIGRGHLKKYVGKKDSCSSQGKRKFYRTDLDDQDKFPSPRKREAAGKRPM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCAAGGGAAGGACTGCGGGTGCATATTACAGCGGAGGCCCCAGGTCTTTGCCAGACGACAAGTGAGTGTAGTGCACCTCAGCAAGCTGGTCAGCAGACTCCGAC
TACACCGAATTGGGGATCTACCCCCACAATAGGTAGGGTCAATTCGAACACAATTCAGTGGCCACCAGCTGCGGCGACCGCCACAACTTCGACATATCAGGCAACGAGCA
GGGTGAACGAGAGGATCCTCATTTTAGGCAACCATAATGACATATCCACTCCATCAAAGATGAAGAAACCCGACATGAAACCGTACGATGGATCAACTGACCCGAGTGAT
CACATCGACCTATATGAAGGCTTGATGGAGTTAAATGCTACAGGAGACGAAATGAAATGCAGGGCGTTCTTCATCACTTTGAAGAGCCAAGCTAGGTCCTCGTACAGACA
ATTAAAGCTTAAAACGGTCGGGTCCTGGAAACAGTTTAGGAAGATCTTTATAAGCCAATTTTCGGTGCAGCATGATAGAAAAACGCCCGATACACATCTCCTCACCAATT
ATCAGTTGGAGGACGAGTCGCTGCGTGACTTCGTCCAGAGATTTATGGCAGAGAAAATCAAAGTCACAGACTGCTTGGATGATGCAGCCAGGATGACTTTCACTTCGACA
ATAAATAATCTCGACCTGCATCAAACAATAATATCTAGGCATGCAAAGGCTCGAGGGCAGAGTTCGCAGCCGAAGAGAGTACGAGTTCATGATCGTCGTTTAAGGGAGTT
CGACCATTACACCCCGCTCAACGTTCCTTTATCAGAGATCCTGGCCAACATCGAGAATAGCGATTTGCATATGATGTTGGAGAAGCTGGGTAAAATGAAGTCGAGTCCGG
ATAAGTGGAGTAAGAGCAAATACTACAGGTTTCATAGAGACTATGGACATGATACTTTGCAGTGCTTTGACTTGAGAGACCAGGTAGAAAACCTGATTGGGCGAGGTCAT
CTGAAAAAGTATGTTGGGAAGAAAGATTCCTGTTCTTCTCAGGGAAAGAGAAAATTCTACCGAACAGATCTTGATGATCAAGACAAATTTCCCTCACCCAGAAAAAGAGA
AGCGGCTGGGAAAAGGCCCATGTCGTCACACCCACCCCCTTTCGAACCTTCTTCTCCGGCGAAGGAGCACCAGCACGGCAACCTGATTACGCTTGTTTACATGCGTAAAA
AGAAGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGTCGAAAAAAGCATCATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCC
AACTTTGCCCTTAATGAAACGCATCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGATTTAGA
GAAGGCCACTGTAAGTCTTAACAAGTCAATACTGAAAAGGTTGCTTATTTGCATTGTTGAGTGGTTAAACCGAGGAGAAGAGCGCCTTAGACATAAGAGTTCCGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCCAAGGGAAGGACTGCGGGTGCATATTACAGCGGAGGCCCCAGGTCTTTGCCAGACGACAAGTGAGTGTAGTGCACCTCAGCAAGCTGGTCAGCAGACTCCGAC
TACACCGAATTGGGGATCTACCCCCACAATAGGTAGGGTCAATTCGAACACAATTCAGTGGCCACCAGCTGCGGCGACCGCCACAACTTCGACATATCAGGCAACGAGCA
GGGTGAACGAGAGGATCCTCATTTTAGGCAACCATAATGACATATCCACTCCATCAAAGATGAAGAAACCCGACATGAAACCGTACGATGGATCAACTGACCCGAGTGAT
CACATCGACCTATATGAAGGCTTGATGGAGTTAAATGCTACAGGAGACGAAATGAAATGCAGGGCGTTCTTCATCACTTTGAAGAGCCAAGCTAGGTCCTCGTACAGACA
ATTAAAGCTTAAAACGGTCGGGTCCTGGAAACAGTTTAGGAAGATCTTTATAAGCCAATTTTCGGTGCAGCATGATAGAAAAACGCCCGATACACATCTCCTCACCAATT
ATCAGTTGGAGGACGAGTCGCTGCGTGACTTCGTCCAGAGATTTATGGCAGAGAAAATCAAAGTCACAGACTGCTTGGATGATGCAGCCAGGATGACTTTCACTTCGACA
ATAAATAATCTCGACCTGCATCAAACAATAATATCTAGGCATGCAAAGGCTCGAGGGCAGAGTTCGCAGCCGAAGAGAGTACGAGTTCATGATCGTCGTTTAAGGGAGTT
CGACCATTACACCCCGCTCAACGTTCCTTTATCAGAGATCCTGGCCAACATCGAGAATAGCGATTTGCATATGATGTTGGAGAAGCTGGGTAAAATGAAGTCGAGTCCGG
ATAAGTGGAGTAAGAGCAAATACTACAGGTTTCATAGAGACTATGGACATGATACTTTGCAGTGCTTTGACTTGAGAGACCAGGTAGAAAACCTGATTGGGCGAGGTCAT
CTGAAAAAGTATGTTGGGAAGAAAGATTCCTGTTCTTCTCAGGGAAAGAGAAAATTCTACCGAACAGATCTTGATGATCAAGACAAATTTCCCTCACCCAGAAAAAGAGA
AGCGGCTGGGAAAAGGCCCATGTCGTCACACCCACCCCCTTTCGAACCTTCTTCTCCGGCGAAGGAGCACCAGCACGGCAACCTGATTACGCTTGTTTACATGCGTAAAA
AGAAGGACCATTTGGGAGTGCAAATTAATCAAAAGAAGCAAAAAGTCGAAAAAAGCATCATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCC
AACTTTGCCCTTAATGAAACGCATCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGATTTAGA
GAAGGCCACTGTAAGTCTTAACAAGTCAATACTGAAAAGGTTGCTTATTTGCATTGTTGAGTGGTTAAACCGAGGAGAAGAGCGCCTTAGACATAAGAGTTCCGCTTAA
Protein sequenceShow/hide protein sequence
MDPREGLRVHITAEAPGLCQTTSECSAPQQAGQQTPTTPNWGSTPTIGRVNSNTIQWPPAAATATTSTYQATSRVNERILILGNHNDISTPSKMKKPDMKPYDGSTDPSD
HIDLYEGLMELNATGDEMKCRAFFITLKSQARSSYRQLKLKTVGSWKQFRKIFISQFSVQHDRKTPDTHLLTNYQLEDESLRDFVQRFMAEKIKVTDCLDDAARMTFTST
INNLDLHQTIISRHAKARGQSSQPKRVRVHDRRLREFDHYTPLNVPLSEILANIENSDLHMMLEKLGKMKSSPDKWSKSKYYRFHRDYGHDTLQCFDLRDQVENLIGRGH
LKKYVGKKDSCSSQGKRKFYRTDLDDQDKFPSPRKREAAGKRPMSSHPPPFEPSSPAKEHQHGNLITLVYMRKKKDHLGVQINQKKQKVEKSIMGGARRLGSLQKNWFSS
NFALNETHLPMRFGGSNRCIRVEEVFHYQFEHDLDLEKATVSLNKSILKRLLICIVEWLNRGEERLRHKSSA