; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022095 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022095
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr7:18173271..18176326
RNA-Seq ExpressionLag0022095
SyntenyLag0022095
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015386823.1 uncharacterized protein LOC107177480 [Citrus sinensis]9.3e-2433.68Show/hide
Query:  LENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVL----------------------------------------G
        L+++  + +P F  +IM A+ P +F LP    YDG++DP +HL+ Y++ ME  GAS+ +                                         
Subjt:  LENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVL----------------------------------------G

Query:  ARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYINAEELMKSKRAER
        A+ RRKP   LLT+KQ   E+L  YI +++NE++QV+GYDDG++L  ++ GL+  KL  SV +  P +Y++ +AR +KY NAEE  K++  E+
Subjt:  ARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYINAEELMKSKRAER

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]9.9e-2627.15Show/hide
Query:  EKGKGVLDEEKGETNSATSKLRKLKGGKEFDLKEPGSSKWVVRKGALNVPDATSTVGSSRRVEVEAEAKSLEKIKLETKIRAELGVKLRVEAEAATKAKA
        +K   VL++++ +        +K KG  E D +E                 +T++VGS  R+      +        T+I      K + ++ A    K+
Subjt:  EKGKGVLDEEKGETNSATSKLRKLKGGKEFDLKEPGSSKWVVRKGALNVPDATSTVGSSRRVEVEAEAKSLEKIKLETKIRAELGVKLRVEAEAATKAKA

Query:  EAEAEAKARIDIQK--------NTQPRDVDKE---NLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEV------
        + +      I + K        +++ R   KE   +LE L+ Q D PF +EIM+ +VP KFKLPT  Q+D   DP+ HLD Y+ WM+ +G SE       
Subjt:  EAEAEAKARIDIQK--------NTQPRDVDKE---NLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEV------

Query:  ----------------------------------LGARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGED
                                          +G R R +P   LLTIKQ T ESL  Y+ RF+ E +QVEG  D V+L A +SG++D+ L  S G+ 
Subjt:  ----------------------------------LGARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGED

Query:  QPRTYAKFVARAQKYINAEELMKS------KRAEREAQRVTTIDRGRRREERG--------KRF------------------------------GEMSRQ
         P T+++ ++RAQ+Y++A E   S      KR + + +R     +G R E+R         ++F                                 +++
Subjt:  QPRTYAKFVARAQKYINAEELMKS------KRAEREAQRVTTIDRGRRREERG--------KRF------------------------------GEMSRQ

Query:  AENRSWALHRDHGHTMRYCIQLRDEIESLIK
        ++ R    HRDHGH  + C  L++E+E LI+
Subjt:  AENRSWALHRDHGHTMRYCIQLRDEIESLIK

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]9.9e-2634.18Show/hide
Query:  IDIQKNTQPRDVDKE---NLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVL----------------------
        ID  ++++ R   KE   +LE L+GQ D PF +EIM+ +VP KFKLPT   +DG  +P+ HLD Y+ WM+ +G S+ +                      
Subjt:  IDIQKNTQPRDVDKE---NLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVL----------------------

Query:  ------------------GARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYIN
                          G R R +P   LLTIKQ T ESL+ Y+ RF+ E +Q+EG  D V+L A +SG++D+ L  S  +  P T+++ ++RAQ+Y++
Subjt:  ------------------GARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYIN

Query:  AEELMKS------KRAEREAQRVTTIDRGRRREERGK
        A E   S      KR +++ +R     +G R E+R +
Subjt:  AEELMKS------KRAEREAQRVTTIDRGRRREERGK

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]3.6e-2830.06Show/hide
Query:  LENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVL----------------------------------------G
        L+++  + +PPF  +IM A+ P +F LP    YDG++DP +HL+ Y++ ME  GAS+ +                                         
Subjt:  LENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVL----------------------------------------G

Query:  ARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYINAEELMKS------------
        AR R KP   LLT+KQ  GE+L  YI R++NE+ QV+GYDDG+AL+ ++ GL+  KL  SV +  P +Y++ +ARA+KY NAEE  K+            
Subjt:  ARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYINAEELMKS------------

Query:  -KRAEREAQRVTTIDR---------GRRREERGKR---------FGEM---------------------------SRQAENRSWALHRDHGHTMRYCIQL
         K+ ++  +RV   D+         G R E R  R         F E+                           SR+  N+    H+DHGH    C +L
Subjt:  -KRAEREAQRVTTIDR---------GRRREERGKR---------FGEM---------------------------SRQAENRSWALHRDHGHTMRYCIQL

Query:  RDEIESLIK----GEKIRRPSKPMSS
        +++IESL++     E +R P   + S
Subjt:  RDEIESLIK----GEKIRRPSKPMSS

XP_024047974.1 uncharacterized protein LOC112101548 [Citrus clementina]3.8e-2531.1Show/hide
Query:  ENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVL----------------------------------------GA
        ++++ + +PPF  EIM+A  P  F+LP+   YDG+K P++H++ Y+S ME  G S  +                                         A
Subjt:  ENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVL----------------------------------------GA

Query:  RDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYINAEELMKSKRA-EREAQRVTT
        R R KP   LLT+KQ  GESL  YI R++ E  QV+GYDDGVAL+ ++ GLQ  +L  SV ++ P TY++ ++RA+KY NAEE  +SK+  E+       
Subjt:  RDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYINAEELMKSKRA-EREAQRVTT

Query:  IDRGRRREERGKRFGEMSRQAENRSWALHRDHGHTMRYCIQLRDEIESLIKGEKIRRPSKPMSSMPMPTKLTMQHGRPESEAF
          +  RR+ R  R  + SR+ + R W          R  +Q  +  E     E I    K  +    P  L     R   + +
Subjt:  IDRGRRREERGKRFGEMSRQAENRSWALHRDHGHTMRYCIQLRDEIESLIKGEKIRRPSKPMSSMPMPTKLTMQHGRPESEAF

TrEMBL top hitse value%identityAlignment
A0A2N9F3Z1 Ribonuclease H4.1e-2527.96Show/hide
Query:  NLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVL---------GARDRRKPQFNLLTIKQWTGESLNGYITRFS
        NL+NL+ + D PF+  I    +P +FK+P    +DG KDP  +L+ +++ M+     E L         G++ R +P  +LL++KQ  GESL  ++ RF+
Subjt:  NLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVL---------GARDRRKPQFNLLTIKQWTGESLNGYITRFS

Query:  NEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYINAEELMKS--------------KRAEREAQRVTTIDRGRRREER------
         E ++++   + V +T  ++GL+    L  + +D P T ++ +  A K++NAE+ +++              ++ E   Q+ + I R  R  +       
Subjt:  NEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYINAEELMKS--------------KRAEREAQRVTTIDRGRRREER------

Query:  -GKRFGEMSRQAENRSWALHRDHGHTMRYCIQLRDEIESLIKGEK----IRRPSKPMSSMPMPTKLTMQHGRPESEAFRVHVVSTTRRLVRRSKGIRIGG
         GK   + + + +N     HRDHGH    CI L++++E+LI+  K    + RP+    + P   K   +H RP         V   R +V    G   GG
Subjt:  -GKRFGEMSRQAENRSWALHRDHGHTMRYCIQLRDEIESLIKGEK----IRRPSKPMSSMPMPTKLTMQHGRPESEAFRVHVVSTTRRLVRRSKGIRIGG

Query:  ES-SEILLYECAIHSWMALVKSLLRPLDA
         S +    Y   +   M L K  LRP+DA
Subjt:  ES-SEILLYECAIHSWMALVKSLLRPLDA

A0A2N9FY65 Uncharacterized protein7.7e-2431.91Show/hide
Query:  ENLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVLGAR-------DRRKPQFNLLTIKQWTGESLNGYITRFSN
        +NL++L+   D PF   +    +P KF++PT   +DG KDP+ HL+++++ M   G    +  R          +  F  LT     GE+L  Y+TRF+ 
Subjt:  ENLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVLGAR-------DRRKPQFNLLTIKQWTGESLNGYITRFSN

Query:  EVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYINAEELMKSKRAEREAQR---------VTTIDRGRRREERGKRFGEMSRQAE
        E + V+G DD V LTA ISGLQ    L SV +D P T  + +  AQ+++N EE ++++      +R            I      +  GK      ++  
Subjt:  EVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYINAEELMKSKRAEREAQR---------VTTIDRGRRREERGKRFGEMSRQAE

Query:  NRSWALHRDHGHTMRYCIQLRDEIESLIKGEKIRR
        ++    HRDHGH    C  L+ +IE LIK  K++R
Subjt:  NRSWALHRDHGHTMRYCIQLRDEIESLIKGEKIRR

A0A2N9J5E4 Ribonuclease H4.5e-2428.36Show/hide
Query:  NLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASE-------VLGARD-------RRKPQFNLLTIKQWTGESLNGY
        NL+NL+ Q D PF+  I    +P +FK+P    +DG KDP  +L+ +++ M+     E        LG R        R +P  +LL IKQ  GESL  +
Subjt:  NLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASE-------VLGARD-------RRKPQFNLLTIKQWTGESLNGY

Query:  ITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYINAEELMKSKRAEREAQRVTTIDRGRRREER-----------GKRF
        + RF+ E ++++   + V +TA ++GL+    L  + +D P T ++ +  A K++NAE+ +++       +R  T DR  +  ++           GK  
Subjt:  ITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYINAEELMKSKRAEREAQRVTTIDRGRRREER-----------GKRF

Query:  GEMSRQAENRSWALHRDHGHTMRYCIQLRDEIESLIKGEKIR----RPSKPMSSMPMPTKLTMQHGRP
         + + + +N  +  HRDH H    C+ L++++E+LI+  K++    RP+    + P   +   +H RP
Subjt:  GEMSRQAENRSWALHRDHGHTMRYCIQLRDEIESLIKGEKIR----RPSKPMSSMPMPTKLTMQHGRP

A0A6J1DWY0 uncharacterized protein LOC1110252934.8e-2627.15Show/hide
Query:  EKGKGVLDEEKGETNSATSKLRKLKGGKEFDLKEPGSSKWVVRKGALNVPDATSTVGSSRRVEVEAEAKSLEKIKLETKIRAELGVKLRVEAEAATKAKA
        +K   VL++++ +        +K KG  E D +E                 +T++VGS  R+      +        T+I      K + ++ A    K+
Subjt:  EKGKGVLDEEKGETNSATSKLRKLKGGKEFDLKEPGSSKWVVRKGALNVPDATSTVGSSRRVEVEAEAKSLEKIKLETKIRAELGVKLRVEAEAATKAKA

Query:  EAEAEAKARIDIQK--------NTQPRDVDKE---NLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEV------
        + +      I + K        +++ R   KE   +LE L+ Q D PF +EIM+ +VP KFKLPT  Q+D   DP+ HLD Y+ WM+ +G SE       
Subjt:  EAEAEAKARIDIQK--------NTQPRDVDKE---NLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEV------

Query:  ----------------------------------LGARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGED
                                          +G R R +P   LLTIKQ T ESL  Y+ RF+ E +QVEG  D V+L A +SG++D+ L  S G+ 
Subjt:  ----------------------------------LGARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGED

Query:  QPRTYAKFVARAQKYINAEELMKS------KRAEREAQRVTTIDRGRRREERG--------KRF------------------------------GEMSRQ
         P T+++ ++RAQ+Y++A E   S      KR + + +R     +G R E+R         ++F                                 +++
Subjt:  QPRTYAKFVARAQKYINAEELMKS------KRAEREAQRVTTIDRGRRREERG--------KRF------------------------------GEMSRQ

Query:  AENRSWALHRDHGHTMRYCIQLRDEIESLIK
        ++ R    HRDHGH  + C  L++E+E LI+
Subjt:  AENRSWALHRDHGHTMRYCIQLRDEIESLIK

A0A6J1E1E7 uncharacterized protein LOC1110255484.8e-2634.18Show/hide
Query:  IDIQKNTQPRDVDKE---NLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVL----------------------
        ID  ++++ R   KE   +LE L+GQ D PF +EIM+ +VP KFKLPT   +DG  +P+ HLD Y+ WM+ +G S+ +                      
Subjt:  IDIQKNTQPRDVDKE---NLENLIGQVDPPFVDEIMQAEVPHKFKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVL----------------------

Query:  ------------------GARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYIN
                          G R R +P   LLTIKQ T ESL+ Y+ RF+ E +Q+EG  D V+L A +SG++D+ L  S  +  P T+++ ++RAQ+Y++
Subjt:  ------------------GARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVARAQKYIN

Query:  AEELMKS------KRAEREAQRVTTIDRGRRREERGK
        A E   S      KR +++ +R     +G R E+R +
Subjt:  AEELMKS------KRAEREAQRVTTIDRGRRREERGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTACAAATTACTCAAAGACTAACCAAATTGTACCAGGAAACATAGCTCGCTATGCGCGCTATCAGTCGACCTCGGCATTGCCAGGCAGTTGGGCCCAATGGCCACC
TGGGTCGTTTGGGGAAGCTCTCTTGGGCCCAGTGAAGGCAGAGTCACGTCTTCCCCGGATTCAAACAAATTTACTGTTGGTGTCACGTGAAGGTCAGGATCGAGAGAAAG
GAAAAGGGGTCTTAGATGAAGAGAAGGGGGAAACAAACAGTGCTACCAGCAAGCTGCGAAAACTAAAGGGTGGCAAGGAATTTGACCTGAAGGAGCCAGGGTCGAGCAAA
TGGGTAGTGCGCAAAGGCGCACTTAACGTTCCAGATGCAACCAGTACAGTGGGTTCGAGCCGGAGGGTTGAGGTCGAGGCCGAAGCCAAGAGTCTGGAGAAGATTAAACT
TGAAACGAAGATCCGGGCCGAGCTTGGGGTTAAGTTGAGGGTCGAAGCTGAAGCCGCGACCAAAGCTAAGGCCGAGGCCGAGGCCGAGGCCAAAGCTAGGATCGATATTC
AGAAAAATACACAGCCCAGGGATGTAGATAAGGAAAACTTAGAAAATCTAATAGGTCAGGTTGATCCACCGTTCGTCGATGAAATCATGCAGGCAGAAGTTCCACATAAG
TTTAAGTTACCGACCTTCTCACAGTATGATGGGAAGAAGGACCCGATTCAACATTTGGATACCTACCAGTCTTGGATGGAGTTTCATGGTGCTTCCGAGGTCTTGGGGGC
AAGGGATCGAAGAAAGCCGCAATTTAATTTGTTGACTATTAAGCAGTGGACAGGGGAGAGCCTGAATGGGTACATCACACGTTTCAGCAATGAGGTTGTGCAAGTAGAAG
GGTACGATGACGGAGTAGCTCTAACGGCTGTTATTTCGGGGTTACAGGACAAGAAGTTGCTTAATTCCGTGGGAGAGGATCAACCACGAACATACGCCAAGTTCGTTGCT
AGGGCGCAAAAGTATATAAACGCAGAGGAGTTAATGAAGTCCAAGCGTGCAGAGAGGGAAGCGCAAAGGGTGACCACTATTGATAGAGGCAGGAGAAGAGAAGAAAGAGG
CAAGAGATTTGGTGAAATGTCCAGACAGGCTGAGAATAGATCCTGGGCGTTACATCGGGACCATGGACATACAATGCGATATTGCATACAGCTCCGAGACGAGATAGAGA
GTCTGATCAAGGGTGAGAAGATCAGGAGGCCGAGCAAGCCGATGTCGAGCATGCCGATGCCGACCAAGCTGACCATGCAGCATGGGAGGCCGGAATCCGAGGCCTTCAGA
GTCCATGTGGTCTCTACCACCAGAAGACTCGTCAGACGAAGCAAAGGCATCAGGATTGGTGGCGAGAGTAGCGAGATCCTCTTGTATGAATGTGCCATCCACAGTTGGAT
GGCGCTTGTTAAATCTCTTCTACGCCCACTTGATGCCGTCATTCCATATCTCGTCTTGCATGACGTAAAACTCGCTAGTTTTCTTGAACTCATCAGCCAGGTAGTCAGCA
CTGGCCAAGTGAGCCTTGGCCTCCATAAGTTGGGCTATGGTGGACTCCAACTCAACATCCCTAGATTTGAGTTTTGTTCGTACTTCCTCCAACAGACGATTTGCTTCATT
AAGCCTCTCCTGGGCCTAAATAAGCTCGCTCCATGGGACGCTATTCAATTGAAGCTCGTTAATTTTGTTCGCCAGGCTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTACAAATTACTCAAAGACTAACCAAATTGTACCAGGAAACATAGCTCGCTATGCGCGCTATCAGTCGACCTCGGCATTGCCAGGCAGTTGGGCCCAATGGCCACC
TGGGTCGTTTGGGGAAGCTCTCTTGGGCCCAGTGAAGGCAGAGTCACGTCTTCCCCGGATTCAAACAAATTTACTGTTGGTGTCACGTGAAGGTCAGGATCGAGAGAAAG
GAAAAGGGGTCTTAGATGAAGAGAAGGGGGAAACAAACAGTGCTACCAGCAAGCTGCGAAAACTAAAGGGTGGCAAGGAATTTGACCTGAAGGAGCCAGGGTCGAGCAAA
TGGGTAGTGCGCAAAGGCGCACTTAACGTTCCAGATGCAACCAGTACAGTGGGTTCGAGCCGGAGGGTTGAGGTCGAGGCCGAAGCCAAGAGTCTGGAGAAGATTAAACT
TGAAACGAAGATCCGGGCCGAGCTTGGGGTTAAGTTGAGGGTCGAAGCTGAAGCCGCGACCAAAGCTAAGGCCGAGGCCGAGGCCGAGGCCAAAGCTAGGATCGATATTC
AGAAAAATACACAGCCCAGGGATGTAGATAAGGAAAACTTAGAAAATCTAATAGGTCAGGTTGATCCACCGTTCGTCGATGAAATCATGCAGGCAGAAGTTCCACATAAG
TTTAAGTTACCGACCTTCTCACAGTATGATGGGAAGAAGGACCCGATTCAACATTTGGATACCTACCAGTCTTGGATGGAGTTTCATGGTGCTTCCGAGGTCTTGGGGGC
AAGGGATCGAAGAAAGCCGCAATTTAATTTGTTGACTATTAAGCAGTGGACAGGGGAGAGCCTGAATGGGTACATCACACGTTTCAGCAATGAGGTTGTGCAAGTAGAAG
GGTACGATGACGGAGTAGCTCTAACGGCTGTTATTTCGGGGTTACAGGACAAGAAGTTGCTTAATTCCGTGGGAGAGGATCAACCACGAACATACGCCAAGTTCGTTGCT
AGGGCGCAAAAGTATATAAACGCAGAGGAGTTAATGAAGTCCAAGCGTGCAGAGAGGGAAGCGCAAAGGGTGACCACTATTGATAGAGGCAGGAGAAGAGAAGAAAGAGG
CAAGAGATTTGGTGAAATGTCCAGACAGGCTGAGAATAGATCCTGGGCGTTACATCGGGACCATGGACATACAATGCGATATTGCATACAGCTCCGAGACGAGATAGAGA
GTCTGATCAAGGGTGAGAAGATCAGGAGGCCGAGCAAGCCGATGTCGAGCATGCCGATGCCGACCAAGCTGACCATGCAGCATGGGAGGCCGGAATCCGAGGCCTTCAGA
GTCCATGTGGTCTCTACCACCAGAAGACTCGTCAGACGAAGCAAAGGCATCAGGATTGGTGGCGAGAGTAGCGAGATCCTCTTGTATGAATGTGCCATCCACAGTTGGAT
GGCGCTTGTTAAATCTCTTCTACGCCCACTTGATGCCGTCATTCCATATCTCGTCTTGCATGACGTAAAACTCGCTAGTTTTCTTGAACTCATCAGCCAGGTAGTCAGCA
CTGGCCAAGTGAGCCTTGGCCTCCATAAGTTGGGCTATGGTGGACTCCAACTCAACATCCCTAGATTTGAGTTTTGTTCGTACTTCCTCCAACAGACGATTTGCTTCATT
AAGCCTCTCCTGGGCCTAAATAAGCTCGCTCCATGGGACGCTATTCAATTGAAGCTCGTTAATTTTGTTCGCCAGGCTGATTGA
Protein sequenceShow/hide protein sequence
MVTNYSKTNQIVPGNIARYARYQSTSALPGSWAQWPPGSFGEALLGPVKAESRLPRIQTNLLLVSREGQDREKGKGVLDEEKGETNSATSKLRKLKGGKEFDLKEPGSSK
WVVRKGALNVPDATSTVGSSRRVEVEAEAKSLEKIKLETKIRAELGVKLRVEAEAATKAKAEAEAEAKARIDIQKNTQPRDVDKENLENLIGQVDPPFVDEIMQAEVPHK
FKLPTFSQYDGKKDPIQHLDTYQSWMEFHGASEVLGARDRRKPQFNLLTIKQWTGESLNGYITRFSNEVVQVEGYDDGVALTAVISGLQDKKLLNSVGEDQPRTYAKFVA
RAQKYINAEELMKSKRAEREAQRVTTIDRGRRREERGKRFGEMSRQAENRSWALHRDHGHTMRYCIQLRDEIESLIKGEKIRRPSKPMSSMPMPTKLTMQHGRPESEAFR
VHVVSTTRRLVRRSKGIRIGGESSEILLYECAIHSWMALVKSLLRPLDAVIPYLVLHDVKLASFLELISQVVSTGQVSLGLHKLGYGGLQLNIPRFEFCSYFLQQTICFI
KPLLGLNKLAPWDAIQLKLVNFVRQAD