; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038540 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038540
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr2:19957322..19958800
RNA-Seq ExpressionLag0038540
SyntenyLag0038540
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147761.1 uncharacterized protein LOC111016619 [Momordica charantia]1.3e-5048.36Show/hide
Query:  MKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQ
        MK +V  KFK+P  K+Y+G  DP+ HL+ Y  W D +G++ AIRCR F FTLTGS R WF++LK++SIS FKELA++F+ QF G   + +P   LLT+KQ
Subjt:  MKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQ

Query:  QPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRTEER
        +  E L+DY+ RFN E LQVEG +D   L+   +G+++E+L+ S GK    T +E  SRAQ YMSV EL+ SKR      +   +D+++   KR+R  E+
Subjt:  QPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRTEER

Query:  GRG----RPDHGRGR---AHPFGKFEKYTPTVVPQEQVLMEIRN
          G    R D G+GR     P  KFEKYTPT VP EQVLMEI++
Subjt:  GRG----RPDHGRGR---AHPFGKFEKYTPTVVPQEQVLMEIRN

XP_022150035.1 uncharacterized protein LOC111018307 [Momordica charantia]4.9e-4247.8Show/hide
Query:  EEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLT
        EE+MK +V  KFK+PT KQ++G  D V HL+AYR WMD +GVS A++CR F  TL+GSAR WF +LK+ SIS FK LA++F+ QF+G R + +P   LLT
Subjt:  EEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLT

Query:  VKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRT
        +KQ+  E L DY+ RFN+E LQVEG ++  +L+A  + + +E L  S GK  P T +E +SRAQKYMS  E   SKR     +   + +    K +  R 
Subjt:  VKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRT

Query:  EERGR
        E+R R
Subjt:  EERGR

XP_022158344.1 uncharacterized protein LOC111024851 [Momordica charantia]1.6e-4547.39Show/hide
Query:  MKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQ
        MK +   KFK+P   +Y+G  DP+ HL+AYR W D + +  AIRCR F FTLTGSAR+WF +LK+ SIS FKELA++F+ QF+G R + KP   LLT+KQ
Subjt:  MKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQ

Query:  QPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRAS-SSDHDSKKDKRQRTEE
        +  E L++Y+ RFN+E LQVEG +D  AL+A  +G+++ERL+ S GK  P T  E +SRAQKYMS  EL+   R + E +RA+ S+  + + +KR R+  
Subjt:  QPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRAS-SSDHDSKKDKRQRTEE

Query:  RGRGRPDHGRGR---AHPFGKFEKYTPTVV
            R D G GR     P  KFEK   +++
Subjt:  RGRGRPDHGRGR---AHPFGKFEKYTPTVV

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]3.4e-5941.25Show/hide
Query:  LVRDPRKGKEPMEHTAESETRSKGKKTDSMTSKVR-GLKPTDRTILRSPESSTLKGRDYTVSTPSYGHTKTDLRNLIVEKRISAKTAESEAKATEAEAWA
        LVRDP+KGK P E   E  T S G       SK+R G     RT +  P  +    + +    P+ G +  D RN             SE  + +     
Subjt:  LVRDPRKGKEPMEHTAESETRSKGKKTDSMTSKVR-GLKPTDRTILRSPESSTLKGRDYTVSTPSYGHTKTDLRNLIVEKRISAKTAESEAKATEAEAWA

Query:  AEAEAKKDNLPWKTELLNTLKELGNPQGDQQKSKDLGDQNLEELADQVDPSFTEEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIR
           + K  + P  +E  ++ KE G               +LEEL DQ D  FTEE+M+ +V  KFK+PT KQ++   DPV HL+AYR WMD +GVS A+R
Subjt:  AEAEAKKDNLPWKTELLNTLKELGNPQGDQQKSKDLGDQNLEELADQVDPSFTEEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIR

Query:  CRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNS
        CR F  TL GSAR WF +LK+ SIS FK LA++F+ QF+G R + +P   LLT+KQ+  E LRDY+ RFN+E LQVEG +D  +L+A  +G+ +E L  S
Subjt:  CRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNS

Query:  IGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRTEERGRGRPDHGRGRAHPFGKFEKYTPTVVPQEQVLMEIRNTGLLKLP
         GK  P T +E +SRAQ+YMS  E   SKR     +     +    K +  R E+R R        +  P  KFEKYTPT VP EQVLMEI++  LLK P
Subjt:  IGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRTEERGRGRPDHGRGRAHPFGKFEKYTPTVVPQEQVLMEIRNTGLLKLP

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]8.7e-4745.71Show/hide
Query:  LNTLKELGNPQGDQQK--SKDLGDQNLEELADQVDPSFTEEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARH
        LN  K +  P+  +++   K+ G  +LEEL  Q D  FTEE+M+ +V  KFK+PT K ++G  +PV HL+AYR WMD +GVS+AIRCR F  TL GSAR 
Subjt:  LNTLKELGNPQGDQQK--SKDLGDQNLEELADQVDPSFTEEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARH

Query:  WFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVS
        WF +LK+ SIS FK LA++F+ QF+G R + +P   LLT+KQ+  E L DY+ RFN+E LQ+EG +D  +L+A  +G+ +E L  S  K  P T +E +S
Subjt:  WFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVS

Query:  RAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRTEERGRG
        RAQ+YMS  E   SKR     +     +    K +  R E+R RG
Subjt:  RAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRTEERGRG

TrEMBL top hitse value%identityAlignment
A0A6J1D3B7 uncharacterized protein LOC1110166196.3e-5148.36Show/hide
Query:  MKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQ
        MK +V  KFK+P  K+Y+G  DP+ HL+ Y  W D +G++ AIRCR F FTLTGS R WF++LK++SIS FKELA++F+ QF G   + +P   LLT+KQ
Subjt:  MKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQ

Query:  QPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRTEER
        +  E L+DY+ RFN E LQVEG +D   L+   +G+++E+L+ S GK    T +E  SRAQ YMSV EL+ SKR      +   +D+++   KR+R  E+
Subjt:  QPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRTEER

Query:  GRG----RPDHGRGR---AHPFGKFEKYTPTVVPQEQVLMEIRN
          G    R D G+GR     P  KFEKYTPT VP EQVLMEI++
Subjt:  GRG----RPDHGRGR---AHPFGKFEKYTPTVVPQEQVLMEIRN

A0A6J1D7D2 uncharacterized protein LOC1110183072.4e-4247.8Show/hide
Query:  EEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLT
        EE+MK +V  KFK+PT KQ++G  D V HL+AYR WMD +GVS A++CR F  TL+GSAR WF +LK+ SIS FK LA++F+ QF+G R + +P   LLT
Subjt:  EEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLT

Query:  VKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRT
        +KQ+  E L DY+ RFN+E LQVEG ++  +L+A  + + +E L  S GK  P T +E +SRAQKYMS  E   SKR     +   + +    K +  R 
Subjt:  VKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRT

Query:  EERGR
        E+R R
Subjt:  EERGR

A0A6J1DWY0 uncharacterized protein LOC1110252931.6e-5941.25Show/hide
Query:  LVRDPRKGKEPMEHTAESETRSKGKKTDSMTSKVR-GLKPTDRTILRSPESSTLKGRDYTVSTPSYGHTKTDLRNLIVEKRISAKTAESEAKATEAEAWA
        LVRDP+KGK P E   E  T S G       SK+R G     RT +  P  +    + +    P+ G +  D RN             SE  + +     
Subjt:  LVRDPRKGKEPMEHTAESETRSKGKKTDSMTSKVR-GLKPTDRTILRSPESSTLKGRDYTVSTPSYGHTKTDLRNLIVEKRISAKTAESEAKATEAEAWA

Query:  AEAEAKKDNLPWKTELLNTLKELGNPQGDQQKSKDLGDQNLEELADQVDPSFTEEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIR
           + K  + P  +E  ++ KE G               +LEEL DQ D  FTEE+M+ +V  KFK+PT KQ++   DPV HL+AYR WMD +GVS A+R
Subjt:  AEAEAKKDNLPWKTELLNTLKELGNPQGDQQKSKDLGDQNLEELADQVDPSFTEEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIR

Query:  CRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNS
        CR F  TL GSAR WF +LK+ SIS FK LA++F+ QF+G R + +P   LLT+KQ+  E LRDY+ RFN+E LQVEG +D  +L+A  +G+ +E L  S
Subjt:  CRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNS

Query:  IGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRTEERGRGRPDHGRGRAHPFGKFEKYTPTVVPQEQVLMEIRNTGLLKLP
         GK  P T +E +SRAQ+YMS  E   SKR     +     +    K +  R E+R R        +  P  KFEKYTPT VP EQVLMEI++  LLK P
Subjt:  IGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRTEERGRGRPDHGRGRAHPFGKFEKYTPTVVPQEQVLMEIRNTGLLKLP

A0A6J1DZ49 uncharacterized protein LOC1110248517.9e-4647.39Show/hide
Query:  MKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQ
        MK +   KFK+P   +Y+G  DP+ HL+AYR W D + +  AIRCR F FTLTGSAR+WF +LK+ SIS FKELA++F+ QF+G R + KP   LLT+KQ
Subjt:  MKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQ

Query:  QPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRAS-SSDHDSKKDKRQRTEE
        +  E L++Y+ RFN+E LQVEG +D  AL+A  +G+++ERL+ S GK  P T  E +SRAQKYMS  EL+   R + E +RA+ S+  + + +KR R+  
Subjt:  QPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRAS-SSDHDSKKDKRQRTEE

Query:  RGRGRPDHGRGR---AHPFGKFEKYTPTVV
            R D G GR     P  KFEK   +++
Subjt:  RGRGRPDHGRGR---AHPFGKFEKYTPTVV

A0A6J1E1E7 uncharacterized protein LOC1110255484.2e-4745.71Show/hide
Query:  LNTLKELGNPQGDQQK--SKDLGDQNLEELADQVDPSFTEEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARH
        LN  K +  P+  +++   K+ G  +LEEL  Q D  FTEE+M+ +V  KFK+PT K ++G  +PV HL+AYR WMD +GVS+AIRCR F  TL GSAR 
Subjt:  LNTLKELGNPQGDQQK--SKDLGDQNLEELADQVDPSFTEEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARH

Query:  WFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVS
        WF +LK+ SIS FK LA++F+ QF+G R + +P   LLT+KQ+  E L DY+ RFN+E LQ+EG +D  +L+A  +G+ +E L  S  K  P T +E +S
Subjt:  WFERLKKRSISCFKELAQSFLAQFMGAREQRKPHINLLTVKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVS

Query:  RAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRTEERGRG
        RAQ+YMS  E   SKR     +     +    K +  R E+R RG
Subjt:  RAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKRQRTEERGRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAGGGGATGGAGAAGAAAGATCAGGACATGAACGTAGAGCATTCGGATGGTGACCACCACCAGCGGAGGTCACGGGAGGAAGGCCGAGGCCGACCTCAAACCGA
ATCCCCTCGACCTCGATCTCCACTGCCCTCATCCCGAGAGAAGCAATCTGATTTAAAATTTGCTGCTCTCGAAAACAAAGTAAGTGTGATGGATCATAATTTGTCTAGGA
TACTAGGTATCTTGGATAAACCTGGTCCTAGCACTAAAACCCCTGATGAGAGGCTGGTTAGGGATCCGAGGAAGGGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGAG
ACGAGGTCAAAGGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCTACTGATCGCACGATTTTGAGGAGCCCAGAGTCAAGCACACTTAAGGGGCG
TGACTACACAGTTTCTACCCCAAGCTACGGTCATACTAAGACAGACTTGAGGAATCTGATCGTTGAGAAGCGCATAAGTGCCAAAACTGCCGAGTCCGAGGCCAAAGCCA
CCGAAGCTGAAGCCTGGGCTGCCGAGGCCGAGGCCAAGAAAGACAACCTCCCTTGGAAGACCGAGCTTCTAAACACACTAAAGGAGCTCGGAAATCCTCAGGGAGACCAG
CAGAAGTCAAAGGACCTTGGAGATCAAAACTTGGAAGAACTAGCCGACCAAGTCGATCCGTCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGTCTCAAAAGTTTAAGGT
ACCCACGTTCAAACAGTATAATGGCAAGAAAGACCCCGTGCAACATCTAAACGCATACAGAAGCTGGATGGACTTCCACGGCGTCTCAAATGCAATCAGGTGCCGCACAT
TCTTTTTCACTCTGACAGGATCAGCCAGGCACTGGTTTGAGAGGCTGAAAAAGAGATCCATCAGCTGTTTCAAAGAGTTGGCCCAATCATTCCTTGCACAGTTCATGGGA
GCCAGAGAGCAGCGCAAGCCTCACATCAACCTCTTGACGGTCAAACAACAGCCAGGTGAGATTTTGCGTGATTACATAACTCGTTTTAATGACGAGGCACTACAGGTTGA
GGGGGACAGCGATGGAGCAGCCCTGGTAGCCATAACAGCTGGACTGGAAAACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAACCTCGAACCCATGCGGAGTTTGTCT
CCCGGGCACAAAAGTATATGAGCGTAGAGGAGTTACTGAAATCAAAGAGGTCAGAATGGGAGTACAAGAGGGCTTCTTCATCTGACCACGACAGTAAAAAGGACAAGAGG
CAGCGGACAGAAGAAAGGGGCCGAGGCCGACCAGACCATGGCCGAGGCCGAGCACATCCTTTCGGTAAGTTTGAGAAATACACCCCAACTGTTGTTCCACAGGAGCAAGT
GCTGATGGAGATCCGAAATACGGGCCTCCTGAAACTCCCAGGAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAGGGGATGGAGAAGAAAGATCAGGACATGAACGTAGAGCATTCGGATGGTGACCACCACCAGCGGAGGTCACGGGAGGAAGGCCGAGGCCGACCTCAAACCGA
ATCCCCTCGACCTCGATCTCCACTGCCCTCATCCCGAGAGAAGCAATCTGATTTAAAATTTGCTGCTCTCGAAAACAAAGTAAGTGTGATGGATCATAATTTGTCTAGGA
TACTAGGTATCTTGGATAAACCTGGTCCTAGCACTAAAACCCCTGATGAGAGGCTGGTTAGGGATCCGAGGAAGGGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGAG
ACGAGGTCAAAGGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCTACTGATCGCACGATTTTGAGGAGCCCAGAGTCAAGCACACTTAAGGGGCG
TGACTACACAGTTTCTACCCCAAGCTACGGTCATACTAAGACAGACTTGAGGAATCTGATCGTTGAGAAGCGCATAAGTGCCAAAACTGCCGAGTCCGAGGCCAAAGCCA
CCGAAGCTGAAGCCTGGGCTGCCGAGGCCGAGGCCAAGAAAGACAACCTCCCTTGGAAGACCGAGCTTCTAAACACACTAAAGGAGCTCGGAAATCCTCAGGGAGACCAG
CAGAAGTCAAAGGACCTTGGAGATCAAAACTTGGAAGAACTAGCCGACCAAGTCGATCCGTCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGTCTCAAAAGTTTAAGGT
ACCCACGTTCAAACAGTATAATGGCAAGAAAGACCCCGTGCAACATCTAAACGCATACAGAAGCTGGATGGACTTCCACGGCGTCTCAAATGCAATCAGGTGCCGCACAT
TCTTTTTCACTCTGACAGGATCAGCCAGGCACTGGTTTGAGAGGCTGAAAAAGAGATCCATCAGCTGTTTCAAAGAGTTGGCCCAATCATTCCTTGCACAGTTCATGGGA
GCCAGAGAGCAGCGCAAGCCTCACATCAACCTCTTGACGGTCAAACAACAGCCAGGTGAGATTTTGCGTGATTACATAACTCGTTTTAATGACGAGGCACTACAGGTTGA
GGGGGACAGCGATGGAGCAGCCCTGGTAGCCATAACAGCTGGACTGGAAAACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAACCTCGAACCCATGCGGAGTTTGTCT
CCCGGGCACAAAAGTATATGAGCGTAGAGGAGTTACTGAAATCAAAGAGGTCAGAATGGGAGTACAAGAGGGCTTCTTCATCTGACCACGACAGTAAAAAGGACAAGAGG
CAGCGGACAGAAGAAAGGGGCCGAGGCCGACCAGACCATGGCCGAGGCCGAGCACATCCTTTCGGTAAGTTTGAGAAATACACCCCAACTGTTGTTCCACAGGAGCAAGT
GCTGATGGAGATCCGAAATACGGGCCTCCTGAAACTCCCAGGAAAATAA
Protein sequenceShow/hide protein sequence
MSKGMEKKDQDMNVEHSDGDHHQRRSREEGRGRPQTESPRPRSPLPSSREKQSDLKFAALENKVSVMDHNLSRILGILDKPGPSTKTPDERLVRDPRKGKEPMEHTAESE
TRSKGKKTDSMTSKVRGLKPTDRTILRSPESSTLKGRDYTVSTPSYGHTKTDLRNLIVEKRISAKTAESEAKATEAEAWAAEAEAKKDNLPWKTELLNTLKELGNPQGDQ
QKSKDLGDQNLEELADQVDPSFTEEVMKAEVSQKFKVPTFKQYNGKKDPVQHLNAYRSWMDFHGVSNAIRCRTFFFTLTGSARHWFERLKKRSISCFKELAQSFLAQFMG
AREQRKPHINLLTVKQQPGEILRDYITRFNDEALQVEGDSDGAALVAITAGLENERLLNSIGKSQPRTHAEFVSRAQKYMSVEELLKSKRSEWEYKRASSSDHDSKKDKR
QRTEERGRGRPDHGRGRAHPFGKFEKYTPTVVPQEQVLMEIRNTGLLKLPGK