; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018237 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018237
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationchr5:20023736..20026796
RNA-Seq ExpressionLag0018237
SyntenyLag0018237
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.9e-2331.92Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANL---------------------------
        ++F  + A  +Y+  +  R    E+GF  D       LP F+   I    W+QFCA PE     +VREFYANL                           
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANL---------------------------

Query:  ---DVKNDFEMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
            V    E +   +   L   +  V + GA W VS    +T   + L   A  W  F++  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  ---DVKNDFEMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQR---TEEARQ
          C+ +K   LFFP+ IT LC  A  P +  +  + + G ID   +AR+ +   TE  +Q
Subjt:  VDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQR---TEEARQ

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.6e-3432.08Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANL---------------------------
        ++F  + A  +Y+  +  R    E+GF  D       LP F+   I    W+QFCA PE     +VREFYANL                           
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANL---------------------------

Query:  ---DVKNDFEMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
            V    E +   +   L   +  V   GA W VS    +T   + L   A  W  F++ RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  ---DVKNDFEMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQR---TEEARQGGLVYGV--------NQILEQLAVLTSR------------
          C+ +K   LFFP+ IT LC  A  P +  +  + + G ID   +AR+ +   TE  +Q                 IL+QL  L  R            
Subjt:  VDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQR---TEEARQGGLVYGV--------NQILEQLAVLTSR------------

Query:  --LEFAERQAQTYWTYAKKRDDALMGALQTNFLRPYQAFPVFPDDL
          L+   +Q Q +W Y+K+RD AL  ALQ NF RP   FP FP ++
Subjt:  --LEFAERQAQTYWTYAKKRDDALMGALQTNFLRPYQAFPVFPDDL

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]1.3e-2336.84Show/hide
Query:  QTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMID
        Q R    +   L   A  W  F++ RLLPTTH  TVS+DR+LL +++L   SI+VG++I SEI  C+ +K   LFFP+ IT LC  A  P +  +  +  
Subjt:  QTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMID

Query:  KGIIDTPNLARLQ---RTEEARQ--------GGLVYGVNQILEQLAVLTSR--------------LEFAERQAQTYWTYAKKRDDALMGALQTNFLRPYQ
         G ID   +AR+    +TE  +Q                 IL+QL  L  R              L+   +Q Q +W Y+K+RD AL  ALQ NF RP  
Subjt:  KGIIDTPNLARLQ---RTEEARQ--------GGLVYGVNQILEQLAVLTSR--------------LEFAERQAQTYWTYAKKRDDALMGALQTNFLRPYQ

Query:  AFPVFPDDL
         FP FP +L
Subjt:  AFPVFPDDL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.1e-2636.28Show/hide
Query:  EMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVE
        E +   +  +L   +  V   GA W VS    +T   + L   A  W  F++ RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C+ +K  
Subjt:  EMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVE

Query:  KLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQR---TEEARQGGLVYGV--------NQILEQLAVLTSRL---EFAERQAQTYWTYAKKR
         LFFP+ IT LC  A  P +  +  + + G ID   +AR+ +   TE  +Q                 +L+QL  L  RL   E   +Q Q +W Y+K+R
Subjt:  KLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQR---TEEARQGGLVYGV--------NQILEQLAVLTSRL---EFAERQAQTYWTYAKKR

Query:  DDALMGALQTNFLRPYQAFPVFPDDL
        D AL  ALQ NF RP   FP FP ++
Subjt:  DDALMGALQTNFLRPYQAFPVFPDDL

TYG52543.1 hypothetical protein ES288_D09G036700v1 [Gossypium darwinii]4.9e-2333.62Show/hide
Query:  VNDLARAKYQEVLK-RDFLFERGFG---SDL---PRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEMVVAPSS----DQLSAAVREVGIE
        +++  + ++  + K +  + E+GFG   +DL   P  +   I  L W +FC      +  +VREFYA+L  ++  E++V   S    D L   +  V   
Subjt:  VNDLARAKYQEVLK-RDFLFERGFG---SDL---PRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEMVVAPSS----DQLSAAVREVGIE

Query:  GARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVEKLFFPNTITMLCSRAGVPTVP
        G++W +     H+ Q  YLK  A  W  F+R   +P +H ST+S + +LL +AIL   SI+VGKII  EI +C+KKK    +FP+ IT LC +A V    
Subjt:  GARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVEKLFFPNTITMLCSRAGVPTVP

Query:  EDMIMIDKGIIDTPNLARL-QRTEEARQG
               +G I   +L RL +R  E  QG
Subjt:  EDMIMIDKGIIDTPNLARL-QRTEEARQG

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.4e-2331.92Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANL---------------------------
        ++F  + A  +Y+  +  R    E+GF  D       LP F+   I    W+QFCA PE     +VREFYANL                           
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANL---------------------------

Query:  ---DVKNDFEMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
            V    E +   +   L   +  V + GA W VS    +T   + L   A  W  F++  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  ---DVKNDFEMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQR---TEEARQ
          C+ +K   LFFP+ IT LC  A  P +  +  + + G ID   +AR+ +   TE  +Q
Subjt:  VDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQR---TEEARQ

A0A2P5BCG4 Uncharacterized protein (Fragment)1.8e-3432.08Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANL---------------------------
        ++F  + A  +Y+  +  R    E+GF  D       LP F+   I    W+QFCA PE     +VREFYANL                           
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGFGSD-------LPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANL---------------------------

Query:  ---DVKNDFEMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
            V    E +   +   L   +  V   GA W VS    +T   + L   A  W  F++ RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  ---DVKNDFEMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQR---TEEARQGGLVYGV--------NQILEQLAVLTSR------------
          C+ +K   LFFP+ IT LC  A  P +  +  + + G ID   +AR+ +   TE  +Q                 IL+QL  L  R            
Subjt:  VDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQR---TEEARQGGLVYGV--------NQILEQLAVLTSR------------

Query:  --LEFAERQAQTYWTYAKKRDDALMGALQTNFLRPYQAFPVFPDDL
          L+   +Q Q +W Y+K+RD AL  ALQ NF RP   FP FP ++
Subjt:  --LEFAERQAQTYWTYAKKRDDALMGALQTNFLRPYQAFPVFPDDL

A0A2P5CEY2 Uncharacterized protein6.3e-2436.84Show/hide
Query:  QTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMID
        Q R    +   L   A  W  F++ RLLPTTH  TVS+DR+LL +++L   SI+VG++I SEI  C+ +K   LFFP+ IT LC  A  P +  +  +  
Subjt:  QTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMID

Query:  KGIIDTPNLARLQ---RTEEARQ--------GGLVYGVNQILEQLAVLTSR--------------LEFAERQAQTYWTYAKKRDDALMGALQTNFLRPYQ
         G ID   +AR+    +TE  +Q                 IL+QL  L  R              L+   +Q Q +W Y+K+RD AL  ALQ NF RP  
Subjt:  KGIIDTPNLARLQ---RTEEARQ--------GGLVYGVNQILEQLAVLTSR--------------LEFAERQAQTYWTYAKKRDDALMGALQTNFLRPYQ

Query:  AFPVFPDDL
         FP FP +L
Subjt:  AFPVFPDDL

A0A2P5DXM3 Uncharacterized protein1.0e-2636.28Show/hide
Query:  EMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVE
        E +   +  +L   +  V   GA W VS    +T   + L   A  W  F++ RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C+ +K  
Subjt:  EMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVE

Query:  KLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQR---TEEARQGGLVYGV--------NQILEQLAVLTSRL---EFAERQAQTYWTYAKKR
         LFFP+ IT LC  A  P +  +  + + G ID   +AR+ +   TE  +Q                 +L+QL  L  RL   E   +Q Q +W Y+K+R
Subjt:  KLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQR---TEEARQGGLVYGV--------NQILEQLAVLTSRL---EFAERQAQTYWTYAKKR

Query:  DDALMGALQTNFLRPYQAFPVFPDDL
        D AL  ALQ NF RP   FP FP ++
Subjt:  DDALMGALQTNFLRPYQAFPVFPDDL

A0A5D2B8V0 Uncharacterized protein2.4e-2333.62Show/hide
Query:  VNDLARAKYQEVLK-RDFLFERGFG---SDL---PRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEMVVAPSS----DQLSAAVREVGIE
        +++  + ++  + K +  + E+GFG   +DL   P  +   I  L W +FC      +  +VREFYA+L  ++  E++V   S    D L   +  V   
Subjt:  VNDLARAKYQEVLK-RDFLFERGFG---SDL---PRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEMVVAPSS----DQLSAAVREVGIE

Query:  GARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVEKLFFPNTITMLCSRAGVPTVP
        G++W +     H+ Q  YLK  A  W  F+R   +P +H ST+S + +LL +AIL   SI+VGKII  EI +C+KKK    +FP+ IT LC +A V    
Subjt:  GARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVEKLFFPNTITMLCSRAGVPTVP

Query:  EDMIMIDKGIIDTPNLARL-QRTEEARQG
               +G I   +L RL +R  E  QG
Subjt:  EDMIMIDKGIIDTPNLARL-QRTEEARQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGCTGAAATTAGTAATAGGGAAATTAAAGCTATTTTAGAGAAAGTAGTCCATCCATCTAGAAAGGATTGGTCCTTTAGGTTGGATGAGGCTCTTTGGGCTTATAG
GACAGCTTATAAGACTCCTCTAGGTCCGTTTGTTGTGATTGAGGTTTTTCCCCATGGAGCAGTTACTTTGCAAGATGAAAAAGATGGGAGAGTATTCAAGGTTCAGAAGA
TTGTTGCAGCAAAGATAATGCTGGAGCAGAATTTCCGCACGAATAGGAAGGATTTTAATGATTTCAAATTGCTGGAGCCTTATTTATTTGCAGACATTGGTAAGTCTTCT
TTCTACTTCGCCCTCTTGTTTAATCTTGCCATCTACATTCTTTCTTTCTCCTTTACATTTTCTGCAAAACCCTTTGAGATATCTATGGCTAAAACAAGAGCTAGGAAAGA
GAGGGAGAGTGAAGAGGAGGAAGTACCGGTCACGCCAGAAGTGCAAAAAGGGAAAACCAAAAAGAAAAGAACGCCGGAGGAAAAGGAAGCAAAGAAAAGGAGAAGACAGC
AAAGGGCTGCAGAACAGGAGGAAGTTCAGGAGGTGGCAGACGTTGTTGCCACTACTGCGGAGGAAGGAAGTACTCAAGAACCTGAAGTACAAAACCCAGATACGGTTCAA
GAAAAGATTGCTGAGAAAAATCAAGAAACAGAGGTTGAAGAACGCCGCCGCATCAAGAGGAAGGCGGGTCGCGTGAGGGTGATTCGGAACACTCCATCACCTCCGACGTC
GGACTCTGAGGAAGAAAAAAGGGAAGCTGAGAATAAGGAAAAAGAAGAAGAGGCAAGAAAGGCAGAAGAAGAGCGTTTGCGTGAACAGAGAGAAAGCAAGGGCAAAGGAA
TTGCCGAAGCATCGGGAGAAATTGAGGAGCCGAGGGCACCATTCATTCGCTTCGTCAACGATCTTGCTCGAGCAAAATACCAGGAGGTGCTGAAACGGGACTTCTTGTTC
GAACGAGGATTTGGCAGTGATTTGCCCAGGTTCTTGGAGTCTGGAATAGTGAACCTCGGATGGAGGCAATTTTGTGCGAAACCAGAACCTGTCAATTCCAACATTGTTCG
AGAATTTTATGCCAATCTTGACGTTAAGAATGATTTTGAGATGGTGGTTGCACCATCTAGTGACCAACTGAGTGCGGCTGTCCGGGAGGTAGGCATTGAGGGGGCTCGAT
GGAGGGTGTCGCAGACGCGGAAGCATACGTTTCAAGCTGCTTATTTGAAGAGTGAAGCCAACACTTGGATGGGTTTCATCAGGCTACGCTTGCTGCCGACAACACACGAC
TCCACAGTATCTCGGGACAGGGTATTGCTTGCCTTTGCCATTCTTCGCTCGATGAGTATTGATGTAGGAAAAATTATTTCTTCTGAGATTGTTGATTGCTCGAAAAAGAA
GGTGGAGAAGCTGTTCTTTCCAAACACTATCACAATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCCAGAAGATATGATCATGATTGATAAGGGAATCATTGACACAC
CTAATCTGGCGCGGCTTCAGCGTACGGAAGAGGCTCGCCAGGGAGGGCTGGTGTATGGCGTTAATCAGATCCTAGAGCAACTGGCAGTGTTGACCAGTAGGTTAGAATTT
GCTGAAAGGCAAGCTCAGACCTATTGGACTTATGCTAAAAAGAGAGATGATGCACTCATGGGGGCCTTGCAAACCAATTTCTTAAGACCATATCAGGCCTTTCCAGTGTT
TCCCGATGATTTGTTTAATCTCTGGATTCCCCCCCACCTGTTGAAAAAGAAGAAGAGAATGATGATGAAGAGCAGGGTCAGGAAGATTGATGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGCTGAAATTAGTAATAGGGAAATTAAAGCTATTTTAGAGAAAGTAGTCCATCCATCTAGAAAGGATTGGTCCTTTAGGTTGGATGAGGCTCTTTGGGCTTATAG
GACAGCTTATAAGACTCCTCTAGGTCCGTTTGTTGTGATTGAGGTTTTTCCCCATGGAGCAGTTACTTTGCAAGATGAAAAAGATGGGAGAGTATTCAAGGTTCAGAAGA
TTGTTGCAGCAAAGATAATGCTGGAGCAGAATTTCCGCACGAATAGGAAGGATTTTAATGATTTCAAATTGCTGGAGCCTTATTTATTTGCAGACATTGGTAAGTCTTCT
TTCTACTTCGCCCTCTTGTTTAATCTTGCCATCTACATTCTTTCTTTCTCCTTTACATTTTCTGCAAAACCCTTTGAGATATCTATGGCTAAAACAAGAGCTAGGAAAGA
GAGGGAGAGTGAAGAGGAGGAAGTACCGGTCACGCCAGAAGTGCAAAAAGGGAAAACCAAAAAGAAAAGAACGCCGGAGGAAAAGGAAGCAAAGAAAAGGAGAAGACAGC
AAAGGGCTGCAGAACAGGAGGAAGTTCAGGAGGTGGCAGACGTTGTTGCCACTACTGCGGAGGAAGGAAGTACTCAAGAACCTGAAGTACAAAACCCAGATACGGTTCAA
GAAAAGATTGCTGAGAAAAATCAAGAAACAGAGGTTGAAGAACGCCGCCGCATCAAGAGGAAGGCGGGTCGCGTGAGGGTGATTCGGAACACTCCATCACCTCCGACGTC
GGACTCTGAGGAAGAAAAAAGGGAAGCTGAGAATAAGGAAAAAGAAGAAGAGGCAAGAAAGGCAGAAGAAGAGCGTTTGCGTGAACAGAGAGAAAGCAAGGGCAAAGGAA
TTGCCGAAGCATCGGGAGAAATTGAGGAGCCGAGGGCACCATTCATTCGCTTCGTCAACGATCTTGCTCGAGCAAAATACCAGGAGGTGCTGAAACGGGACTTCTTGTTC
GAACGAGGATTTGGCAGTGATTTGCCCAGGTTCTTGGAGTCTGGAATAGTGAACCTCGGATGGAGGCAATTTTGTGCGAAACCAGAACCTGTCAATTCCAACATTGTTCG
AGAATTTTATGCCAATCTTGACGTTAAGAATGATTTTGAGATGGTGGTTGCACCATCTAGTGACCAACTGAGTGCGGCTGTCCGGGAGGTAGGCATTGAGGGGGCTCGAT
GGAGGGTGTCGCAGACGCGGAAGCATACGTTTCAAGCTGCTTATTTGAAGAGTGAAGCCAACACTTGGATGGGTTTCATCAGGCTACGCTTGCTGCCGACAACACACGAC
TCCACAGTATCTCGGGACAGGGTATTGCTTGCCTTTGCCATTCTTCGCTCGATGAGTATTGATGTAGGAAAAATTATTTCTTCTGAGATTGTTGATTGCTCGAAAAAGAA
GGTGGAGAAGCTGTTCTTTCCAAACACTATCACAATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCCAGAAGATATGATCATGATTGATAAGGGAATCATTGACACAC
CTAATCTGGCGCGGCTTCAGCGTACGGAAGAGGCTCGCCAGGGAGGGCTGGTGTATGGCGTTAATCAGATCCTAGAGCAACTGGCAGTGTTGACCAGTAGGTTAGAATTT
GCTGAAAGGCAAGCTCAGACCTATTGGACTTATGCTAAAAAGAGAGATGATGCACTCATGGGGGCCTTGCAAACCAATTTCTTAAGACCATATCAGGCCTTTCCAGTGTT
TCCCGATGATTTGTTTAATCTCTGGATTCCCCCCCACCTGTTGAAAAAGAAGAAGAGAATGATGATGAAGAGCAGGGTCAGGAAGATTGATGGATGA
Protein sequenceShow/hide protein sequence
MQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGPFVVIEVFPHGAVTLQDEKDGRVFKVQKIVAAKIMLEQNFRTNRKDFNDFKLLEPYLFADIGKSS
FYFALLFNLAIYILSFSFTFSAKPFEISMAKTRARKERESEEEEVPVTPEVQKGKTKKKRTPEEKEAKKRRRQQRAAEQEEVQEVADVVATTAEEGSTQEPEVQNPDTVQ
EKIAEKNQETEVEERRRIKRKAGRVRVIRNTPSPPTSDSEEEKREAENKEKEEEARKAEEERLREQRESKGKGIAEASGEIEEPRAPFIRFVNDLARAKYQEVLKRDFLF
ERGFGSDLPRFLESGIVNLGWRQFCAKPEPVNSNIVREFYANLDVKNDFEMVVAPSSDQLSAAVREVGIEGARWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHD
STVSRDRVLLAFAILRSMSIDVGKIISSEIVDCSKKKVEKLFFPNTITMLCSRAGVPTVPEDMIMIDKGIIDTPNLARLQRTEEARQGGLVYGVNQILEQLAVLTSRLEF
AERQAQTYWTYAKKRDDALMGALQTNFLRPYQAFPVFPDDLFNLWIPPHLLKKKKRMMMKSRVRKIDG