; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002381 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002381
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationscaffold6:2417590..2421125
RNA-Seq ExpressionSpg002381
SyntenySpg002381
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058980.1 uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa]5.6e-2532.1Show/hide
Query:  DCWL----LGAGLLDDSNRQKRLALKSDLQEIALFEVR------YWSQRCKKLWLSDGDENTAYFHKDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTA
        DC+L    +   L + ++  K   LKS L  + +   R      +W   C  L LS         +     ++ N+  + KS E+  GL I+  +    A
Subjt:  DCWL----LGAGLLDDSNRQKRLALKSDLQEIALFEVR------YWSQRCKKLWLSDGDENTAYFHKDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTA

Query:  LLFKWLWRFFNEHGSLWSTLVKFKYRAVRLGSIPSISRLSSARSPWFAISKLQESFIQNSAWELRDGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLL
        LL KWLWR+F+E  +LW  L++ KY+    G IPS +  SS+++PW +I    + F  N +W+L +G  I FW+  W+    L +   RLF L+ +K + 
Subjt:  LLFKWLWRFFNEHGSLWSTLVKFKYRAVRLGSIPSISRLSSARSPWFAISKLQESFIQNSAWELRDGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLL

Query:  VSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKG
        V DAW+    +W I  RR L DRE  +W      LP P  ++G
Subjt:  VSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKG

KAA0058980.1 uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa]4.4e-3045.99Show/hide
Query:  MKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDG
        MK+L WNVRG+GS  KR  IK  I    P+FV L+ETK+ + + +++KS+WSSISI ++  +  G SGGIL+MWD+L   +   V G F+IS     +DG
Subjt:  MKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDG

Query:  FSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLL
        ++WW++ +Y   NR  RK FW+EL ++   CG  WLL
Subjt:  FSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLL

TYK11012.1 uncharacterized protein E5676_scaffold874G00540 [Cucumis melo var. makuwa]2.0e-3040.83Show/hide
Query:  SDKDDNEEDNSAVGTIDDTEEAMKEDKMEENTEEEKDES-----------------FKKKMNKWL------PFGSGELRFSMKVLTWNVRGLGSPSKRAL
        S +D +   N+ V  I +TE   +  +M+    E  + S                 ++KK +K         +GSG++   MK+LTWN RGLGSPSKRAL
Subjt:  SDKDDNEEDNSAVGTIDDTEEAMKEDKMEENTEEEKDES-----------------FKKKMNKWL------PFGSGELRFSMKVLTWNVRGLGSPSKRAL

Query:  IKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDGFSWWLSGIYGPANRKNRKL
        IK+ I S  P+FVILTET +   +KR+IKS W S SINW+   A GSSGGILI+WD    S+  Q +  FS+S    L++  SWWL+G+YGP  R+ R  
Subjt:  IKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDGFSWWLSGIYGPANRKNRKL

Query:  FWRELYDIYGLCGDCWLL
        FW +L+++  L    W L
Subjt:  FWRELYDIYGLCGDCWLL

TYK11012.1 uncharacterized protein E5676_scaffold874G00540 [Cucumis melo var. makuwa]5.6e-2532.1Show/hide
Query:  DCWL----LGAGLLDDSNRQKRLALKSDLQEIALFEVR------YWSQRCKKLWLSDGDENTAYFHKDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTA
        DC+L    +   L + ++  K   LKS L  + +   R      +W   C  L LS         +     ++ N+  + KS E+  GL I+  +    A
Subjt:  DCWL----LGAGLLDDSNRQKRLALKSDLQEIALFEVR------YWSQRCKKLWLSDGDENTAYFHKDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTA

Query:  LLFKWLWRFFNEHGSLWSTLVKFKYRAVRLGSIPSISRLSSARSPWFAISKLQESFIQNSAWELRDGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLL
        LL KWLWR+F+E  +LW  L++ KY+    G IPS +  SS+++PW +I    + F  N +W+L +G  I FW+  W+    L +   RLF L+ +K + 
Subjt:  LLFKWLWRFFNEHGSLWSTLVKFKYRAVRLGSIPSISRLSSARSPWFAISKLQESFIQNSAWELRDGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLL

Query:  VSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKG
        V DAW+    +W I  RR L DRE  +W      LP P  ++G
Subjt:  VSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKG

TYK11012.1 uncharacterized protein E5676_scaffold874G00540 [Cucumis melo var. makuwa]2.0e-3040.83Show/hide
Query:  SDKDDNEEDNSAVGTIDDTEEAMKEDKMEENTEEEKDES-----------------FKKKMNKWL------PFGSGELRFSMKVLTWNVRGLGSPSKRAL
        S +D +   N+ V  I +TE   +  +M+    E  + S                 ++KK +K         +GSG++   MK+LTWN RGLGSPSKRAL
Subjt:  SDKDDNEEDNSAVGTIDDTEEAMKEDKMEENTEEEKDES-----------------FKKKMNKWL------PFGSGELRFSMKVLTWNVRGLGSPSKRAL

Query:  IKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDGFSWWLSGIYGPANRKNRKL
        IK+ I S  P+FVILTET +   +KR+IKS W S SINW+   A GSSGGILI+WD    S+  Q +  FS+S    L++  SWWL+G+YGP  R+ R  
Subjt:  IKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDGFSWWLSGIYGPANRKNRKL

Query:  FWRELYDIYGLCGDCWLL
        FW +L+++  L    W L
Subjt:  FWRELYDIYGLCGDCWLL

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]9.2e-4461.59Show/hide
Query:  MKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDG
        M +L WNVRGLGS SKRA IKDTI SLCP+ VIL+ETK +S + + IKSLWSSISI W +L+A G+SGGI+++WD+L  S  E + G FSISV   L+D 
Subjt:  MKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDG

Query:  FSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLLG
        F+WWL+G+Y P  +K RKLFW+EL+D+ GLCG  WLLG
Subjt:  FSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLLG

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.3e-2949.64Show/hide
Query:  MKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDG
        MK LTWNVRGL S  K ALIK  I  L P  VIL ETK++     ++KSLWS+  INW AL+A G + GILI+W++     +E ++G FS+++   LSDG
Subjt:  MKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDG

Query:  FSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLL
        F +W+SGIYGP+  +   LFW+EL D+  LC + W+L
Subjt:  FSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLL

TrEMBL top hitse value%identityAlignment
A0A5A7UV84 Reverse transcriptase domain-containing protein9.6e-3140.83Show/hide
Query:  SDKDDNEEDNSAVGTIDDTEEAMKEDKMEENTEEEKDES-----------------FKKKMNKWL------PFGSGELRFSMKVLTWNVRGLGSPSKRAL
        S +D +   N+ V  I +TE   +  +M+    E  + S                 ++KK +K         +GSG++   MK+LTWN RGLGSPSKRAL
Subjt:  SDKDDNEEDNSAVGTIDDTEEAMKEDKMEENTEEEKDES-----------------FKKKMNKWL------PFGSGELRFSMKVLTWNVRGLGSPSKRAL

Query:  IKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDGFSWWLSGIYGPANRKNRKL
        IK+ I S  P+FVILTET +   +KR+IKS W S SINW+   A GSSGGILI+WD    S+  Q +  FS+S    L++  SWWL+G+YGP  R+ R  
Subjt:  IKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDGFSWWLSGIYGPANRKNRKL

Query:  FWRELYDIYGLCGDCWLL
        FW +L+++  L    W L
Subjt:  FWRELYDIYGLCGDCWLL

A0A5A7UV84 Reverse transcriptase domain-containing protein2.7e-2532.1Show/hide
Query:  DCWL----LGAGLLDDSNRQKRLALKSDLQEIALFEVR------YWSQRCKKLWLSDGDENTAYFHKDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTA
        DC+L    +   L + ++  K   LKS L  + +   R      +W   C  L LS         +     ++ N+  + KS E+  GL I+  +    A
Subjt:  DCWL----LGAGLLDDSNRQKRLALKSDLQEIALFEVR------YWSQRCKKLWLSDGDENTAYFHKDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTA

Query:  LLFKWLWRFFNEHGSLWSTLVKFKYRAVRLGSIPSISRLSSARSPWFAISKLQESFIQNSAWELRDGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLL
        LL KWLWR+F+E  +LW  L++ KY+    G IPS +  SS+++PW +I    + F  N +W+L +G  I FW+  W+    L +   RLF L+ +K + 
Subjt:  LLFKWLWRFFNEHGSLWSTLVKFKYRAVRLGSIPSISRLSSARSPWFAISKLQESFIQNSAWELRDGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLL

Query:  VSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKG
        V DAW+    +W I  RR L DRE  +W      LP P  ++G
Subjt:  VSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKG

A0A5A7UV84 Reverse transcriptase domain-containing protein9.6e-3140.83Show/hide
Query:  SDKDDNEEDNSAVGTIDDTEEAMKEDKMEENTEEEKDES-----------------FKKKMNKWL------PFGSGELRFSMKVLTWNVRGLGSPSKRAL
        S +D +   N+ V  I +TE   +  +M+    E  + S                 ++KK +K         +GSG++   MK+LTWN RGLGSPSKRAL
Subjt:  SDKDDNEEDNSAVGTIDDTEEAMKEDKMEENTEEEKDES-----------------FKKKMNKWL------PFGSGELRFSMKVLTWNVRGLGSPSKRAL

Query:  IKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDGFSWWLSGIYGPANRKNRKL
        IK+ I S  P+FVILTET +   +KR+IKS W S SINW+   A GSSGGILI+WD    S+  Q +  FS+S    L++  SWWL+G+YGP  R+ R  
Subjt:  IKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDGFSWWLSGIYGPANRKNRKL

Query:  FWRELYDIYGLCGDCWLL
        FW +L+++  L    W L
Subjt:  FWRELYDIYGLCGDCWLL

A0A5D3CI86 Reverse transcriptase domain-containing protein2.7e-2532.1Show/hide
Query:  DCWL----LGAGLLDDSNRQKRLALKSDLQEIALFEVR------YWSQRCKKLWLSDGDENTAYFHKDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTA
        DC+L    +   L + ++  K   LKS L  + +   R      +W   C  L LS         +     ++ N+  + KS E+  GL I+  +    A
Subjt:  DCWL----LGAGLLDDSNRQKRLALKSDLQEIALFEVR------YWSQRCKKLWLSDGDENTAYFHKDDDKYIDNFFFIIKSFEQASGLRINLSKSAVTA

Query:  LLFKWLWRFFNEHGSLWSTLVKFKYRAVRLGSIPSISRLSSARSPWFAISKLQESFIQNSAWELRDGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLL
        LL KWLWR+F+E  +LW  L++ KY+    G IPS +  SS+++PW +I    + F  N +W+L +G  I FW+  W+    L +   RLF L+ +K + 
Subjt:  LLFKWLWRFFNEHGSLWSTLVKFKYRAVRLGSIPSISRLSSARSPWFAISKLQESFIQNSAWELRDGKSILFWFDKWAGPDSLCSINNRLFHLSEEKSLL

Query:  VSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKG
        V DAW+    +W I  RR L DRE  +W      LP P  ++G
Subjt:  VSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKG

A0A5D3CI86 Reverse transcriptase domain-containing protein4.8e-3043.48Show/hide
Query:  MKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDG
        MK+++WN+RGLGS  KR L+K+ ++ L P+ VIL ETK     ++++  +W S    WV   + G SGGI ++W+    S+ + + G FS+S+++  + G
Subjt:  MKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDG

Query:  FSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLLG
          WWLSGIYGP  ++ R  FW EL D+YG CGD W LG
Subjt:  FSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLLG

A0A6J1CVN2 uncharacterized protein LOC1110146574.4e-4461.59Show/hide
Query:  MKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDG
        M +L WNVRGLGS SKRA IKDTI SLCP+ VIL+ETK +S + + IKSLWSSISI W +L+A G+SGGI+++WD+L  S  E + G FSISV   L+D 
Subjt:  MKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDG

Query:  FSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLLG
        F+WWL+G+Y P  +K RKLFW+EL+D+ GLCG  WLLG
Subjt:  FSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLLG

A0A6J1E2G6 uncharacterized protein LOC1110254056.2e-3049.64Show/hide
Query:  MKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDG
        MK LTWNVRGL S  K ALIK  I  L P  VIL ETK++     ++KSLWS+  INW AL+A G + GILI+W++     +E ++G FS+++   LSDG
Subjt:  MKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGCFSISVKVTLSDG

Query:  FSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLL
        F +W+SGIYGP+  +   LFW+EL D+  LC + W+L
Subjt:  FSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLL

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.6e-0626.4Show/hide
Query:  EQASGLRINLSKSAVTALLFKWLWRFFNEHGSLWSTLVKFKYRAVRLGSIPSISRLSSARSPWFAIS-KLQESFIQNSAWELRDGKSILFWFDKWAGPDS
        ++  GL +  +KS   AL+ K  WR   E  SLW+ +++ KY    +     +    S  S W +I+  L++       W   DG+ I FW D+W     
Subjt:  EQASGLRINLSKSAVTALLFKWLWRFFNEHGSLWSTLVKFKYRAVRLGSIPSISRLSSARSPWFAIS-KLQESFIQNSAWELRDGKSILFWFDKWAGPDS

Query:  LCSINNRLFHLSEEKSLLVSDAWSP
        L  ++N     ++  +++  D W P
Subjt:  LCSINNRLFHLSEEKSLLVSDAWSP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCAACAGTAGAAGAGGGTCCATCAAAGAGTCAAGATCACAGTACAAGTAATGAAAATTTCTCCCTCAGTGTGGACTTGGGCTATTTGTCCCCAATGTCAGACAC
CATCATATCAAGCCCAGAATATACCCCTTCACCTATGCCAAAGATTTCAGCGGAGCCACCATCAGCTATTATCAATGATAGTCTTAAATCTTTGCTAGCCCAAGAGGATA
ATAGTGACAAAGATGATAATGAAGAAGACAATAGCGCAGTTGGTACAATCGACGATACCGAGGAGGCAATGAAAGAAGACAAAATGGAAGAAAACACAGAGGAGGAAAAG
GATGAATCTTTCAAGAAAAAGATGAACAAATGGCTTCCGTTCGGCTCGGGGGAACTACGTTTCTCCATGAAAGTTCTTACTTGGAATGTAAGGGGGCTAGGCTCGCCCTC
CAAAAGAGCTTTAATAAAAGACACCATTCAATCTCTTTGCCCGGAATTTGTTATCCTTACGGAAACTAAGATTGCTTCTTTTTCTAAGCGTATGATTAAATCTCTTTGGA
GCTCTATTAGCATTAACTGGGTTGCTCTTGAAGCTTATGGCTCCTCTGGTGGTATTCTTATTATGTGGGATGAGCTTTTTTGCAGTATTTCTGAACAGGTCAAAGGTTGT
TTCTCCATTTCTGTGAAAGTTACATTATCTGATGGCTTCTCGTGGTGGCTTTCTGGTATCTATGGCCCAGCTAACAGAAAAAATCGAAAACTCTTCTGGAGAGAGCTCTA
TGACATCTATGGGCTATGTGGTGACTGTTGGCTATTAGGAGCTGGTCTTTTAGACGACTCGAACAGACAGAAGCGTTTGGCCCTTAAATCGGATCTCCAGGAGATTGCTC
TCTTTGAAGTCCGGTACTGGAGTCAACGTTGCAAAAAATTATGGCTCAGTGATGGCGATGAGAATACTGCCTATTTCCACAAAGATGATGACAAATATATTGATAACTTT
TTCTTCATCATCAAATCTTTCGAACAAGCCTCGGGCCTTCGAATCAACTTATCTAAGTCTGCAGTGACAGCCCTCCTCTTTAAATGGCTATGGAGGTTTTTCAATGAACA
TGGCTCTCTTTGGAGTACCCTCGTCAAATTCAAATACAGGGCTGTCAGATTGGGAAGTATTCCATCAATTTCTAGACTTTCAAGTGCACGTTCCCCTTGGTTTGCAATTT
CAAAGCTGCAAGAATCCTTTATTCAAAATTCCGCATGGGAGCTTAGAGACGGTAAATCCATTCTTTTTTGGTTTGATAAATGGGCTGGTCCCGATTCTCTATGTTCCATC
AACAATCGGCTTTTTCATCTATCTGAGGAGAAAAGCTTATTGGTGTCTGATGCTTGGTCCCCTGAATCTCAAAGGTGGAACATTAAGCCTAGAAGAAATCTTCTTGACAG
AGAGTTGCAATCTTGGGCTGCTTTCACCTCTGATTTACCTAGGCCTGATGTTTCCAAAGGCAAAGATTTTCTCAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAACCAACAGTAGAAGAGGGTCCATCAAAGAGTCAAGATCACAGTACAAGTAATGAAAATTTCTCCCTCAGTGTGGACTTGGGCTATTTGTCCCCAATGTCAGACAC
CATCATATCAAGCCCAGAATATACCCCTTCACCTATGCCAAAGATTTCAGCGGAGCCACCATCAGCTATTATCAATGATAGTCTTAAATCTTTGCTAGCCCAAGAGGATA
ATAGTGACAAAGATGATAATGAAGAAGACAATAGCGCAGTTGGTACAATCGACGATACCGAGGAGGCAATGAAAGAAGACAAAATGGAAGAAAACACAGAGGAGGAAAAG
GATGAATCTTTCAAGAAAAAGATGAACAAATGGCTTCCGTTCGGCTCGGGGGAACTACGTTTCTCCATGAAAGTTCTTACTTGGAATGTAAGGGGGCTAGGCTCGCCCTC
CAAAAGAGCTTTAATAAAAGACACCATTCAATCTCTTTGCCCGGAATTTGTTATCCTTACGGAAACTAAGATTGCTTCTTTTTCTAAGCGTATGATTAAATCTCTTTGGA
GCTCTATTAGCATTAACTGGGTTGCTCTTGAAGCTTATGGCTCCTCTGGTGGTATTCTTATTATGTGGGATGAGCTTTTTTGCAGTATTTCTGAACAGGTCAAAGGTTGT
TTCTCCATTTCTGTGAAAGTTACATTATCTGATGGCTTCTCGTGGTGGCTTTCTGGTATCTATGGCCCAGCTAACAGAAAAAATCGAAAACTCTTCTGGAGAGAGCTCTA
TGACATCTATGGGCTATGTGGTGACTGTTGGCTATTAGGAGCTGGTCTTTTAGACGACTCGAACAGACAGAAGCGTTTGGCCCTTAAATCGGATCTCCAGGAGATTGCTC
TCTTTGAAGTCCGGTACTGGAGTCAACGTTGCAAAAAATTATGGCTCAGTGATGGCGATGAGAATACTGCCTATTTCCACAAAGATGATGACAAATATATTGATAACTTT
TTCTTCATCATCAAATCTTTCGAACAAGCCTCGGGCCTTCGAATCAACTTATCTAAGTCTGCAGTGACAGCCCTCCTCTTTAAATGGCTATGGAGGTTTTTCAATGAACA
TGGCTCTCTTTGGAGTACCCTCGTCAAATTCAAATACAGGGCTGTCAGATTGGGAAGTATTCCATCAATTTCTAGACTTTCAAGTGCACGTTCCCCTTGGTTTGCAATTT
CAAAGCTGCAAGAATCCTTTATTCAAAATTCCGCATGGGAGCTTAGAGACGGTAAATCCATTCTTTTTTGGTTTGATAAATGGGCTGGTCCCGATTCTCTATGTTCCATC
AACAATCGGCTTTTTCATCTATCTGAGGAGAAAAGCTTATTGGTGTCTGATGCTTGGTCCCCTGAATCTCAAAGGTGGAACATTAAGCCTAGAAGAAATCTTCTTGACAG
AGAGTTGCAATCTTGGGCTGCTTTCACCTCTGATTTACCTAGGCCTGATGTTTCCAAAGGCAAAGATTTTCTCAAATAG
Protein sequenceShow/hide protein sequence
MEPTVEEGPSKSQDHSTSNENFSLSVDLGYLSPMSDTIISSPEYTPSPMPKISAEPPSAIINDSLKSLLAQEDNSDKDDNEEDNSAVGTIDDTEEAMKEDKMEENTEEEK
DESFKKKMNKWLPFGSGELRFSMKVLTWNVRGLGSPSKRALIKDTIQSLCPEFVILTETKIASFSKRMIKSLWSSISINWVALEAYGSSGGILIMWDELFCSISEQVKGC
FSISVKVTLSDGFSWWLSGIYGPANRKNRKLFWRELYDIYGLCGDCWLLGAGLLDDSNRQKRLALKSDLQEIALFEVRYWSQRCKKLWLSDGDENTAYFHKDDDKYIDNF
FFIIKSFEQASGLRINLSKSAVTALLFKWLWRFFNEHGSLWSTLVKFKYRAVRLGSIPSISRLSSARSPWFAISKLQESFIQNSAWELRDGKSILFWFDKWAGPDSLCSI
NNRLFHLSEEKSLLVSDAWSPESQRWNIKPRRNLLDRELQSWAAFTSDLPRPDVSKGKDFLK