; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020954 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020954
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold9:1528462..1535285
RNA-Seq ExpressionSpg020954
SyntenySpg020954
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG69190.1 hypothetical protein EZV62_004125 [Acer yangbiense]2.9e-2826.45Show/hide
Query:  MQEFRDVIDLCGLIDPGYVGPEFTWCNNQVHGVIIWERLDRFLLNVAMQEKCSFFKVTHLSRIASDHRPIIAEWSFEPLNQCFITPSHTKRFEEAWCKYE
        M  F++ ++ CGL D G++GP FTW N +     I ERLDR + N    +  S F + HL    SDHRPI+ E S +          H   ++  W + +
Subjt:  MQEFRDVIDLCGLIDPGYVGPEFTWCNNQVHGVIIWERLDRFLLNVAMQEKCSFFKVTHLSRIASDHRPIIAEWSFEPLNQCFITPSHTKRFEEAWCKYE

Query:  ECRDIVKQGEAGMPSAPLLGMCGWSVSDYCERFWKGNNNDALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILH--QQIDRSIDELMDKEDMHLRLT
           D V+   + +P    +  C            KG     LD    +   ++ W++W  RN + + K    S+ +H    +D +   + D +       
Subjt:  ECRDIVKQGEAGMPSAPLLGMCGWSVSDYCERFWKGNNNDALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILH--QQIDRSIDELMDKEDMHLRLT

Query:  TQQRPCNVAKPSTPQWKPVPEGAWKLSCDASWNDKTMWGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVR
        T      V +   P+WKP P G++K++ DA+ + +    G+G ++RD    ++ +  +S     +   +EA+++  G R  L     P S+ESDSL VV 
Subjt:  TQQRPCNVAKPSTPQWKPVPEGAWKLSCDASWNDKTMWGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVR

Query:  LLNGEDEDFTELALFIKEAQNLISLRKVTAISHISREHNFMAHSLARRASEEDDSKIWYSEFP
        L+N  D    E+ + + +   + S    +++S + R  N +AHSLA+ +   +   +W  + P
Subjt:  LLNGEDEDFTELALFIKEAQNLISLRKVTAISHISREHNFMAHSLARRASEEDDSKIWYSEFP

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]4.6e-2632.88Show/hide
Query:  DALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSI-------DELMDK---EDMHLRLTTQQRPCNVAKPSTPQWKPVPEGAWKLSCDAS
        D    +  ++++II W+IW  RN        P++  +   IDR I         L  K   +D+HL    +         +  QWKP    +WKL+ +A+
Subjt:  DALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSI-------DELMDK---EDMHLRLTTQQRPCNVAKPSTPQWKPVPEGAWKLSCDAS

Query:  WNDKTMWGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAI
        W   T  GG+GW++RD    ++ A  + +     I++LE M+ICEGLRA+  +   P+ +ESDSL+ + LL+ + +D TE+   ++E   ++   ++ ++
Subjt:  WNDKTMWGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAI

Query:  SHISREHNFMAHSLARRASEED
         HISRE N +AH LARRA E D
Subjt:  SHISREHNFMAHSLARRASEED

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]1.4e-2734.51Show/hide
Query:  DALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSIDELMDKEDMHLRLTTQQRPCNVAKPSTPQWKPVPEGAWKLSCDASWNDKTMWGGV
        D    + L   LI  W IW +RN +    +      + QQ+ + + E   + +  L +       +    +  +W+P P   W L+ DASW+D T  GG+
Subjt:  DALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSIDELMDKEDMHLRLTTQQRPCNVAKPSTPQWKPVPEGAWKLSCDASWNDKTMWGGV

Query:  GWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLS-DSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAISHISREHNF
        GW++R W+  I++AG + VE    +  LEA +I EGLR L +     P+ +E+DS +V  LLN + ED T+    ++E  NL    ++ A + + RE N 
Subjt:  GWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLS-DSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAISHISREHNF

Query:  MAHSLARRASEEDDSKIWYSEFPNWL
         AHSLA+RAS   +S IW   FPNWL
Subjt:  MAHSLARRASEEDDSKIWYSEFPNWL

XP_022155286.1 uncharacterized protein LOC111022423 [Momordica charantia]3.6e-2333.04Show/hide
Query:  MQEFRDVIDLCGLIDPGYVGPEFTWCNNQVHGVIIWERLDRFLLNVAMQEKCSFFKVTHLSRIASDHRPIIAEWSFEPLNQCFITPSHTK-----RFEEA
        MQ F+D +DLCGL+DPG+VG  FTWC+   +   IWERLDRFL+N A+ +     ++ HL  +ASDHRPI+AEW    L     T    K     RFEE 
Subjt:  MQEFRDVIDLCGLIDPGYVGPEFTWCNNQVHGVIIWERLDRFLLNVAMQEKCSFFKVTHLSRIASDHRPIIAEWSFEPLNQCFITPSHTK-----RFEEA

Query:  WCKYEECRDIVKQGEAGMPSAPLLGMCGWSV-SDYCERFWKGNNNDALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQID-------------
        W  ++EC++IV++               W+V  D C   ++G  N  L+        +I W +     S+R    + + EI     D             
Subjt:  WCKYEECRDIVKQGEAGMPSAPLLGMCGWSV-SDYCERFWKGNNNDALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQID-------------

Query:  RSIDELMDKEDMHLRLTTQQRPCN
        R +++L+++E+ + R    Q P N
Subjt:  RSIDELMDKEDMHLRLTTQQRPCN

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]7.3e-2431.25Show/hide
Query:  DALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSIDELMDKEDMHLRLTTQQRPCN----VAKPSTPQWKPVPEGAWKLSCDASWNDKTM
        D    +  ++++II W+IW  RN         ++  +   IDR I     + D +L+  +  +  +    +   +  +WKP    +WKL+ DA+W   T 
Subjt:  DALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSIDELMDKEDMHLRLTTQQRPCN----VAKPSTPQWKPVPEGAWKLSCDASWNDKTM

Query:  WGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVS--------VESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVT
         GG+GW++RD    ++ A  + +     I++LE M+ICEGLRA+  +   P+         +ESDSL+ + LL+ + +D TE+   ++E   ++   K+ 
Subjt:  WGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVS--------VESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVT

Query:  AISHISREHNFMAHSLARRASEED
        ++ HISRE N +AH LARRA E D
Subjt:  AISHISREHNFMAHSLARRASEED

TrEMBL top hitse value%identityAlignment
A0A5C7IIT4 Uncharacterized protein1.4e-2826.45Show/hide
Query:  MQEFRDVIDLCGLIDPGYVGPEFTWCNNQVHGVIIWERLDRFLLNVAMQEKCSFFKVTHLSRIASDHRPIIAEWSFEPLNQCFITPSHTKRFEEAWCKYE
        M  F++ ++ CGL D G++GP FTW N +     I ERLDR + N    +  S F + HL    SDHRPI+ E S +          H   ++  W + +
Subjt:  MQEFRDVIDLCGLIDPGYVGPEFTWCNNQVHGVIIWERLDRFLLNVAMQEKCSFFKVTHLSRIASDHRPIIAEWSFEPLNQCFITPSHTKRFEEAWCKYE

Query:  ECRDIVKQGEAGMPSAPLLGMCGWSVSDYCERFWKGNNNDALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILH--QQIDRSIDELMDKEDMHLRLT
           D V+   + +P    +  C            KG     LD    +   ++ W++W  RN + + K    S+ +H    +D +   + D +       
Subjt:  ECRDIVKQGEAGMPSAPLLGMCGWSVSDYCERFWKGNNNDALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILH--QQIDRSIDELMDKEDMHLRLT

Query:  TQQRPCNVAKPSTPQWKPVPEGAWKLSCDASWNDKTMWGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVR
        T      V +   P+WKP P G++K++ DA+ + +    G+G ++RD    ++ +  +S     +   +EA+++  G R  L     P S+ESDSL VV 
Subjt:  TQQRPCNVAKPSTPQWKPVPEGAWKLSCDASWNDKTMWGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVR

Query:  LLNGEDEDFTELALFIKEAQNLISLRKVTAISHISREHNFMAHSLARRASEEDDSKIWYSEFP
        L+N  D    E+ + + +   + S    +++S + R  N +AHSLA+ +   +   +W  + P
Subjt:  LLNGEDEDFTELALFIKEAQNLISLRKVTAISHISREHNFMAHSLARRASEEDDSKIWYSEFP

A0A6J1CP26 uncharacterized protein LOC1110134122.2e-2632.88Show/hide
Query:  DALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSI-------DELMDK---EDMHLRLTTQQRPCNVAKPSTPQWKPVPEGAWKLSCDAS
        D    +  ++++II W+IW  RN        P++  +   IDR I         L  K   +D+HL    +         +  QWKP    +WKL+ +A+
Subjt:  DALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSI-------DELMDK---EDMHLRLTTQQRPCNVAKPSTPQWKPVPEGAWKLSCDAS

Query:  WNDKTMWGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAI
        W   T  GG+GW++RD    ++ A  + +     I++LE M+ICEGLRA+  +   P+ +ESDSL+ + LL+ + +D TE+   ++E   ++   ++ ++
Subjt:  WNDKTMWGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAI

Query:  SHISREHNFMAHSLARRASEED
         HISRE N +AH LARRA E D
Subjt:  SHISREHNFMAHSLARRASEED

A0A6J1DNV9 uncharacterized protein LOC1110224036.9e-2834.51Show/hide
Query:  DALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSIDELMDKEDMHLRLTTQQRPCNVAKPSTPQWKPVPEGAWKLSCDASWNDKTMWGGV
        D    + L   LI  W IW +RN +    +      + QQ+ + + E   + +  L +       +    +  +W+P P   W L+ DASW+D T  GG+
Subjt:  DALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSIDELMDKEDMHLRLTTQQRPCNVAKPSTPQWKPVPEGAWKLSCDASWNDKTMWGGV

Query:  GWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLS-DSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAISHISREHNF
        GW++R W+  I++AG + VE    +  LEA +I EGLR L +     P+ +E+DS +V  LLN + ED T+    ++E  NL    ++ A + + RE N 
Subjt:  GWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLS-DSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAISHISREHNF

Query:  MAHSLARRASEEDDSKIWYSEFPNWL
         AHSLA+RAS   +S IW   FPNWL
Subjt:  MAHSLARRASEEDDSKIWYSEFPNWL

A0A6J1DRA0 uncharacterized protein LOC1110224231.7e-2333.04Show/hide
Query:  MQEFRDVIDLCGLIDPGYVGPEFTWCNNQVHGVIIWERLDRFLLNVAMQEKCSFFKVTHLSRIASDHRPIIAEWSFEPLNQCFITPSHTK-----RFEEA
        MQ F+D +DLCGL+DPG+VG  FTWC+   +   IWERLDRFL+N A+ +     ++ HL  +ASDHRPI+AEW    L     T    K     RFEE 
Subjt:  MQEFRDVIDLCGLIDPGYVGPEFTWCNNQVHGVIIWERLDRFLLNVAMQEKCSFFKVTHLSRIASDHRPIIAEWSFEPLNQCFITPSHTK-----RFEEA

Query:  WCKYEECRDIVKQGEAGMPSAPLLGMCGWSV-SDYCERFWKGNNNDALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQID-------------
        W  ++EC++IV++               W+V  D C   ++G  N  L+        +I W +     S+R    + + EI     D             
Subjt:  WCKYEECRDIVKQGEAGMPSAPLLGMCGWSV-SDYCERFWKGNNNDALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQID-------------

Query:  RSIDELMDKEDMHLRLTTQQRPCN
        R +++L+++E+ + R    Q P N
Subjt:  RSIDELMDKEDMHLRLTTQQRPCN

A0A6J1DSV1 uncharacterized protein LOC1110236083.5e-2431.25Show/hide
Query:  DALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSIDELMDKEDMHLRLTTQQRPCN----VAKPSTPQWKPVPEGAWKLSCDASWNDKTM
        D    +  ++++II W+IW  RN         ++  +   IDR I     + D +L+  +  +  +    +   +  +WKP    +WKL+ DA+W   T 
Subjt:  DALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSIDELMDKEDMHLRLTTQQRPCN----VAKPSTPQWKPVPEGAWKLSCDASWNDKTM

Query:  WGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVS--------VESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVT
         GG+GW++RD    ++ A  + +     I++LE M+ICEGLRA+  +   P+         +ESDSL+ + LL+ + +D TE+   ++E   ++   K+ 
Subjt:  WGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVS--------VESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVT

Query:  AISHISREHNFMAHSLARRASEED
        ++ HISRE N +AH LARRA E D
Subjt:  AISHISREHNFMAHSLARRASEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein5.2e-0423.65Show/hide
Query:  VPEGAWKLSCDASWNDKTMWGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKE
        VP    K + DAS ++  +  G+GWL+R+   T+L  G    +        E  ++   ++A  +  ++ V  E D+  V RL+N +  D   L  ++  
Subjt:  VPEGAWKLSCDASWNDKTMWGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKE

Query:  AQNLISLRKVTAISHISREHNFMAHSLARRASEEDDSKIWYSEFPNWL
         ++ I     T      RE N  A +L ++A +       ++  P++L
Subjt:  AQNLISLRKVTAISHISREHNFMAHSLARRASEEDDSKIWYSEFPNWL

AT5G61090.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.0e-0424.51Show/hide
Query:  MSICEGLRALLSDSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAISHISREHNFMAHSLARRASEEDDSKIWYSEFPNWLLSLNE
        ++I +GL+ L    F  + +E+ S +++  L  +   F +    +   +++I       I HIS+E N  A  LA+R+ E+    +++   P  L+   E
Subjt:  MSICEGLRALLSDSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAISHISREHNFMAHSLARRASEEDDSKIWYSEFPNWLLSLNE

Query:  AD
         D
Subjt:  AD

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.7e-0721.4Show/hide
Query:  NNNDALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSIDELMDKEDMHLRLTTQQRPCNVAKPS-TPQWKPVPEGAWKLSCDASWNDKTM
        +N+  +D  +      + W+IW   N +  N  + K +   +       E +D    +     QQ     A PS   +W P      K + DAS +++  
Subjt:  NNNDALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSIDELMDKEDMHLRLTTQQRPCNVAKPS-TPQWKPVPEGAWKLSCDASWNDKTM

Query:  WGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAISHISRE
          G+GW++R+   T++  G    +        E  ++   ++A        V  E D+  + R++N +  +   L  F+   Q+ I   +    S   RE
Subjt:  WGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAISHISRE

Query:  HNFMAHSLARRASEEDDSKIWYSEFPNWL
         N  A  LA++A +E+     +   P +L
Subjt:  HNFMAHSLARRASEEDDSKIWYSEFPNWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAATTCCGAGATGTGATTGATTTATGTGGGCTTATTGACCCTGGATATGTTGGACCAGAGTTTACTTGGTGTAATAATCAGGTTCATGGAGTGATTATCTGGGA
AAGATTAGATCGTTTTCTGTTAAATGTAGCTATGCAGGAAAAGTGTAGCTTCTTTAAAGTTACTCACCTTTCACGTATAGCTTCCGACCATAGACCGATCATTGCAGAAT
GGTCTTTTGAGCCCTTAAACCAATGCTTCATAACTCCGAGTCACACAAAGAGATTTGAAGAAGCTTGGTGCAAATATGAGGAATGTAGAGACATTGTGAAACAGGGAGAA
GCCGGAATGCCTTCAGCACCTCTTTTGGGAATGTGTGGGTGGTCTGTTTCGGATTATTGTGAGAGGTTTTGGAAGGGGAACAACAATGACGCGCTGGATACTCAAAGTTT
ACAAAAAAACCTCATTATATGCTGGAAAATATGGATGTATCGGAACTCAATCAGACACAACAAGCAGCAACCCAAATCAGAAATATTGCACCAGCAGATTGACAGATCTA
TAGATGAGCTTATGGACAAAGAAGACATGCACCTTAGATTGACGACCCAACAACGACCCTGCAACGTTGCTAAACCGAGTACTCCTCAGTGGAAGCCGGTTCCGGAAGGT
GCTTGGAAACTCAGCTGCGACGCTAGCTGGAACGACAAGACGATGTGGGGTGGGGTAGGATGGCTGGTTCGCGACTGGAATGAAACGATTCTAATGGCGGGTTACAAATC
GGTCGAGCGTGGATGGAAAATTTCGTGGTTGGAGGCGATGTCCATTTGTGAGGGGCTGCGGGCGCTTCTCTCGGATTCCTTTTCTCCCGTGAGTGTCGAAAGTGACTCCC
TACAGGTGGTACGATTGTTAAATGGTGAGGATGAAGATTTTACTGAACTGGCTCTTTTCATTAAAGAAGCCCAAAACCTCATATCCTTAAGGAAGGTGACAGCTATCTCT
CATATATCTAGAGAGCATAACTTTATGGCCCATTCTTTGGCCCGTCGGGCTAGTGAGGAAGATGATTCCAAAATCTGGTATTCTGAGTTTCCTAATTGGCTTCTTTCTTT
AAATGAGGCTGATATAGGAGGTGTAAATCACGCAAGTGGGGGTTCTTGTCCCACTAGTGAAATATCTTCCGTTGTTTTTGCTTCTTCTTTATATTCTCATGATCTGCCTA
CGAGTCGCCCTGGGAGCGATCACCCTACGAAGGGCTTGATCATGGGAGTCAGAACAACGCAAACTCCAGAAATGGATAGGGTTTTCTTAGCTCCATTTCCAACTTTGACT
TTCCCCACGGTAGCATTGTTGGAGCCGACCTATGAGATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAATTCCGAGATGTGATTGATTTATGTGGGCTTATTGACCCTGGATATGTTGGACCAGAGTTTACTTGGTGTAATAATCAGGTTCATGGAGTGATTATCTGGGA
AAGATTAGATCGTTTTCTGTTAAATGTAGCTATGCAGGAAAAGTGTAGCTTCTTTAAAGTTACTCACCTTTCACGTATAGCTTCCGACCATAGACCGATCATTGCAGAAT
GGTCTTTTGAGCCCTTAAACCAATGCTTCATAACTCCGAGTCACACAAAGAGATTTGAAGAAGCTTGGTGCAAATATGAGGAATGTAGAGACATTGTGAAACAGGGAGAA
GCCGGAATGCCTTCAGCACCTCTTTTGGGAATGTGTGGGTGGTCTGTTTCGGATTATTGTGAGAGGTTTTGGAAGGGGAACAACAATGACGCGCTGGATACTCAAAGTTT
ACAAAAAAACCTCATTATATGCTGGAAAATATGGATGTATCGGAACTCAATCAGACACAACAAGCAGCAACCCAAATCAGAAATATTGCACCAGCAGATTGACAGATCTA
TAGATGAGCTTATGGACAAAGAAGACATGCACCTTAGATTGACGACCCAACAACGACCCTGCAACGTTGCTAAACCGAGTACTCCTCAGTGGAAGCCGGTTCCGGAAGGT
GCTTGGAAACTCAGCTGCGACGCTAGCTGGAACGACAAGACGATGTGGGGTGGGGTAGGATGGCTGGTTCGCGACTGGAATGAAACGATTCTAATGGCGGGTTACAAATC
GGTCGAGCGTGGATGGAAAATTTCGTGGTTGGAGGCGATGTCCATTTGTGAGGGGCTGCGGGCGCTTCTCTCGGATTCCTTTTCTCCCGTGAGTGTCGAAAGTGACTCCC
TACAGGTGGTACGATTGTTAAATGGTGAGGATGAAGATTTTACTGAACTGGCTCTTTTCATTAAAGAAGCCCAAAACCTCATATCCTTAAGGAAGGTGACAGCTATCTCT
CATATATCTAGAGAGCATAACTTTATGGCCCATTCTTTGGCCCGTCGGGCTAGTGAGGAAGATGATTCCAAAATCTGGTATTCTGAGTTTCCTAATTGGCTTCTTTCTTT
AAATGAGGCTGATATAGGAGGTGTAAATCACGCAAGTGGGGGTTCTTGTCCCACTAGTGAAATATCTTCCGTTGTTTTTGCTTCTTCTTTATATTCTCATGATCTGCCTA
CGAGTCGCCCTGGGAGCGATCACCCTACGAAGGGCTTGATCATGGGAGTCAGAACAACGCAAACTCCAGAAATGGATAGGGTTTTCTTAGCTCCATTTCCAACTTTGACT
TTCCCCACGGTAGCATTGTTGGAGCCGACCTATGAGATCTGA
Protein sequenceShow/hide protein sequence
MQEFRDVIDLCGLIDPGYVGPEFTWCNNQVHGVIIWERLDRFLLNVAMQEKCSFFKVTHLSRIASDHRPIIAEWSFEPLNQCFITPSHTKRFEEAWCKYEECRDIVKQGE
AGMPSAPLLGMCGWSVSDYCERFWKGNNNDALDTQSLQKNLIICWKIWMYRNSIRHNKQQPKSEILHQQIDRSIDELMDKEDMHLRLTTQQRPCNVAKPSTPQWKPVPEG
AWKLSCDASWNDKTMWGGVGWLVRDWNETILMAGYKSVERGWKISWLEAMSICEGLRALLSDSFSPVSVESDSLQVVRLLNGEDEDFTELALFIKEAQNLISLRKVTAIS
HISREHNFMAHSLARRASEEDDSKIWYSEFPNWLLSLNEADIGGVNHASGGSCPTSEISSVVFASSLYSHDLPTSRPGSDHPTKGLIMGVRTTQTPEMDRVFLAPFPTLT
FPTVALLEPTYEI