; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039619 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039619
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr2:47460239..47461195
RNA-Seq ExpressionLag0039619
SyntenyLag0039619
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]3.8e-3530.74Show/hide
Query:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSEAYVKELID-ETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHLA
        +RWR+GNG  + I  D W+P     K +        + V  L+D E G W+  V+R  F P EA  IL IP+G     D+++W+ +  G +SV+S Y +A
Subjt:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSEAYVKELID-ETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHLA

Query:  YSIDK-VDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPIF
           +  V + S S   +++ +WN FW +    K K+  W+  L+ LP+  N+SKRG+++ N C+ C +  E + H+ W C F +A+W +     ++  + 
Subjt:  YSIDK-VDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPIF

Query:  FREGWDALDRWDFLVEAIKEDNMSKAINICWNIWNQRNSVKINLGSPDFSKIIREVM
         RE  ++L + DF           +   + W +WNQRN+   N  +    KI  E++
Subjt:  FREGWDALDRWDFLVEAIKEDNMSKAINICWNIWNQRNSVKINLGSPDFSKIIREVM

XP_023899813.1 uncharacterized protein LOC112011695 [Quercus suber]5.3e-3732.56Show/hide
Query:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSEA--YVKELIDE-TGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYH
        +RWRVGNGA IR+ +D W+P    +K V     F +A   V+ELI+E T  WK  V+ + F+P +AD I  IP+  +   D+++W     G F+V+SAYH
Subjt:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSEA--YVKELIDE-TGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYH

Query:  LAYS-IDKVDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVW-CHFFPILVNS
        LA +    +   S S+ S +K FWN+ W+I    K +  AW+   + LP++SN+ +R +   +LC  CR+ PE+  HV+W+C   K  W C    I    
Subjt:  LAYS-IDKVDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVW-CHFFPILVNS

Query:  PIFFREGWDALD------RWDFLVEA-IKEDNMSKAINICWNIWNQRNSVKINLGSPDFSKIIREVMSKDRDEVYLRACGKIVPAVIRSETLSSQ-RLWS
             EG+D  +       W+ LV     ED ++ A+   W +W+ RN V+         ++ R  +       YLR    +    +  E +  Q   WS
Subjt:  PIFFREGWDALD------RWDFLVEA-IKEDNMSKAINICWNIWNQRNSVKINLGSPDFSKIIREVMSKDRDEVYLRACGKIVPAVIRSETLSSQ-RLWS

Query:  P
        P
Subjt:  P

XP_023905045.1 uncharacterized protein LOC112016795 [Quercus suber]1.1e-3433.59Show/hide
Query:  RWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSE-AYVKELIDE-TGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHLA
        RW+VG+G  I+I  D W+P    F+ +      +E A V ELIDE TG W   +++  F+P +A +IL IP   K  RD+++W   PKG F+V SAY +A
Subjt:  RWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSE-AYVKELIDE-TGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHLA

Query:  YSIDKVDS-ASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPIF
         S+ +  +    SD S   +FW + W+++   K K  AW+A  NILP+++N+  RG+  +  C  C    E++ H+ W C     VW        N  + 
Subjt:  YSIDKVDS-ASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPIF

Query:  FREGWDALDRWDFLV-EAIKEDNMSKAINICWNIWNQRNSVKINLGSPDFSKIIREVMSKDR
        +R+  D L  W  +  + + +D +   I I W +W  RN  ++  GSP  S    E++ K R
Subjt:  FREGWDALDRWDFLV-EAIKEDNMSKAINICWNIWNQRNSVKINLGSPDFSKIIREVMSKDR

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]3.4e-3635.89Show/hide
Query:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENF---SEAYVKELID-ETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAY
        ++WRVGNGA IR+ +D W+P   + K   IT      S+  V +L+D E G W+  VI + F+P EADSI  IP+  +   D+++W   P G F+V+SAY
Subjt:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENF---SEAYVKELID-ETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAY

Query:  HLAYS-IDKVDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGK-AVWCH--FFPIL
         LA + +   +  +PSD SK++ FW R W+I    K +   W+A  N LP++ N+ +R I  +++C  C++ PES  HV+W C   + A  C    FP L
Subjt:  HLAYS-IDKVDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGK-AVWCH--FFPIL

Query:  VNSPIFFREGWDALDRWDFLV-EAIKEDNMSKAINICWNIWNQRNSVK
          S + F +       W  ++ E + E+++++A    W IW+ RN V+
Subjt:  VNSPIFFREGWDALDRWDFLV-EAIKEDNMSKAINICWNIWNQRNSVK

XP_030939658.1 uncharacterized protein LOC115964500 [Quercus lobata]6.4e-3534.68Show/hide
Query:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENF---SEAYVKELID-ETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAY
        ++WRVGNGA IR+ +D W+P   + K   IT      ++  V +L+D E G W+  VI + F+P EADSI  IP+  +   D+++W   P G F+V+SAY
Subjt:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENF---SEAYVKELID-ETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAY

Query:  HLAYS-IDKVDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGK-AVWCH--FFPIL
         LA + +   +  +PSD SK++ FW R W+I    K +   W A  N LP++ N+ +R I  + +C  C++ PES  HV+W C   + A  C    FP L
Subjt:  HLAYS-IDKVDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGK-AVWCH--FFPIL

Query:  VNSPIFFRE-GWDALDRWDFLVEAIKEDNMSKAINICWNIWNQRNSVK
          S + F +  W  +     + E + E+++++     W +W+ RN V+
Subjt:  VNSPIFFRE-GWDALDRWDFLVEAIKEDNMSKAINICWNIWNQRNSVK

TrEMBL top hitse value%identityAlignment
A0A2N9F2B6 Uncharacterized protein2.0e-3431.08Show/hide
Query:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENF-SEAYVKELID-ETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHL
        +RWRVG G+ I I  D W+    +FK +         A V+ LID E+  WK+ +I   F+P +A+ IL IPL  +   D+++W    KG ++VKS Y L
Subjt:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENF-SEAYVKELID-ETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHL

Query:  AYSIDKVDSA-SPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPI
          S  +   A + S  +   KFWN  W ++ APK +L  WKA  +ILP+Q+ +  R I  +  C  C ++PE+ +HV+W C F + VW  +     + P+
Subjt:  AYSIDKVDSA-SPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPI

Query:  FFREGWDALDRWDFLVEAIKEDNMSKAINICWNIWNQRNSVKINLGSPDFSKIIREVMSKDRDEVYLRACGKIVPAVIRSETLSSQRLWSPLPAVR
         F       D  D     ++  ++       W++WN RN +  +       KI+         E+  RA G  +  +   E L S+   S +PAV+
Subjt:  FFREGWDALDRWDFLVEAIKEDNMSKAINICWNIWNQRNSVKINLGSPDFSKIIREVMSKDRDEVYLRACGKIVPAVIRSETLSSQRLWSPLPAVR

A0A2N9I509 Uncharacterized protein4.5e-3433.74Show/hide
Query:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSE-AYVKELIDE-TGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHL
        +RWRVGNG  I++  D W+P    FK +      S+ A V +LI+  T  WK  V+  SF P +A+ I  IPL  +   D ++W     G FSV+SAYH+
Subjt:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSE-AYVKELIDE-TGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHL

Query:  AYSIDKVDSASPSDQSKIK-KFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPI
          S    D  S S    ++ +FW+  W+++  PK KL  WKA  NI+P+Q+ +  +G+  +  C  C ++PE+  H++W C F + VW         S +
Subjt:  AYSIDKVDSASPSDQSKIK-KFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPI

Query:  FFREGWDALDRWDFLVEAIKEDNMSKAINI----CWNIWNQRN
              D    +  +VEA      S A+ I     W +WN RN
Subjt:  FFREGWDALDRWDFLVEAIKEDNMSKAINI----CWNIWNQRN

A0A2N9IMU5 Uncharacterized protein9.1e-3534.43Show/hide
Query:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVC-ITENFSEAYVKELIDETGV-WKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHL
        +RWRVGNG+ I+I  D W+P    F+ +  I ++ SEA V  LID   + W    +   F+P + + I  IPL  +  RD+++W     G FSVKSAY+L
Subjt:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVC-ITENFSEAYVKELIDETGV-WKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHL

Query:  AYSIDKVDSASPSD-QSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPI
               DS S S+ +S ++  W+  W+ +  PK +L  W+A L+ILP+++ +  +G+  +  C  C + PE+A HV+W C F + +W    P+++ S  
Subjt:  AYSIDKVDSASPSD-QSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPI

Query:  FFREGWDALDRWDFLVEAIKEDNMSKA-----INICWNIWNQRN
               +++  DF++  I  DN+S+A       I W IWN RN
Subjt:  FFREGWDALDRWDFLVEAIKEDNMSKA-----INICWNIWNQRN

A0A6J1DAR4 uncharacterized protein LOC1110189541.8e-3530.74Show/hide
Query:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSEAYVKELID-ETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHLA
        +RWR+GNG  + I  D W+P     K +        + V  L+D E G W+  V+R  F P EA  IL IP+G     D+++W+ +  G +SV+S Y +A
Subjt:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSEAYVKELID-ETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHLA

Query:  YSIDK-VDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPIF
           +  V + S S   +++ +WN FW +    K K+  W+  L+ LP+  N+SKRG+++ N C+ C +  E + H+ W C F +A+W +     ++  + 
Subjt:  YSIDK-VDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPIF

Query:  FREGWDALDRWDFLVEAIKEDNMSKAINICWNIWNQRNSVKINLGSPDFSKIIREVM
         RE  ++L + DF           +   + W +WNQRN+   N  +    KI  E++
Subjt:  FREGWDALDRWDFLVEAIKEDNMSKAINICWNIWNQRNSVKINLGSPDFSKIIREVM

A0A7N2LEC9 zf-RVT domain-containing protein2.6e-3433.33Show/hide
Query:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVC-ITENFSEAYVKELID-ETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHL
        M WR+GNG  +RIK+D W+    N  ++  +T   ++  V  LI+ E G+WK  V+   F+P EA  IL IPL  +   D I W   P G FS KSAY +
Subjt:  MRWRVGNGAIIRIKDDPWIPGGGNFKSVC-ITENFSEAYVKELID-ETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHL

Query:  AYSID---KVDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNS
          S+D   +V S+SP  Q    KFW   W+++   K K  AW+A  N LP+  N+ +R I  ++LC  C  +PE   H +W CS  + VW      L  +
Subjt:  AYSID---KVDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNS

Query:  PIFFREGWDALDRWDFLVEA---IKEDNMSKA-INICWNIWNQRNSVKINLGSPDFSKIIREVMSKDRDEVYLRACGKIVPAVIRSETLSSQRLWSP
                 +L  ++ L++    +K+D   +  I I W IWN+RN++K    +   + II    S  ++ +  +      P+V    T+SSQ  W P
Subjt:  PIFFREGWDALDRWDFLVEA---IKEDNMSKA-INICWNIWNQRNSVKINLGSPDFSKIIREVMSKDRDEVYLRACGKIVPAVIRSETLSSQRLWSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.4e-1621.91Show/hide
Query:  RWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSEAYVKELIDETG---VWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHL
        R  +G+G  IRI  D  I      + +   E + E  +  L +  G    W +  I      S+   I  I L      D+I+W+ +  G ++V+S Y L
Subjt:  RWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSEAYVKELIDETG---VWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHL

Query:  AYSIDKVDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPIF
               +  + +          R WN+   PK K   W+A+   L +   ++ RG+ ++  C  C ++ ES  H ++TC F    W      L+ + + 
Subjt:  AYSIDKVDSASPSDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPIF

Query:  FREGWDALDRWDFLVEAIKEDNMSK-----AINICWNIWNQRNSVKINLGSPDFSKIIREVMSKDRDEVYLRACGKIVPAVIR
          +  + +     ++  +++  MS       + + W IW  RN+V  N      SK +    ++  D +      K  P+  R
Subjt:  FREGWDALDRWDFLVEAIKEDNMSK-----AINICWNIWNQRNSVKINLGSPDFSKIIREVMSKDRDEVYLRACGKIVPAVIR

AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.9e-0528.04Show/hide
Query:  GIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPIL-VNSPIFFREGWDALDRWDFLVEAIKEDNMSKAI--NICWNIWNQRNSVKINLGSPDFSKI
        G+ V+ LC LC   PE+ +H++  CSF KA+W      L ++S IF    W  L  W           + K +  ++ +  W QRN++  N    D + +
Subjt:  GIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPIL-VNSPIFFREGWDALDRWDFLVEAIKEDNMSKAI--NICWNIWNQRNSVKINLGSPDFSKI

Query:  IREVMSK
         + +  K
Subjt:  IREVMSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGATGGAGGGTGGGTAATGGTGCTATCATTAGAATCAAGGATGACCCGTGGATTCCGGGGGGTGGGAATTTCAAGTCGGTCTGTATTACAGAAAATTTCTCTGAAGC
GTATGTAAAAGAGCTTATTGACGAGACAGGGGTTTGGAAGGAGGGGGTCATCCGTTCTTCTTTTATCCCTTCCGAAGCTGACTCTATCCTGGACATCCCTTTGGGGGGGA
AAGATGCGAGGGATCAAATTTTGTGGGATCCGGACCCGAAGGGATTTTTTTCTGTAAAAAGTGCATACCACCTTGCTTATAGCATTGACAAGGTGGATTCTGCTTCCCCT
TCAGACCAGTCGAAGATAAAGAAATTCTGGAATCGATTTTGGAATATTAAGGCAGCTCCAAAGGAAAAGCTATGCGCTTGGAAGGCTATTCTTAATATCTTACCTTCCCA
ATCTAATATTAGCAAAAGAGGGATTGATGTTAACAATTTGTGTTTTCTGTGCAGGAAGAAACCTGAATCTGCGGAGCACGTTATCTGGACCTGTAGCTTCGGTAAGGCTG
TGTGGTGCCACTTCTTCCCCATCCTTGTTAATTCTCCTATTTTTTTCAGGGAAGGTTGGGATGCGTTGGATAGATGGGATTTTTTGGTCGAGGCTATCAAAGAAGATAAT
ATGTCAAAAGCTATCAATATTTGTTGGAACATTTGGAATCAGCGCAATAGCGTCAAAATCAACTTAGGCAGTCCTGATTTCTCAAAGATTATCAGAGAAGTCATGAGCAA
GGATAGAGATGAAGTTTACCTTAGGGCCTGTGGGAAGATTGTTCCGGCAGTGATCAGATCAGAGACCCTCTCGAGTCAAAGGCTCTGGTCCCCTCTCCCTGCTGTTCGTG
GAAGATCAACACAGACGCTTCGTGGCTGGAAAAGGAAAGGCGTGGTAGGGTTGGATGGATCGCTTGTGACTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGATGGAGGGTGGGTAATGGTGCTATCATTAGAATCAAGGATGACCCGTGGATTCCGGGGGGTGGGAATTTCAAGTCGGTCTGTATTACAGAAAATTTCTCTGAAGC
GTATGTAAAAGAGCTTATTGACGAGACAGGGGTTTGGAAGGAGGGGGTCATCCGTTCTTCTTTTATCCCTTCCGAAGCTGACTCTATCCTGGACATCCCTTTGGGGGGGA
AAGATGCGAGGGATCAAATTTTGTGGGATCCGGACCCGAAGGGATTTTTTTCTGTAAAAAGTGCATACCACCTTGCTTATAGCATTGACAAGGTGGATTCTGCTTCCCCT
TCAGACCAGTCGAAGATAAAGAAATTCTGGAATCGATTTTGGAATATTAAGGCAGCTCCAAAGGAAAAGCTATGCGCTTGGAAGGCTATTCTTAATATCTTACCTTCCCA
ATCTAATATTAGCAAAAGAGGGATTGATGTTAACAATTTGTGTTTTCTGTGCAGGAAGAAACCTGAATCTGCGGAGCACGTTATCTGGACCTGTAGCTTCGGTAAGGCTG
TGTGGTGCCACTTCTTCCCCATCCTTGTTAATTCTCCTATTTTTTTCAGGGAAGGTTGGGATGCGTTGGATAGATGGGATTTTTTGGTCGAGGCTATCAAAGAAGATAAT
ATGTCAAAAGCTATCAATATTTGTTGGAACATTTGGAATCAGCGCAATAGCGTCAAAATCAACTTAGGCAGTCCTGATTTCTCAAAGATTATCAGAGAAGTCATGAGCAA
GGATAGAGATGAAGTTTACCTTAGGGCCTGTGGGAAGATTGTTCCGGCAGTGATCAGATCAGAGACCCTCTCGAGTCAAAGGCTCTGGTCCCCTCTCCCTGCTGTTCGTG
GAAGATCAACACAGACGCTTCGTGGCTGGAAAAGGAAAGGCGTGGTAGGGTTGGATGGATCGCTTGTGACTCCTTAG
Protein sequenceShow/hide protein sequence
MRWRVGNGAIIRIKDDPWIPGGGNFKSVCITENFSEAYVKELIDETGVWKEGVIRSSFIPSEADSILDIPLGGKDARDQILWDPDPKGFFSVKSAYHLAYSIDKVDSASP
SDQSKIKKFWNRFWNIKAAPKEKLCAWKAILNILPSQSNISKRGIDVNNLCFLCRKKPESAEHVIWTCSFGKAVWCHFFPILVNSPIFFREGWDALDRWDFLVEAIKEDN
MSKAINICWNIWNQRNSVKINLGSPDFSKIIREVMSKDRDEVYLRACGKIVPAVIRSETLSSQRLWSPLPAVRGRSTQTLRGWKRKGVVGLDGSLVTP