; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019218 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019218
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationscaffold1:47374457..47390553
RNA-Seq ExpressionSpg019218
SyntenySpg019218
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051980.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.8e-1342.54Show/hide
Query:  LFGLSYTEGTKRPRGNRSGLQTRELAKT-----------------VEIELPVPDTLPTSAESSRTSSSSMLQ----------------------------
        LFGL Y EGTKRPRGNR+ LQT EL KT                 VEIELPVPDTLPTSAESS ++SS+ L+                            
Subjt:  LFGLSYTEGTKRPRGNRSGLQTRELAKT-----------------VEIELPVPDTLPTSAESSRTSSSSMLQ----------------------------

Query:  -EGQRVSLEEDDVRWLHAIFRTELPGGPGGDIEP
          G    +  D+V WLHA+F  +  GGPGG + P
Subjt:  -EGQRVSLEEDDVRWLHAIFRTELPGGPGGDIEP

KAA0052218.1 uncharacterized protein E6C27_scaffold207G00290 [Cucumis melo var. makuwa]6.5e-1639.87Show/hide
Query:  IGKAHAKPAIAESDFNLSTSVFLFGLSYTEGTK----------------------RPRGNRSGLQTRELAKTVEIELPVPDTLPTSAESSRTSSSSMLQ-
        I K +AKPAIAES  N    + LFGLSY E  +                       PRGNR  L T +L KTVEIELPVPDTLPTSAESS+++SS+ L+ 
Subjt:  IGKAHAKPAIAESDFNLSTSVFLFGLSYTEGTK----------------------RPRGNRSGLQTRELAKTVEIELPVPDTLPTSAESSRTSSSSMLQ-

Query:  -------------------------EGQRVSLEEDDVRWLHAIFRTELPGGPGGDIEP
                                 EG    +  DD+ WLHA+F  +  G PGG + P
Subjt:  -------------------------EGQRVSLEEDDVRWLHAIFRTELPGGPGGDIEP

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]9.1e-1830Show/hide
Query:  MRKRDFLNEKGF-SNRARTLSE--FVTKVISQYKWQEFCAHPQEVVVPLVREFYAGLREERMSMAVVRGKMVSFSSVNINRVYRIKAPLHPRGNNAIKNP
        ++ R    EKGF  + + T+ +  F+ +VI+Q+ W++FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    +  I+N 
Subjt:  MRKRDFLNEKGF-SNRARTLSE--FVTKVISQYKWQEFCAHPQEVVVPLVREFYAGLREERMSMAVVRGKMVSFSSVNINRVYRIKAPLHPRGNNAIKNP

Query:  SAKQMKEALKMVANKGVQWKESQTKVKT-----LVPESAVWLHFLKN----------------------------------REEILACGKKKVGKLFFGS
        +   +   L+ VA  G +W  S     T     L P + VW HFLK+                                    EI AC  +K G LFF S
Subjt:  SAKQMKEALKMVANKGVQWKESQTKVKT-----LVPESAVWLHFLKN----------------------------------REEILACGKKKVGKLFFGS

Query:  LITQLCQRVKIVPGKDEERHFFRPTIDLSLIGKLQQNNAQRKDKASTSQATPPSGLNPAS
        LIT+LC+  +     +EE+      ID   + ++ Q       +  T     PS   PA+
Subjt:  LITQLCQRVKIVPGKDEERHFFRPTIDLSLIGKLQQNNAQRKDKASTSQATPPSGLNPAS

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.6e-2228.61Show/hide
Query:  MRKRDFLNEKGF-SNRARTLSE--FVTKVISQYKWQEFCAHPQEVVVPLVREFYAGLREERMSMAVVRGKMVSFSSVNINRVYRIKAPLHPRGNNAIKNP
        ++ R    EKGF  + + T+ +  F+ +VI+Q+ W++FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    +  I+N 
Subjt:  MRKRDFLNEKGF-SNRARTLSE--FVTKVISQYKWQEFCAHPQEVVVPLVREFYAGLREERMSMAVVRGKMVSFSSVNINRVYRIKAPLHPRGNNAIKNP

Query:  SAKQMKEALKMVANKGVQWKESQTKVKT-----LVPESAVWLHFLKNR----------------------------------EEILACGKKKVGKLFFGS
        + + +   L+ VA  G +W  S     T     L P + VW HFLK+R                                   EI AC  +K G LFF S
Subjt:  SAKQMKEALKMVANKGVQWKESQTKVKT-----LVPESAVWLHFLKNR----------------------------------EEILACGKKKVGKLFFGS

Query:  LITQLCQRVKIVPGKDEERHFFRPTIDLSLIGKLQQNNAQRKDKASTSQATPPSGLNPASPAQHTPFSGPSPSSEAL--AISYQQLDQ---------IRD
        LIT+LC+  +     +EE+      ID   + ++ Q       +  T     PS   PA+ + +          +AL   +S Q++ Q            
Subjt:  LITQLCQRVKIVPGKDEERHFFRPTIDLSLIGKLQQNNAQRKDKASTSQATPPSGLNPASPAQHTPFSGPSPSSEAL--AISYQQLDQ---------IRD

Query:  NLRTYWAYAKERDEAIREFYLSISPSIALVFPDFPQTLLP--QEEKDSDDDEEDENEDEE
          + +WAY+KERD A+++   +        FP FPQ +L     E +++ D++  NE  E
Subjt:  NLRTYWAYAKERDEAIREFYLSISPSIALVFPDFPQTLLP--QEEKDSDDDEEDENEDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]6.7e-1327.23Show/hide
Query:  FPEVCKDVNFQERMEIMRKRDFLNEKGF---SNRARTLSEFVTKVISQYKWQEFCAHPQEVVVPLVREFYAGLREERMSMAVVRGKMVSFSSVNINRVYR
        F     ++ ++E ++    R    EK F   +++      F+  VI Q+ WQ FCAHP++ +VPLVREFY  +         +RG  V  S   IN ++ 
Subjt:  FPEVCKDVNFQERMEIMRKRDFLNEKGF---SNRARTLSEFVTKVISQYKWQEFCAHPQEVVVPLVREFYAGLREERMSMAVVRGKMVSFSSVNINRVYR

Query:  IKAPLHPRGNNAIKNPSAKQMKEALKMVANKGVQWKESQTKVKT-----LVPESAVWLHFLKNR----------------------------------EE
        +  P+    +  +++ +  ++   L+ VA  G +W  S     T     L P + VW HFLK+R                                   E
Subjt:  IKAPLHPRGNNAIKNPSAKQMKEALKMVANKGVQWKESQTKVKT-----LVPESAVWLHFLKNR----------------------------------EE

Query:  ILACGKKKVGKLFFGSLITQLCQRVKIVPGKDEER
        I AC  +K G LFF SLIT +C+  +     +EE+
Subjt:  ILACGKKKVGKLFFGSLITQLCQRVKIVPGKDEER

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)4.4e-1830Show/hide
Query:  MRKRDFLNEKGF-SNRARTLSE--FVTKVISQYKWQEFCAHPQEVVVPLVREFYAGLREERMSMAVVRGKMVSFSSVNINRVYRIKAPLHPRGNNAIKNP
        ++ R    EKGF  + + T+ +  F+ +VI+Q+ W++FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    +  I+N 
Subjt:  MRKRDFLNEKGF-SNRARTLSE--FVTKVISQYKWQEFCAHPQEVVVPLVREFYAGLREERMSMAVVRGKMVSFSSVNINRVYRIKAPLHPRGNNAIKNP

Query:  SAKQMKEALKMVANKGVQWKESQTKVKT-----LVPESAVWLHFLKN----------------------------------REEILACGKKKVGKLFFGS
        +   +   L+ VA  G +W  S     T     L P + VW HFLK+                                    EI AC  +K G LFF S
Subjt:  SAKQMKEALKMVANKGVQWKESQTKVKT-----LVPESAVWLHFLKN----------------------------------REEILACGKKKVGKLFFGS

Query:  LITQLCQRVKIVPGKDEERHFFRPTIDLSLIGKLQQNNAQRKDKASTSQATPPSGLNPAS
        LIT+LC+  +     +EE+      ID   + ++ Q       +  T     PS   PA+
Subjt:  LITQLCQRVKIVPGKDEERHFFRPTIDLSLIGKLQQNNAQRKDKASTSQATPPSGLNPAS

A0A2P5BCG4 Uncharacterized protein (Fragment)1.7e-2228.61Show/hide
Query:  MRKRDFLNEKGF-SNRARTLSE--FVTKVISQYKWQEFCAHPQEVVVPLVREFYAGLREERMSMAVVRGKMVSFSSVNINRVYRIKAPLHPRGNNAIKNP
        ++ R    EKGF  + + T+ +  F+ +VI+Q+ W++FCAHP++ +VPLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    +  I+N 
Subjt:  MRKRDFLNEKGF-SNRARTLSE--FVTKVISQYKWQEFCAHPQEVVVPLVREFYAGLREERMSMAVVRGKMVSFSSVNINRVYRIKAPLHPRGNNAIKNP

Query:  SAKQMKEALKMVANKGVQWKESQTKVKT-----LVPESAVWLHFLKNR----------------------------------EEILACGKKKVGKLFFGS
        + + +   L+ VA  G +W  S     T     L P + VW HFLK+R                                   EI AC  +K G LFF S
Subjt:  SAKQMKEALKMVANKGVQWKESQTKVKT-----LVPESAVWLHFLKNR----------------------------------EEILACGKKKVGKLFFGS

Query:  LITQLCQRVKIVPGKDEERHFFRPTIDLSLIGKLQQNNAQRKDKASTSQATPPSGLNPASPAQHTPFSGPSPSSEAL--AISYQQLDQ---------IRD
        LIT+LC+  +     +EE+      ID   + ++ Q       +  T     PS   PA+ + +          +AL   +S Q++ Q            
Subjt:  LITQLCQRVKIVPGKDEERHFFRPTIDLSLIGKLQQNNAQRKDKASTSQATPPSGLNPASPAQHTPFSGPSPSSEAL--AISYQQLDQ---------IRD

Query:  NLRTYWAYAKERDEAIREFYLSISPSIALVFPDFPQTLLP--QEEKDSDDDEEDENEDEE
          + +WAY+KERD A+++   +        FP FPQ +L     E +++ D++  NE  E
Subjt:  NLRTYWAYAKERDEAIREFYLSISPSIALVFPDFPQTLLP--QEEKDSDDDEEDENEDEE

A0A2P5DXM3 Uncharacterized protein3.3e-1327.72Show/hide
Query:  VPLVREFYAGLREERMSMAVVRGKMVSFSSVNINRVYRIKAPLHPRGNNAIKNPSAKQMKEALKMVANKGVQWKESQTKVKT-----LVPESAVWLHFLK
        +PLVREFYA L +   +   VRG  VS+S   IN V+ +  P+    +  I+N +  ++   L+ VA  G +W  S     T     L P + VW HFLK
Subjt:  VPLVREFYAGLREERMSMAVVRGKMVSFSSVNINRVYRIKAPLHPRGNNAIKNPSAKQMKEALKMVANKGVQWKESQTKVKT-----LVPESAVWLHFLK

Query:  NR----------------------------------EEILACGKKKVGKLFFGSLITQLCQRVKIVPGKDEERHFFRPTIDLSLIGKLQQNNAQRKDKAS
        +R                                   EI AC  +K G LFF SLIT+LC+    +   +EE+      ID   + ++ Q       +  
Subjt:  NR----------------------------------EEILACGKKKVGKLFFGSLITQLCQRVKIVPGKDEERHFFRPTIDLSLIGKLQQNNAQRKDKAS

Query:  TSQATPPSGLNPASPAQHTPFSGPSPSSEALAISYQQLDQIRDNLRTYWAYAKERDEAIREFYLSISPSIALVFPDFPQTLLP--QEEKDSDDDEEDENE
        T     PS   PA+ +            +AL     Q +      + +WAY+KERD A+++   +        FP FPQ +L     E +++ D++  NE
Subjt:  TSQATPPSGLNPASPAQHTPFSGPSPSSEALAISYQQLDQIRDNLRTYWAYAKERDEAIREFYLSISPSIALVFPDFPQTLLP--QEEKDSDDDEEDENE

Query:  DEE
          E
Subjt:  DEE

A0A5A7U9X4 DNA/RNA polymerases superfamily protein8.6e-1442.54Show/hide
Query:  LFGLSYTEGTKRPRGNRSGLQTRELAKT-----------------VEIELPVPDTLPTSAESSRTSSSSMLQ----------------------------
        LFGL Y EGTKRPRGNR+ LQT EL KT                 VEIELPVPDTLPTSAESS ++SS+ L+                            
Subjt:  LFGLSYTEGTKRPRGNRSGLQTRELAKT-----------------VEIELPVPDTLPTSAESSRTSSSSMLQ----------------------------

Query:  -EGQRVSLEEDDVRWLHAIFRTELPGGPGGDIEP
          G    +  D+V WLHA+F  +  GGPGG + P
Subjt:  -EGQRVSLEEDDVRWLHAIFRTELPGGPGGDIEP

A0A5A7UAH6 CCHC-type domain-containing protein3.1e-1639.87Show/hide
Query:  IGKAHAKPAIAESDFNLSTSVFLFGLSYTEGTK----------------------RPRGNRSGLQTRELAKTVEIELPVPDTLPTSAESSRTSSSSMLQ-
        I K +AKPAIAES  N    + LFGLSY E  +                       PRGNR  L T +L KTVEIELPVPDTLPTSAESS+++SS+ L+ 
Subjt:  IGKAHAKPAIAESDFNLSTSVFLFGLSYTEGTK----------------------RPRGNRSGLQTRELAKTVEIELPVPDTLPTSAESSRTSSSSMLQ-

Query:  -------------------------EGQRVSLEEDDVRWLHAIFRTELPGGPGGDIEP
                                 EG    +  DD+ WLHA+F  +  G PGG + P
Subjt:  -------------------------EGQRVSLEEDDVRWLHAIFRTELPGGPGGDIEP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTTGAGGTTGAAGAAGATAGGAAAAGCTCATGCAAAGCCAGCCATAGCAGAATCTGACTTCAACCTGTCAACCTCTGTTTTTTTATTCGGTTTGAGCTACACAGA
AGGAACTAAGAGGCCAAGAGGAAATAGGTCGGGTCTACAAACCAGGGAACTAGCTAAGACTGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAA
GTTCCAGAACAAGCTCCAGTTCAATGTTACAGGAGGGTCAACGGGTATCGTTAGAGGAGGACGATGTTCGTTGGCTTCACGCCATCTTTCGGACTGAGTTACCAGGTGGT
CCGGGAGGGGATATTGAGCCCGAGGCTATATGGTACCGTGTGCACACAGGTTATATTCCGTTGTTGACGTTGAGTGTACTCCGTCACAACGATGCTGTCGTAGAGATCGA
GCTTCCGGTGCCTGATACACTGTCAGCGTCTGCTGAAAGTTTCAGATCAAGCACCAGTACGTGGGGTCAACAGGGATCGTTAGAGGAGGACGATGTCTGTTTGCTTCACG
CCATCTCCCTGACTAAGCTAGCAGGATTGGAATTCCGTAATTCTGTAAAGCGGAAGCGTGATTGGGACCGATTCATTAAATTCACCAACTGTTGTAAAATAATCACGAAC
TACTGCGAACACCACCACTATGGCTACACCGGTATGACTCTGAGACTTCTAGAGACAGGAGACTGGTGGGAGTATAGAGGGAGCCTTCAGGGGAATTCTCTGAGAGGTCC
ACCACTAAATAAGAGTTCCTCTCGGGCCAGGAGAGGACGACGCGCCTTTGTTCAAGCCCCGGAATCAGCGCTTAAGGGAACACACATCTACTTACCCAATAGGGGAAGGA
GTGAATTCCATCTTGTACTGTTATGTTCCCAGCCCCCATTCGGTCTTGCCCTTGAAATGGATACCCCCACCCGCATGTCTCCTACCTGGATGCTTTGGATCATTGCATCT
GTATCGAATACAAGGGGGGTCGTATCACATAAGGTCACCAGGATAAGGGACAAACCATTTGTTACTTACAGTGCAAGAAAGAAGAGTCCCAATGAGGTTGCATCTGAACA
GTCGCTTGTCATCGAGCCCCTCAAAGTGGCTAGAATGCCTCCGGATGTGTTTGAGGACATGATCTGCCAAGCTGTGGCACAAGTCCTCCTTATCGCTGAAGGTTATAGGG
CAGAGCAAAATGCCTTGATGGAGATCCGGGCTGAAAGAGAGATGGAAAACCAAACCTTTGAGCCACTGCACAAGGCTCAAAGTGAGGCTGAATTGATGCAAGGAAGAGAA
GAAAAGGCCCTTGAGGGGTCAAATGAAGAGAACCAAGAAAAAGAAGAAAAAGAAGAAGACGGGAATGAAGGCCAGAATGCGACCGCATTGGGGCCGCATTCTGAAGAAGG
CAAAGAAAAGGCCACTGAAGAGCAGCCAGCTGATGAGACTTTGGATCCTCTGTTTGAGTATGATATTAAGGAATTGGATAACGACCAAGTTCCTATCTCTGCGGCATTGA
GGAAAAAGAGAATAACAGAGATTAGGGCCGAAAGGAGGACCAAAAATAGAAATGATCTGATCTTTGCCAAGAGGTCGAGGACAAGGTCCGTGGACGCTTCTCCTGCATCT
CCTCCAACCATCTCACCTGCCAAGCCGAAAGCCAAATCGCCTAAGGCTCCATCTCCTAAAAATCCATTCCCAGAAGTTTGTAAAGATGTAAATTTTCAGGAACGGATGGA
GATCATGAGAAAAAGAGATTTCCTCAACGAGAAGGGATTCTCCAATAGAGCGAGGACACTGTCAGAGTTCGTAACCAAAGTTATCTCACAGTACAAGTGGCAGGAGTTCT
GTGCTCACCCTCAGGAGGTCGTGGTGCCTTTAGTTCGAGAATTTTACGCCGGACTGAGGGAGGAAAGGATGAGTATGGCAGTGGTGAGAGGCAAAATGGTCAGCTTCTCT
TCTGTTAACATCAACCGGGTGTACAGAATCAAAGCACCCTTACATCCAAGAGGGAACAATGCCATTAAGAACCCCTCGGCCAAACAAATGAAAGAGGCGCTAAAAATGGT
GGCCAACAAGGGTGTTCAGTGGAAAGAGTCCCAAACGAAGGTGAAGACATTAGTGCCAGAATCGGCAGTATGGCTTCACTTTCTGAAGAACAGGGAGGAAATCCTTGCCT
GTGGAAAGAAGAAGGTAGGGAAGCTTTTCTTCGGGTCGCTTATCACCCAGCTATGTCAGAGGGTGAAGATAGTTCCTGGTAAGGATGAGGAGCGTCACTTCTTCCGGCCT
ACCATCGACCTATCCCTGATTGGGAAGCTTCAACAGAACAACGCCCAAAGAAAAGACAAAGCTTCCACATCTCAAGCCACTCCACCATCAGGGCTGAATCCGGCTTCTCC
AGCTCAACACACTCCTTTTTCAGGGCCCTCACCGTCATCTGAAGCCCTAGCAATTTCCTACCAACAACTGGATCAAATCAGGGACAACCTGAGGACTTATTGGGCATATG
CCAAGGAGAGAGATGAAGCGATTAGAGAGTTTTACCTCTCTATCTCGCCGAGTATTGCTCTTGTCTTTCCTGATTTCCCTCAAACGCTGCTACCTCAAGAAGAAAAGGAC
TCTGATGATGATGAAGAAGATGAAAATGAAGATGAAGAAAAAGAGTTCCTCGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTTGAGGTTGAAGAAGATAGGAAAAGCTCATGCAAAGCCAGCCATAGCAGAATCTGACTTCAACCTGTCAACCTCTGTTTTTTTATTCGGTTTGAGCTACACAGA
AGGAACTAAGAGGCCAAGAGGAAATAGGTCGGGTCTACAAACCAGGGAACTAGCTAAGACTGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAA
GTTCCAGAACAAGCTCCAGTTCAATGTTACAGGAGGGTCAACGGGTATCGTTAGAGGAGGACGATGTTCGTTGGCTTCACGCCATCTTTCGGACTGAGTTACCAGGTGGT
CCGGGAGGGGATATTGAGCCCGAGGCTATATGGTACCGTGTGCACACAGGTTATATTCCGTTGTTGACGTTGAGTGTACTCCGTCACAACGATGCTGTCGTAGAGATCGA
GCTTCCGGTGCCTGATACACTGTCAGCGTCTGCTGAAAGTTTCAGATCAAGCACCAGTACGTGGGGTCAACAGGGATCGTTAGAGGAGGACGATGTCTGTTTGCTTCACG
CCATCTCCCTGACTAAGCTAGCAGGATTGGAATTCCGTAATTCTGTAAAGCGGAAGCGTGATTGGGACCGATTCATTAAATTCACCAACTGTTGTAAAATAATCACGAAC
TACTGCGAACACCACCACTATGGCTACACCGGTATGACTCTGAGACTTCTAGAGACAGGAGACTGGTGGGAGTATAGAGGGAGCCTTCAGGGGAATTCTCTGAGAGGTCC
ACCACTAAATAAGAGTTCCTCTCGGGCCAGGAGAGGACGACGCGCCTTTGTTCAAGCCCCGGAATCAGCGCTTAAGGGAACACACATCTACTTACCCAATAGGGGAAGGA
GTGAATTCCATCTTGTACTGTTATGTTCCCAGCCCCCATTCGGTCTTGCCCTTGAAATGGATACCCCCACCCGCATGTCTCCTACCTGGATGCTTTGGATCATTGCATCT
GTATCGAATACAAGGGGGGTCGTATCACATAAGGTCACCAGGATAAGGGACAAACCATTTGTTACTTACAGTGCAAGAAAGAAGAGTCCCAATGAGGTTGCATCTGAACA
GTCGCTTGTCATCGAGCCCCTCAAAGTGGCTAGAATGCCTCCGGATGTGTTTGAGGACATGATCTGCCAAGCTGTGGCACAAGTCCTCCTTATCGCTGAAGGTTATAGGG
CAGAGCAAAATGCCTTGATGGAGATCCGGGCTGAAAGAGAGATGGAAAACCAAACCTTTGAGCCACTGCACAAGGCTCAAAGTGAGGCTGAATTGATGCAAGGAAGAGAA
GAAAAGGCCCTTGAGGGGTCAAATGAAGAGAACCAAGAAAAAGAAGAAAAAGAAGAAGACGGGAATGAAGGCCAGAATGCGACCGCATTGGGGCCGCATTCTGAAGAAGG
CAAAGAAAAGGCCACTGAAGAGCAGCCAGCTGATGAGACTTTGGATCCTCTGTTTGAGTATGATATTAAGGAATTGGATAACGACCAAGTTCCTATCTCTGCGGCATTGA
GGAAAAAGAGAATAACAGAGATTAGGGCCGAAAGGAGGACCAAAAATAGAAATGATCTGATCTTTGCCAAGAGGTCGAGGACAAGGTCCGTGGACGCTTCTCCTGCATCT
CCTCCAACCATCTCACCTGCCAAGCCGAAAGCCAAATCGCCTAAGGCTCCATCTCCTAAAAATCCATTCCCAGAAGTTTGTAAAGATGTAAATTTTCAGGAACGGATGGA
GATCATGAGAAAAAGAGATTTCCTCAACGAGAAGGGATTCTCCAATAGAGCGAGGACACTGTCAGAGTTCGTAACCAAAGTTATCTCACAGTACAAGTGGCAGGAGTTCT
GTGCTCACCCTCAGGAGGTCGTGGTGCCTTTAGTTCGAGAATTTTACGCCGGACTGAGGGAGGAAAGGATGAGTATGGCAGTGGTGAGAGGCAAAATGGTCAGCTTCTCT
TCTGTTAACATCAACCGGGTGTACAGAATCAAAGCACCCTTACATCCAAGAGGGAACAATGCCATTAAGAACCCCTCGGCCAAACAAATGAAAGAGGCGCTAAAAATGGT
GGCCAACAAGGGTGTTCAGTGGAAAGAGTCCCAAACGAAGGTGAAGACATTAGTGCCAGAATCGGCAGTATGGCTTCACTTTCTGAAGAACAGGGAGGAAATCCTTGCCT
GTGGAAAGAAGAAGGTAGGGAAGCTTTTCTTCGGGTCGCTTATCACCCAGCTATGTCAGAGGGTGAAGATAGTTCCTGGTAAGGATGAGGAGCGTCACTTCTTCCGGCCT
ACCATCGACCTATCCCTGATTGGGAAGCTTCAACAGAACAACGCCCAAAGAAAAGACAAAGCTTCCACATCTCAAGCCACTCCACCATCAGGGCTGAATCCGGCTTCTCC
AGCTCAACACACTCCTTTTTCAGGGCCCTCACCGTCATCTGAAGCCCTAGCAATTTCCTACCAACAACTGGATCAAATCAGGGACAACCTGAGGACTTATTGGGCATATG
CCAAGGAGAGAGATGAAGCGATTAGAGAGTTTTACCTCTCTATCTCGCCGAGTATTGCTCTTGTCTTTCCTGATTTCCCTCAAACGCTGCTACCTCAAGAAGAAAAGGAC
TCTGATGATGATGAAGAAGATGAAAATGAAGATGAAGAAAAAGAGTTCCTCGGATGA
Protein sequenceShow/hide protein sequence
MILRLKKIGKAHAKPAIAESDFNLSTSVFLFGLSYTEGTKRPRGNRSGLQTRELAKTVEIELPVPDTLPTSAESSRTSSSSMLQEGQRVSLEEDDVRWLHAIFRTELPGG
PGGDIEPEAIWYRVHTGYIPLLTLSVLRHNDAVVEIELPVPDTLSASAESFRSSTSTWGQQGSLEEDDVCLLHAISLTKLAGLEFRNSVKRKRDWDRFIKFTNCCKIITN
YCEHHHYGYTGMTLRLLETGDWWEYRGSLQGNSLRGPPLNKSSSRARRGRRAFVQAPESALKGTHIYLPNRGRSEFHLVLLCSQPPFGLALEMDTPTRMSPTWMLWIIAS
VSNTRGVVSHKVTRIRDKPFVTYSARKKSPNEVASEQSLVIEPLKVARMPPDVFEDMICQAVAQVLLIAEGYRAEQNALMEIRAEREMENQTFEPLHKAQSEAELMQGRE
EKALEGSNEENQEKEEKEEDGNEGQNATALGPHSEEGKEKATEEQPADETLDPLFEYDIKELDNDQVPISAALRKKRITEIRAERRTKNRNDLIFAKRSRTRSVDASPAS
PPTISPAKPKAKSPKAPSPKNPFPEVCKDVNFQERMEIMRKRDFLNEKGFSNRARTLSEFVTKVISQYKWQEFCAHPQEVVVPLVREFYAGLREERMSMAVVRGKMVSFS
SVNINRVYRIKAPLHPRGNNAIKNPSAKQMKEALKMVANKGVQWKESQTKVKTLVPESAVWLHFLKNREEILACGKKKVGKLFFGSLITQLCQRVKIVPGKDEERHFFRP
TIDLSLIGKLQQNNAQRKDKASTSQATPPSGLNPASPAQHTPFSGPSPSSEALAISYQQLDQIRDNLRTYWAYAKERDEAIREFYLSISPSIALVFPDFPQTLLPQEEKD
SDDDEEDENEDEEKEFLG