; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006238 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006238
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr6:39779308..39781564
RNA-Seq ExpressionLag0006238
SyntenyLag0006238
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]7.1e-7962.14Show/hide
Query:  KKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDK
        ++++KP +     +E  A      L + +    P  ++++VM+T+  T+E+RM E+++ +N LMKA+EE+D  IA LK  IE++  AESS T  IKN +K
Subjt:  KKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDK

Query:  GKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVK
        GK  +Q+ QP  S S+ASLS+QQLQ+MI N I+ QYGGP Q   LYSKPYTKRIDN+RMP GYQPPKFQQFDG+GNPKQH+AHF+ETCE AGTRGDLLVK
Subjt:  GKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVK

Query:  QLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV
        Q VRTLKGNAFDWYTDLEPESIDSWE+LER+FLNRFYSTRR V
Subjt:  QLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV

XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]7.8e-7857.73Show/hide
Query:  KLTSRFNRSSSQQADPRDHQVSRPIIQEDQQANKPIQEIINLAGRSSKKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERM
        K  S+ + +S     P     S+ IIQ   Q +   Q I+       K+L +  +     +E   Y ND        L E   DV++VMM +    E  M
Subjt:  KLTSRFNRSSSQQADPRDHQVSRPIIQEDQQANKPIQEIINLAGRSSKKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERM

Query:  TEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-HCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTK
         EM+  IN LMK ++E+D  IA LK Q++ +  AESSQT V+K  DKGK  VQ++QP   S SVASLS+QQLQDMITN IRAQYGGP+Q S +YSKPYTK
Subjt:  TEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-HCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTK

Query:  RIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV
        RIDNLRMP+GYQPPKFQQFDG+GNPKQH+AHFVETCENAG+RGD LV+Q VR+LKGNAF+WYTDLEPESI+SWE+LE+EFLNRFYSTRRTV
Subjt:  RIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]3.0e-7757.39Show/hide
Query:  KLTSRFNRSSSQQADPRDHQVSRPIIQEDQQANKPIQEIINLAGRSSKKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERM
        K  S+ + +S     P     S+ IIQ   Q +   Q I+       K+L +  +     +E   Y ND        L E   DV++VMM +    E  M
Subjt:  KLTSRFNRSSSQQADPRDHQVSRPIIQEDQQANKPIQEIINLAGRSSKKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERM

Query:  TEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-HCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTK
         EM+  IN LMK ++E+D  IA LK Q++ +  AESSQT V+K  DKGK  VQ++QP   S SVASLS+QQLQDMIT+ IRAQYGGP+Q S +YSKPYTK
Subjt:  TEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-HCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTK

Query:  RIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV
        RIDNLRMP+GYQPPKFQQFDG+GNPKQH+AHFVETCENAG+RGD LV+Q VR+LKGNAF+WYTDLEPESI+SWE+LE+EFLNRFYSTRRTV
Subjt:  RIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]3.0e-7757.39Show/hide
Query:  KLTSRFNRSSSQQADPRDHQVSRPIIQEDQQANKPIQEIINLAGRSSKKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERM
        K  S+ + +S     P     S+ IIQ   Q +   Q I+       K+L +  +     +E   Y ND        L E   DV++VMM +    E  M
Subjt:  KLTSRFNRSSSQQADPRDHQVSRPIIQEDQQANKPIQEIINLAGRSSKKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERM

Query:  TEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-HCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTK
         EM+  IN LMK ++E+D  IA LK Q++ +  AESSQT V+K  DKGK  VQ++QP   S SVASLS+QQLQDMIT+ IRAQYGGP+Q S +YSKPYTK
Subjt:  TEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-HCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTK

Query:  RIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV
        RIDNLRMP+GYQPPKFQQFDG+GNPKQH+AHFVETCENAG+RGD LV+Q VR+LKGNAF+WYTDLEPESI+SWE+LE+EFLNRFYSTRRTV
Subjt:  RIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]7.8e-7857.73Show/hide
Query:  KLTSRFNRSSSQQADPRDHQVSRPIIQEDQQANKPIQEIINLAGRSSKKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERM
        K  S+ + +S     P     S+ IIQ   Q +   Q I+       K+L +  +     +E   Y ND        L E   DV++VMM +    E  M
Subjt:  KLTSRFNRSSSQQADPRDHQVSRPIIQEDQQANKPIQEIINLAGRSSKKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERM

Query:  TEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-HCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTK
         EM+  IN LMK ++E+D  IA LK Q++ +  AESSQT V+K  DKGK  VQ++QP   S SVASLS+QQLQDMITN IRAQYGGP+Q S +YSKPYTK
Subjt:  TEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-HCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTK

Query:  RIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV
        RIDNLRMP+GYQPPKFQQFDG+GNPKQH+AHFVETCENAG+RGD LV+Q VR+LKGNAF+WYTDLEPESI+SWE+LE+EFLNRFYSTRRTV
Subjt:  RIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H3.4e-7962.14Show/hide
Query:  KKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDK
        ++++KP +     +E  A      L + +    P  ++++VM+T+  T+E+RM E+++ +N LMKA+EE+D  IA LK  IE++  AESS T  IKN +K
Subjt:  KKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDK

Query:  GKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVK
        GK  +Q+ QP  S S+ASLS+QQLQ+MI N I+ QYGGP Q   LYSKPYTKRIDN+RMP GYQPPKFQQFDG+GNPKQH+AHF+ETCE AGTRGDLLVK
Subjt:  GKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVK

Query:  QLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV
        Q VRTLKGNAFDWYTDLEPESIDSWE+LER+FLNRFYSTRR V
Subjt:  QLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV

A0A5A7UUI7 Ty3-gypsy retrotransposon protein9.3e-7760.49Show/hide
Query:  KKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDK
        ++++KP +     +E  A      L + +    P  ++++VM+T   T+E RM E+++ +N LMK +EE+D  IA LK  IE++  AESS    +KN DK
Subjt:  KKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDK

Query:  GKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVK
        GK  +Q+ QP  S S+ASLS+QQLQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLRMP GYQPPKFQQFDG+GNPKQH+AHF+ETCE AGTRGDLLVK
Subjt:  GKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVK

Query:  QLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV
        Q VRTLKGNAFDWY DLEPESID+WE+LER+FLNRFYSTRR V
Subjt:  QLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV

A0A5A7UZD5 Ty3-gypsy retrotransposon protein1.2e-7659.67Show/hide
Query:  KKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDK
        ++++KP +     +E  A      L + +    P  +++++M+T+  T+E+RMT++++ +N LMKA+EE+D  IA LK  IE+   AESS +  IKN +K
Subjt:  KKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDK

Query:  GKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVK
        GK  +Q+ QP  S S+ASLS+QQLQ+MI N I+ QYGGP +   LYSKPYTKRIDN+RMP GYQPPKFQQFDG+GNPKQH+A+F+ETCE  GTRGDLLVK
Subjt:  GKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVK

Query:  QLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV
        Q VRTLKGNAFDWYTDLEPESIDSW++LER+FLNRFYSTRR V
Subjt:  QLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV

A0A5D3BRH6 Ty3-gypsy retrotransposon protein1.2e-7659.67Show/hide
Query:  KKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDK
        ++++KP +     +E  A      L + +    P  +++++M+T+  T+E+RMT++++ +N LMKA+EE+D  IA LK  IE+   AESS +  IKN +K
Subjt:  KKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDK

Query:  GKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVK
        GK  +Q+ QP  S S+ASLS+QQLQ+MI N I+ QYGGP +   LYSKPYTKRIDN+RMP GYQPPKFQQFDG+GNPKQH+A+F+ETCE  GTRGDLLVK
Subjt:  GKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVK

Query:  QLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV
        Q VRTLKGNAFDWYTDLEPESIDSW++LER+FLNRFYSTRR V
Subjt:  QLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV

A0A5D3D4X3 Ty3-gypsy retrotransposon protein3.2e-7760.08Show/hide
Query:  KKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDK
        ++++KP ++    +E  A      L + +    P  ++++VM+T+   +E+RM E+++ +N LMK +EE+D  IA LK  IE++  AESS    +KN DK
Subjt:  KKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEERMTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDK

Query:  GKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVK
        GK  +Q+ QP  S S+ASLS+QQLQ+MI + I+ QYGGP Q   LYSKPYTKRIDNLRMP GYQPPKFQQFDG+GNPKQH+AHF+ETCE AGTRGDLLVK
Subjt:  GKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIGYQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVK

Query:  QLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV
        Q VRTLKGNAFDWY DLEPESID+WE+LER+FLNRFYSTRR V
Subjt:  QLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACGTGGCAATTGGACCCAAGCCCAAGCCCATATGAGGTCATCAAACTCTATAAATAGAGGAGGTCACCATTCAGATTCGGGAGATTCAGCCAGAGGCAGAGAGGG
CAGAGTCCAGAGCATTCTCCCAAGATTGACCGATCAAGAAGATCAACAACTAACAAGTCGATCCAACAGATCATCAAGCCAACAGGCTGATCCAAGAGATCAACAAGCCA
ACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGCCAACAGGCCGATCAAGAAGGTCAAC
AAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGTCGATTCAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCGATCAT
CCAAGAGGATCAACAAGCTAACAAGCCGATCCAAGAGATCATCAACCTAGCAGGCCGATCATCCAAGAAGCTCAACAAACCAGTCCAAGAAGATCAACAAGCTAGAGAAC
AAAGGGCATATACCAACGACAAGTTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGACTGAGACAAGAACTACGGAAGAAAGA
ATGACTGAGATGCAAGAACACATCAACAATTTGATGAAGGCGATTGAAGAAAAAGATTCTCATATCGCGCAACTAAAGTGCCAAATTGAGAACCAACATATCGCCGAATC
AAGTCAAACCCAAGTCATAAAAAATCATGACAAAGGAAAGACTACAGTGCAAGATGATCAGCCACATTGTTCTGCTTCGGTCGCTTCACTATCCATCCAACAGCTCCAAG
ATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTATATTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAATGCCAATCGGG
TATCAGCCACCAAAATTTCAGCAGTTCGATGGAAGGGGCAATCCTAAACAACATATTGCTCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTACTAGT
CAAACAGCTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAATAGACAGTTGGGAGGAACTCGAAAGAGAGTTCTTGAATCGCT
TCTACAGCACTAGAAGAACCGTAATTTTGGACCACCATGGCGTACAAGAAGCTGACGAGGACAACCGGGCAGAAAAAGAGTTGAGGAATGGATCCAAAGGGCAAAACCGG
CAAGTGGGACGGGCCAAGACCGAAGGGGTCGGCCAAAGGCCAAAGGCCGAGGCCGACCCTCGACCCGCCTGCGCGGGCCGGGCTTGTCCGGCTCCGCTTGGTCCCCACCG
TCTCTGGCCGCCTCGGTTCAGCCTGTTTGAGCAGGTATTTATATCCCTCTTCGCCACCGAAGAAGGGGACTCAAACTCTAATCCCGAAACTCATTATATATTCTCTACTC
TCTCCTCTTGCTCTTACTTTTCCGCTCCCCACCGTTCTGCTCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCACGTGGCAATTGGACCCAAGCCCAAGCCCATATGAGGTCATCAAACTCTATAAATAGAGGAGGTCACCATTCAGATTCGGGAGATTCAGCCAGAGGCAGAGAGGG
CAGAGTCCAGAGCATTCTCCCAAGATTGACCGATCAAGAAGATCAACAACTAACAAGTCGATCCAACAGATCATCAAGCCAACAGGCTGATCCAAGAGATCAACAAGCCA
ACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAAGATCAACAAGCCAACCGATCGAACAGATCATCAAGCCAACAGGCCGATCAAGAAGGTCAAC
AAGTCAGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGTCGATTCAACAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAAGTCAGCAGGCCGATCAT
CCAAGAGGATCAACAAGCTAACAAGCCGATCCAAGAGATCATCAACCTAGCAGGCCGATCATCCAAGAAGCTCAACAAACCAGTCCAAGAAGATCAACAAGCTAGAGAAC
AAAGGGCATATACCAACGACAAGTTTCTTGTTAAGTATAACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGACTGAGACAAGAACTACGGAAGAAAGA
ATGACTGAGATGCAAGAACACATCAACAATTTGATGAAGGCGATTGAAGAAAAAGATTCTCATATCGCGCAACTAAAGTGCCAAATTGAGAACCAACATATCGCCGAATC
AAGTCAAACCCAAGTCATAAAAAATCATGACAAAGGAAAGACTACAGTGCAAGATGATCAGCCACATTGTTCTGCTTCGGTCGCTTCACTATCCATCCAACAGCTCCAAG
ATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACCTACTCAAGATTCCCTCTTATATTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAATGCCAATCGGG
TATCAGCCACCAAAATTTCAGCAGTTCGATGGAAGGGGCAATCCTAAACAACATATTGCTCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTACTAGT
CAAACAGCTCGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCTGAGTCAATAGACAGTTGGGAGGAACTCGAAAGAGAGTTCTTGAATCGCT
TCTACAGCACTAGAAGAACCGTAATTTTGGACCACCATGGCGTACAAGAAGCTGACGAGGACAACCGGGCAGAAAAAGAGTTGAGGAATGGATCCAAAGGGCAAAACCGG
CAAGTGGGACGGGCCAAGACCGAAGGGGTCGGCCAAAGGCCAAAGGCCGAGGCCGACCCTCGACCCGCCTGCGCGGGCCGGGCTTGTCCGGCTCCGCTTGGTCCCCACCG
TCTCTGGCCGCCTCGGTTCAGCCTGTTTGAGCAGGTATTTATATCCCTCTTCGCCACCGAAGAAGGGGACTCAAACTCTAATCCCGAAACTCATTATATATTCTCTACTC
TCTCCTCTTGCTCTTACTTTTCCGCTCCCCACCGTTCTGCTCGCTGA
Protein sequenceShow/hide protein sequence
MPRGNWTQAQAHMRSSNSINRGGHHSDSGDSARGREGRVQSILPRLTDQEDQQLTSRSNRSSSQQADPRDQQANRPIKKINKSAGRSSKKINKPTDRTDHQANRPIKKVN
KSAGRSSKRINKLTSRFNRSSSQQADPRDHQVSRPIIQEDQQANKPIQEIINLAGRSSKKLNKPVQEDQQAREQRAYTNDKFLVKYNPLFEPDSDVVTVMMTETRTTEER
MTEMQEHINNLMKAIEEKDSHIAQLKCQIENQHIAESSQTQVIKNHDKGKTTVQDDQPHCSASVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRMPIG
YQPPKFQQFDGRGNPKQHIAHFVETCENAGTRGDLLVKQLVRTLKGNAFDWYTDLEPESIDSWEELEREFLNRFYSTRRTVILDHHGVQEADEDNRAEKELRNGSKGQNR
QVGRAKTEGVGQRPKAEADPRPACAGRACPAPLGPHRLWPPRFSLFEQVFISLFATEEGDSNSNPETHYIFSTLSSCSYFSAPHRSAR