; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025508 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025508
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4216 domain-containing protein
Genome locationchr10:14089400..14100243
RNA-Seq ExpressionLag0025508
SyntenyLag0025508
Gene Ontology termsNA
InterPro domainsIPR004242 - Transposon, En/Spm-like
IPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032061.1 uncharacterized protein E6C27_scaffold223G00250 [Cucumis melo var. makuwa]6.3e-5450Show/hide
Query:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL
        M LLIPGPKSPG+EIDVYLQPLIEELK+LW  GV TYD ++G++FQL+A+LLWTINDFP YGDLS WSTKGYQAC IC  D SSFGIRG I+FMGH+RYL
Subjt:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL

Query:  PENHSWRKSKQHNGKLDRRPPPMTMDGESGMFADSISLSFWGQDRVGSWERSLTRWIHSFPILGEIEDLLLLFFYELVREKGVLSRIHQGQHLRLIRPQV
        P+NH WR+S+ H+GK++R+ PP+ +         +IS                             E+   +F + ++     +S  ++ +HLRL R   
Subjt:  PENHSWRKSKQHNGKLDRRPPPMTMDGESGMFADSISLSFWGQDRVGSWERSLTRWIHSFPILGEIEDLLLLFFYELVREKGVLSRIHQGQHLRLIRPQV

Query:  QSLTDLYKRHQLTFPDWFKS
        Q+  DLYK H+  FP+WF++
Subjt:  QSLTDLYKRHQLTFPDWFKS

KAA0038099.1 uncharacterized protein E6C27_scaffold36G003190 [Cucumis melo var. makuwa]1.5e-5576.56Show/hide
Query:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL
        MS L+PGPKSPGKE+DVYLQPLI+ELKELWNNGV T+DC+  EYF+LHA LLWTINDFPAYGDLS WSTKGYQAC  CKEDTSSFGI+GKI+FMGH+RYL
Subjt:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL

Query:  PENHSWRKSKQHNGKLDRRPPPMTMDGE
          +H WR+S+QH+GK +RRPPP+ M+G+
Subjt:  PENHSWRKSKQHNGKLDRRPPPMTMDGE

KAA0045598.1 uncharacterized protein E6C27_scaffold243G001040 [Cucumis melo var. makuwa]6.7e-7230.56Show/hide
Query:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL
        MSLL+PGPKSPGKE+DVYLQPLI+ELKELWNNGV T+DC   EYF+LHA LLWTINDFPAYGDLS WSTKGYQAC  CKEDTSSFGI+GKI+FMGH+RYL
Subjt:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL

Query:  PENHSWRKSKQHNGKLDRRPPPMTMDGESGMFADSISLSF------WGQDRVGSWERSLTRWI---------HSFPILGEI-------EDLLLLFFYELV
          +H WR+S+QH+GK +RRPPP+ M+G+  +     S++F         +R    + S+ R +          +   +GE        ++  +  +Y L 
Subjt:  PENHSWRKSKQHNGKLDRRPPPMTMDGESGMFADSISLSF------WGQDRVGSWERSLTRWI---------HSFPILGEI-------EDLLLLFFYELV

Query:  REKGVLSRIHQGQHLRLIRPQVQSLTDLYKRHQLTFPDWFKSHKIPAIKRRGDCC-------------------VFVR----WSVVGEERTIYNEAGV--
            + S  ++ +HL LI  + + + DL++RHQL FP+WF++H + +++ RG+                      FV      S+  + R     +G+  
Subjt:  REKGVLSRIHQGQHLRLIRPQVQSLTDLYKRHQLTFPDWFKSHKIPAIKRRGDCC-------------------VFVR----WSVVGEERTIYNEAGV--

Query:  --------------------------------------TDTREGL-----------TSRYW---------SRSVDT---------ENMSAVRRVQLR---
                                              TD ++             TSR+W         +++V            N   V+ VQ +   
Subjt:  --------------------------------------TDTREGL-----------TSRYW---------SRSVDT---------ENMSAVRRVQLR---

Query:  ------KFQRSRGEEAAAFSFVGASLAKNG----------QVYNEGEKTGWDQLRDAAVEYDNDDLDEFGPPSSTGETSVGNTTTRSYSRNIELD-----
              + +  R E   A S +G   + +            V +  E    DQ R    ++ ND+ ++     S G         R Y RNIELD     
Subjt:  ------KFQRSRGEEAAAFSFVGASLAKNG----------QVYNEGEKTGWDQLRDAAVEYDNDDLDEFGPPSSTGETSVGNTTTRSYSRNIELD-----

Query:  -----------------------SNAIGIAVRESSPVRCASTRTI-------------------------------------------------RSPEVA
                               +  IG AVR + P+ C + + +                                                 +  ++ 
Subjt:  -----------------------SNAIGIAVRESSPVRCASTRTI-------------------------------------------------RSPEVA

Query:  RANPPPRLKENIEDWHFLCTKFETPEWKKLAEANRNNQSALSFNHRVGSKSFLQIREDLKLEQGREIGPIDLFERTHSRNGE-WVNQKANDAHLSLR
         A   PR +   EDW+ +C ++ET  WKK  E N+ + SA+ FNH  G+KSFLQ+R +LK ++G ++  I++F  TH R  E W N KA DA+L ++
Subjt:  RANPPPRLKENIEDWHFLCTKFETPEWKKLAEANRNNQSALSFNHRVGSKSFLQIREDLKLEQGREIGPIDLFERTHSRNGE-WVNQKANDAHLSLR

KAA0066295.1 uncharacterized protein E6C27_scaffold21G003460 [Cucumis melo var. makuwa]5.2e-5650.63Show/hide
Query:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL
        MSLLIP PKSPG+EIDVYLQPLIEELKELW  GV TYD ++G++FQL+A+LLWTINDFP YGDLS WSTKGYQAC IC  D SSFGIRG+I+FMGH+RYL
Subjt:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL

Query:  PENHSWRKSKQHNGKLDRRPPPMTMDG----------ESGMFADSISLSFWGQDRVGSWERSLTRWIHSFPILGEIEDLLLLFFYELVR-EKGVLSRIHQ
        P+NH WR+S+ H GK++R+ PP+ M+G          E  + +   S+    + R  +W +        F  L     LLL    +++  EK V   + +
Subjt:  PENHSWRKSKQHNGKLDRRPPPMTMDG----------ESGMFADSISLSFWGQDRVGSWERSLTRWIHSFPILGEIEDLLLLFFYELVR-EKGVLSRIHQ

Query:  ------GQHLRLIRPQVQSLTDLYKRHQLTFPDWFKS
               +HLRL R   Q+  DLYK H+  FP+WF++
Subjt:  ------GQHLRLIRPQVQSLTDLYKRHQLTFPDWFKS

XP_022158896.1 uncharacterized protein LOC111025354 [Momordica charantia]6.1e-5782.03Show/hide
Query:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL
        MSLLIPGPKSPGK+IDVYLQPLIEELKELW NGV TYDC SGEYF++HA+LLWTINDFPAYGDLS WSTKGYQAC ICKEDTSSF IRGKI+FMGH+ YL
Subjt:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL

Query:  PENHSWRKSKQHNGKLDRRPPPMTMDGE
        PE HSWR+SK H+GK++RR PP  MDG+
Subjt:  PENHSWRKSKQHNGKLDRRPPPMTMDGE

TrEMBL top hitse value%identityAlignment
A0A5A7SR78 DUF4216 domain-containing protein3.1e-5450Show/hide
Query:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL
        M LLIPGPKSPG+EIDVYLQPLIEELK+LW  GV TYD ++G++FQL+A+LLWTINDFP YGDLS WSTKGYQAC IC  D SSFGIRG I+FMGH+RYL
Subjt:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL

Query:  PENHSWRKSKQHNGKLDRRPPPMTMDGESGMFADSISLSFWGQDRVGSWERSLTRWIHSFPILGEIEDLLLLFFYELVREKGVLSRIHQGQHLRLIRPQV
        P+NH WR+S+ H+GK++R+ PP+ +         +IS                             E+   +F + ++     +S  ++ +HLRL R   
Subjt:  PENHSWRKSKQHNGKLDRRPPPMTMDGESGMFADSISLSFWGQDRVGSWERSLTRWIHSFPILGEIEDLLLLFFYELVREKGVLSRIHQGQHLRLIRPQV

Query:  QSLTDLYKRHQLTFPDWFKS
        Q+  DLYK H+  FP+WF++
Subjt:  QSLTDLYKRHQLTFPDWFKS

A0A5A7T9T1 Uncharacterized protein7.3e-5676.56Show/hide
Query:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL
        MS L+PGPKSPGKE+DVYLQPLI+ELKELWNNGV T+DC+  EYF+LHA LLWTINDFPAYGDLS WSTKGYQAC  CKEDTSSFGI+GKI+FMGH+RYL
Subjt:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL

Query:  PENHSWRKSKQHNGKLDRRPPPMTMDGE
          +H WR+S+QH+GK +RRPPP+ M+G+
Subjt:  PENHSWRKSKQHNGKLDRRPPPMTMDGE

A0A5A7TRX4 DUF4216 domain-containing protein3.2e-7230.56Show/hide
Query:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL
        MSLL+PGPKSPGKE+DVYLQPLI+ELKELWNNGV T+DC   EYF+LHA LLWTINDFPAYGDLS WSTKGYQAC  CKEDTSSFGI+GKI+FMGH+RYL
Subjt:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL

Query:  PENHSWRKSKQHNGKLDRRPPPMTMDGESGMFADSISLSF------WGQDRVGSWERSLTRWI---------HSFPILGEI-------EDLLLLFFYELV
          +H WR+S+QH+GK +RRPPP+ M+G+  +     S++F         +R    + S+ R +          +   +GE        ++  +  +Y L 
Subjt:  PENHSWRKSKQHNGKLDRRPPPMTMDGESGMFADSISLSF------WGQDRVGSWERSLTRWI---------HSFPILGEI-------EDLLLLFFYELV

Query:  REKGVLSRIHQGQHLRLIRPQVQSLTDLYKRHQLTFPDWFKSHKIPAIKRRGDCC-------------------VFVR----WSVVGEERTIYNEAGV--
            + S  ++ +HL LI  + + + DL++RHQL FP+WF++H + +++ RG+                      FV      S+  + R     +G+  
Subjt:  REKGVLSRIHQGQHLRLIRPQVQSLTDLYKRHQLTFPDWFKSHKIPAIKRRGDCC-------------------VFVR----WSVVGEERTIYNEAGV--

Query:  --------------------------------------TDTREGL-----------TSRYW---------SRSVDT---------ENMSAVRRVQLR---
                                              TD ++             TSR+W         +++V            N   V+ VQ +   
Subjt:  --------------------------------------TDTREGL-----------TSRYW---------SRSVDT---------ENMSAVRRVQLR---

Query:  ------KFQRSRGEEAAAFSFVGASLAKNG----------QVYNEGEKTGWDQLRDAAVEYDNDDLDEFGPPSSTGETSVGNTTTRSYSRNIELD-----
              + +  R E   A S +G   + +            V +  E    DQ R    ++ ND+ ++     S G         R Y RNIELD     
Subjt:  ------KFQRSRGEEAAAFSFVGASLAKNG----------QVYNEGEKTGWDQLRDAAVEYDNDDLDEFGPPSSTGETSVGNTTTRSYSRNIELD-----

Query:  -----------------------SNAIGIAVRESSPVRCASTRTI-------------------------------------------------RSPEVA
                               +  IG AVR + P+ C + + +                                                 +  ++ 
Subjt:  -----------------------SNAIGIAVRESSPVRCASTRTI-------------------------------------------------RSPEVA

Query:  RANPPPRLKENIEDWHFLCTKFETPEWKKLAEANRNNQSALSFNHRVGSKSFLQIREDLKLEQGREIGPIDLFERTHSRNGE-WVNQKANDAHLSLR
         A   PR +   EDW+ +C ++ET  WKK  E N+ + SA+ FNH  G+KSFLQ+R +LK ++G ++  I++F  TH R  E W N KA DA+L ++
Subjt:  RANPPPRLKENIEDWHFLCTKFETPEWKKLAEANRNNQSALSFNHRVGSKSFLQIREDLKLEQGREIGPIDLFERTHSRNGE-WVNQKANDAHLSLR

A0A6J1E2A8 uncharacterized protein LOC1110253543.0e-5782.03Show/hide
Query:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL
        MSLLIPGPKSPGK+IDVYLQPLIEELKELW NGV TYDC SGEYF++HA+LLWTINDFPAYGDLS WSTKGYQAC ICKEDTSSF IRGKI+FMGH+ YL
Subjt:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL

Query:  PENHSWRKSKQHNGKLDRRPPPMTMDGE
        PE HSWR+SK H+GK++RR PP  MDG+
Subjt:  PENHSWRKSKQHNGKLDRRPPPMTMDGE

A0A6J1E2A8 uncharacterized protein LOC1110253543.5e-0262.86Show/hide
Query:  RIHQGQHLRLIRPQVQSLTDLYKRHQLTFPDWFKS
        RI   QHLR++RP   S TDLY+ HQL F DWFKS
Subjt:  RIHQGQHLRLIRPQVQSLTDLYKRHQLTFPDWFKS

A0A6J1E2A8 uncharacterized protein LOC1110253542.5e-5650.63Show/hide
Query:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL
        MSLLIP PKSPG+EIDVYLQPLIEELKELW  GV TYD ++G++FQL+A+LLWTINDFP YGDLS WSTKGYQAC IC  D SSFGIRG+I+FMGH+RYL
Subjt:  MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYL

Query:  PENHSWRKSKQHNGKLDRRPPPMTMDG----------ESGMFADSISLSFWGQDRVGSWERSLTRWIHSFPILGEIEDLLLLFFYELVR-EKGVLSRIHQ
        P+NH WR+S+ H GK++R+ PP+ M+G          E  + +   S+    + R  +W +        F  L     LLL    +++  EK V   + +
Subjt:  PENHSWRKSKQHNGKLDRRPPPMTMDG----------ESGMFADSISLSFWGQDRVGSWERSLTRWIHSFPILGEIEDLLLLFFYELVR-EKGVLSRIHQ

Query:  ------GQHLRLIRPQVQSLTDLYKRHQLTFPDWFKS
               +HLRL R   Q+  DLYK H+  FP+WF++
Subjt:  ------GQHLRLIRPQVQSLTDLYKRHQLTFPDWFKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G30200.1 Plant transposase (Ptta/En/Spm family)3.7e-0427.84Show/hide
Query:  NPPPRLKENIEDWHFLCTKFETPEWKKLAEANRNNQSALSFNHRVGSKSFLQIREDLKLEQGREIGPIDLF-ERTHSRNGEWVNQKANDAHLSLRCL
        N P ++ E  + W  L     T +W+K+ E N  NQ      H  G KSF + R+++K++ G+    ++ F E     +G +V+ +A    ++L  L
Subjt:  NPPPRLKENIEDWHFLCTKFETPEWKKLAEANRNNQSALSFNHRVGSKSFLQIREDLKLEQGREIGPIDLF-ERTHSRNGEWVNQKANDAHLSLRCL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTTACTCATACCCGGACCAAAATCTCCTGGTAAAGAAATTGACGTTTACTTGCAACCATTAATAGAAGAGTTAAAGGAGTTATGGAACAATGGTGTGTGCACTTA
TGACTGTGTTAGCGGTGAGTATTTTCAACTACATGCAAGTTTGTTGTGGACGATCAATGACTTTCCTGCGTATGGTGACTTATCTTGTTGGAGCACTAAGGGGTATCAAG
CATGTCTTATTTGTAAGGAGGATACATCTTCCTTTGGGATCCGAGGAAAAATTGCGTTCATGGGGCATCAACGTTATCTTCCAGAGAACCATAGTTGGCGCAAAAGTAAA
CAACACAACGGAAAGCTGGATCGTAGACCTCCTCCAATGACAATGGATGGTGAGAGTGGTATGTTCGCCGACTCAATAAGCCTATCATTTTGGGGACAAGACCGAGTGGG
GAGCTGGGAACGTAGTCTTACAAGATGGATTCACTCCTTCCCGATATTAGGTGAGATTGAGGATCTCTTGTTACTGTTTTTCTACGAATTAGTGAGGGAAAAAGGCGTTT
TGTCAAGAATTCATCAAGGGCAACACTTAAGACTAATTCGACCACAAGTACAAAGCTTGACCGACTTATATAAAAGACATCAGTTGACATTTCCTGATTGGTTCAAATCT
CATAAAATTCCAGCGATCAAGAGGCGAGGAGACTGCTGTGTTTTCGTTCGTTGGAGCGTCGTTGGCGAAGAACGGACAATCTACAACGAAGCTGGTGTTACGGACACTCG
TGAAGGACTAACTAGTCGATATTGGTCTAGATCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGCAACTAAGAAAATTCCAGCGATCAAGAGGCGAAGAGGCTG
CTGCGTTTTCGTTCGTTGGAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGGGGAAAAGACGGGATGGGACCAATTGAGAGATGCAGCCGTAGAATATGATAAT
GATGATCTAGACGAGTTTGGCCCACCATCTTCCACTGGAGAGACCTCAGTTGGTAATACGACGACTCGCTCGTATTCGCGTAATATTGAATTGGACAGCAACGCAATTGG
TATTGCTGTGAGGGAATCATCTCCAGTTCGATGCGCCTCAACCCGAACAATACGAAGTCCTGAAGTGGCACGTGCAAACCCACCACCTCGGCTGAAAGAAAACATTGAAG
ATTGGCATTTTCTATGTACAAAGTTTGAGACCCCAGAGTGGAAGAAACTTGCAGAGGCCAATCGGAATAATCAATCTGCTTTGTCATTTAATCATAGGGTTGGGTCAAAG
TCTTTCCTTCAAATTCGAGAAGACTTGAAACTGGAACAAGGTCGCGAAATTGGACCAATTGACCTATTTGAGCGGACACACTCTAGAAATGGCGAGTGGGTCAATCAGAA
GGCTAATGATGCACATCTAAGTTTGCGTTGCTTGCAAGATGCTCATATTCTGGAAAGGTCTGAACCACTCGCAGGAGAGGAGATATTGGAGAAAGTGTTGGAAAACGACC
AGGCCATGTTAAAGGGCTTGGCCCAACTTCAGGAATCAGAAGCGAAGATTCACAAGGTAGAAGCAATGGTTGAGGAAGAGAGGAGTGCAAGACTCGAAACGAAAGCTGAG
TTGGGAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCTTACTCATACCCGGACCAAAATCTCCTGGTAAAGAAATTGACGTTTACTTGCAACCATTAATAGAAGAGTTAAAGGAGTTATGGAACAATGGTGTGTGCACTTA
TGACTGTGTTAGCGGTGAGTATTTTCAACTACATGCAAGTTTGTTGTGGACGATCAATGACTTTCCTGCGTATGGTGACTTATCTTGTTGGAGCACTAAGGGGTATCAAG
CATGTCTTATTTGTAAGGAGGATACATCTTCCTTTGGGATCCGAGGAAAAATTGCGTTCATGGGGCATCAACGTTATCTTCCAGAGAACCATAGTTGGCGCAAAAGTAAA
CAACACAACGGAAAGCTGGATCGTAGACCTCCTCCAATGACAATGGATGGTGAGAGTGGTATGTTCGCCGACTCAATAAGCCTATCATTTTGGGGACAAGACCGAGTGGG
GAGCTGGGAACGTAGTCTTACAAGATGGATTCACTCCTTCCCGATATTAGGTGAGATTGAGGATCTCTTGTTACTGTTTTTCTACGAATTAGTGAGGGAAAAAGGCGTTT
TGTCAAGAATTCATCAAGGGCAACACTTAAGACTAATTCGACCACAAGTACAAAGCTTGACCGACTTATATAAAAGACATCAGTTGACATTTCCTGATTGGTTCAAATCT
CATAAAATTCCAGCGATCAAGAGGCGAGGAGACTGCTGTGTTTTCGTTCGTTGGAGCGTCGTTGGCGAAGAACGGACAATCTACAACGAAGCTGGTGTTACGGACACTCG
TGAAGGACTAACTAGTCGATATTGGTCTAGATCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGCAACTAAGAAAATTCCAGCGATCAAGAGGCGAAGAGGCTG
CTGCGTTTTCGTTCGTTGGAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGGGGAAAAGACGGGATGGGACCAATTGAGAGATGCAGCCGTAGAATATGATAAT
GATGATCTAGACGAGTTTGGCCCACCATCTTCCACTGGAGAGACCTCAGTTGGTAATACGACGACTCGCTCGTATTCGCGTAATATTGAATTGGACAGCAACGCAATTGG
TATTGCTGTGAGGGAATCATCTCCAGTTCGATGCGCCTCAACCCGAACAATACGAAGTCCTGAAGTGGCACGTGCAAACCCACCACCTCGGCTGAAAGAAAACATTGAAG
ATTGGCATTTTCTATGTACAAAGTTTGAGACCCCAGAGTGGAAGAAACTTGCAGAGGCCAATCGGAATAATCAATCTGCTTTGTCATTTAATCATAGGGTTGGGTCAAAG
TCTTTCCTTCAAATTCGAGAAGACTTGAAACTGGAACAAGGTCGCGAAATTGGACCAATTGACCTATTTGAGCGGACACACTCTAGAAATGGCGAGTGGGTCAATCAGAA
GGCTAATGATGCACATCTAAGTTTGCGTTGCTTGCAAGATGCTCATATTCTGGAAAGGTCTGAACCACTCGCAGGAGAGGAGATATTGGAGAAAGTGTTGGAAAACGACC
AGGCCATGTTAAAGGGCTTGGCCCAACTTCAGGAATCAGAAGCGAAGATTCACAAGGTAGAAGCAATGGTTGAGGAAGAGAGGAGTGCAAGACTCGAAACGAAAGCTGAG
TTGGGAACTTGA
Protein sequenceShow/hide protein sequence
MSLLIPGPKSPGKEIDVYLQPLIEELKELWNNGVCTYDCVSGEYFQLHASLLWTINDFPAYGDLSCWSTKGYQACLICKEDTSSFGIRGKIAFMGHQRYLPENHSWRKSK
QHNGKLDRRPPPMTMDGESGMFADSISLSFWGQDRVGSWERSLTRWIHSFPILGEIEDLLLLFFYELVREKGVLSRIHQGQHLRLIRPQVQSLTDLYKRHQLTFPDWFKS
HKIPAIKRRGDCCVFVRWSVVGEERTIYNEAGVTDTREGLTSRYWSRSVDTENMSAVRRVQLRKFQRSRGEEAAAFSFVGASLAKNGQVYNEGEKTGWDQLRDAAVEYDN
DDLDEFGPPSSTGETSVGNTTTRSYSRNIELDSNAIGIAVRESSPVRCASTRTIRSPEVARANPPPRLKENIEDWHFLCTKFETPEWKKLAEANRNNQSALSFNHRVGSK
SFLQIREDLKLEQGREIGPIDLFERTHSRNGEWVNQKANDAHLSLRCLQDAHILERSEPLAGEEILEKVLENDQAMLKGLAQLQESEAKIHKVEAMVEEERSARLETKAE
LGT