; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036001 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036001
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposase, Ptta/En/Spm, plant
Genome locationscaffold5:25134300..25146271
RNA-Seq ExpressionSpg036001
SyntenySpg036001
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KDO39328.1 hypothetical protein CISIN_1g041788mg, partial [Citrus sinensis]4.6e-2532.17Show/hide
Query:  GIDTKRIIEATGNRVSISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIKKFIRREIGARFNDYRSRL
        G+D  R+++A   R+ IS+T ++GR +    S F+ EIG+  R + P++Y     IPDE    + E+LL KF+ D S PHI  ++   +  R++D+R   
Subjt:  GIDTKRIIEATGNRVSISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIKKFIRREIGARFNDYRSRL

Query:  HQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKRQEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAHEKMVALTQEQ-ASSSTPMDADEVMATV
        H+++KK   P  AR+  ++  S QDW  LC+ +E  ++KR+      + IE+F  TH+S  KGW++  A   +EKM  + QE  A   TP+   E++  V
Subjt:  HQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKRQEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAHEKMVALTQEQ-ASSSTPMDADEVMATV

Query:  LGK-------RPSSEECLENQVREKKESEARVKEQMKEEMAELLAMQRQDYLRKRRLM
        +GK       RP+              S ++   Q++ EM  ++  Q + Y  K R+M
Subjt:  LGK-------RPSSEECLENQVREKKESEARVKEQMKEEMAELLAMQRQDYLRKRRLM

XP_022159083.1 uncharacterized protein LOC111025525 [Momordica charantia]2.5e-3145.62Show/hide
Query:  DEADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHI
        D   E   +  +  +G +RG    R++   G ++ + WT QQGR V    + F++EIG LAR YI  K AKKK+I      KI++ LL KF VD SQPH+
Subjt:  DEADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHI

Query:  KKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKRQ
         ++I  EIG RF DYR++LH++YKK  DP  AR++P+KDI  + W ILCD+WE P WK +
Subjt:  KKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKRQ

XP_038895319.1 uncharacterized protein LOC120083572 isoform X1 [Benincasa hispida]3.1e-4237.66Show/hide
Query:  ENIRGVDVNP--IIVSESSDISEKMLNLEEQILEEEDD--EDEEEGEEDE-----ADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVR
        E++  +D  P  II+ E  +   K  N+   +LE++      E +G  D+         ++      RG SRG+   +   AT  R+ ++WTP QG+ + 
Subjt:  ENIRGVDVNP--IIVSESSDISEKMLNLEEQILEEEDD--EDEEEGEEDE-----ADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVR

Query:  RVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIKKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQI
         + S+F+ EIG+L R +IP+KY K+KDIP+E+   + E+LLN+FDVD SQPHIK++I  EIG RF DYR  L+++Y+K  DPV AR  P+K  +  DW I
Subjt:  RVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIKKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQI

Query:  LCDKWEDPKWKR-------------------------------QEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAHEKMVALTQEQASSSTPMDADEVMA
        LCD+WE   WK                                +ED TY+S IEVF  TH S +KGW D AA EA+E M+ L + +   +     +E++ 
Subjt:  LCDKWEDPKWKR-------------------------------QEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAHEKMVALTQEQASSSTPMDADEVMA

Query:  TVLGKRPS
         VLGKR S
Subjt:  TVLGKRPS

XP_038895320.1 uncharacterized protein LOC120083572 isoform X2 [Benincasa hispida]5.6e-3938.41Show/hide
Query:  ENIRGVDVNP--IIVSESSDISEKMLNLEEQILEEEDD--EDEEEGEEDE-----ADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVR
        E++  +D  P  II+ E  +   K  N+   +LE++      E +G  D+         ++      RG SRG+   +   AT  R+ ++WTP QG+ + 
Subjt:  ENIRGVDVNP--IIVSESSDISEKMLNLEEQILEEEDD--EDEEEGEEDE-----ADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVR

Query:  RVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIKKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQI
         + S+F+ EIG+L R +IP+KY K+KDIP+E+   + E+LLN+FDVD SQPHIK++I  EIG RF DYR  L+++Y+K  DPV AR  P+K  +  DW I
Subjt:  RVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIKKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQI

Query:  LCDKWEDPKWKR-------------------------------QEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAH
        LCD+WE   WK                                +ED TY+S IEVF  TH S +KGW D AA EA+
Subjt:  LCDKWEDPKWKR-------------------------------QEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAH

XP_038895321.1 uncharacterized protein LOC120083572 isoform X3 [Benincasa hispida]5.6e-3938.41Show/hide
Query:  ENIRGVDVNP--IIVSESSDISEKMLNLEEQILEEEDD--EDEEEGEEDE-----ADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVR
        E++  +D  P  II+ E  +   K  N+   +LE++      E +G  D+         ++      RG SRG+   +   AT  R+ ++WTP QG+ + 
Subjt:  ENIRGVDVNP--IIVSESSDISEKMLNLEEQILEEEDD--EDEEEGEEDE-----ADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVR

Query:  RVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIKKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQI
         + S+F+ EIG+L R +IP+KY K+KDIP+E+   + E+LLN+FDVD SQPHIK++I  EIG RF DYR  L+++Y+K  DPV AR  P+K  +  DW I
Subjt:  RVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIKKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQI

Query:  LCDKWEDPKWKR-------------------------------QEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAH
        LCD+WE   WK                                +ED TY+S IEVF  TH S +KGW D AA EA+
Subjt:  LCDKWEDPKWKR-------------------------------QEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAH

TrEMBL top hitse value%identityAlignment
A0A067D926 Uncharacterized protein (Fragment)2.2e-2532.17Show/hide
Query:  GIDTKRIIEATGNRVSISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIKKFIRREIGARFNDYRSRL
        G+D  R+++A   R+ IS+T ++GR +    S F+ EIG+  R + P++Y     IPDE    + E+LL KF+ D S PHI  ++   +  R++D+R   
Subjt:  GIDTKRIIEATGNRVSISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIKKFIRREIGARFNDYRSRL

Query:  HQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKRQEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAHEKMVALTQEQ-ASSSTPMDADEVMATV
        H+++KK   P  AR+  ++  S QDW  LC+ +E  ++KR+      + IE+F  TH+S  KGW++  A   +EKM  + QE  A   TP+   E++  V
Subjt:  HQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKRQEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAHEKMVALTQEQ-ASSSTPMDADEVMATV

Query:  LGK-------RPSSEECLENQVREKKESEARVKEQMKEEMAELLAMQRQDYLRKRRLM
        +GK       RP+              S ++   Q++ EM  ++  Q + Y  K R+M
Subjt:  LGK-------RPSSEECLENQVREKKESEARVKEQMKEEMAELLAMQRQDYLRKRRLM

A0A438FRA2 Uncharacterized protein6.2e-2025.59Show/hide
Query:  DEGVSSNSSATRGVSRGIDTKRIIEATGNRV--SISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIK
        D   ++     RG SRG+ T ++I+A G+R    +   P+         S   +EIG + R Y P+   K  DI +     ++E+L  KF +D++Q H+K
Subjt:  DEGVSSNSSATRGVSRGIDTKRIIEATGNRV--SISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIK

Query:  KFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQ-DWQILCDKWEDPKWKRQEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAHEKMVALTQ
        K I  ++ AR+ D+R++ H+++KK      A++ P+K +S Q  W  LCD++   K++ +        IE+F+  HW+   GW +  A  ++EKM+ L +
Subjt:  KFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQ-DWQILCDKWEDPKWKRQEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAHEKMVALTQ

Query:  EQA-SSSTPMDADEVMATVLGKRPSSEECLENQVREKKESEARVKEQMKEEMAELLAMQRQDYLRKRRLMGFVATCDVWFVRRKLRRLTGFVASLRR
        +        +   E+   VLG R    + L +  R    S     +  +E       ++ ++ L+  R +       +   + ++ +L  FV+ +R+
Subjt:  EQA-SSSTPMDADEVMATVLGKRPSSEECLENQVREKKESEARVKEQMKEEMAELLAMQRQDYLRKRRLMGFVATCDVWFVRRKLRRLTGFVASLRR

A0A438HJV7 Uncharacterized protein1.3e-2234.6Show/hide
Query:  RGVSRGIDTKRIIEATGNRVSISWTPQQGRSVRRVVS-MFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIKKFIRREIGARFN
        RG +RG  T  I++ +G ++ + +  + GR      S  FS E G L R + P+++ K   I       +V+RLL+KFD+D S+ +I K++   +G R+ 
Subjt:  RGVSRGIDTKRIIEATGNRVSISWTPQQGRSVRRVVS-MFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHIKKFIRREIGARFN

Query:  DYRSRLHQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKRQEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAHEKMVALTQEQAS--SSTPMDA
         YR  LHQ++K+      AR  P  D++ +DW  LC+ +   K+K          +E+FR TH++E+KGW +D A   +E+M  L QEQ++    TP+  
Subjt:  DYRSRLHQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKRQEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAHEKMVALTQEQAS--SSTPMDA

Query:  DEVMATVLGKR
         E+   VLGKR
Subjt:  DEVMATVLGKR

A0A443P6A9 Transposase, Ptta/En/Spm, plant2.3e-2227.39Show/hide
Query:  EDEADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPH
        E EA    S+++   RG S+ I   ++I+ TG ++ I       R V +    F TE+GI+ R Y P+       +  E      +R+L+KFD+D   P 
Subjt:  EDEADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPH

Query:  IKKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKR----------------------------QEDDTYVSPIEV
        +KK I   + + F +YR+RLH +YK++G+   A ++P++ +S +DW++ C+++   ++++                            +  D  +S IE+
Subjt:  IKKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKR----------------------------QEDDTYVSPIEV

Query:  FRRTHWSEAKGWTDDAASEAHEKMVAL-TQEQASSSTPMDADEVMATVLGKR--------------PSSEECLENQVR-EKKESEARVKEQMKEEMAELL
        + +TH+S+ KGW+       +EKM+ L +Q     S P+  DE+   VLG R              PSS   L N  R E+    A   E+  +E+AE L
Subjt:  FRRTHWSEAKGWTDDAASEAHEKMVAL-TQEQASSSTPMDADEVMATVLGKR--------------PSSEECLENQVR-EKKESEARVKEQMKEEMAELL

Query:  AMQRQDYLRKRRLM
                 +R+ M
Subjt:  AMQRQDYLRKRRLM

A0A6J1DXU5 uncharacterized protein LOC1110255251.2e-3145.62Show/hide
Query:  DEADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHI
        D   E   +  +  +G +RG    R++   G ++ + WT QQGR V    + F++EIG LAR YI  K AKKK+I      KI++ LL KF VD SQPH+
Subjt:  DEADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDVDYSQPHI

Query:  KKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKRQ
         ++I  EIG RF DYR++LH++YKK  DP  AR++P+KDI  + W ILCD+WE P WK +
Subjt:  KKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKRQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCACCTCAATAGCTGAACAAGTTTATCGGCTAGAGGAATTAATTGATCCTCCTCCAGCGCCCGTTAAACCAATGTTCAGTCCTCTCGAGCCAGTCAATCTAAGAAC
TCCAAAGACAAGACTTTTGGAGGAAATTGACAGAAAGCTATCTGCCATATCAAAAGGTGAAAGCTCTAATCAGCTTAACACCCTTGATGATACAGAAAGTGAGGATGTTG
ATTCAATAATTGACCAGCTTAAAGGTCTTCACTTAGAAGACCCTCCATCTCTAAACAAACTGGGTTATAATCCCAGTGATAGAGTAAACTGGTACCAAAGATCAATTGGT
GGTGTCCCAATTGTTGACGCTTCGTCCAAAATCAAGAAAAGGTGTCCGTTGAAGTATTCAGAGTCGTCAACGACGATTAAGCTGCGGTTCAGCGACTTTCCGACCGGTTC
AAGCAGTTTTGGAGCTGGTTCATGCGGTCTGAAGCTGGTTCGGGCTGGTCCTGCGACGTCAATTGATTTACGTCGCAAGCGTGCGACGAAAATAACATTTCGTCGCATTA
TAGAAATTCGCGCACCCTTTTGGAATTTTTGGCAGCCCAGATATGCGACGAAAGGCACATTTGCGTCGCTGATCTGCGAAGAAAACCGTGTTCGTTGCAAACTCGGAAAG
CACATGAAAAATGGGCAGCCCAGATCTGCGACGAAAGACCCAATTTCGTCGCAGATCTGCAATGAAAATCATGTTTCGTCGCAAGGGTTAAAACCCACGTCGAATGGCGC
CTTTTTTGGCAGCCATATCTGCGACGAAAATCGTTTTTTCGACGCAGATCTGCAACGAAATTGGAATAGAAGGGGAGGTGTGGGTTTTGTGCATGCCTCAAATCGCTTGA
AAACACGGGAACCCGACCTCAAACCGCCGGAAAACATACGTGGAGTTGATGTGAATCCGATCATAGTTTCTGAAAGTAGTGATATCAGTGAAAAAATGTTGAATTTGGAA
GAACAAATTTTAGAAGAGGAAGATGATGAAGATGAGGAAGAAGGTGAAGAAGATGAAGCAGATGAAGGGGTATCGTCTAATAGTAGTGCAACGCGTGGAGTGTCACGTGG
AATTGACACAAAAAGAATTATTGAGGCTACTGGAAACAGAGTAAGTATTTCATGGACTCCTCAGCAAGGCAGATCGGTGAGAAGAGTTGTCAGTATGTTTAGCACTGAAA
TTGGCATTTTGGCAAGAGGGTATATCCCCATGAAGTATGCAAAGAAGAAAGACATTCCAGATGAAGTTATGACTAAGATCGTAGAACGACTATTAAATAAATTTGATGTG
GACTACTCGCAGCCGCATATTAAGAAGTTCATTCGTCGCGAGATTGGTGCCCGATTTAATGATTACAGATCTAGGTTACACCAATATTACAAAAAGATTGGTGATCCAGT
TGCAGCTCGTGAACGTCCACATAAGGACATTTCTCCTCAGGACTGGCAGATATTATGTGACAAATGGGAGGATCCAAAATGGAAGAGACAAGAAGACGATACATACGTGA
GCCCCATAGAAGTCTTCCGTCGAACTCATTGGTCCGAAGCAAAGGGATGGACTGATGATGCAGCAAGTGAAGCACATGAAAAAATGGTAGCGTTGACCCAAGAGCAGGCC
AGCTCGAGTACACCAATGGATGCTGATGAAGTTATGGCTACTGTTCTCGGAAAAAGACCATCGTCCGAAGAATGTTTGGAGAACCAAGTACGAGAAAAGAAAGAATCAGA
GGCTCGAGTGAAAGAACAAATGAAGGAAGAAATGGCAGAACTATTGGCCATGCAGCGTCAAGATTATCTGCGTAAGCGACGTTTAATGGGTTTCGTCGCAACTTGCGACG
TTTGGTTCGTTCGTCGCAAGTTGCGACGTTTAACTGGTTTCGTCGCAAGTTTGCGACGTTTATTGCTTTCGTCGCAACTTGTGACGTTTGATTGCTTTCGTCGCACCTTG
CGACGAAAATTACGAAAAACGGAGCTGATGATCGTCTGCGACACGCCTTTTGCGACGCTGTCCTGGCGCAACCATCCCTTCGGAGGGCCTGATCATAGGATTCAGAACAC
CGTGAACTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCACCTCAATAGCTGAACAAGTTTATCGGCTAGAGGAATTAATTGATCCTCCTCCAGCGCCCGTTAAACCAATGTTCAGTCCTCTCGAGCCAGTCAATCTAAGAAC
TCCAAAGACAAGACTTTTGGAGGAAATTGACAGAAAGCTATCTGCCATATCAAAAGGTGAAAGCTCTAATCAGCTTAACACCCTTGATGATACAGAAAGTGAGGATGTTG
ATTCAATAATTGACCAGCTTAAAGGTCTTCACTTAGAAGACCCTCCATCTCTAAACAAACTGGGTTATAATCCCAGTGATAGAGTAAACTGGTACCAAAGATCAATTGGT
GGTGTCCCAATTGTTGACGCTTCGTCCAAAATCAAGAAAAGGTGTCCGTTGAAGTATTCAGAGTCGTCAACGACGATTAAGCTGCGGTTCAGCGACTTTCCGACCGGTTC
AAGCAGTTTTGGAGCTGGTTCATGCGGTCTGAAGCTGGTTCGGGCTGGTCCTGCGACGTCAATTGATTTACGTCGCAAGCGTGCGACGAAAATAACATTTCGTCGCATTA
TAGAAATTCGCGCACCCTTTTGGAATTTTTGGCAGCCCAGATATGCGACGAAAGGCACATTTGCGTCGCTGATCTGCGAAGAAAACCGTGTTCGTTGCAAACTCGGAAAG
CACATGAAAAATGGGCAGCCCAGATCTGCGACGAAAGACCCAATTTCGTCGCAGATCTGCAATGAAAATCATGTTTCGTCGCAAGGGTTAAAACCCACGTCGAATGGCGC
CTTTTTTGGCAGCCATATCTGCGACGAAAATCGTTTTTTCGACGCAGATCTGCAACGAAATTGGAATAGAAGGGGAGGTGTGGGTTTTGTGCATGCCTCAAATCGCTTGA
AAACACGGGAACCCGACCTCAAACCGCCGGAAAACATACGTGGAGTTGATGTGAATCCGATCATAGTTTCTGAAAGTAGTGATATCAGTGAAAAAATGTTGAATTTGGAA
GAACAAATTTTAGAAGAGGAAGATGATGAAGATGAGGAAGAAGGTGAAGAAGATGAAGCAGATGAAGGGGTATCGTCTAATAGTAGTGCAACGCGTGGAGTGTCACGTGG
AATTGACACAAAAAGAATTATTGAGGCTACTGGAAACAGAGTAAGTATTTCATGGACTCCTCAGCAAGGCAGATCGGTGAGAAGAGTTGTCAGTATGTTTAGCACTGAAA
TTGGCATTTTGGCAAGAGGGTATATCCCCATGAAGTATGCAAAGAAGAAAGACATTCCAGATGAAGTTATGACTAAGATCGTAGAACGACTATTAAATAAATTTGATGTG
GACTACTCGCAGCCGCATATTAAGAAGTTCATTCGTCGCGAGATTGGTGCCCGATTTAATGATTACAGATCTAGGTTACACCAATATTACAAAAAGATTGGTGATCCAGT
TGCAGCTCGTGAACGTCCACATAAGGACATTTCTCCTCAGGACTGGCAGATATTATGTGACAAATGGGAGGATCCAAAATGGAAGAGACAAGAAGACGATACATACGTGA
GCCCCATAGAAGTCTTCCGTCGAACTCATTGGTCCGAAGCAAAGGGATGGACTGATGATGCAGCAAGTGAAGCACATGAAAAAATGGTAGCGTTGACCCAAGAGCAGGCC
AGCTCGAGTACACCAATGGATGCTGATGAAGTTATGGCTACTGTTCTCGGAAAAAGACCATCGTCCGAAGAATGTTTGGAGAACCAAGTACGAGAAAAGAAAGAATCAGA
GGCTCGAGTGAAAGAACAAATGAAGGAAGAAATGGCAGAACTATTGGCCATGCAGCGTCAAGATTATCTGCGTAAGCGACGTTTAATGGGTTTCGTCGCAACTTGCGACG
TTTGGTTCGTTCGTCGCAAGTTGCGACGTTTAACTGGTTTCGTCGCAAGTTTGCGACGTTTATTGCTTTCGTCGCAACTTGTGACGTTTGATTGCTTTCGTCGCACCTTG
CGACGAAAATTACGAAAAACGGAGCTGATGATCGTCTGCGACACGCCTTTTGCGACGCTGTCCTGGCGCAACCATCCCTTCGGAGGGCCTGATCATAGGATTCAGAACAC
CGTGAACTCCTGA
Protein sequenceShow/hide protein sequence
MLTSIAEQVYRLEELIDPPPAPVKPMFSPLEPVNLRTPKTRLLEEIDRKLSAISKGESSNQLNTLDDTESEDVDSIIDQLKGLHLEDPPSLNKLGYNPSDRVNWYQRSIG
GVPIVDASSKIKKRCPLKYSESSTTIKLRFSDFPTGSSSFGAGSCGLKLVRAGPATSIDLRRKRATKITFRRIIEIRAPFWNFWQPRYATKGTFASLICEENRVRCKLGK
HMKNGQPRSATKDPISSQICNENHVSSQGLKPTSNGAFFGSHICDENRFFDADLQRNWNRRGGVGFVHASNRLKTREPDLKPPENIRGVDVNPIIVSESSDISEKMLNLE
EQILEEEDDEDEEEGEEDEADEGVSSNSSATRGVSRGIDTKRIIEATGNRVSISWTPQQGRSVRRVVSMFSTEIGILARGYIPMKYAKKKDIPDEVMTKIVERLLNKFDV
DYSQPHIKKFIRREIGARFNDYRSRLHQYYKKIGDPVAARERPHKDISPQDWQILCDKWEDPKWKRQEDDTYVSPIEVFRRTHWSEAKGWTDDAASEAHEKMVALTQEQA
SSSTPMDADEVMATVLGKRPSSEECLENQVREKKESEARVKEQMKEEMAELLAMQRQDYLRKRRLMGFVATCDVWFVRRKLRRLTGFVASLRRLLLSSQLVTFDCFRRTL
RRKLRKTELMIVCDTPFATLSWRNHPFGGPDHRIQNTVNS