; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042163 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042163
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr13:37644977..37647694
RNA-Seq ExpressionLag0042163
SyntenyLag0042163
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAE1312874.1 unnamed protein product [Sepia pharaonis]2.3e-0629.39Show/hide
Query:  WSFFSLSSKVLTRFVAVPSLQGRRFSRAALQFFLPKFEGSRTSLQFLLPNL--RVLTRFAAVPSSKFEGSHVALLEFLPPS------SKVLTCFAAVSSS
        +SF  L S++L RF+  P L      R  L+F L +F     S  F  P L  R+L RF   P+S+F  S  + +EF   S      S++L  F     S
Subjt:  WSFFSLSSKVLTRFVAVPSLQGRRFSRAALQFFLPKFEGSRTSLQFLLPNL--RVLTRFAAVPSSKFEGSHVALLEFLPPS------SKVLTCFAAVSSS

Query:  KFEGSHVLRCNSF-LRVRRFSRASLQFLPPSSKVLTRFVAVP---SLQDRSSFLQVRRFSRAPLQFLPPSLKVLTSLHYSSFLQVRRFSHASLQFLLSKV
        +F  S+ L+ +SF L V R S     F     ++L  F   P   SL+  S  L V R       F P S KV + L   SFL         L+FLLS+ 
Subjt:  KFEGSHVLRCNSF-LRVRRFSRASLQFLPPSSKVLTRFVAVP---SLQDRSSFLQVRRFSRAPLQFLPPSLKVLTSLHYSSFLQVRRFSHASLQFLLSKV

Query:  EGPHALRCSSFSPSSKV-HALRFSSFSQIRFTAVP------SSKFEGSPLL---------------QFIPPSSKVLTCSAA----VPSSKFEGSHVASLQ
             L     S  S V ++L+ SSF  +  ++ P      S K    PLL               +F+  SS  L  S      + SS+F         
Subjt:  EGPHALRCSSFSPSSKV-HALRFSSFSQIRFTAVP------SSKFEGSPLL---------------QFIPPSSKVLTCSAA----VPSSKFEGSHVASLQ

Query:  FLP-LSSKVLTCFVAVPSLQGRRSSSVALQFFLPKFEGSRTSLQFLLPNSKV----LTRFAAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSS----KFE
        F P L S++L    + P L  R    + L+F L +F      L  L   S++    L+RF+++ S K    PLL      S++L        S    +F 
Subjt:  FLP-LSSKVLTCFVAVPSLQGRRSSSVALQFFLPKFEGSRTSLQFLLPNSKV----LTRFAAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSS----KFE

Query:  GSHVTSLQFLPPSSKVLRCCSSFLQVRRF--SRAPLQFLPPSLKVLTSL------HYSSFLQVRRFSHASLQF--------LLSKV---EGPHALRCSSF
         S  +S++FL     +L   SSF    RF  SR  L+FL   L V   L       +SS   + RF  ++ +F        +LS V        L  SSF
Subjt:  GSHVTSLQFLPPSSKVLRCCSSFLQVRRF--SRAPLQFLPPSLKVLTSL------HYSSFLQVRRFSHASLQF--------LLSKV---EGPHALRCSSF

Query:  SPSSKVHALRF--SSFSQIRFTAVPSFKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVASLQFLPLSSKVLTCFIAVPSLQGRRSSSVALQFFLPKF
           S++  LRF    FS + F+ V SF+F     L+    SS     S+ V S +F  + +    F  L S++L  F+    L    S  + L+F L +F
Subjt:  SPSSKVHALRF--SSFSQIRFTAVPSFKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVASLQFLPLSSKVLTCFIAVPSLQGRRSSSVALQFFLPKF

Query:  EGSRTLLQF
          SR LL+F
Subjt:  EGSRTLLQF

KAA0051997.1 hypothetical protein E6C27_scaffold60G004810 [Cucumis melo var. makuwa]1.6e-0740.34Show/hide
Query:  MFDVHLHSTFSFPKAKCLHIEEKSTFNIYFYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLK--LEGSSLYVVALF-L
        M +++LH  FSF K K    ++       F RLK+T  Q +R+M +L++K F E N+D K+ S +PSRMKRK SV INT+G+     +      + LF +
Subjt:  MFDVHLHSTFSFPKAKCLHIEEKSTFNIYFYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLK--LEGSSLYVVALF-L

Query:  LQVRRFFVVCCCVVPSPSS
        LQ R F     CV P  SS
Subjt:  LQVRRFFVVCCCVVPSPSS

SSD60786.1 uncharacterized protein SCODWIG_02547 [Saccharomycodes ludwigii]7.8e-0732.33Show/hide
Query:  LLSKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVASLQFLPLSSKVLTCFVAVPSLQGR
        L S V    A+  SS  PSS V  + +SS   + +++VPSS     P    I PSS  +  S+A+PSS    S V     +PLSS + +  V +PS    
Subjt:  LLSKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVASLQFLPLSSKVLTCFVAVPSLQGR

Query:  RSSSVALQFFLPKFEGSRTSLQFLLPNSKVLTRFAAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVTSLQFLP-----PSSKVLRCCSSFLQ
         SSSV L   +P   G        +P+S VL   + VPSS     P    IP SS VL+ S A  S+    S V     +P     PSS ++   S    
Subjt:  RSSSVALQFFLPKFEGSRTSLQFLLPNSKVLTRFAAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVTSLQFLP-----PSSKVLRCCSSFLQ

Query:  VRRFSRAPLQFLPPSLKVLTSLHYSSFLQVRRFSHASLQFLL-SKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSFKFEGSPLLQFIPPSSKVL
        V   S  P   + PS  ++ S    S   V   S  S   +L S V    A+  SS  PSS +      S S    +A+PS    GS +      SS  +
Subjt:  VRRFSRAPLQFLPPSLKVLTSLHYSSFLQVRRFSHASLQFLL-SKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSFKFEGSPLLQFIPPSSKVL

Query:  TCSAAVPSSKFEGSHVASLQFLPLSSK-------VLTCFIA-----VPSLQGRRSSSVALQFFLP
          S+A+PSS    S      F+PLSS        VL+  +A     VPS     SSSV L   +P
Subjt:  TCSAAVPSSKFEGSHVASLQFLPLSSK-------VLTCFIA-----VPSLQGRRSSSVALQFFLP

TYK30948.1 hypothetical protein E5676_scaffold455G001730 [Cucumis melo var. makuwa]6.0e-0745.74Show/hide
Query:  MFDVHLHSTFSFPKAKCLHIEEKSTFNIYFYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLKLEGSSLYVVA
        M +V+ HS FSF KAK LHIEE       F  LK+T+ Q +R+M  L+ K F E NND K+ S + S MKRK  V IN    L    S  ++VA
Subjt:  MFDVHLHSTFSFPKAKCLHIEEKSTFNIYFYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLKLEGSSLYVVA

XP_005819407.1 hypothetical protein GUITHDRAFT_121393 [Guillardia theta CCMP2712]5.1e-0629.22Show/hide
Query:  LTQLRWSFFSLSSKVLTRFVAVPSLQGRRFSRAALQFFLPKFEGSRTSLQFLLPNLRVLTRFAAVPSSKFEGSHVALLEFLPPSSKVLTCFAAVSSSKFE
        LT++R S   L+S   T F  + SLQ    S   L            S++ L+ +   LT   +VP + F G  +A L++L   +  LT   +V ++ F 
Subjt:  LTQLRWSFFSLSSKVLTRFVAVPSLQGRRFSRAALQFFLPKFEGSRTSLQFLLPNLRVLTRFAAVPSSKFEGSHVALLEFLPPSSKVLTCFAAVSSSKFE

Query:  GSHVLRCNSFLRVRRFSRASLQFLPPSSKVLTR-----FVAVPSLQDRSSFLQVRRFSRAP---------LQFLPPSLKVLTSLHYSSFLQVRRFSHASL
        G                 ASLQ L  SS  LT      F  + SL  RS +L     +  P         LQ L      LTS+  + F  +     ASL
Subjt:  GSHVLRCNSFLRVRRFSRASLQFLPPSSKVLTR-----FVAVPSLQDRSSFLQVRRFSRAP---------LQFLPPSLKVLTSLHYSSFLQVRRFSHASL

Query:  QFLLSKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVASLQFLPLSSKVL-----TCFVA
        Q L        ++  + F   + + +L + S++++  T+VP + F+G   LQ++  SS  LT   +VP++ F G  + SLQ L LS   L     T F  
Subjt:  QFLLSKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVASLQFLPLSSKVL-----TCFVA

Query:  VPSLQGRRSSSVALQFFLPKFEGSRTSLQFLLPNSKVLTRFAAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVTSLQFLPPSSKVLRCCSSF
        + SLQ    SS  L            SLQ L  +S  LT   +VP++ F G   LQ++      LT   ++P++ F G  +TSLQ L  SS  L   +S 
Subjt:  VPSLQGRRSSSVALQFFLPKFEGSRTSLQFLLPNSKVLTRFAAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVTSLQFLPPSSKVLRCCSSF

Query:  LQVRRFSRAPLQFLPPSLKVLTSLHYSSFLQVRRFSHASLQFLLSKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSFKFEGSPLLQFIPPSSKV
         +      A LQ L  S   LTS+  + F  +     ASLQ L        ++  + F+  + +  L  SS      T++P   F G   LQ +  S   
Subjt:  LQVRRFSRAPLQFLPPSLKVLTSLHYSSFLQVRRFSHASLQFLLSKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSFKFEGSPLLQFIPPSSKV

Query:  LTCSAAVPSSKFEGSHVASLQFLPLSSKVL-----TCFIAVPSLQGRRSSSVALQFFLPKFEGSRTLLQFLLPNS
        LT   +VP + F G  +ASLQ L LS   L     T F  + SLQ    SS  L             LQ+L  +S
Subjt:  LTCSAAVPSSKFEGSHVASLQFLPLSSKVL-----TCFIAVPSLQGRRSSSVALQFFLPKFEGSRTLLQFLLPNS

TrEMBL top hitse value%identityAlignment
A0A376B7X3 Chitinase3.8e-0732.33Show/hide
Query:  LLSKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVASLQFLPLSSKVLTCFVAVPSLQGR
        L S V    A+  SS  PSS V  + +SS   + +++VPSS     P    I PSS  +  S+A+PSS    S V     +PLSS + +  V +PS    
Subjt:  LLSKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVASLQFLPLSSKVLTCFVAVPSLQGR

Query:  RSSSVALQFFLPKFEGSRTSLQFLLPNSKVLTRFAAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVTSLQFLP-----PSSKVLRCCSSFLQ
         SSSV L   +P   G        +P+S VL   + VPSS     P    IP SS VL+ S A  S+    S V     +P     PSS ++   S    
Subjt:  RSSSVALQFFLPKFEGSRTSLQFLLPNSKVLTRFAAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVTSLQFLP-----PSSKVLRCCSSFLQ

Query:  VRRFSRAPLQFLPPSLKVLTSLHYSSFLQVRRFSHASLQFLL-SKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSFKFEGSPLLQFIPPSSKVL
        V   S  P   + PS  ++ S    S   V   S  S   +L S V    A+  SS  PSS +      S S    +A+PS    GS +      SS  +
Subjt:  VRRFSRAPLQFLPPSLKVLTSLHYSSFLQVRRFSHASLQFLL-SKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSFKFEGSPLLQFIPPSSKVL

Query:  TCSAAVPSSKFEGSHVASLQFLPLSSK-------VLTCFIA-----VPSLQGRRSSSVALQFFLP
          S+A+PSS    S      F+PLSS        VL+  +A     VPS     SSSV L   +P
Subjt:  TCSAAVPSSKFEGSHVASLQFLPLSSK-------VLTCFIA-----VPSLQGRRSSSVALQFFLP

A0A5A7SRE2 Ty3-gypsy retrotransposon protein2.1e-0547.3Show/hide
Query:  KAKCLHIEEKSTFNIY-FYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLKLE
        K   + I +K   + Y F RLK+T+ Q +R+M  L+ K F E N+D K+ S +PSRMKRK SV INT+GSL ++
Subjt:  KAKCLHIEEKSTFNIY-FYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLKLE

A0A5A7UC34 Uncharacterized protein7.6e-0840.34Show/hide
Query:  MFDVHLHSTFSFPKAKCLHIEEKSTFNIYFYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLK--LEGSSLYVVALF-L
        M +++LH  FSF K K    ++       F RLK+T  Q +R+M +L++K F E N+D K+ S +PSRMKRK SV INT+G+     +      + LF +
Subjt:  MFDVHLHSTFSFPKAKCLHIEEKSTFNIYFYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLK--LEGSSLYVVALF-L

Query:  LQVRRFFVVCCCVVPSPSS
        LQ R F     CV P  SS
Subjt:  LQVRRFFVVCCCVVPSPSS

A0A5D3D209 Retrotransposon gag protein2.1e-0554.39Show/hide
Query:  FYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLKLE
        F RLK+T+ Q KR+M + + K F E N+D K+ S +PSRMKRK SV INT+GSL ++
Subjt:  FYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLKLE

A0A5D3E580 Uncharacterized protein2.9e-0745.74Show/hide
Query:  MFDVHLHSTFSFPKAKCLHIEEKSTFNIYFYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLKLEGSSLYVVA
        M +V+ HS FSF KAK LHIEE       F  LK+T+ Q +R+M  L+ K F E NND K+ S + S MKRK  V IN    L    S  ++VA
Subjt:  MFDVHLHSTFSFPKAKCLHIEEKSTFNIYFYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLKLEGSSLYVVA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATTGAAGAAAAGTCGACCTTCAACATCTATTTTTATCGCCTCAAAGTAACAAG
CGGTCAATCTAAAAGAAAGATGGATAACTTGGAGATGAAACTTTTTGATGAAGTAAACAACGACAAGAAGCTTCAAAGTAGCATCCCATCACGTATGAAGAGAAAGTTCT
CTGTTCTCATAAATACAGATGGTTCCTTGAAGTTGGAAGGTTCTTCGTTGTATGTTGTTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATGTTGTTGCGTT
GTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTCCTCAGTTGTACTACTACTACGTTGTTCCTCCTCCAAGTGC
GAAGGATATTATGTGGTGTGTTGTTGCATTGTTTCTCTCAAGTTTGATGGTTCTCACGCAGCTTCGCTGGAGTTTCTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCG
TTGCAGTTCCTTCTCTCCAAGGTCGAAGGTTCTCACGTGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTTCAGTTCCTTCTCCCAAATTTG
AGGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGTCGCTTTGTTGGAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGC
AGTTTCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGTTGCAATTCCTTCCTCAGAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGG
TTCTCACACGCTTTGTTGCAGTTCCTTCTCTCCAAGATCGAAGTTCATTCCTCCAAGTTCGAAGGTTCTCACGCGCTCCGCTGCAGTTCCTTCCTCCAAGTTTAAAGGTT
CTCACGTCGCTTCACTACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGTTGCAGTTCCTTCTCTCCAAGGTCGAAGGTCCTCATGCGTTGCGTTGCAGTTC
TTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTTCAGTTCCTTCTCCCAAATTCGCTTCACTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCCGCTGCTGCAGTTCA
TTCCTCCAAGTTCGAAGGTTCTCACGTGCTCCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCTAAGTTCGAAGGTTCTC
ACGTGCTTCGTTGCAGTTCCTTCTCTCCAAGGTCGAAGGTCCTCAAGCGTTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTTCAGTTCCTTCT
CCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCCGCTGCTGCAGTTCATTCCTCCAAGTTCGAAGGTTCTCACGTGCTCTG
CTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGTCACTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCCGCTGCTGCAGTTCATTCCTCCAAGTTCGAAGG
TTCTCACGCGCTCCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGTCGCTTCACTACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGTTGCAGTT
CCTTCTCTCCAAGGTCGAAGGTCCTCATGCGTTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTTCAGTTCCTTCTCCCAAATTCGCTTCACTG
CAGTTCCTTCCTTCAAGTTCGAAGGTTCTCCGCTGCTGCAGTTCATTCCTCCAAGTTCGAAGGTTCTCACGTGCTCCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCT
CACGTCGCTTCGCTGCAGTTCCTTCCTCTAAGTTCGAAGGTTCTCACGTGCTTCATTGCAGTTCCTTCTCTCCAAGGTCGAAGGTCCTCAAGCGTTGCGTTGCAGTTCTT
TCTTCCCAAGTTCGAAGGTTCACGCACTTTGCTTCAGTTCCTTCTCCCAAATTCGCTTGAGGACCTTCGACCTTGGAGAGAAGGAACTGCAACGCAACGAATTTGGGAGA
AGGAACTGGAGAAAGAACTGCAACGCAACGATTTGACCAATAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATTGAAGAAAAGTCGACCTTCAACATCTATTTTTATCGCCTCAAAGTAACAAG
CGGTCAATCTAAAAGAAAGATGGATAACTTGGAGATGAAACTTTTTGATGAAGTAAACAACGACAAGAAGCTTCAAAGTAGCATCCCATCACGTATGAAGAGAAAGTTCT
CTGTTCTCATAAATACAGATGGTTCCTTGAAGTTGGAAGGTTCTTCGTTGTATGTTGTTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATGTTGTTGCGTT
GTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTCCTCAGTTGTACTACTACTACGTTGTTCCTCCTCCAAGTGC
GAAGGATATTATGTGGTGTGTTGTTGCATTGTTTCTCTCAAGTTTGATGGTTCTCACGCAGCTTCGCTGGAGTTTCTTCTCCCTAAGTTCGAAGGTTCTCACGCGCTTCG
TTGCAGTTCCTTCTCTCCAAGGTCGAAGGTTCTCACGTGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTTCAGTTCCTTCTCCCAAATTTG
AGGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGTCGCTTTGTTGGAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGCTGC
AGTTTCTTCCTCCAAGTTCGAAGGTTCTCACGTGCTTCGTTGCAATTCCTTCCTCAGAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGG
TTCTCACACGCTTTGTTGCAGTTCCTTCTCTCCAAGATCGAAGTTCATTCCTCCAAGTTCGAAGGTTCTCACGCGCTCCGCTGCAGTTCCTTCCTCCAAGTTTAAAGGTT
CTCACGTCGCTTCACTACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGTTGCAGTTCCTTCTCTCCAAGGTCGAAGGTCCTCATGCGTTGCGTTGCAGTTC
TTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTTCAGTTCCTTCTCCCAAATTCGCTTCACTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCCGCTGCTGCAGTTCA
TTCCTCCAAGTTCGAAGGTTCTCACGTGCTCCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCTAAGTTCGAAGGTTCTC
ACGTGCTTCGTTGCAGTTCCTTCTCTCCAAGGTCGAAGGTCCTCAAGCGTTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTTCAGTTCCTTCT
CCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCCGCTGCTGCAGTTCATTCCTCCAAGTTCGAAGGTTCTCACGTGCTCTG
CTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGTCACTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCCGCTGCTGCAGTTCATTCCTCCAAGTTCGAAGG
TTCTCACGCGCTCCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGTCGCTTCACTACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGTTGCAGTT
CCTTCTCTCCAAGGTCGAAGGTCCTCATGCGTTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTTCAGTTCCTTCTCCCAAATTCGCTTCACTG
CAGTTCCTTCCTTCAAGTTCGAAGGTTCTCCGCTGCTGCAGTTCATTCCTCCAAGTTCGAAGGTTCTCACGTGCTCCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCT
CACGTCGCTTCGCTGCAGTTCCTTCCTCTAAGTTCGAAGGTTCTCACGTGCTTCATTGCAGTTCCTTCTCTCCAAGGTCGAAGGTCCTCAAGCGTTGCGTTGCAGTTCTT
TCTTCCCAAGTTCGAAGGTTCACGCACTTTGCTTCAGTTCCTTCTCCCAAATTCGCTTGAGGACCTTCGACCTTGGAGAGAAGGAACTGCAACGCAACGAATTTGGGAGA
AGGAACTGGAGAAAGAACTGCAACGCAACGATTTGACCAATAATTAA
Protein sequenceShow/hide protein sequence
MFDVHLHSTFSFPKAKCLHIEEKSTFNIYFYRLKVTSGQSKRKMDNLEMKLFDEVNNDKKLQSSIPSRMKRKFSVLINTDGSLKLEGSSLYVVALFLLQVRRFFVVCCCV
VPSPSSKVLRCILLRCSFSKFEGPQLYYYYVVPPPSAKDIMWCVVALFLSSLMVLTQLRWSFFSLSSKVLTRFVAVPSLQGRRFSRAALQFFLPKFEGSRTSLQFLLPNL
RVLTRFAAVPSSKFEGSHVALLEFLPPSSKVLTCFAAVSSSKFEGSHVLRCNSFLRVRRFSRASLQFLPPSSKVLTRFVAVPSLQDRSSFLQVRRFSRAPLQFLPPSLKV
LTSLHYSSFLQVRRFSHASLQFLLSKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVASLQFLPLSSKVL
TCFVAVPSLQGRRSSSVALQFFLPKFEGSRTSLQFLLPNSKVLTRFAAVPSSKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGSHVTSLQFLPPSSKVLRCCSSFLQVRR
FSRAPLQFLPPSLKVLTSLHYSSFLQVRRFSHASLQFLLSKVEGPHALRCSSFSPSSKVHALRFSSFSQIRFTAVPSFKFEGSPLLQFIPPSSKVLTCSAAVPSSKFEGS
HVASLQFLPLSSKVLTCFIAVPSLQGRRSSSVALQFFLPKFEGSRTLLQFLLPNSLEDLRPWREGTATQRIWEKELEKELQRNDLTNN