; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029177 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029177
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr8:36079145..36085876
RNA-Seq ExpressionLag0029177
SyntenyLag0029177
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.7e-0934.27Show/hide
Query:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNVHK
        FVLK+LI KLA E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++          +  L +SF ++  +
Subjt:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNVHK

Query:  KEKENFA--TSYCIDV-------EEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQL----------------------------------
        +  E  A  T+  ++V       EE+DNS   +QRTSVFD IKP TTR S FQR+SMA  +EENQ                                   
Subjt:  KEKENFA--TSYCIDV-------EEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQL----------------------------------

Query:  --------------------NSDKKLQSSIPSRMKRKFSVLINTEGSL
                            N D K+ S +PSRMKRK SV INTEGSL
Subjt:  --------------------NSDKKLQSSIPSRMKRKFSVLINTEGSL

KAA0035697.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.6e-0935.05Show/hide
Query:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEVDNSEMGEQ
        FVLK+LILKLA E KI+LD+DE+AQ+N   ++  S       P  L         + Q ++ V+   +       +   N+A+S     +EV NS   +Q
Subjt:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEVDNSEMGEQ

Query:  RTSVFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKLQSSIPSRM
         TSVF+RIKP TTR S FQR+SMA  EEENQ                                                       N+D K+ S + SRM
Subjt:  RTSVFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKLQSSIPSRM

Query:  KRKFSVLINTEGSL
        KRK SV INTEGSL
Subjt:  KRKFSVLINTEGSL

KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.8e-0827.97Show/hide
Query:  FVLKDLILKLAMEGKIELDLDEVAQSNL------------------------------------------------------------------------
        FVLK+LILKLA E KI+LD+DEVAQ+N                                                                         
Subjt:  FVLKDLILKLAMEGKIELDLDEVAQSNL------------------------------------------------------------------------

Query:  ------------ATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFA------TSYCIDV-------EEVDNSEMGEQRTS
                      I  K K +R K   K +P +++ +KF QP++ + L + F ++  +   E         T+  ++V       EEVDNS   +QRTS
Subjt:  ------------ATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFA------TSYCIDV-------EEVDNSEMGEQRTS

Query:  VFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKLQSSIPSRMKRK
        VFDRIKP TTR S FQR+SMA  EEENQ                                                       N D K+ S +PSR+KRK
Subjt:  VFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKLQSSIPSRMKRK

Query:  FSVLINTEGSL
         S+ INTEGSL
Subjt:  FSVLINTEGSL

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-1237.39Show/hide
Query:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEV
        FVLK+LILKLA E KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + ++ + +  N+ +S     +EV
Subjt:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEV

Query:  DNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKL
        +NS    QRTSVFDRIKP TTR S FQR+S+A  EEENQ                                                       N D K+
Subjt:  DNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKL

Query:  QSSIPSRMKRKFSVLINTEGSL
         S +PSRMKRK  V INTEGSL
Subjt:  QSSIPSRMKRKFSVLINTEGSL

KAA0065608.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.8e-1137.38Show/hide
Query:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEVDNSEMGEQ
        FVLK+LILKLA + KIELD+DEVAQ+N A +   S       P  L     +S     P++++ +    + ++   E +N   SY    EEVDNS   +Q
Subjt:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEVDNSEMGEQ

Query:  RTSVFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKLQSSIPSRM
        RT VFDRIKP TTR S FQR+SMA  EEE Q                                                       N D K+ S +PSR 
Subjt:  RTSVFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKLQSSIPSRM

Query:  KRKFSVLINTEGSL
        KRK SV INTEGSL
Subjt:  KRKFSVLINTEGSL

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein1.3e-0934.27Show/hide
Query:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNVHK
        FVLK+LI KLA E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++          +  L +SF ++  +
Subjt:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ----------LVMLNKSFSKNVHK

Query:  KEKENFA--TSYCIDV-------EEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQL----------------------------------
        +  E  A  T+  ++V       EE+DNS   +QRTSVFD IKP TTR S FQR+SMA  +EENQ                                   
Subjt:  KEKENFA--TSYCIDV-------EEVDNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQL----------------------------------

Query:  --------------------NSDKKLQSSIPSRMKRKFSVLINTEGSL
                            N D K+ S +PSRMKRK SV INTEGSL
Subjt:  --------------------NSDKKLQSSIPSRMKRKFSVLINTEGSL

A0A5A7U974 Retrotransposon gag protein8.6e-0927.97Show/hide
Query:  FVLKDLILKLAMEGKIELDLDEVAQSNL------------------------------------------------------------------------
        FVLK+LILKLA E KI+LD+DEVAQ+N                                                                         
Subjt:  FVLKDLILKLAMEGKIELDLDEVAQSNL------------------------------------------------------------------------

Query:  ------------ATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFA------TSYCIDV-------EEVDNSEMGEQRTS
                      I  K K +R K   K +P +++ +KF QP++ + L + F ++  +   E         T+  ++V       EEVDNS   +QRTS
Subjt:  ------------ATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFA------TSYCIDV-------EEVDNSEMGEQRTS

Query:  VFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKLQSSIPSRMKRK
        VFDRIKP TTR S FQR+SMA  EEENQ                                                       N D K+ S +PSR+KRK
Subjt:  VFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKLQSSIPSRMKRK

Query:  FSVLINTEGSL
         S+ INTEGSL
Subjt:  FSVLINTEGSL

A0A5A7URH1 Ty3-gypsy retrotransposon protein7.5e-1337.39Show/hide
Query:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEV
        FVLK+LILKLA E KIELD+DEVAQ+N A I+  S   + KD   LQ +R         RS     P++++ +    + ++ + +  N+ +S     +EV
Subjt:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEV

Query:  DNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKL
        +NS    QRTSVFDRIKP TTR S FQR+S+A  EEENQ                                                       N D K+
Subjt:  DNSEMGEQRTSVFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKL

Query:  QSSIPSRMKRKFSVLINTEGSL
         S +PSRMKRK  V INTEGSL
Subjt:  QSSIPSRMKRKFSVLINTEGSL

A0A5D3CA53 Retrotransposon gag protein1.9e-1137.38Show/hide
Query:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEVDNSEMGEQ
        FVLK+LILKLA + KIELD+DEVAQ+N A +   S       P  L     +S     P++++ +    + ++   E +N   SY    EEVDNS   +Q
Subjt:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEVDNSEMGEQ

Query:  RTSVFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKLQSSIPSRM
        RT VFDRIKP TTR S FQR+SMA  EEE Q                                                       N D K+ S +PSR 
Subjt:  RTSVFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKLQSSIPSRM

Query:  KRKFSVLINTEGSL
        KRK SV INTEGSL
Subjt:  KRKFSVLINTEGSL

A0A5D3E4T1 Retrotransposon gag protein1.7e-0935.05Show/hide
Query:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEVDNSEMGEQ
        FVLK+LILKLA E KI+LD+DE+AQ+N   ++  S       P  L         + Q ++ V+   +       +   N+A+S     +EV NS   +Q
Subjt:  FVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEVDNSEMGEQ

Query:  RTSVFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKLQSSIPSRM
         TSVF+RIKP TTR S FQR+SMA  EEENQ                                                       N+D K+ S + SRM
Subjt:  RTSVFDRIKPPTTRPSAFQRMSMAATEEENQL------------------------------------------------------NSDKKLQSSIPSRM

Query:  KRKFSVLINTEGSL
        KRK SV INTEGSL
Subjt:  KRKFSVLINTEGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTCCTAAAGGACTTAATTCTAAAGCTAGCAATGGAAGGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAA
ACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAAAGGAAGAGAAGTAAAAAATTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCCTTCTCCAAAAATG
TCCACAAAAAGGAAAAAGAGAACTTTGCGACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCTGAGATGGGTGAACAAAGGACCTCCGTCTTCGATCGCATCAAG
CCTCCAACTACTCGTCCTTCGGCATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATTAAACAGCGATAAGAAGCTTCAAAGTAGCATCCCGTCACGTAT
GAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTT
CGAAGGTTCTTTGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGTTACGCTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGG
TGCGTTGTTGCATTGTTCCCTCTTTTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCC
TTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCA
CGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGTTCCTTCTCTCCAAGGTCGAAGGTTCTCACGCGCTGCGTTGCAATTCTTTCTCCCCAAGTTCGAAGGTTCACGC
ACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCATGCAGTTCCTTCCTCC
AAATTCGAAGGTTCTCACGCGCTTTGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTT
CGGTGAAGTTCCTTCCTCCAAGTCTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCGCTTTGCTGCAGTTCCTTCCTCACAGT
TCGAAGGTTCTCACGCGCTTGCTGCAGTTCCTTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCG
CTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGCTGCAGTTCCTTCTCTCCACGTTCAAAGG
TTCTCACGCATTTCGCTACAGTTCATTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCAGTGAAGTTCCTTCCTCCAAGTCTGAAGGTTCTCACGTGCTTCGGTGAAGTT
CCTTCCTCCCAAGTTCGAAGGTTCTTACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTTCTTCTCCTAAGTTCGAAGGTTC
TCATGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTTTCGAAGGTTCTCACGCGCTGTGA
TTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCTGCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTACCTTCAAGTTCG
AAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTT
CTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCT
TCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTC
CACTGCTTCTTCTCCAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTAAGGTTCTCCTT
CTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGTCCTCATTCCGCTTCATCTTCAAATGTTGGTAGTTGACGGCGTCCGCTGCGCTT
CATCTTCAAATGTTGGCAGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGTAGGCGAGTCGGGTCTGGTGACCACCCCTGCAGGTTACTCAGATCACCCAAT
AAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTAGGCAAATCTGGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTAGGCAAAT
CTGGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTAGGCAAATCTGGAGTGCATCACTAAAGACGAATCTGGTGACTACCCCTGCAG
GTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCGTGAAGGTGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACT
GGTCTAGTAGGAGAGTGCATCGCTGTAGGCGAATCTGGTGACTACCCCTGCAGGTTACCCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGGAGTGGGATCACTGC
AAGTGAAGCTAGTGACGACCGTTGCACGAAGCTTACACATTCAAGCAAGAAAGAAACAAATTGCAATAATATATGTGCAAAGAAAGAAATGGAAGCCAAAAGCTCTATAA
CCGATCATCCAAAAGATCAACAAGCCAACAGGCTGATCATCCAAGAAGATCAACAAGTCAGCAGACCGATCATCCGAGAAGATCAACAAGTCAGCAGACCGATCATCCAA
AAAGATCAACAAGCCAACAGGCCGATCATCCAAGAAGATCAACAAGTCACAACAGGCCGATCCAAGAGATCATTAAGCCGGCAGGCCGATCATCCAAGAAGATCATCAAG
CCAACAGGCCGATCCAAGTGATCATCAACCTAGCAAGCCGATCATCCAAGAAACTCAACAAGCCAACAGGTCGATCATCCAAGAAGATCATCAAGCCAACAGGCCGATCC
AAGTGATCATCAACCTAGCAAGTCGATCATCCAGGAAGATCAACAAGCCAACAAGCCGATCCAAGAGATCATCACGCCAACAGGCCGATCATCCAAGAAGATCAACAAGC
CCAATAGGTCGATCCAGGAGATCATCAACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTCCTAAAGGACTTAATTCTAAAGCTAGCAATGGAAGGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAA
ACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAAAGGAAGAGAAGTAAAAAATTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCCTTCTCCAAAAATG
TCCACAAAAAGGAAAAAGAGAACTTTGCGACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCTGAGATGGGTGAACAAAGGACCTCCGTCTTCGATCGCATCAAG
CCTCCAACTACTCGTCCTTCGGCATTCCAAAGAATGAGTATGGCCGCGACAGAGGAAGAAAATCAATTAAACAGCGATAAGAAGCTTCAAAGTAGCATCCCGTCACGTAT
GAAGAGGAAGTTCTCTGTTCTCATAAATACAGAAGGTTCCTTGAAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTT
CGAAGGTTCTTTGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACGACTGTTACGCTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGG
TGCGTTGTTGCATTGTTCCCTCTTTTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTCCCCAAGTTCGAAGGTTCTCACGCGCTCCGTTGCAGTTCC
TTCTTTCCAAGGTCGAAGGTTCTCACTCGCTGCGTTGCAGTTCTTTCTCCCCAAGTTCGAAGGTTCACGCACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCA
CGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGTTCCTTCTCTCCAAGGTCGAAGGTTCTCACGCGCTGCGTTGCAATTCTTTCTCCCCAAGTTCGAAGGTTCACGC
ACTTCGCTGCAGTTCCTTCTCCCAAATTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCATGCAGTTCCTTCCTCC
AAATTCGAAGGTTCTCACGCGCTTTGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCACAATTCGAAGGTTCTCACGCGCTT
CGGTGAAGTTCCTTCCTCCAAGTCTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCGCTTTGCTGCAGTTCCTTCCTCACAGT
TCGAAGGTTCTCACGCGCTTGCTGCAGTTCCTTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCG
CTTCGCGCTGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTTTCATGCGCTTTGCTGCAGTTCCTTCTCTCCACGTTCAAAGG
TTCTCACGCATTTCGCTACAGTTCATTTCCTCCAAGTTCAAAGGTTCTCACGCGCTTCAGTGAAGTTCCTTCCTCCAAGTCTGAAGGTTCTCACGTGCTTCGGTGAAGTT
CCTTCCTCCCAAGTTCGAAGGTTCTTACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTTCTTCTCCTAAGTTCGAAGGTTC
TCATGCGCTTCGATGCAGTTCCTTCCTCCCTAAGTTCGAAGGTTCTCACTCGCTTCGCTGCAGTTCCTTCCTCCAAATTCGAAGGTTTCGAAGGTTCTCACGCGCTGTGA
TTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCTGCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTACCTTCAAGTTCG
AAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTT
CTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCT
TCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGTTGCTACTTCTCCAAGTTCGAAGGCGCTTCTCTC
CACTGCTTCTTCTCCAGTTCGAAGGCGCTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTAAGGTTCTCCTT
CTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGTCCTCATTCCGCTTCATCTTCAAATGTTGGTAGTTGACGGCGTCCGCTGCGCTT
CATCTTCAAATGTTGGCAGTGGTGAAGTCACTGCAATTGAATCTGATGACGACCGTTGTAGGCGAGTCGGGTCTGGTGACCACCCCTGCAGGTTACTCAGATCACCCAAT
AAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTAGGCAAATCTGGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTAGGCAAAT
CTGGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCATCACTGTAGGCAAATCTGGAGTGCATCACTAAAGACGAATCTGGTGACTACCCCTGCAG
GTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCGTGAAGGTGAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACT
GGTCTAGTAGGAGAGTGCATCGCTGTAGGCGAATCTGGTGACTACCCCTGCAGGTTACCCAGATCACCCAATGAAATAGGGGACTGGTCTAGCAGGAGTGGGATCACTGC
AAGTGAAGCTAGTGACGACCGTTGCACGAAGCTTACACATTCAAGCAAGAAAGAAACAAATTGCAATAATATATGTGCAAAGAAAGAAATGGAAGCCAAAAGCTCTATAA
CCGATCATCCAAAAGATCAACAAGCCAACAGGCTGATCATCCAAGAAGATCAACAAGTCAGCAGACCGATCATCCGAGAAGATCAACAAGTCAGCAGACCGATCATCCAA
AAAGATCAACAAGCCAACAGGCCGATCATCCAAGAAGATCAACAAGTCACAACAGGCCGATCCAAGAGATCATTAAGCCGGCAGGCCGATCATCCAAGAAGATCATCAAG
CCAACAGGCCGATCCAAGTGATCATCAACCTAGCAAGCCGATCATCCAAGAAACTCAACAAGCCAACAGGTCGATCATCCAAGAAGATCATCAAGCCAACAGGCCGATCC
AAGTGATCATCAACCTAGCAAGTCGATCATCCAGGAAGATCAACAAGCCAACAAGCCGATCCAAGAGATCATCACGCCAACAGGCCGATCATCCAAGAAGATCAACAAGC
CCAATAGGTCGATCCAGGAGATCATCAACCTAA
Protein sequenceShow/hide protein sequence
MFVLKDLILKLAMEGKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKNVHKKEKENFATSYCIDVEEVDNSEMGEQRTSVFDRIK
PPTTRPSAFQRMSMAATEEENQLNSDKKLQSSIPSRMKRKFSVLINTEGSLKFLLSKFEGPYTVRYCVVPSPSSKVLCCILLRCSFSKFEGSQLYDCYAVPPPSAKDLMW
CVVALFPLFSSSMVLTQLCWSFFSPSSKVLTRSVAVPSFQGRRFSLAALQFFLPKFEGSRTSLQFLLPNSKVLTRFAAVPSSKFEVPSLQGRRFSRAALQFFLPKFEGSR
TSLQFLLPNSKVLTRFALQFLPPSSKVLTRFMQFLPPNSKVLTRFAAVPSSKFEGSHALRCSSFLTIRRFSRASVKFLPPSLKVLTRFAAVPSPQVRRFSRALLQFLPHS
SKVLTRLLQFLSPKFEGSHVASLQFLPPSLKVLTSLRFALRFVAVPSSKFEVPSSKFEGFHALCCSSFSPRSKVLTHFATVHFLQVQRFSRASVKFLPPSLKVLTCFGEV
PSSQVRRFLRASLQFLPPSSKVLTRFAAVSSPKFEGSHALRCSSFLPKFEGSHSLRCSSFLQIRRFRRFSRAVIRCSSFLQVRRFSCASLHSSFLQVRRFSCASLLPSSS
KVLSRAAAAPSSKFEGSLTRFARSFSKFEGASLHCSFSKFEGASLRCYLPPSSKVLSRAAAVPSSKFEGSLTRFARSFSKFEGASLHCSFSKFEGASLRCYFSKFEGASL
HCFFSSSKALLSTAPSPSSKVLLSTPLFEGSPLRFSFSKFEGSPLLLFKCLAAVDVLIPLHLQMLVVDGVRCASSSNVGSGEVTAIESDDDRCRRVGSGDHPCRLLRSPN
KMGTGLAGVHHCRQIWLLRSPNKMGTGLAGVHHCRQIWLLRSPNKMGTGLAGVHHCRQIWSASLKTNLVTTPAGYSDHPIKWGLGLAGVREGESGDYPCRLLRSPNKMGT
GLVGECIAVGESGDYPCRLPRSPNEIGDWSSRSGITASEASDDRCTKLTHSSKKETNCNNICAKKEMEAKSSITDHPKDQQANRLIIQEDQQVSRPIIREDQQVSRPIIQ
KDQQANRPIIQEDQQVTTGRSKRSLSRQADHPRRSSSQQADPSDHQPSKPIIQETQQANRSIIQEDHQANRPIQVIINLASRSSRKINKPTSRSKRSSRQQADHPRRSTS
PIGRSRRSST