; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041682 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041682
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr13:23862724..23873878
RNA-Seq ExpressionLag0041682
SyntenyLag0041682
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054901.1 T3P18.3 [Cucumis melo var. makuwa]7.0e-2030Show/hide
Query:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGAGASSSEVAISEAAESLPTQEK-----LLPK
        F  P     +    + KLDR N+LLWK +ALPIL++        GE         +T   S  +SS    G ++S     ++    P  EK     LL  
Subjt:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGAGASSSEVAISEAAESLPTQEK-----LLPK

Query:  SWDVRML--------------KSYGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQ
         W    +              K   ++ KN       +K+I  V SS    K S        GL + +NPV A+IQG+ DI   ++Q+ELL+FEK LE Q
Subjt:  SWDVRML--------------KSYGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQ

Query:  NNLKS----SLSLGQGAAV------------------------------NMANDKATSKELLRGVLVEGLYRFDG-AIAAAVDISKSGNYSRGLSSMNKG
        N+ ++    ++ +G G  +                              NMA D  T   LL+G L +GLY  +G A+ +  ++  S +  +    +N  
Subjt:  NNLKS----SLSLGQGAAV------------------------------NMANDKATSKELLRGVLVEGLYRFDG-AIAAAVDISKSGNYSRGLSSMNKG

Query:  ASSAFVLSSSVNVAI--SKVIWHRRLGHPSEKVLDSIVKRCNLSYKVNEK
         +S  VLS  V+ +I  SK +W RRLGHPS K+L+S++K CNL    NE+
Subjt:  ASSAFVLSSSVNVAI--SKVIWHRRLGHPSEKVLDSIVKRCNLSYKVNEK

TYK22689.1 T3P18.3 [Cucumis melo var. makuwa]7.0e-2030Show/hide
Query:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGAGASSSEVAISEAAESLPTQEK-----LLPK
        F  P     +    + KLDR N+LLWK +ALPIL++        GE         +T   S  +SS    G ++S     ++    P  EK     LL  
Subjt:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGAGASSSEVAISEAAESLPTQEK-----LLPK

Query:  SWDVRML--------------KSYGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQ
         W    +              K   ++ KN       +K+I  V SS    K S        GL + +NPV A+IQG+ DI   ++Q+ELL+FEK LE Q
Subjt:  SWDVRML--------------KSYGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQ

Query:  NNLKS----SLSLGQGAAV------------------------------NMANDKATSKELLRGVLVEGLYRFDG-AIAAAVDISKSGNYSRGLSSMNKG
        N+ ++    ++ +G G  +                              NMA D  T   LL+G L +GLY  +G A+ +  ++  S +  +    +N  
Subjt:  NNLKS----SLSLGQGAAV------------------------------NMANDKATSKELLRGVLVEGLYRFDG-AIAAAVDISKSGNYSRGLSSMNKG

Query:  ASSAFVLSSSVNVAI--SKVIWHRRLGHPSEKVLDSIVKRCNLSYKVNEK
         +S  VLS  V+ +I  SK +W RRLGHPS K+L+S++K CNL    NE+
Subjt:  ASSAFVLSSSVNVAI--SKVIWHRRLGHPSEKVLDSIVKRCNLSYKVNEK

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]1.7e-1836.19Show/hide
Query:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGA----GASSSEV--AISEAAESLPTQEKLLP
        F  P     +  +TT+KLDR N+LLWK LALPIL+ Y+LEGHLT E PC + F+  +   SN + +  GA    GASSS     ++   E   T + LL 
Subjt:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGA----GASSSEV--AISEAAESLPTQEKLLP

Query:  KSWDVRMLKS----YGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSL-S
          W    +          + N        +    V+S    +        +  GLDE +N V+ +IQG+ DISW ++Q++LL+FEKRL+ QN  K +  +
Subjt:  KSWDVRMLKS----YGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSL-S

Query:  LGQGAAVNMA
        + Q  A+NMA
Subjt:  LGQGAAVNMA

XP_016902203.1 PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo]1.7e-1836.19Show/hide
Query:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGA----GASSSEV--AISEAAESLPTQEKLLP
        F  P     +  +TT+KLDR N+LLWK LALPIL+ Y+LEGHLT E PC + F+  +   SN + +  GA    GASSS     ++   E   T + LL 
Subjt:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGA----GASSSEV--AISEAAESLPTQEKLLP

Query:  KSWDVRMLKS----YGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSL-S
          W    +          + N        +    V+S    +        +  GLDE +N V+ +IQG+ DISW ++Q++LL+FEKRL+ QN  K +  +
Subjt:  KSWDVRMLKS----YGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSL-S

Query:  LGQGAAVNMA
        + Q  A+NMA
Subjt:  LGQGAAVNMA

XP_016902205.1 PREDICTED: uncharacterized protein LOC107991581 isoform X5 [Cucumis melo]1.7e-1836.19Show/hide
Query:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGA----GASSSEV--AISEAAESLPTQEKLLP
        F  P     +  +TT+KLDR N+LLWK LALPIL+ Y+LEGHLT E PC + F+  +   SN + +  GA    GASSS     ++   E   T + LL 
Subjt:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGA----GASSSEV--AISEAAESLPTQEKLLP

Query:  KSWDVRMLKS----YGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSL-S
          W    +          + N        +    V+S    +        +  GLDE +N V+ +IQG+ DISW ++Q++LL+FEKRL+ QN  K +  +
Subjt:  KSWDVRMLKS----YGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSL-S

Query:  LGQGAAVNMA
        + Q  A+NMA
Subjt:  LGQGAAVNMA

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X18.4e-1936.19Show/hide
Query:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGA----GASSSEV--AISEAAESLPTQEKLLP
        F  P     +  +TT+KLDR N+LLWK LALPIL+ Y+LEGHLT E PC + F+  +   SN + +  GA    GASSS     ++   E   T + LL 
Subjt:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGA----GASSSEV--AISEAAESLPTQEKLLP

Query:  KSWDVRMLKS----YGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSL-S
          W    +          + N        +    V+S    +        +  GLDE +N V+ +IQG+ DISW ++Q++LL+FEKRL+ QN  K +  +
Subjt:  KSWDVRMLKS----YGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSL-S

Query:  LGQGAAVNMA
        + Q  A+NMA
Subjt:  LGQGAAVNMA

A0A1S4E1U9 uncharacterized protein LOC107991581 isoform X48.4e-1936.19Show/hide
Query:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGA----GASSSEV--AISEAAESLPTQEKLLP
        F  P     +  +TT+KLDR N+LLWK LALPIL+ Y+LEGHLT E PC + F+  +   SN + +  GA    GASSS     ++   E   T + LL 
Subjt:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGA----GASSSEV--AISEAAESLPTQEKLLP

Query:  KSWDVRMLKS----YGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSL-S
          W    +          + N        +    V+S    +        +  GLDE +N V+ +IQG+ DISW ++Q++LL+FEKRL+ QN  K +  +
Subjt:  KSWDVRMLKS----YGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSL-S

Query:  LGQGAAVNMA
        + Q  A+NMA
Subjt:  LGQGAAVNMA

A0A1S4E1V2 uncharacterized protein LOC107991581 isoform X38.4e-1936.19Show/hide
Query:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGA----GASSSEV--AISEAAESLPTQEKLLP
        F  P     +  +TT+KLDR N+LLWK LALPIL+ Y+LEGHLT E PC + F+  +   SN + +  GA    GASSS     ++   E   T + LL 
Subjt:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGA----GASSSEV--AISEAAESLPTQEKLLP

Query:  KSWDVRMLKS----YGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSL-S
          W    +          + N        +    V+S    +        +  GLDE +N V+ +IQG+ DISW ++Q++LL+FEKRL+ QN  K +  +
Subjt:  KSWDVRMLKS----YGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSL-S

Query:  LGQGAAVNMA
        + Q  A+NMA
Subjt:  LGQGAAVNMA

A0A5A7UMW2 T3P18.33.4e-2030Show/hide
Query:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGAGASSSEVAISEAAESLPTQEK-----LLPK
        F  P     +    + KLDR N+LLWK +ALPIL++        GE         +T   S  +SS    G ++S     ++    P  EK     LL  
Subjt:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGAGASSSEVAISEAAESLPTQEK-----LLPK

Query:  SWDVRML--------------KSYGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQ
         W    +              K   ++ KN       +K+I  V SS    K S        GL + +NPV A+IQG+ DI   ++Q+ELL+FEK LE Q
Subjt:  SWDVRML--------------KSYGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQ

Query:  NNLKS----SLSLGQGAAV------------------------------NMANDKATSKELLRGVLVEGLYRFDG-AIAAAVDISKSGNYSRGLSSMNKG
        N+ ++    ++ +G G  +                              NMA D  T   LL+G L +GLY  +G A+ +  ++  S +  +    +N  
Subjt:  NNLKS----SLSLGQGAAV------------------------------NMANDKATSKELLRGVLVEGLYRFDG-AIAAAVDISKSGNYSRGLSSMNKG

Query:  ASSAFVLSSSVNVAI--SKVIWHRRLGHPSEKVLDSIVKRCNLSYKVNEK
         +S  VLS  V+ +I  SK +W RRLGHPS K+L+S++K CNL    NE+
Subjt:  ASSAFVLSSSVNVAI--SKVIWHRRLGHPSEKVLDSIVKRCNLSYKVNEK

A0A5D3DHC7 T3P18.33.4e-2030Show/hide
Query:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGAGASSSEVAISEAAESLPTQEK-----LLPK
        F  P     +    + KLDR N+LLWK +ALPIL++        GE         +T   S  +SS    G ++S     ++    P  EK     LL  
Subjt:  FQQPTAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGAGASSSEVAISEAAESLPTQEK-----LLPK

Query:  SWDVRML--------------KSYGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQ
         W    +              K   ++ KN       +K+I  V SS    K S        GL + +NPV A+IQG+ DI   ++Q+ELL+FEK LE Q
Subjt:  SWDVRML--------------KSYGQSYKNCLTFNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQ

Query:  NNLKS----SLSLGQGAAV------------------------------NMANDKATSKELLRGVLVEGLYRFDG-AIAAAVDISKSGNYSRGLSSMNKG
        N+ ++    ++ +G G  +                              NMA D  T   LL+G L +GLY  +G A+ +  ++  S +  +    +N  
Subjt:  NNLKS----SLSLGQGAAV------------------------------NMANDKATSKELLRGVLVEGLYRFDG-AIAAAVDISKSGNYSRGLSSMNKG

Query:  ASSAFVLSSSVNVAI--SKVIWHRRLGHPSEKVLDSIVKRCNLSYKVNEK
         +S  VLS  V+ +I  SK +W RRLGHPS K+L+S++K CNL    NE+
Subjt:  ASSAFVLSSSVNVAI--SKVIWHRRLGHPSEKVLDSIVKRCNLSYKVNEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGATTCAGTTAGTGGTGGTGATGCGCTTGGTTCGACACCCAATGAGGCACGAGAGATCAGTACAAACCTGACAGAAAACACTCAGTTCCATGCCACTTCCCATGA
TGAGCTAGAATGCCAAATTTCAAGCAGTAGCAGGCTCAGATGGTATTTAAATAATATGTTTGGAAAGGTTGAAGATTTCTTGATTAAAATTTCTTATGTTGTGCACCCTG
TGTGCAAGATGAGCCTTTCTATGCAGGAATTGAAGAAGAAGAAAGATCCTCTTAGGATCAAAGAGTCGCCTAGGAGGTCTTCACCACGATTTGCAGCTGCCAGTCACGAG
CTCCGATGTCACCTACGTACTCCAGCCAACAGCAAACCTCACGGCGGCGCGTTCTCTCCGGCGAGTCCTGACGTGCGCAGGCGCGACAGTAGCAGTTCTGGCGACTTCTC
TCGGTTCCGTCGGCAGCAACAGTGTTCGACGACGTCTTCTTGCGACGGTGTTGTGATTTTCCAGCGACTGGGTTATGTAGGAGCGACCAATCTTAAGCGAGGTCAAGACG
ACTTTCAACAGAGTTCGTGTGGGTATATTCCTTTGGCGTTTTTGGCGAATTTAGCTTTGGCCGCTGAAACACTGGAAAGAGGCCACCGCCGCTACTGCTTACACGAACGC
ACGTCGCCGGAGACAGATAGGGGAAGGGAAGAGAGAAATCGAGCGTGGGAGAGGAGGTTAGAGAGAGAAATCGACAGAAACGGAGAGACCTTACCAGAACAACCCAAATC
GCTTGAACTCCCTGCCGTGAACCACGAAGACGAAGCCGCCGGACTGCTCTTAAACTGCTGCCACGACACCTCTGCACTCCGGAGAGACACCAACGATCCAAGCTTCGCCG
GAGAGGAGACAACGGCGCCGGATAAATCCAAGAGGTCATTTGATCCTATAGTTTTTAGCTTGCAAGCAACAATGAGAAGCCGCTGCCACCAGAATGAATTTCAGCAGCCC
ACCGCTCAATCAACTATTAAATCAGTGACGACTATCAAACTAGACCGTGGAAATTTTCTTCTGTGGAAGAACCTTGCACTACCGATCCTTAGGAGCTATAGATTGGAGGG
ACATTTAACCGGTGAGAAACCATGCCTAGCAAAATTTCTTCAAGCTACTGGTGGAAGATCGAATCCTAGTTCTTCTCATGCTGGAGCAGGAGCATCGAGCTCAGAGGTCG
CAATAAGTGAGGCGGCAGAATCCCTCCCTACCCAAGAGAAGTTGCTACCCAAGTCATGGGATGTGAGAATGCTCAAGAGTTATGGTCAGTCATACAAGAACTGTTTGACG
TTCAATCGAGGGCGGAAGAGGATTATCTCTGTCAAGTCTTCCAACAATCGAGAAAAGTACTCGTACCCTAATTTTGCAAGTCCTTTTGGACTTGATGAGGAATTCAATCC
GGTCGTAGCCATGATACAAGGGAGGTCAGACATAAGTTGGTCTGAAATACAAGCAGAATTGCTTGTGTTTGAGAAGCGGTTGGAGTTGCAGAACAATTTGAAGAGTTCTT
TATCACTTGGCCAGGGAGCAGCAGTGAATATGGCCAATGACAAGGCTACGAGCAAGGAGCTCTTGAGAGGGGTGCTCGTTGAAGGTTTATATCGATTTGATGGTGCCATT
GCTGCTGCAGTTGATATTTCTAAGTCTGGAAACTACAGTAGAGGTCTATCTAGTATGAATAAGGGTGCATCATCTGCTTTTGTGTTGTCAAGTTCTGTTAATGTTGCGAT
TTCAAAAGTTATATGGCATAGGCGTCTTGGACATCCGTCTGAAAAAGTACTTGATTCAATTGTCAAACGATGTAATCTGTCTTATAAAGTTAATGAGAAGTTATTGTGCA
ATTTTTCGCAAGTGTACGGTTGCCCAAGTAATATTAAAACGAAAAGTATTTCGAGTATCGAATCCTCAGGGAATTTGAATTATATCCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTAGATTCAGTTAGTGGTGGTGATGCGCTTGGTTCGACACCCAATGAGGCACGAGAGATCAGTACAAACCTGACAGAAAACACTCAGTTCCATGCCACTTCCCATGA
TGAGCTAGAATGCCAAATTTCAAGCAGTAGCAGGCTCAGATGGTATTTAAATAATATGTTTGGAAAGGTTGAAGATTTCTTGATTAAAATTTCTTATGTTGTGCACCCTG
TGTGCAAGATGAGCCTTTCTATGCAGGAATTGAAGAAGAAGAAAGATCCTCTTAGGATCAAAGAGTCGCCTAGGAGGTCTTCACCACGATTTGCAGCTGCCAGTCACGAG
CTCCGATGTCACCTACGTACTCCAGCCAACAGCAAACCTCACGGCGGCGCGTTCTCTCCGGCGAGTCCTGACGTGCGCAGGCGCGACAGTAGCAGTTCTGGCGACTTCTC
TCGGTTCCGTCGGCAGCAACAGTGTTCGACGACGTCTTCTTGCGACGGTGTTGTGATTTTCCAGCGACTGGGTTATGTAGGAGCGACCAATCTTAAGCGAGGTCAAGACG
ACTTTCAACAGAGTTCGTGTGGGTATATTCCTTTGGCGTTTTTGGCGAATTTAGCTTTGGCCGCTGAAACACTGGAAAGAGGCCACCGCCGCTACTGCTTACACGAACGC
ACGTCGCCGGAGACAGATAGGGGAAGGGAAGAGAGAAATCGAGCGTGGGAGAGGAGGTTAGAGAGAGAAATCGACAGAAACGGAGAGACCTTACCAGAACAACCCAAATC
GCTTGAACTCCCTGCCGTGAACCACGAAGACGAAGCCGCCGGACTGCTCTTAAACTGCTGCCACGACACCTCTGCACTCCGGAGAGACACCAACGATCCAAGCTTCGCCG
GAGAGGAGACAACGGCGCCGGATAAATCCAAGAGGTCATTTGATCCTATAGTTTTTAGCTTGCAAGCAACAATGAGAAGCCGCTGCCACCAGAATGAATTTCAGCAGCCC
ACCGCTCAATCAACTATTAAATCAGTGACGACTATCAAACTAGACCGTGGAAATTTTCTTCTGTGGAAGAACCTTGCACTACCGATCCTTAGGAGCTATAGATTGGAGGG
ACATTTAACCGGTGAGAAACCATGCCTAGCAAAATTTCTTCAAGCTACTGGTGGAAGATCGAATCCTAGTTCTTCTCATGCTGGAGCAGGAGCATCGAGCTCAGAGGTCG
CAATAAGTGAGGCGGCAGAATCCCTCCCTACCCAAGAGAAGTTGCTACCCAAGTCATGGGATGTGAGAATGCTCAAGAGTTATGGTCAGTCATACAAGAACTGTTTGACG
TTCAATCGAGGGCGGAAGAGGATTATCTCTGTCAAGTCTTCCAACAATCGAGAAAAGTACTCGTACCCTAATTTTGCAAGTCCTTTTGGACTTGATGAGGAATTCAATCC
GGTCGTAGCCATGATACAAGGGAGGTCAGACATAAGTTGGTCTGAAATACAAGCAGAATTGCTTGTGTTTGAGAAGCGGTTGGAGTTGCAGAACAATTTGAAGAGTTCTT
TATCACTTGGCCAGGGAGCAGCAGTGAATATGGCCAATGACAAGGCTACGAGCAAGGAGCTCTTGAGAGGGGTGCTCGTTGAAGGTTTATATCGATTTGATGGTGCCATT
GCTGCTGCAGTTGATATTTCTAAGTCTGGAAACTACAGTAGAGGTCTATCTAGTATGAATAAGGGTGCATCATCTGCTTTTGTGTTGTCAAGTTCTGTTAATGTTGCGAT
TTCAAAAGTTATATGGCATAGGCGTCTTGGACATCCGTCTGAAAAAGTACTTGATTCAATTGTCAAACGATGTAATCTGTCTTATAAAGTTAATGAGAAGTTATTGTGCA
ATTTTTCGCAAGTGTACGGTTGCCCAAGTAATATTAAAACGAAAAGTATTTCGAGTATCGAATCCTCAGGGAATTTGAATTATATCCATTAA
Protein sequenceShow/hide protein sequence
MLDSVSGGDALGSTPNEAREISTNLTENTQFHATSHDELECQISSSSRLRWYLNNMFGKVEDFLIKISYVVHPVCKMSLSMQELKKKKDPLRIKESPRRSSPRFAAASHE
LRCHLRTPANSKPHGGAFSPASPDVRRRDSSSSGDFSRFRRQQQCSTTSSCDGVVIFQRLGYVGATNLKRGQDDFQQSSCGYIPLAFLANLALAAETLERGHRRYCLHER
TSPETDRGREERNRAWERRLEREIDRNGETLPEQPKSLELPAVNHEDEAAGLLLNCCHDTSALRRDTNDPSFAGEETTAPDKSKRSFDPIVFSLQATMRSRCHQNEFQQP
TAQSTIKSVTTIKLDRGNFLLWKNLALPILRSYRLEGHLTGEKPCLAKFLQATGGRSNPSSSHAGAGASSSEVAISEAAESLPTQEKLLPKSWDVRMLKSYGQSYKNCLT
FNRGRKRIISVKSSNNREKYSYPNFASPFGLDEEFNPVVAMIQGRSDISWSEIQAELLVFEKRLELQNNLKSSLSLGQGAAVNMANDKATSKELLRGVLVEGLYRFDGAI
AAAVDISKSGNYSRGLSSMNKGASSAFVLSSSVNVAISKVIWHRRLGHPSEKVLDSIVKRCNLSYKVNEKLLCNFSQVYGCPSNIKTKSISSIESSGNLNYIH