; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041353 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041353
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr13:16270915..16277839
RNA-Seq ExpressionLag0041353
SyntenyLag0041353
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS33695.1 hypothetical protein Acr_00g0030110 [Actinidia rufa]9.2e-3344.25Show/hide
Query:  MSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFV
        MSWIY+S+ E  +G+I+G ++A +IWE L  +Y ++S A +  LR  LQ I KDGLT   ++ + + + +  ++IGEP++Y DHL Y L GLG +YNPFV
Subjt:  MSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFV

Query:  TSIQNRTDRPSLVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSH
        TSIQ++  RPS+ +V SLL++Y+ARLE+Q++ D L+ +QANLAN +    + +N S  S        FP+ +S+
Subjt:  TSIQNRTDRPSLVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSH

GFY82848.1 hypothetical protein Acr_02g0010880 [Actinidia rufa]9.2e-3344.25Show/hide
Query:  MSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFV
        MSWIY+S+ E  +G+I+G ++A +IWE L  +Y ++S A +  LR  LQ I KDGLT   ++ + + + +  ++IGEP++Y DHL Y L GLG +YNPFV
Subjt:  MSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFV

Query:  TSIQNRTDRPSLVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSH
        TSIQ++  RPS+ +V SLL++Y+ARLE+Q++ D L+ +QANLAN +    + +N S  S        FP+ +S+
Subjt:  TSIQNRTDRPSLVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSH

KAF4357428.1 hypothetical protein F8388_011166 [Cannabis sativa]2.1e-3239.89Show/hide
Query:  STWKVLLTKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLG
        +TW     +YN+ LMSW+Y+SL++  +G+I+G +TA EIW  L   Y ++S AR    R  LQ + KD L  S +L ++K + +  +++G+P+S ++HL 
Subjt:  STWKVLLTKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLG

Query:  YILEGLGTEYNPFVTSIQNRTDRPSLVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFP
        Y+L GLG EYN FVT I  R  +P++ +V +LL++YEARLE+Q +    + +QAN AN S    + ++ S + S+ P+ P  P
Subjt:  YILEGLGTEYNPFVTSIQNRTDRPSLVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFP

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.1e-5747.75Show/hide
Query:  YPTSTPFFPAPQPSP---SPFPTLTLPLNIMLTDSNYLLWKIYCSTTSLLSTWKVLL-------------------------TKYNRTLMSWIYSSLTED
        +P  TP F A  P+P   +PFPTL  PLN+ L D+N+LLWK       + +  +  L                          +YNR LM WIYSSL+E+
Subjt:  YPTSTPFFPAPQPSP---SPFPTLTLPLNIMLTDSNYLLWKIYCSTTSLLSTWKVLL-------------------------TKYNRTLMSWIYSSLTED

Query:  KIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPS
        K+GE++   T  +IW  L  VY+S +TAR+MGL+ +LQ + KDG +VSQ+LA+IK+IADKF+A+GEPLSYRDHL ++L+GLG+EYN FVTSI NR D PS
Subjt:  KIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPS

Query:  LVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSHFLSSNAYLFKSQC--SWPSSNFSSSI
        L DVRSLL+AYEARL+KQ +VDQLN+ QANL N SL +N KR     S        FP+       S + L K Q    WP    SS I
Subjt:  LVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSHFLSSNAYLFKSQC--SWPSSNFSSSI

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]1.0e-4761.96Show/hide
Query:  IGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSL
        +GEI+G  +AF+IWE LR VYESSS A +MG   QLQKI KDGLTVSQ+LAQIKD+ D F+AIGEPLSYRDHL YILEGLG+EYNPFV+SI NRT+RPS+
Subjt:  IGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSL

Query:  VDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNN------QKRNQSNRSSNMPKIPLFP
         DVR+LLI Y++RLEKQT+ D L ++QAN+A+ S+++       Q+ N+S+  S+ P +  FP
Subjt:  VDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNN------QKRNQSNRSSNMPKIPLFP

TrEMBL top hitse value%identityAlignment
A0A6J1DQX7 uncharacterized protein LOC1110223155.2e-5847.75Show/hide
Query:  YPTSTPFFPAPQPSP---SPFPTLTLPLNIMLTDSNYLLWKIYCSTTSLLSTWKVLL-------------------------TKYNRTLMSWIYSSLTED
        +P  TP F A  P+P   +PFPTL  PLN+ L D+N+LLWK       + +  +  L                          +YNR LM WIYSSL+E+
Subjt:  YPTSTPFFPAPQPSP---SPFPTLTLPLNIMLTDSNYLLWKIYCSTTSLLSTWKVLL-------------------------TKYNRTLMSWIYSSLTED

Query:  KIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPS
        K+GE++   T  +IW  L  VY+S +TAR+MGL+ +LQ + KDG +VSQ+LA+IK+IADKF+A+GEPLSYRDHL ++L+GLG+EYN FVTSI NR D PS
Subjt:  KIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPS

Query:  LVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSHFLSSNAYLFKSQC--SWPSSNFSSSI
        L DVRSLL+AYEARL+KQ +VDQLN+ QANL N SL +N KR     S        FP+       S + L K Q    WP    SS I
Subjt:  LVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSHFLSSNAYLFKSQC--SWPSSNFSSSI

A0A7J0DER3 Uncharacterized protein4.5e-3344.25Show/hide
Query:  MSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFV
        MSWIY+S+ E  +G+I+G ++A +IWE L  +Y ++S A +  LR  LQ I KDGLT   ++ + + + +  ++IGEP++Y DHL Y L GLG +YNPFV
Subjt:  MSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFV

Query:  TSIQNRTDRPSLVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSH
        TSIQ++  RPS+ +V SLL++Y+ARLE+Q++ D L+ +QANLAN +    + +N S  S        FP+ +S+
Subjt:  TSIQNRTDRPSLVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSH

A0A7J0E8R3 Uncharacterized protein4.5e-3344.25Show/hide
Query:  MSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFV
        MSWIY+S+ E  +G+I+G ++A +IWE L  +Y ++S A +  LR  LQ I KDGLT   ++ + + + +  ++IGEP++Y DHL Y L GLG +YNPFV
Subjt:  MSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFV

Query:  TSIQNRTDRPSLVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSH
        TSIQ++  RPS+ +V SLL++Y+ARLE+Q++ D L+ +QANLAN +    + +N S  S        FP+ +S+
Subjt:  TSIQNRTDRPSLVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSH

A0A7J6FHQ6 Uncharacterized protein9.9e-3339.89Show/hide
Query:  STWKVLLTKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLG
        +TW     +YN+ LMSW+Y+SL++  +G+I+G +TA EIW  L   Y ++S AR    R  LQ + KD L  S +L ++K + +  +++G+P+S ++HL 
Subjt:  STWKVLLTKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLG

Query:  YILEGLGTEYNPFVTSIQNRTDRPSLVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFP
        Y+L GLG EYN FVT I  R  +P++ +V +LL++YEARLE+Q +    + +QAN AN S    + ++ S + S+ P+ P  P
Subjt:  YILEGLGTEYNPFVTSIQNRTDRPSLVDVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFP

A0A803NL56 Uncharacterized protein7.6e-3334.38Show/hide
Query:  TSTPFFPAPQPSPS---PFPTLTLPLNIMLTDSNYLLWK-------------------IYC------STTSLLSTWKVLLTKYNRTLMSWIYSSLTEDKI
        T+ P   A   SPS    F +    +++ L D+NYL+W+                   + C      ST+S +        +YN+ LMSW+Y+SL++  +
Subjt:  TSTPFFPAPQPSPS---PFPTLTLPLNIMLTDSNYLLWK-------------------IYC------STTSLLSTWKVLLTKYNRTLMSWIYSSLTEDKI

Query:  GEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLV
        G+I+G +TA EIW  L   Y ++S AR    R  LQ + KD L  S +L ++K + +  +++G+P+S ++HL Y+L GLG EYN FVT I  R  +P++ 
Subjt:  GEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLV

Query:  DVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFP
        +V +LL++YEARLE+Q +    + +QAN AN S    + ++ S + S+ P+ P  P
Subjt:  DVRSLLIAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFP

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-1023.56Show/hide
Query:  WKVLLTKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYI
        WK    + ++ + S +  +++      +   +TA +IWE LR +Y + S   V  LR QL++  K   T+  ++  +    D+ + +G+P+ + + +  +
Subjt:  WKVLLTKYNRTLMSWIYSSLTEDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYI

Query:  LEGLGTEYNPFVTSIQNRTDRPSLVDVRSLLIAYEARLEKQTS----------VDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSHFLSS
        LE L  EY P +  I  +   P+L ++   L+ +E+++   +S          V   N    N  N    NN+  N++N +++ P      + H +   S
Subjt:  LEGLGTEYNPFVTSIQNRTDRPSLVDVRSLLIAYEARLEKQTS----------VDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSHFLSS

Query:  NAYLFKSQ
          YL K Q
Subjt:  NAYLFKSQ

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.9e-0921.98Show/hide
Query:  LPLNIMLTDSNYLLWK----IYCSTTSLLSTWKVLLTKYNRTLMSW----------IYSSLTEDKI-GEIIGCSTAFEIWEHLRIVYESSSTARVMGLRF
        +P+ + + +SNY  W+     +C +  ++      L   N   ++W          +Y +LT  +  G  +  ST+ +IW  ++  + ++  AR + L  
Subjt:  LPLNIMLTDSNYLLWK----IYCSTTSLLSTWKVLLTKYNRTLMSW----------IYSSLTEDKI-GEIIGCSTAFEIWEHLRIVYESSSTARVMGLRF

Query:  QLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLVDVRSLLIAYEARLEK
        +L+      + V+ +  ++K +AD    +  P++ R+ + Y+L GL  +++  +  I++R   PS  D  ++L   E RL++
Subjt:  QLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLVDVRSLLIAYEARLEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACATTCCACGTTACCTCCAACTTTTCAGTCATTGTGTCAAGAAGTTCATGTGCTCTATTCAAATCATCATTCCATTCATGCCCAATAATCAAAGTCATCATTCCGCA
CATACTCAATAATCAAAGTCTTGTTGCTATATTGTCGATCAACTTACAATTCTCAGTATTCTTCTTTTTGTCCACTTCAAGTTCTTGTTGCTTTGACAATAATGTCCATT
CTTCATTCCTCTTCCTCTACTTCAAAACTGACTCAAGAATTAATGCCTTGGCTTCTTCACCCATAATGGCCTTAGATTTGCCTATGATTATGTGCAAGTTCAAGAAGGGA
AACCCCCTCAAATCTGGCATTCGATCGGCGTTGGCGTTGGCGTTCCAACGTTCGATCGAAGGCAACGGCTGCAACGACGTTGATGTCAAGGTAGACGAGGAGGTTCCAGT
AGTCTCCAAGAACTCAGAAGGCACTGCTTTTGTCCCAATGGGACCTAATGGACCTACATATCAGAAGCTCCAACGATACGAGACTAATCGACTTAAACTCATTAACCAAG
TTAGTCTCCATTCGTTAACTGTGGGTCACTCCACTAAAGACCAACAACTGCACTCTTCTCACTGCAAAATATTTTGTGTCCACGGATATCGACCAATACTACAAGTAGAC
GAGGAGGTTCCAGTAGTCTTCAAGAACTTAGAAGGTACTGCTTTTGTCCCAAGCTGTTGTTTCTCCGTTCCAATGACTAGAGTATCGTCGACACTCATCTTCCTAGGTGA
ACAACAAAAAGCCAAGAAGCTCGGCTTGCCTTCATCCGGCCAAGACACTATTCCTGCACTCCCTACAACCTCAACAGTAGTCACAGTTCCCTCCCTTGTTTCCACCTCCA
TTGCCACCACCCCTGTTTCGACGTCTGGCTCTTCTCATCGAACTCAAATCAGAGGATCCACTCCCTTATCCAATACCAATACCAGACCCTTAAACCCTAATAATCCCCCA
TTTCATTCACATTTTCAGCCTCCCCCAATTTCGTCCACATTTCCCTTCCCAACTGCTGTGCCTCAACCTGGCTTTCAGTACCCTCCTCCCTCTACCCCTGCTTATCCCTT
CATTCCGTCTTACCCTACTTCAACTCCCTTTTTCCCAGCTCCGCAGCCATCACCTAGCCCTTTTCCCACCCTCACTCTGCCTCTCAATATCATGCTCACAGACTCAAATT
ATCTCCTCTGGAAGATTTATTGCTCAACCACATCATTGCTTTCGACATGGAAAGTCTTATTAACGAAATATAATCGCACGTTAATGAGTTGGATTTACTCTTCTCTGACT
GAGGATAAGATAGGTGAAATAATTGGTTGCTCTACTGCTTTTGAAATTTGGGAGCATCTTAGAATTGTTTATGAATCATCTTCCACTGCTCGTGTTATGGGGTTAAGGTT
TCAGTTACAAAAAATCCATAAAGATGGGCTCACTGTGTCTCAGTTCCTAGCTCAGATAAAGGATATAGCGGATAAATTCTCGGCCATTGGTGAACCACTGTCATATAGGG
ACCATTTAGGCTATATTCTTGAAGGATTAGGGACTGAATATAACCCATTTGTGACATCAATACAAAATCGCACTGACCGCCCATCTCTTGTGGATGTCCGTAGCTTGTTG
ATTGCTTATGAAGCTAGGCTTGAAAAACAAACCTCTGTCGACCAGTTGAACATGGTACAAGCTAATTTAGCTAATTTCTCCCTTTCCAACAACCAGAAACGAAACCAATC
CAACCGTTCCTCCAATATGCCCAAGATTCCCCTGTTCCCAGACTGCCACAGTCATTTCCTTTCCTCCAATGCCTACCTCTTCAAATCCCAGTGTTCTTGGCCGTCCTCAA
ACTTCTCCTCGTCCATATACCAACTCAAATTGATGGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACATTCCACGTTACCTCCAACTTTTCAGTCATTGTGTCAAGAAGTTCATGTGCTCTATTCAAATCATCATTCCATTCATGCCCAATAATCAAAGTCATCATTCCGCA
CATACTCAATAATCAAAGTCTTGTTGCTATATTGTCGATCAACTTACAATTCTCAGTATTCTTCTTTTTGTCCACTTCAAGTTCTTGTTGCTTTGACAATAATGTCCATT
CTTCATTCCTCTTCCTCTACTTCAAAACTGACTCAAGAATTAATGCCTTGGCTTCTTCACCCATAATGGCCTTAGATTTGCCTATGATTATGTGCAAGTTCAAGAAGGGA
AACCCCCTCAAATCTGGCATTCGATCGGCGTTGGCGTTGGCGTTCCAACGTTCGATCGAAGGCAACGGCTGCAACGACGTTGATGTCAAGGTAGACGAGGAGGTTCCAGT
AGTCTCCAAGAACTCAGAAGGCACTGCTTTTGTCCCAATGGGACCTAATGGACCTACATATCAGAAGCTCCAACGATACGAGACTAATCGACTTAAACTCATTAACCAAG
TTAGTCTCCATTCGTTAACTGTGGGTCACTCCACTAAAGACCAACAACTGCACTCTTCTCACTGCAAAATATTTTGTGTCCACGGATATCGACCAATACTACAAGTAGAC
GAGGAGGTTCCAGTAGTCTTCAAGAACTTAGAAGGTACTGCTTTTGTCCCAAGCTGTTGTTTCTCCGTTCCAATGACTAGAGTATCGTCGACACTCATCTTCCTAGGTGA
ACAACAAAAAGCCAAGAAGCTCGGCTTGCCTTCATCCGGCCAAGACACTATTCCTGCACTCCCTACAACCTCAACAGTAGTCACAGTTCCCTCCCTTGTTTCCACCTCCA
TTGCCACCACCCCTGTTTCGACGTCTGGCTCTTCTCATCGAACTCAAATCAGAGGATCCACTCCCTTATCCAATACCAATACCAGACCCTTAAACCCTAATAATCCCCCA
TTTCATTCACATTTTCAGCCTCCCCCAATTTCGTCCACATTTCCCTTCCCAACTGCTGTGCCTCAACCTGGCTTTCAGTACCCTCCTCCCTCTACCCCTGCTTATCCCTT
CATTCCGTCTTACCCTACTTCAACTCCCTTTTTCCCAGCTCCGCAGCCATCACCTAGCCCTTTTCCCACCCTCACTCTGCCTCTCAATATCATGCTCACAGACTCAAATT
ATCTCCTCTGGAAGATTTATTGCTCAACCACATCATTGCTTTCGACATGGAAAGTCTTATTAACGAAATATAATCGCACGTTAATGAGTTGGATTTACTCTTCTCTGACT
GAGGATAAGATAGGTGAAATAATTGGTTGCTCTACTGCTTTTGAAATTTGGGAGCATCTTAGAATTGTTTATGAATCATCTTCCACTGCTCGTGTTATGGGGTTAAGGTT
TCAGTTACAAAAAATCCATAAAGATGGGCTCACTGTGTCTCAGTTCCTAGCTCAGATAAAGGATATAGCGGATAAATTCTCGGCCATTGGTGAACCACTGTCATATAGGG
ACCATTTAGGCTATATTCTTGAAGGATTAGGGACTGAATATAACCCATTTGTGACATCAATACAAAATCGCACTGACCGCCCATCTCTTGTGGATGTCCGTAGCTTGTTG
ATTGCTTATGAAGCTAGGCTTGAAAAACAAACCTCTGTCGACCAGTTGAACATGGTACAAGCTAATTTAGCTAATTTCTCCCTTTCCAACAACCAGAAACGAAACCAATC
CAACCGTTCCTCCAATATGCCCAAGATTCCCCTGTTCCCAGACTGCCACAGTCATTTCCTTTCCTCCAATGCCTACCTCTTCAAATCCCAGTGTTCTTGGCCGTCCTCAA
ACTTCTCCTCGTCCATATACCAACTCAAATTGATGGCCTAA
Protein sequenceShow/hide protein sequence
MTFHVTSNFSVIVSRSSCALFKSSFHSCPIIKVIIPHILNNQSLVAILSINLQFSVFFFLSTSSSCCFDNNVHSSFLFLYFKTDSRINALASSPIMALDLPMIMCKFKKG
NPLKSGIRSALALAFQRSIEGNGCNDVDVKVDEEVPVVSKNSEGTAFVPMGPNGPTYQKLQRYETNRLKLINQVSLHSLTVGHSTKDQQLHSSHCKIFCVHGYRPILQVD
EEVPVVFKNLEGTAFVPSCCFSVPMTRVSSTLIFLGEQQKAKKLGLPSSGQDTIPALPTTSTVVTVPSLVSTSIATTPVSTSGSSHRTQIRGSTPLSNTNTRPLNPNNPP
FHSHFQPPPISSTFPFPTAVPQPGFQYPPPSTPAYPFIPSYPTSTPFFPAPQPSPSPFPTLTLPLNIMLTDSNYLLWKIYCSTTSLLSTWKVLLTKYNRTLMSWIYSSLT
EDKIGEIIGCSTAFEIWEHLRIVYESSSTARVMGLRFQLQKIHKDGLTVSQFLAQIKDIADKFSAIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRTDRPSLVDVRSLL
IAYEARLEKQTSVDQLNMVQANLANFSLSNNQKRNQSNRSSNMPKIPLFPDCHSHFLSSNAYLFKSQCSWPSSNFSSSIYQLKLMA