; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002750 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002750
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold6:5587174..5593384
RNA-Seq ExpressionSpg002750
SyntenySpg002750
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039967.1 hypothetical protein E6C27_scaffold122G002490 [Cucumis melo var. makuwa]1.3e-3836.2Show/hide
Query:  ERTISHPLVLPETPCKGEVRKIAWDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMR
        E + S  + + +T     +    W+  +++T+R+FHDDW RI++ L  QL++ +   PF+++KAL+   + E A  L  NKGW + G   +K ++W++  
Subjt:  ERTISHPLVLPETPCKGEVRKIAWDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMR

Query:  HDRINVVSCYGGWVRIRKLPLHLWNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVAR
        H    V+  YGGW+++R +PLH WN   F  I D  GGF++ AK    L D  E  IKIKDNY GFIP+ +++ D ++  F  Q+I   +G    +R   
Subjt:  HDRINVVSCYGGWVRIRKLPLHLWNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVAR

Query:  IHGGFSPAAAHVFHRDPMDAE
        IHG F+  AA  F    M++E
Subjt:  IHGGFSPAAAHVFHRDPMDAE

KAA0044333.1 hypothetical protein E6C27_scaffold46G00570 [Cucumis melo var. makuwa]1.3e-3837.88Show/hide
Query:  WDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHL
        W+  +++T+R+FHDDW +I++ L  QL++ +   PF ++KAL+   ++E AN +  NKGW + G   +K ++WN+  H    V+  YGGW+++R +PLH 
Subjt:  WDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHL

Query:  WNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPMDAE
        WN   F  I D  GGF++ AK    L D +E  I++KDNY GFIP+ +++ D ++  F TQ+I    G   ++R   IHG F+  AA  F    +++E
Subjt:  WNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPMDAE

KAA0050054.1 hypothetical protein E6C27_scaffold675G00340 [Cucumis melo var. makuwa]1.3e-3838.38Show/hide
Query:  WDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHL
        W+  +++T+R+FHDDW +I++ L  QL++ +   PF ++KAL+   + E AN +  NKGW + G   +K ++WN+  H    V+  YGGW+++R +PLH 
Subjt:  WDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHL

Query:  WNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPMDAE
        WN   F  I D  GGFI+ AK    L D +E  I+IKDNY GFIP+ +++ D ++  F  Q+I   +G   ++R   IHG F+  AA  F    +++E
Subjt:  WNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPMDAE

TYK10355.1 hypothetical protein E5676_scaffold367G00330 [Cucumis melo var. makuwa]1.3e-3838.38Show/hide
Query:  WDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHL
        W+  +++T+R+FHDDW +I++ L  QL++ +   PF ++KAL+   + E AN +  NKGW + G   +K ++WN+  H    V+  YGGW+++R +PLH 
Subjt:  WDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHL

Query:  WNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPMDAE
        WN   F  I D  GGFI+ AK    L D +E  I+IKDNY GFIP+ +++ D ++  F  Q+I   +G   ++R   IHG F+  AA  F    +++E
Subjt:  WNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPMDAE

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]3.7e-6549.11Show/hide
Query:  EVRKIAWDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIR
        EVR++ W+E I+IT+R FHDDW RIL  ++ Q ES  +INPFQ++KAL+KCPS +LA  L TNKGWV+FGPV +K++ WN + H R  +   YG WV+IR
Subjt:  EVRKIAWDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIR

Query:  KLPLHLWNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQVFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPM
         +PLHLW+   FKAI + LGGFIDY  +NS  I+C +  IK+K NYCGFIP+E+  +DG   F+ ++++F+D   L  +   IHGGFS  AA  FH+   
Subjt:  KLPLHLWNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQVFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPM

Query:  DAEFSTADKWRIEDGTFFPQADVQ
        +   ++ D+WR+E+G  +P  ++Q
Subjt:  DAEFSTADKWRIEDGTFFPQADVQ

TrEMBL top hitse value%identityAlignment
A0A5A7TEK8 DUF4283 domain-containing protein6.4e-3936.2Show/hide
Query:  ERTISHPLVLPETPCKGEVRKIAWDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMR
        E + S  + + +T     +    W+  +++T+R+FHDDW RI++ L  QL++ +   PF+++KAL+   + E A  L  NKGW + G   +K ++W++  
Subjt:  ERTISHPLVLPETPCKGEVRKIAWDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMR

Query:  HDRINVVSCYGGWVRIRKLPLHLWNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVAR
        H    V+  YGGW+++R +PLH WN   F  I D  GGF++ AK    L D  E  IKIKDNY GFIP+ +++ D ++  F  Q+I   +G    +R   
Subjt:  HDRINVVSCYGGWVRIRKLPLHLWNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVAR

Query:  IHGGFSPAAAHVFHRDPMDAE
        IHG F+  AA  F    M++E
Subjt:  IHGGFSPAAAHVFHRDPMDAE

A0A5A7TLE2 DUF4283 domain-containing protein6.4e-3937.88Show/hide
Query:  WDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHL
        W+  +++T+R+FHDDW +I++ L  QL++ +   PF ++KAL+   ++E AN +  NKGW + G   +K ++WN+  H    V+  YGGW+++R +PLH 
Subjt:  WDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHL

Query:  WNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPMDAE
        WN   F  I D  GGF++ AK    L D +E  I++KDNY GFIP+ +++ D ++  F TQ+I    G   ++R   IHG F+  AA  F    +++E
Subjt:  WNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPMDAE

A0A5A7U495 DUF4283 domain-containing protein6.4e-3938.38Show/hide
Query:  WDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHL
        W+  +++T+R+FHDDW +I++ L  QL++ +   PF ++KAL+   + E AN +  NKGW + G   +K ++WN+  H    V+  YGGW+++R +PLH 
Subjt:  WDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHL

Query:  WNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPMDAE
        WN   F  I D  GGFI+ AK    L D +E  I+IKDNY GFIP+ +++ D ++  F  Q+I   +G   ++R   IHG F+  AA  F    +++E
Subjt:  WNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPMDAE

A0A5D3CFS8 DUF4283 domain-containing protein6.4e-3938.38Show/hide
Query:  WDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHL
        W+  +++T+R+FHDDW +I++ L  QL++ +   PF ++KAL+   + E AN +  NKGW + G   +K ++WN+  H    V+  YGGW+++R +PLH 
Subjt:  WDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHL

Query:  WNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPMDAE
        WN   F  I D  GGFI+ AK    L D +E  I+IKDNY GFIP+ +++ D ++  F  Q+I   +G   ++R   IHG F+  AA  F    +++E
Subjt:  WNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQ-VFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPMDAE

A0A6J1D6X4 uncharacterized protein LOC1110181861.8e-6549.11Show/hide
Query:  EVRKIAWDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIR
        EVR++ W+E I+IT+R FHDDW RIL  ++ Q ES  +INPFQ++KAL+KCPS +LA  L TNKGWV+FGPV +K++ WN + H R  +   YG WV+IR
Subjt:  EVRKIAWDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSFGPVILKVKKWNRMRHDRINVVSCYGGWVRIR

Query:  KLPLHLWNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQVFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPM
         +PLHLW+   FKAI + LGGFIDY  +NS  I+C +  IK+K NYCGFIP+E+  +DG   F+ ++++F+D   L  +   IHGGFS  AA  FH+   
Subjt:  KLPLHLWNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQVFRTQIITFQDGNLLIDRVARIHGGFSPAAAHVFHRDPM

Query:  DAEFSTADKWRIEDGTFFPQADVQ
        +   ++ D+WR+E+G  +P  ++Q
Subjt:  DAEFSTADKWRIEDGTFFPQADVQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.3e-0430.16Show/hide
Query:  QKCKIKWMREGDENTAFFHRWAGSMKSKSFIATLESEEGGFLSSKADIEKEIIGFFTKLYSKD
        QK +IKW+++GD NT FFH+   + ++K+ I  L  ++   + +   +++ I+ ++T L   D
Subjt:  QKCKIKWMREGDENTAFFHRWAGSMKSKSFIATLESEEGGFLSSKADIEKEIIGFFTKLYSKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTAAGAACGTCGGCAATCGAACGACTTTTCATCGAGGACAAGCGATAAAGATTAGGAAATTTAGAGCAGAGAGAACTATCTCCCACCCACTTGTCCTCCCAGAAAC
ACCGTGCAAGGGCGAAGTGAGGAAGATTGCTTGGGATGAAGTGATTATTATCACTAAGAGATACTTTCACGATGACTGGGGAAGGATCCTCGACATTCTTCAGAGGCAGC
TAGAGAGTGGTCTTGTTATAAACCCTTTCCAATCGAACAAGGCTCTGCTCAAATGTCCTTCCATGGAGCTGGCGAATTTCTTAACAACAAACAAGGGATGGGTCAGCTTT
GGGCCTGTGATATTAAAAGTCAAGAAATGGAATAGGATGAGGCACGATAGAATCAATGTAGTGTCGTGTTATGGGGGTTGGGTGAGAATCAGAAAACTCCCGTTGCATTT
ATGGAACACCAGAATATTTAAGGCTATTGACGATCGCTTGGGAGGGTTTATTGATTATGCTAAATCTAACTCTCTCCTCATTGATTGCGTTGAGACATGGATTAAAATTA
AAGATAACTACTGTGGGTTTATTCCAAGTGAAGTCGAAGTTATCGACGGGGACCAAGTTTTCAGAACGCAAATCATCACATTTCAAGATGGAAACTTGCTTATTGACCGT
GTGGCCAGAATTCATGGGGGCTTCTCGCCGGCGGCAGCTCACGTCTTCCACAGGGATCCGATGGATGCTGAGTTTAGCACAGCGGACAAATGGAGAATTGAAGATGGAAC
CTTTTTTCCGCAGGCCGATGTCCAGAAAGTGTTTGCAGAGCACGAGGTTAATGTGTTGAAACGTGGAGAGATCCAACTTACGGATGAGAAAAGAAGCCCACAACATCAGG
AGCACCCCGACAGAGAAGAAGATGTGGATGAATCAGACTTTTCGGTCTCAAGTCTTGGGAGTAAAACAGGGGACGAGACTTCGGTTGAAAGGGATCAGTCTGATCTTAAC
GAAGGACTCCCAAAGGAATATTACAAGTGTTTTCAGAGGGAAGGGGAAGGAACATCTCGGTCAGAATATGCTGAAACAGAGCGAAGCAGCCTGGATGGTGACCCTGAGAG
TAATGAGATAATTCCTCTGTCTTTGACCAATACAGGGGAAGGAAGCAAGGAATTGGAGTTAGACGAGATCTGTGAAGAATTCCCGAAGGCAATTGCTATGATTCCGTCAG
TGGAGAAACAGAAAACAAAAGGGGTGGAAACTCCAGAAGGCTTCGTTATTAATAAAGAGATTATCCTCACCCTTAAGAAGAATAATCTTTGTATTAGGCTTATTTCAGGA
GCTTCGTCTAAAAAAGAGTGCACTTTCCACGGCCAGATTAAAGGTTGGGTGACTGGCGTTTATGGCCCATGTTCGATTAATGATAGGAAAAGATTCTTGCTAGAGTTGGC
TGATGTTGCTAGGTTGTGTCATGGCATATGGTGCGTTGCTGCTGAAAGGCTTAAGCTGAGAGCTTCCCTCCTTGATATCACGGTGACTGATCAAAGGAGGCTTCTTCAGA
AATGCAAAATCAAATGGATGAGGGAAGGTGATGAGAATACCGCTTTTTTCCATAGATGGGCAGGTTCCATGAAGAGCAAATCCTTCATTGCCACGCTTGAAAGTGAAGAG
GGAGGATTCCTCTCATCTAAAGCCGATATTGAGAAGGAGATAATTGGTTTCTTTACCAAATTGTACTCAAAAGATGATAGCTCCCGCTATGGGAGAAGAATAAGGTTCTG
GAGTGATGCTTGGTGTGATATGCAGCCCCTTAAAGCTGTTTTCCCAGATTTATTCAAGATCTCCCATAAGAAATCTGCTTCTGTTGTAGAATGCTGGGAGGACAGCACCC
AAACGTGGAACCTTGGCCTTCGAAGAGGGCTTTTTGACCGTGAGTTGAGTAGTTGGGTGGCCCTCATTGGCAAGTTAGACAATATCCGTATGGGAAATGAGATGGACAGA
ATCACCTGGAAGTTGGAAGGCTCGGGGCTTTTTACTACTAAATCTTTGTTCAGCAGTTCGGTTGGAAAATCCCCCAAGATCAATCTGTCTCTTGCAGGCAAGATTTGGAA
ACATAAGTCTCCTAAGAAAGTGAAAATCTTCCTTTGGAGCGTGGTTTATAGAAGTTTGAACACGGATGATAAAGTGCAAAGAAAGATGAAAAACTGGGCTCTATCCCCCT
CGGGGGTGCAGACTTTGTTTAAAAGAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTAAGAACGTCGGCAATCGAACGACTTTTCATCGAGGACAAGCGATAAAGATTAGGAAATTTAGAGCAGAGAGAACTATCTCCCACCCACTTGTCCTCCCAGAAAC
ACCGTGCAAGGGCGAAGTGAGGAAGATTGCTTGGGATGAAGTGATTATTATCACTAAGAGATACTTTCACGATGACTGGGGAAGGATCCTCGACATTCTTCAGAGGCAGC
TAGAGAGTGGTCTTGTTATAAACCCTTTCCAATCGAACAAGGCTCTGCTCAAATGTCCTTCCATGGAGCTGGCGAATTTCTTAACAACAAACAAGGGATGGGTCAGCTTT
GGGCCTGTGATATTAAAAGTCAAGAAATGGAATAGGATGAGGCACGATAGAATCAATGTAGTGTCGTGTTATGGGGGTTGGGTGAGAATCAGAAAACTCCCGTTGCATTT
ATGGAACACCAGAATATTTAAGGCTATTGACGATCGCTTGGGAGGGTTTATTGATTATGCTAAATCTAACTCTCTCCTCATTGATTGCGTTGAGACATGGATTAAAATTA
AAGATAACTACTGTGGGTTTATTCCAAGTGAAGTCGAAGTTATCGACGGGGACCAAGTTTTCAGAACGCAAATCATCACATTTCAAGATGGAAACTTGCTTATTGACCGT
GTGGCCAGAATTCATGGGGGCTTCTCGCCGGCGGCAGCTCACGTCTTCCACAGGGATCCGATGGATGCTGAGTTTAGCACAGCGGACAAATGGAGAATTGAAGATGGAAC
CTTTTTTCCGCAGGCCGATGTCCAGAAAGTGTTTGCAGAGCACGAGGTTAATGTGTTGAAACGTGGAGAGATCCAACTTACGGATGAGAAAAGAAGCCCACAACATCAGG
AGCACCCCGACAGAGAAGAAGATGTGGATGAATCAGACTTTTCGGTCTCAAGTCTTGGGAGTAAAACAGGGGACGAGACTTCGGTTGAAAGGGATCAGTCTGATCTTAAC
GAAGGACTCCCAAAGGAATATTACAAGTGTTTTCAGAGGGAAGGGGAAGGAACATCTCGGTCAGAATATGCTGAAACAGAGCGAAGCAGCCTGGATGGTGACCCTGAGAG
TAATGAGATAATTCCTCTGTCTTTGACCAATACAGGGGAAGGAAGCAAGGAATTGGAGTTAGACGAGATCTGTGAAGAATTCCCGAAGGCAATTGCTATGATTCCGTCAG
TGGAGAAACAGAAAACAAAAGGGGTGGAAACTCCAGAAGGCTTCGTTATTAATAAAGAGATTATCCTCACCCTTAAGAAGAATAATCTTTGTATTAGGCTTATTTCAGGA
GCTTCGTCTAAAAAAGAGTGCACTTTCCACGGCCAGATTAAAGGTTGGGTGACTGGCGTTTATGGCCCATGTTCGATTAATGATAGGAAAAGATTCTTGCTAGAGTTGGC
TGATGTTGCTAGGTTGTGTCATGGCATATGGTGCGTTGCTGCTGAAAGGCTTAAGCTGAGAGCTTCCCTCCTTGATATCACGGTGACTGATCAAAGGAGGCTTCTTCAGA
AATGCAAAATCAAATGGATGAGGGAAGGTGATGAGAATACCGCTTTTTTCCATAGATGGGCAGGTTCCATGAAGAGCAAATCCTTCATTGCCACGCTTGAAAGTGAAGAG
GGAGGATTCCTCTCATCTAAAGCCGATATTGAGAAGGAGATAATTGGTTTCTTTACCAAATTGTACTCAAAAGATGATAGCTCCCGCTATGGGAGAAGAATAAGGTTCTG
GAGTGATGCTTGGTGTGATATGCAGCCCCTTAAAGCTGTTTTCCCAGATTTATTCAAGATCTCCCATAAGAAATCTGCTTCTGTTGTAGAATGCTGGGAGGACAGCACCC
AAACGTGGAACCTTGGCCTTCGAAGAGGGCTTTTTGACCGTGAGTTGAGTAGTTGGGTGGCCCTCATTGGCAAGTTAGACAATATCCGTATGGGAAATGAGATGGACAGA
ATCACCTGGAAGTTGGAAGGCTCGGGGCTTTTTACTACTAAATCTTTGTTCAGCAGTTCGGTTGGAAAATCCCCCAAGATCAATCTGTCTCTTGCAGGCAAGATTTGGAA
ACATAAGTCTCCTAAGAAAGTGAAAATCTTCCTTTGGAGCGTGGTTTATAGAAGTTTGAACACGGATGATAAAGTGCAAAGAAAGATGAAAAACTGGGCTCTATCCCCCT
CGGGGGTGCAGACTTTGTTTAAAAGAGAGTGA
Protein sequenceShow/hide protein sequence
MIKNVGNRTTFHRGQAIKIRKFRAERTISHPLVLPETPCKGEVRKIAWDEVIIITKRYFHDDWGRILDILQRQLESGLVINPFQSNKALLKCPSMELANFLTTNKGWVSF
GPVILKVKKWNRMRHDRINVVSCYGGWVRIRKLPLHLWNTRIFKAIDDRLGGFIDYAKSNSLLIDCVETWIKIKDNYCGFIPSEVEVIDGDQVFRTQIITFQDGNLLIDR
VARIHGGFSPAAAHVFHRDPMDAEFSTADKWRIEDGTFFPQADVQKVFAEHEVNVLKRGEIQLTDEKRSPQHQEHPDREEDVDESDFSVSSLGSKTGDETSVERDQSDLN
EGLPKEYYKCFQREGEGTSRSEYAETERSSLDGDPESNEIIPLSLTNTGEGSKELELDEICEEFPKAIAMIPSVEKQKTKGVETPEGFVINKEIILTLKKNNLCIRLISG
ASSKKECTFHGQIKGWVTGVYGPCSINDRKRFLLELADVARLCHGIWCVAAERLKLRASLLDITVTDQRRLLQKCKIKWMREGDENTAFFHRWAGSMKSKSFIATLESEE
GGFLSSKADIEKEIIGFFTKLYSKDDSSRYGRRIRFWSDAWCDMQPLKAVFPDLFKISHKKSASVVECWEDSTQTWNLGLRRGLFDRELSSWVALIGKLDNIRMGNEMDR
ITWKLEGSGLFTTKSLFSSSVGKSPKINLSLAGKIWKHKSPKKVKIFLWSVVYRSLNTDDKVQRKMKNWALSPSGVQTLFKRE