; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010730 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010730
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr1:4819359..4825565
RNA-Seq ExpressionLag0010730
SyntenyLag0010730
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]5.3e-3739.02Show/hide
Query:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPAHGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMFKGHARTWWK
        A   L   ALQA++ N+      Q +    A A   + QFI+ F +  PP+F+G S+ +  V EWI  LEA++ +LG + Q +V+GA FM +G A  WW 
Subjt:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPAHGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMFKGHARTWWK

Query:  VVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPTTFAAA
        VV   ++    PI+W   K L+ D++  +    E+E+EF+ L Q TL V QY ++F E S     L+ TE  +I RFV GL   I+G + L RPTT+A A
Subjt:  VVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPTTFAAA

Query:  LASARMLDRD-IPRAEQSQEVGTSSGAKKKSEVEVLAASQKVRSSP
        +  A ++D+D I +A+  Q+VG SSG K+K  V  +++SQ  ++SP
Subjt:  LASARMLDRD-IPRAEQSQEVGTSSGAKKKSEVEVLAASQKVRSSP

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]3.8e-3538.19Show/hide
Query:  PEVNSHPEANPPVPP--PAPPVLAAEALQAMLGNAFLNNLQHVGANGA----PAHG----EDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQF
        P       A+  VPP  P   VL AEALQ +L NA        GA GA    P+ G    E+VQFI+ F +  PP F+G S+   A  EW+  LEA++ +
Subjt:  PEVNSHPEANPPVPP--PAPPVLAAEALQAMLGNAFLNNLQHVGANGA----PAHG----EDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQF

Query:  LGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRIN
        LG +   +V+GA FM +G A  WW+ V   ++    P++W  FK L+ +++       E+  EF+ L Q +L V QY R+F ELS      + TE+++I+
Subjt:  LGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRIN

Query:  RFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTSSGAKKK
        +F++GLR EI+GL+ L  PTT+AAA+  A ++D+ +   +  Q +G+SSG K+K
Subjt:  RFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTSSGAKKK

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]2.4e-3745.88Show/hide
Query:  DVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQE
        + +FIK F +  PP+FDG S+ + AV EWI  LEA++ +LG   Q +V+GA FM +G A  WW  V   ++    PI W  FK L+ D++        +E
Subjt:  DVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQE

Query:  VEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIP-RAEQSQEVGTSSGAKKK
         EF+ LVQGTLSV QY R+F ELS     L+ TE ++I RFV GLR  IRG V L RPTT+A A+  A ++D+D+  +A    EVG+SSG K+K
Subjt:  VEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIP-RAEQSQEVGTSSGAKKK

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]4.5e-3638.25Show/hide
Query:  PPVPPPAPP---------VLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGA
        PPVP  AP           L AEALQ +L NA        GA GA            ++VQFI+ F    PP F+G S+   A  EW+  LEA++ +LG 
Subjt:  PPVPPPAPP---------VLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGA

Query:  NAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFV
        +   +V+GA FM +G A  WW+ V   ++    P++W  FK L+ +++    A  E+ VEF+ L QG+L+V QY R+F ELS      V TE+++I++F+
Subjt:  NAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFV

Query:  NGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTSSGAKKK
        +GLR EI+GL+ L  PTT+AAA+  A ++D+ +   +  Q +G++SG K+K
Subjt:  NGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTSSGAKKK

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]6.5e-3537.31Show/hide
Query:  GRSGPPDVPEVNSHPEANPPVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAAL
        GR  PP VP+  + P+  P V P     L AEALQ +L NA        GA GA            ++VQFI+ F +  PP F+G S+   A  EW+  L
Subjt:  GRSGPPDVPEVNSHPEANPPVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAAL

Query:  EAIFQFLGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVAT
        EA++ +LG +   +V+GA FM +G A  WW+ V   ++    P++W  FK L+ +++       E+  EF+ L QG+L+V QY R+F ELS      + T
Subjt:  EAIFQFLGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVAT

Query:  EEIRINRFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTSSGAKKK
        E+++I++F++GLR EI+GL+ +  PTT+AAA+  A ++D+ +   +  Q +G+SSG K+K
Subjt:  EEIRINRFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTSSGAKKK

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196032.6e-3739.02Show/hide
Query:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPAHGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMFKGHARTWWK
        A   L   ALQA++ N+      Q +    A A   + QFI+ F +  PP+F+G S+ +  V EWI  LEA++ +LG + Q +V+GA FM +G A  WW 
Subjt:  APPVLAAEALQAMLGNAF-LNNLQHVGANGAPAHGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMFKGHARTWWK

Query:  VVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPTTFAAA
        VV   ++    PI+W   K L+ D++  +    E+E+EF+ L Q TL V QY ++F E S     L+ TE  +I RFV GL   I+G + L RPTT+A A
Subjt:  VVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPTTFAAA

Query:  LASARMLDRD-IPRAEQSQEVGTSSGAKKKSEVEVLAASQKVRSSP
        +  A ++D+D I +A+  Q+VG SSG K+K  V  +++SQ  ++SP
Subjt:  LASARMLDRD-IPRAEQSQEVGTSSGAKKKSEVEVLAASQKVRSSP

A0A6J1DNV8 uncharacterized protein LOC1110229251.9e-3538.19Show/hide
Query:  PEVNSHPEANPPVPP--PAPPVLAAEALQAMLGNAFLNNLQHVGANGA----PAHG----EDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQF
        P       A+  VPP  P   VL AEALQ +L NA        GA GA    P+ G    E+VQFI+ F +  PP F+G S+   A  EW+  LEA++ +
Subjt:  PEVNSHPEANPPVPP--PAPPVLAAEALQAMLGNAFLNNLQHVGANGA----PAHG----EDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQF

Query:  LGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRIN
        LG +   +V+GA FM +G A  WW+ V   ++    P++W  FK L+ +++       E+  EF+ L Q +L V QY R+F ELS      + TE+++I+
Subjt:  LGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRIN

Query:  RFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTSSGAKKK
        +F++GLR EI+GL+ L  PTT+AAA+  A ++D+ +   +  Q +G+SSG K+K
Subjt:  RFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTSSGAKKK

A0A6J1DQB9 Reverse transcriptase2.2e-3638.25Show/hide
Query:  PPVPPPAPP---------VLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGA
        PPVP  AP           L AEALQ +L NA        GA GA            ++VQFI+ F    PP F+G S+   A  EW+  LEA++ +LG 
Subjt:  PPVPPPAPP---------VLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGA

Query:  NAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFV
        +   +V+GA FM +G A  WW+ V   ++    P++W  FK L+ +++    A  E+ VEF+ L QG+L+V QY R+F ELS      V TE+++I++F+
Subjt:  NAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFV

Query:  NGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTSSGAKKK
        +GLR EI+GL+ L  PTT+AAA+  A ++D+ +   +  Q +G++SG K+K
Subjt:  NGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTSSGAKKK

A0A6J1DTA8 uncharacterized protein LOC1110241143.2e-3537.31Show/hide
Query:  GRSGPPDVPEVNSHPEANPPVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAAL
        GR  PP VP+  + P+  P V P     L AEALQ +L NA        GA GA            ++VQFI+ F +  PP F+G S+   A  EW+  L
Subjt:  GRSGPPDVPEVNSHPEANPPVPPPAPPVLAAEALQAMLGNAFLNNLQHVGANGAPAH--------GEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAAL

Query:  EAIFQFLGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVAT
        EA++ +LG +   +V+GA FM +G A  WW+ V   ++    P++W  FK L+ +++       E+  EF+ L QG+L+V QY R+F ELS      + T
Subjt:  EAIFQFLGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVAT

Query:  EEIRINRFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTSSGAKKK
        E+++I++F++GLR EI+GL+ +  PTT+AAA+  A ++D+ +   +  Q +G+SSG K+K
Subjt:  EEIRINRFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTSSGAKKK

A0A6J1DUM2 uncharacterized protein LOC1110232471.2e-3745.88Show/hide
Query:  DVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQE
        + +FIK F +  PP+FDG S+ + AV EWI  LEA++ +LG   Q +V+GA FM +G A  WW  V   ++    PI W  FK L+ D++        +E
Subjt:  DVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPISWLGFKGLVRDHFGCRFAGVEQE

Query:  VEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIP-RAEQSQEVGTSSGAKKK
         EF+ LVQGTLSV QY R+F ELS     L+ TE ++I RFV GLR  IRG V L RPTT+A A+  A ++D+D+  +A    EVG+SSG K+K
Subjt:  VEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIP-RAEQSQEVGTSSGAKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGGCAGTGCACCTTTGTTCACGTCCCGGAGTCAGCACTTAAGGGAAGAACCTTTCTACTATCCCTAAGCCGCTCCTTCGTGCGTGGAGTTGCAGCCGTGGGTTC
TCTCCAGCGTCTCTCTCTCCCGTGTTCTGACGCCAACAGCAGCAGTCCGTGCCCTGTGTTCGCGAGCGTCGGCCAGCACCACAGCTTCAGTTTTTCTCCTTCAAGCTCGC
GTGTGCTGAAAGCTAATTGTGTTCAATATCCTGATCTGCAGCGCCGTCTAGGTGTTCGATTAAGTTCGAAACACTTCAACTTGGAATTAGGTGTATTCGGTTGCTGTTCA
GCGAGCGTTTGTCTCAAATATCGTGTTCAGCGAATACCCACAACTCGAAAGACCTTGATTTTGGTTACCCATAACCCTGGTGACTTGGGATCTTGGTTGTTGGGTGTTTC
GAACACGAGTCAGCTTGTTCTCGAGGTTGTTTGGGCACTAAGCGGTAAATTCCAGCACCCTTTACCTTCCTTTATTAAGTCTCTAGGGTGTTGTTTATCAATTCCTAACC
CTTGGGTCTCGGGTATAAATGGTCGAGGACTGATAATGTTCAAGGAAAGTAAAGTGATTATCAGGGCCTCGGGTATAAAAGGTCGGGGACTGATATATCACTATTGGTGT
CGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGTTAGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTCCTGAGGC
AAGTATTGAGGCCTTGGGTATAAATGGTCAAGGGTCAGTACGTTGCTTAGTCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGATGCACAGTTCGAGGCCTTGGGTAG
GGGCTGCTTACCAGTACCTTAGTGTACTGACCCCCTCCACTCTCCCCCCAACTACCAGATTTTGCAGGAGGTGTCGTTTCAAGATTTTGTTTAAAGATGTAGTTGGTTTA
GCTAATACTTGTTTATGCGAGATCTGTTTGTCTGTTGGTTCTGTTTATGTTCTGCTGAGATGTGAAGTGGGAAGAGTTAGTAATGTCGCTGGGTTAGCTTTTAAATTCCT
GGGGCGTTACAGTTGGTATCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGATGTTTAGGGTTATGGTCTTCCTCGTTCTCCTCTCCATCAC
CAGCGATGAGTTCGAGTAGCAGTCAAGGTAGTGGACGTTCTGGCCCTCCAGACGTTCCCGAGGTTAATTCGCATCCTGAAGCGAATCCTCCTGTTCCTCCCCCAGCGCCT
CCTGTGCTGGCAGCAGAGGCATTGCAGGCGATGCTTGGCAATGCGTTCCTGAACAACCTGCAGCATGTCGGTGCAAATGGAGCCCCTGCTCATGGCGAAGATGTGCAATT
TATCAAGAGCTTCATGAAGGCGAAGCCTCCTTCATTCGATGGGCACTCGGATAGTTCTGAAGCAGTGGTAGAATGGATCGCCGCGTTGGAAGCGATATTTCAATTTCTTG
GAGCTAATGCCCAACAGCGAGTCCAAGGAGCTGCCTTTATGTTCAAAGGCCATGCTCGCACTTGGTGGAAGGTTGTGGGTCAAACCAAGAACCGCCCAGAGAATCCCATT
TCCTGGTTGGGGTTCAAAGGTCTTGTGCGAGACCATTTTGGCTGTCGTTTTGCTGGTGTTGAGCAAGAAGTGGAGTTTGTCTCTCTTGTTCAAGGGACCTTGTCTGTGGA
GCAGTACGTCAGAAGGTTTGAAGAGTTATCCTACCGAGTCCCAGGGTTGGTTGCCACCGAAGAGATTAGGATCAACCGATTCGTTAATGGGCTCCGCGCAGAAATTCGAG
GTTTGGTCAGGCTTGGTCGACCGACCACCTTTGCAGCAGCCCTAGCGAGCGCTCGGATGTTGGATAGGGACATCCCCAGGGCGGAGCAGTCCCAAGAGGTTGGCACGTCG
TCTGGTGCCAAGAAGAAGAGCGAAGTGGAAGTGCTTGCAGCTAGTCAGAAGGTAAGAAGTTCTCCGTCAGGATCTAGCGCGTACGCTGAGGAGTTCTTGCCCTGTGTCAC
CGATGAGGAACTCAAGGCAGAATACCCAGAGCTTTACGAGGACGATGGTTCTGATGACGAAGATAGCTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGGCAGTGCACCTTTGTTCACGTCCCGGAGTCAGCACTTAAGGGAAGAACCTTTCTACTATCCCTAAGCCGCTCCTTCGTGCGTGGAGTTGCAGCCGTGGGTTC
TCTCCAGCGTCTCTCTCTCCCGTGTTCTGACGCCAACAGCAGCAGTCCGTGCCCTGTGTTCGCGAGCGTCGGCCAGCACCACAGCTTCAGTTTTTCTCCTTCAAGCTCGC
GTGTGCTGAAAGCTAATTGTGTTCAATATCCTGATCTGCAGCGCCGTCTAGGTGTTCGATTAAGTTCGAAACACTTCAACTTGGAATTAGGTGTATTCGGTTGCTGTTCA
GCGAGCGTTTGTCTCAAATATCGTGTTCAGCGAATACCCACAACTCGAAAGACCTTGATTTTGGTTACCCATAACCCTGGTGACTTGGGATCTTGGTTGTTGGGTGTTTC
GAACACGAGTCAGCTTGTTCTCGAGGTTGTTTGGGCACTAAGCGGTAAATTCCAGCACCCTTTACCTTCCTTTATTAAGTCTCTAGGGTGTTGTTTATCAATTCCTAACC
CTTGGGTCTCGGGTATAAATGGTCGAGGACTGATAATGTTCAAGGAAAGTAAAGTGATTATCAGGGCCTCGGGTATAAAAGGTCGGGGACTGATATATCACTATTGGTGT
CGATGCCTCGGGTATAAATGGTCGAGGGTCGATATGCCAATGTTAGATAAAGAGGAGCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGGTGTGATGAGTCCTGAGGC
AAGTATTGAGGCCTTGGGTATAAATGGTCAAGGGTCAGTACGTTGCTTAGTCATCGAGGCCTTGGGTATAAATGGTCAAGGGTCGATGCACAGTTCGAGGCCTTGGGTAG
GGGCTGCTTACCAGTACCTTAGTGTACTGACCCCCTCCACTCTCCCCCCAACTACCAGATTTTGCAGGAGGTGTCGTTTCAAGATTTTGTTTAAAGATGTAGTTGGTTTA
GCTAATACTTGTTTATGCGAGATCTGTTTGTCTGTTGGTTCTGTTTATGTTCTGCTGAGATGTGAAGTGGGAAGAGTTAGTAATGTCGCTGGGTTAGCTTTTAAATTCCT
GGGGCGTTACAGTTGGTATCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGATGTTTAGGGTTATGGTCTTCCTCGTTCTCCTCTCCATCAC
CAGCGATGAGTTCGAGTAGCAGTCAAGGTAGTGGACGTTCTGGCCCTCCAGACGTTCCCGAGGTTAATTCGCATCCTGAAGCGAATCCTCCTGTTCCTCCCCCAGCGCCT
CCTGTGCTGGCAGCAGAGGCATTGCAGGCGATGCTTGGCAATGCGTTCCTGAACAACCTGCAGCATGTCGGTGCAAATGGAGCCCCTGCTCATGGCGAAGATGTGCAATT
TATCAAGAGCTTCATGAAGGCGAAGCCTCCTTCATTCGATGGGCACTCGGATAGTTCTGAAGCAGTGGTAGAATGGATCGCCGCGTTGGAAGCGATATTTCAATTTCTTG
GAGCTAATGCCCAACAGCGAGTCCAAGGAGCTGCCTTTATGTTCAAAGGCCATGCTCGCACTTGGTGGAAGGTTGTGGGTCAAACCAAGAACCGCCCAGAGAATCCCATT
TCCTGGTTGGGGTTCAAAGGTCTTGTGCGAGACCATTTTGGCTGTCGTTTTGCTGGTGTTGAGCAAGAAGTGGAGTTTGTCTCTCTTGTTCAAGGGACCTTGTCTGTGGA
GCAGTACGTCAGAAGGTTTGAAGAGTTATCCTACCGAGTCCCAGGGTTGGTTGCCACCGAAGAGATTAGGATCAACCGATTCGTTAATGGGCTCCGCGCAGAAATTCGAG
GTTTGGTCAGGCTTGGTCGACCGACCACCTTTGCAGCAGCCCTAGCGAGCGCTCGGATGTTGGATAGGGACATCCCCAGGGCGGAGCAGTCCCAAGAGGTTGGCACGTCG
TCTGGTGCCAAGAAGAAGAGCGAAGTGGAAGTGCTTGCAGCTAGTCAGAAGGTAAGAAGTTCTCCGTCAGGATCTAGCGCGTACGCTGAGGAGTTCTTGCCCTGTGTCAC
CGATGAGGAACTCAAGGCAGAATACCCAGAGCTTTACGAGGACGATGGTTCTGATGACGAAGATAGCTCCTAA
Protein sequenceShow/hide protein sequence
MRGQCTFVHVPESALKGRTFLLSLSRSFVRGVAAVGSLQRLSLPCSDANSSSPCPVFASVGQHHSFSFSPSSSRVLKANCVQYPDLQRRLGVRLSSKHFNLELGVFGCCS
ASVCLKYRVQRIPTTRKTLILVTHNPGDLGSWLLGVSNTSQLVLEVVWALSGKFQHPLPSFIKSLGCCLSIPNPWVSGINGRGLIMFKESKVIIRASGIKGRGLIYHYWC
RCLGYKWSRVDMPMLDKEEHRGLGYKWSRVGVMSPEASIEALGINGQGSVRCLVIEALGINGQGSMHSSRPWVGAAYQYLSVLTPSTLPPTTRFCRRCRFKILFKDVVGL
ANTCLCEICLSVGSVYVLLRCEVGRVSNVAGLAFKFLGRYSWYQSRVVPVDWPRKSRLFGCLGLWSSSFSSPSPAMSSSSSQGSGRSGPPDVPEVNSHPEANPPVPPPAP
PVLAAEALQAMLGNAFLNNLQHVGANGAPAHGEDVQFIKSFMKAKPPSFDGHSDSSEAVVEWIAALEAIFQFLGANAQQRVQGAAFMFKGHARTWWKVVGQTKNRPENPI
SWLGFKGLVRDHFGCRFAGVEQEVEFVSLVQGTLSVEQYVRRFEELSYRVPGLVATEEIRINRFVNGLRAEIRGLVRLGRPTTFAAALASARMLDRDIPRAEQSQEVGTS
SGAKKKSEVEVLAASQKVRSSPSGSSAYAEEFLPCVTDEELKAEYPELYEDDGSDDEDSS