; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G16930 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G16930
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr09:27070330..27077089
RNA-Seq ExpressionClc09G16930
SyntenyClc09G16930
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR036875 - Zinc finger, CCHC-type superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]1.1e-5951.22Show/hide
Query:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--
        KRY PPTFD  SE+ATA E+WI ELE+ + YL CED FKV+G VFMLR EA  WW SI A+EDHA   + WARFKDLLYDYY+ + VKD KE EFLHL  
Subjt:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--

Query:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK
                    TELS FA +L+  +  +IKRF++GL + IRG V L+ P ++A A++ ALIMDK+++ K     S  EVG+SS  KRK  P  +D + +
Subjt:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK

Query:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC
        A +  +      P+C +C KRH  QCW G   CF+CG+E HFAR C
Subjt:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]6.3e-6050Show/hide
Query:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--
        KR+ PP F+  SE+ TA E+W+ ELE+L+ YL C D FKVRG VFML+ EA  WW S+ A+EDHA  P+TWARFKDLLY+YYFP  V+++K  EFL L  
Subjt:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--

Query:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK
                    TELS F    + T + +I +FI GLR EI+G + LKEP T+AAA++ AL+MDK + ++PQ   S+  +G+SS  KRK    SS Q S+
Subjt:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK

Query:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC
         H+    + TT P C SC K H   CW+G+ IC++C KEGHFAR C
Subjt:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]3.5e-6352.03Show/hide
Query:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--
        KRY PPTFD  SE+ATA E+WI ELE+L+ YL CED FKV+G VFMLR EA  WW S+ A+ED+A  P+ WARFK+LLYDYY+P+ VKD KE EFLHL  
Subjt:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--

Query:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK
                    TELS FA +L+ T   +IKRF++GLR+ IRG V L+ P T+A A++ AL+MDK+++ K        EVG+SS  KRK P   +D   +
Subjt:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK

Query:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC
        A +  +      P+C +C KRH  QCW G   CF+CG+EGHFAR C
Subjt:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]4.8e-6047.41Show/hide
Query:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--
        K + PP F+  SE+ TA E+W+ ELE+L+ YL C D FKVRG VFMLR EA  WW S+ A+EDHA  P+TWARFKDLLY+YYFP   +++K +EFL L  
Subjt:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--

Query:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK
                    TELS F    V T + +I +FI GLR EI+G + LKEP T+AAA++ AL+MDK + ++PQ   S+  +G++S  KRK    S+ Q+S+
Subjt:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK

Query:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC----QNKREASADKPTSKALPEAT
         H+  + + T  P+C SC K H   CWLG+ ICFKC KEGHF R C     N +  S   PT+ A    T
Subjt:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC----QNKREASADKPTSKALPEAT

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]5.0e-5750.21Show/hide
Query:  SEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL-------------
        SE+ TA E+W+ ELE+L+ YL C D FKVRG VFMLR EA  WW S+ A+EDHA  P+TWARFKDLLY+YYFP  V+++K +EFL L             
Subjt:  SEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL-------------

Query:  -TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSKAHRLASGQATT
         TELS F    + T + +I +FI GLR EI+G + LKEP T+AAA++ AL+MDK + ++PQ   S+  +G+SS  KRK    SS Q S+ H+    + T 
Subjt:  -TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSKAHRLASGQATT

Query:  LPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC
         P+C SC K H   CW+G+ IC++C KEGHFAR C
Subjt:  LPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC

TrEMBL top hitse value%identityAlignment
A0A6J1DL73 uncharacterized protein LOC1110221445.2e-6051.22Show/hide
Query:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--
        KRY PPTFD  SE+ATA E+WI ELE+ + YL CED FKV+G VFMLR EA  WW SI A+EDHA   + WARFKDLLYDYY+ + VKD KE EFLHL  
Subjt:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--

Query:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK
                    TELS FA +L+  +  +IKRF++GL + IRG V L+ P ++A A++ ALIMDK+++ K     S  EVG+SS  KRK  P  +D + +
Subjt:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK

Query:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC
        A +  +      P+C +C KRH  QCW G   CF+CG+E HFAR C
Subjt:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC

A0A6J1DNV8 uncharacterized protein LOC1110229253.0e-6050Show/hide
Query:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--
        KR+ PP F+  SE+ TA E+W+ ELE+L+ YL C D FKVRG VFML+ EA  WW S+ A+EDHA  P+TWARFKDLLY+YYFP  V+++K  EFL L  
Subjt:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--

Query:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK
                    TELS F    + T + +I +FI GLR EI+G + LKEP T+AAA++ AL+MDK + ++PQ   S+  +G+SS  KRK    SS Q S+
Subjt:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK

Query:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC
         H+    + TT P C SC K H   CW+G+ IC++C KEGHFAR C
Subjt:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC

A0A6J1DQB9 Reverse transcriptase2.3e-6047.41Show/hide
Query:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--
        K + PP F+  SE+ TA E+W+ ELE+L+ YL C D FKVRG VFMLR EA  WW S+ A+EDHA  P+TWARFKDLLY+YYFP   +++K +EFL L  
Subjt:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--

Query:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK
                    TELS F    V T + +I +FI GLR EI+G + LKEP T+AAA++ AL+MDK + ++PQ   S+  +G++S  KRK    S+ Q+S+
Subjt:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK

Query:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC----QNKREASADKPTSKALPEAT
         H+  + + T  P+C SC K H   CWLG+ ICFKC KEGHF R C     N +  S   PT+ A    T
Subjt:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC----QNKREASADKPTSKALPEAT

A0A6J1DUM2 uncharacterized protein LOC1110232471.7e-6352.03Show/hide
Query:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--
        KRY PPTFD  SE+ATA E+WI ELE+L+ YL CED FKV+G VFMLR EA  WW S+ A+ED+A  P+ WARFK+LLYDYY+P+ VKD KE EFLHL  
Subjt:  KRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL--

Query:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK
                    TELS FA +L+ T   +IKRF++GLR+ IRG V L+ P T+A A++ AL+MDK+++ K        EVG+SS  KRK P   +D   +
Subjt:  ------------TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSK

Query:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC
        A +  +      P+C +C KRH  QCW G   CF+CG+EGHFAR C
Subjt:  AHRLASGQATTLPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC

A0A6J1DWP4 uncharacterized protein LOC1110252152.4e-5750.21Show/hide
Query:  SEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL-------------
        SE+ TA E+W+ ELE+L+ YL C D FKVRG VFMLR EA  WW S+ A+EDHA  P+TWARFKDLLY+YYFP  V+++K +EFL L             
Subjt:  SEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHL-------------

Query:  -TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSKAHRLASGQATT
         TELS F    + T + +I +FI GLR EI+G + LKEP T+AAA++ AL+MDK + ++PQ   S+  +G+SS  KRK    SS Q S+ H+    + T 
Subjt:  -TELSCFAPDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSKAHRLASGQATT

Query:  LPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC
         P+C SC K H   CW+G+ IC++C KEGHFAR C
Subjt:  LPLCGSCNKRHLEQCWLGQSICFKCGKEGHFARMC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTAGGAGAAGTCACTTCACAATTGTCACAGTTGACAATTCAACTGACCTTGAGACAGAAGATTATTGATACACAACAAGGCGATCCTTATTTTGTGGAGAAGTT
TTGCGAGGTGCAGTCGGAACAAGATAGGGAGTTCTTAGTATACTCAAATAATGGTCTCTATTATCAGAGACGCCTATGTGTTCCTGCAAACGGAAATGTTAAGAACGAAT
TGTTGGCAGAAGCTCATAGCTCTCCATTCACAATACATCCTGAAGGTACCAAGATGTACCAGGACTTGAAACGTTATTACTGGTGGCATAATATGAAGAAAGAGGTAGTT
GGACTACTAATGAGGTCATTCAGAAGATCAGGGCAAGGATGCAGACAGCTTAGAGAAGACAGAAAGTTACACCGATTGGTTGTCTTTCTTTCTCTGCAAATTCCCTTAGC
CGCTGACCATTGCGACGCCGCTGCCTTCACCGTCATTCATGAGCCACAGTCCGACATCGTGCGTAGGGACACAGTCACCGATTGTCACCACCTCTGTTTGCTTGATTGTC
TGCTCATCGTTTCTGGTCGTCGCAGCTATTCTGTCGTTTTTGGACCAACCTCTTTATCTTCGTCATGCCGCCGGGCGTCGTTTTGGTCGTCGTTTCCATCCACCACCGGA
AGCCACCATTTGCTCGCTAGAAAGACCACCAGAGGTTTAATGTATATTTCGAGATTCACTAAAGGTTTTGTGTTTCCTATGAGAATCACTAGAGGTTATGTGTATCATTC
GGGATCTACCAGAGGTTTTGTGTTTCCTACGAAATTCACTAGAGACATGTGTTTCCCATCAAGATTCACTAGAGGCCCCGAGCCCTACATCGACCCTTGGCCATTTGTAC
CCAAGGCCTCGACACTTACTCGTCACACTGACCCTTGGCCATTTGTACCCAAAGCCTCAATGCTTGTCTTAGTACCCAAGGCCTCGACGCTTACCCTAAACTCGGTTGCC
ATACCGATCCTACATAAGCGTTATCGACCTCCGACCTTCGACTGCTGGTCAGAGAAAGCTACGGCAACTGAGCAATGGATTGCAGAGCTGGAGTCATTGTTTGAATACCT
GAATTGTGAGGACCATTTTAAGGTCAGAGGAGTAGTTTTCATGCTTCGAGACGAGGCGCGAATGTGGTGGAGATCCATTGAAGCCTCAGAAGATCATGCCGAAGGACCCA
TGACTTGGGCAAGGTTTAAGGACCTCCTATACGACTACTATTTTCCCGACAAAGTGAAGGACGACAAGGAAATAGAGTTCCTGCATCTAACCGAGTTGTCTTGTTTTGCA
CCCGACCTGGTGAGTACGTCAGAGAGGAGGATTAAAAGGTTCATTAGGGGCTTGCGCGAGGAAATTAGAGGTGCGGTCGCCTTAAAGGAGCCAATGACTTTTGCTGCAGC
ACTTAAGGCGGCACTGATCATGGACAAAAATATGGCCAAGAAACCTCAGGTGACACACTCACGTTGGGAGGTTGGCGCCTCATCTAGATTTAAAAGAAAGTCTCCCCCAG
CTTCGTCAGATCAAACTTCCAAGGCCCATCGTCTGGCCTCGGGGCAAGCTACCACCCTCCCATTGTGCGGCTCATGCAACAAGCGTCACTTGGAGCAGTGCTGGCTAGGC
CAGAGCATTTGCTTTAAGTGTGGAAAGGAAGGTCACTTTGCAAGGATGTGCCAAAATAAAAGGGAGGCCAGCGCAGACAAGCCGACCTCGAAGGCCTTACCAGAAGCTAC
TTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTAGGAGAAGTCACTTCACAATTGTCACAGTTGACAATTCAACTGACCTTGAGACAGAAGATTATTGATACACAACAAGGCGATCCTTATTTTGTGGAGAAGTT
TTGCGAGGTGCAGTCGGAACAAGATAGGGAGTTCTTAGTATACTCAAATAATGGTCTCTATTATCAGAGACGCCTATGTGTTCCTGCAAACGGAAATGTTAAGAACGAAT
TGTTGGCAGAAGCTCATAGCTCTCCATTCACAATACATCCTGAAGGTACCAAGATGTACCAGGACTTGAAACGTTATTACTGGTGGCATAATATGAAGAAAGAGGTAGTT
GGACTACTAATGAGGTCATTCAGAAGATCAGGGCAAGGATGCAGACAGCTTAGAGAAGACAGAAAGTTACACCGATTGGTTGTCTTTCTTTCTCTGCAAATTCCCTTAGC
CGCTGACCATTGCGACGCCGCTGCCTTCACCGTCATTCATGAGCCACAGTCCGACATCGTGCGTAGGGACACAGTCACCGATTGTCACCACCTCTGTTTGCTTGATTGTC
TGCTCATCGTTTCTGGTCGTCGCAGCTATTCTGTCGTTTTTGGACCAACCTCTTTATCTTCGTCATGCCGCCGGGCGTCGTTTTGGTCGTCGTTTCCATCCACCACCGGA
AGCCACCATTTGCTCGCTAGAAAGACCACCAGAGGTTTAATGTATATTTCGAGATTCACTAAAGGTTTTGTGTTTCCTATGAGAATCACTAGAGGTTATGTGTATCATTC
GGGATCTACCAGAGGTTTTGTGTTTCCTACGAAATTCACTAGAGACATGTGTTTCCCATCAAGATTCACTAGAGGCCCCGAGCCCTACATCGACCCTTGGCCATTTGTAC
CCAAGGCCTCGACACTTACTCGTCACACTGACCCTTGGCCATTTGTACCCAAAGCCTCAATGCTTGTCTTAGTACCCAAGGCCTCGACGCTTACCCTAAACTCGGTTGCC
ATACCGATCCTACATAAGCGTTATCGACCTCCGACCTTCGACTGCTGGTCAGAGAAAGCTACGGCAACTGAGCAATGGATTGCAGAGCTGGAGTCATTGTTTGAATACCT
GAATTGTGAGGACCATTTTAAGGTCAGAGGAGTAGTTTTCATGCTTCGAGACGAGGCGCGAATGTGGTGGAGATCCATTGAAGCCTCAGAAGATCATGCCGAAGGACCCA
TGACTTGGGCAAGGTTTAAGGACCTCCTATACGACTACTATTTTCCCGACAAAGTGAAGGACGACAAGGAAATAGAGTTCCTGCATCTAACCGAGTTGTCTTGTTTTGCA
CCCGACCTGGTGAGTACGTCAGAGAGGAGGATTAAAAGGTTCATTAGGGGCTTGCGCGAGGAAATTAGAGGTGCGGTCGCCTTAAAGGAGCCAATGACTTTTGCTGCAGC
ACTTAAGGCGGCACTGATCATGGACAAAAATATGGCCAAGAAACCTCAGGTGACACACTCACGTTGGGAGGTTGGCGCCTCATCTAGATTTAAAAGAAAGTCTCCCCCAG
CTTCGTCAGATCAAACTTCCAAGGCCCATCGTCTGGCCTCGGGGCAAGCTACCACCCTCCCATTGTGCGGCTCATGCAACAAGCGTCACTTGGAGCAGTGCTGGCTAGGC
CAGAGCATTTGCTTTAAGTGTGGAAAGGAAGGTCACTTTGCAAGGATGTGCCAAAATAAAAGGGAGGCCAGCGCAGACAAGCCGACCTCGAAGGCCTTACCAGAAGCTAC
TTAA
Protein sequenceShow/hide protein sequence
MAVGEVTSQLSQLTIQLTLRQKIIDTQQGDPYFVEKFCEVQSEQDREFLVYSNNGLYYQRRLCVPANGNVKNELLAEAHSSPFTIHPEGTKMYQDLKRYYWWHNMKKEVV
GLLMRSFRRSGQGCRQLREDRKLHRLVVFLSLQIPLAADHCDAAAFTVIHEPQSDIVRRDTVTDCHHLCLLDCLLIVSGRRSYSVVFGPTSLSSSCRRASFWSSFPSTTG
SHHLLARKTTRGLMYISRFTKGFVFPMRITRGYVYHSGSTRGFVFPTKFTRDMCFPSRFTRGPEPYIDPWPFVPKASTLTRHTDPWPFVPKASMLVLVPKASTLTLNSVA
IPILHKRYRPPTFDCWSEKATATEQWIAELESLFEYLNCEDHFKVRGVVFMLRDEARMWWRSIEASEDHAEGPMTWARFKDLLYDYYFPDKVKDDKEIEFLHLTELSCFA
PDLVSTSERRIKRFIRGLREEIRGAVALKEPMTFAAALKAALIMDKNMAKKPQVTHSRWEVGASSRFKRKSPPASSDQTSKAHRLASGQATTLPLCGSCNKRHLEQCWLG
QSICFKCGKEGHFARMCQNKREASADKPTSKALPEAT