; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036245 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036245
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr3:42382162..42389578
RNA-Seq ExpressionLag0036245
SyntenyLag0036245
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151688.1 uncharacterized protein LOC111019603 [Momordica charantia]5.6e-6652.77Show/hide
Query:  TAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWW
        TA + +   ALQ + DN     A     P        E+QFIRDF+RYGPP+F+G+SE     E WI +LEAL+  + C+D LK++GAVFML+ +A  WW
Subjt:  TAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWW

Query:  QSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAE
          VA  EDH N PI+W   KDLLYDYYFP+T+KD+KE EFLHL Q ++ V QYE+KFT  SRFA DL+ T  RKIKRF++GL + I+G + L RP T+AE
Subjt:  QSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAE

Query:  ALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPL
        A+ GAL+MDK+V +K QP  + G +SGVKRK+ P+
Subjt:  ALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPL

XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]3.6e-6558.04Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKE
        E+ FI+DFKRYGPP+FDG+SE   AAE WI +LEA +  + C D  K++GAVFML+ +A  WW S+AAAEDHAN  I W RFKDLLYDYY+ ETVKD KE
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKE

Query:  AEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT
        AEFLHL QG++SV QYERKFT LSRFA +L+     KIKRF+KGL + IRG V L RPA++AEA+ GALIMDK+VS K     E GS+SGVKRK  P   
Subjt:  AEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT

Query:  HLLSLLSISPDAKCPRRLAKQTSM
                 P  + P+  A+   M
Subjt:  HLLSLLSISPDAKCPRRLAKQTSM

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]3.2e-6152.74Show/hide
Query:  PPTAP--MLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDA
        PP  P  +++  EALQ + DN         + P+    + EE QFIRDFKR+GPP F+G SE P AAE W+ +LEAL+  + C+D  K+RGAVFML+ +A
Subjt:  PPTAP--MLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDA

Query:  RTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPA
          WW+SVAAAEDHAN P++W RFKDLLY+YYFP TV+++K AEFL L Q S+ V QYERKFT LSRF    + T + KI +FI GLR EI+G + L  P 
Subjt:  RTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPA

Query:  TFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLS
        T+A A+  AL+MDK + ++PQ     GS+SGVKRK +
Subjt:  TFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLS

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.3e-6757.38Show/hide
Query:  PPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLY
        PP      P++  E++FI+DFKRYGPP+FDG+SE   A E WI +LEAL+  + C D  K++GAVFML+ +A  WW SVAAAED+AN PI W RFK+LLY
Subjt:  PPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLY

Query:  DYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGS
        DYY+PETVKD KEAEFLHL QG++SV QYERKFT LSRFA +L+ T   KIKRF+KGLR+ IRG V L RP T+AEA+ GAL+MDK+VS K  P  E GS
Subjt:  DYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGS

Query:  TSGVKRKLSPLRTHLLSLLSISPDAKCPRRLAKQTSM
        +SGVKRK       L+         + P+R A+   M
Subjt:  TSGVKRKLSPLRTHLLSLLSISPDAKCPRRLAKQTSM

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]2.8e-6553.46Show/hide
Query:  PPGQRRVDPPPPP-------PPPAPPTAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALF
        P G+   DPPPPP        PP PP A   +   AL      +     +PPR+ +  P++  E+QFI+DFKRYGPP+F G SE    AE W+ +LEAL+
Subjt:  PPGQRRVDPPPPP-------PPPAPPTAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALF

Query:  DLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERK
          + C D  K++GAVFML+ +A  WW SVAA EDHAN P+ W RFK+LLYD+Y+ ETV+D KE EFLHL QG+++V QYERKFT LS FA +L+ T   K
Subjt:  DLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERK

Query:  IKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSP
        IKRF+KGL + IRGSV L RP T+AEA+ G LIMDK+VS + QP +E GS+ GVKRK+ P
Subjt:  IKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSP

TrEMBL top hitse value%identityAlignment
A0A6J1DCW8 uncharacterized protein LOC1110196032.7e-6652.77Show/hide
Query:  TAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWW
        TA + +   ALQ + DN     A     P        E+QFIRDF+RYGPP+F+G+SE     E WI +LEAL+  + C+D LK++GAVFML+ +A  WW
Subjt:  TAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWW

Query:  QSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAE
          VA  EDH N PI+W   KDLLYDYYFP+T+KD+KE EFLHL Q ++ V QYE+KFT  SRFA DL+ T  RKIKRF++GL + I+G + L RP T+AE
Subjt:  QSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAE

Query:  ALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPL
        A+ GAL+MDK+V +K QP  + G +SGVKRK+ P+
Subjt:  ALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPL

A0A6J1DL73 uncharacterized protein LOC1110221441.8e-6558.04Show/hide
Query:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKE
        E+ FI+DFKRYGPP+FDG+SE   AAE WI +LEA +  + C D  K++GAVFML+ +A  WW S+AAAEDHAN  I W RFKDLLYDYY+ ETVKD KE
Subjt:  ESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKE

Query:  AEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT
        AEFLHL QG++SV QYERKFT LSRFA +L+     KIKRF+KGL + IRG V L RPA++AEA+ GALIMDK+VS K     E GS+SGVKRK  P   
Subjt:  AEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSPLRT

Query:  HLLSLLSISPDAKCPRRLAKQTSM
                 P  + P+  A+   M
Subjt:  HLLSLLSISPDAKCPRRLAKQTSM

A0A6J1DNV8 uncharacterized protein LOC1110229251.5e-6152.74Show/hide
Query:  PPTAP--MLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDA
        PP  P  +++  EALQ + DN         + P+    + EE QFIRDFKR+GPP F+G SE P AAE W+ +LEAL+  + C+D  K+RGAVFML+ +A
Subjt:  PPTAP--MLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDA

Query:  RTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPA
          WW+SVAAAEDHAN P++W RFKDLLY+YYFP TV+++K AEFL L Q S+ V QYERKFT LSRF    + T + KI +FI GLR EI+G + L  P 
Subjt:  RTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPA

Query:  TFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLS
        T+A A+  AL+MDK + ++PQ     GS+SGVKRK +
Subjt:  TFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLS

A0A6J1DUM2 uncharacterized protein LOC1110232476.4e-6857.38Show/hide
Query:  PPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLY
        PP      P++  E++FI+DFKRYGPP+FDG+SE   A E WI +LEAL+  + C D  K++GAVFML+ +A  WW SVAAAED+AN PI W RFK+LLY
Subjt:  PPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLY

Query:  DYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGS
        DYY+PETVKD KEAEFLHL QG++SV QYERKFT LSRFA +L+ T   KIKRF+KGLR+ IRG V L RP T+AEA+ GAL+MDK+VS K  P  E GS
Subjt:  DYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGS

Query:  TSGVKRKLSPLRTHLLSLLSISPDAKCPRRLAKQTSM
        +SGVKRK       L+         + P+R A+   M
Subjt:  TSGVKRKLSPLRTHLLSLLSISPDAKCPRRLAKQTSM

A0A6J1DVA0 uncharacterized protein LOC1110234241.3e-6553.46Show/hide
Query:  PPGQRRVDPPPPP-------PPPAPPTAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALF
        P G+   DPPPPP        PP PP A   +   AL      +     +PPR+ +  P++  E+QFI+DFKRYGPP+F G SE    AE W+ +LEAL+
Subjt:  PPGQRRVDPPPPP-------PPPAPPTAPMLITPEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALF

Query:  DLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERK
          + C D  K++GAVFML+ +A  WW SVAA EDHAN P+ W RFK+LLYD+Y+ ETV+D KE EFLHL QG+++V QYERKFT LS FA +L+ T   K
Subjt:  DLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWERFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERK

Query:  IKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSP
        IKRF+KGL + IRGSV L RP T+AEA+ G LIMDK+VS + QP +E GS+ GVKRK+ P
Subjt:  IKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSGVKRKLSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGAGCCACCGAAAAGGAAGGATCCCTCTCATTCGGATGTGATCTCGGTTCATTCATGTACCCCTCCTAACCTCGGGCGCCACATGATTGTAACGCCCCGCGTTTC
GAAATCGAGCCGTTTTGGTATTTACTGGGCCCGGCCAGCCCCGCTGCCACTCGAGCTCGCGCCACCGCCTCCCCCTACTGCCGCCGCTCGAGTTCCAGCCGGTCAGTCAC
CGTCGCAGCCGCCTTGTCGCGCCGTCGCCGCCAGTCTCCCTTCCGCACGACTCTATCTCTTTTGTCTCGCCGTCTCCGCGTCGTCTCCTCTCTGTCCGTGCGTTTTTGGT
CGAGTTAAGTATTGGCTCGGAGTCTCCTTTCCTCGCGTTTTCGCCTCTGTCCAGCAGCGTCGTTGGGCGTTTTCGGCGTGGGTTAGTGTTTCCGCGCCGTCTAGGTGTTC
GATTAAGTTCGAAACACTTCAACTTGGGTACCCACTGCTCGAAGAGCGTTCTAGCTCACTGGTTGTGGTTGGTATAACCCGTCTAGCGCAAAAGCGGGCTCTTCCGGTTG
TTCCAATTAGTCCTTTAGGGTCGTTTAGGAGCGTTAGAGATTCCATGGGCATGGTTCATGAGTCTAGAAGCATGTTGCAGGGCATATGCATTCTAGTGGTTGAGCGATAT
AGGGTCAATACGCTGCCTAGTCATCGAGGCCTTGGGTGTAAATGGTCAAGGGTCGATGCACAGTTCGAGGCCTTGGGTATAAATGGTCAAGGGTCGAATGCCGAGCTCTG
TAGAGGAGTGTCGAGGCCCTGGGTAGGAGGGGCTACTTACCAGTACCTTAGTGTACTGACCCCCTCCCCTTTCTCTCCCCCCAACTACCAGATTTTGCAGGTTGTTAAAA
AATTTTGGGTTAGTCCAATGTACGCCGTTATGCTGCCGAAATTTTCGGTGCTCACAGTTTGTTTTGGTCTAGGAATTAGTAATGTCGTTGGGTTAGCTTTTAAAATCCTG
GGGCGTTACAGTTGGTATCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGTTGTTTAGGGTTATGGTCTTCCTCGTTCTCCTCTCCATCACC
AGTACCACATTCTCAGGCAATGTCTCGCGGTCATGACCCTGAAGTTCCAATTGTCGATCAAGATGATCAAGTAGAGGAAGTTACTACTCAGCAGGGGGTCGATCCTCTGG
CTCCCCCTATGCAGGAGGCTAATCCTCTGATTCCTCCCGGTCAGCGCAGGGTTGATCCTCCTCCTCCCCCGCCTCCTCCGGCCCCTCCTACGGCTCCTATGCTGATCACT
CCGGAAGCCCTCCAGACCATGTTCGATAACATGGCCCAGAGAAATGCTAGGCCACCGCGAAACCCTAATTGGGTACCTGAGAACGCGGAGGAATCCCAGTTCATTAGGGA
CTTCAAGCGCTACGGGCCTCCCTCTTTTGATGGGCAATCCGAAAATCCTTTAGCAGCAGAGCGATGGATCACTGATTTGGAGGCACTGTTTGACCTCATGAACTGTAATG
ATTCCTTGAAAATCAGAGGGGCAGTTTTCATGCTCAAGGATGACGCTCGCACGTGGTGGCAATCGGTGGCAGCAGCCGAAGACCATGCTAACCGACCGATCTCGTGGGAA
AGGTTCAAGGATCTATTGTATGATTATTACTTCCCGGAGACAGTCAAGGACGACAAAGAAGCGGAATTTCTTCATTTGGCCCAGGGGAGTATGTCTGTAGTGCAGTATGA
GAGGAAGTTCACTGCACTATCACGCTTTGCTCCTGACCTGGTCAGCACGCCAGAGCGGAAGATTAAGAGGTTCATTAAAGGTCTTCGTGAGGAAATTCGAGGCTCTGTGG
CCCTAAGCAGGCCCGCGACCTTTGCTGAAGCACTCACGGGTGCATTGATCATGGATAAGAATGTTTCTAAGAAGCCACAACCTCATCTCGAGAAGGGATCAACCTCTGGA
GTTAAAAGAAAGTTGTCTCCCCTGAGGACCCACCTATTGAGCCTACTCAGCATCAGCCCAGACGCCAAGTGCCCAAGGAGGTTAGCCAAGCAAACATCAATGGAGTCCTT
AAAGGTGGGAAGTCGGGTTCACCTCTCATTTCCAGCGTTTTGCAACTATTATGGAATCCATGAACCAAGTCTCAAGGTCTTTTTGGACTTGCTCATGATAATGCTCTTGG
TGTGCAAGAACCCAGTTCGGATTGGTGTTTTTCAGACCTTTGGTGTTCTTAATGCACTTGGTCAAGTTGCCGTTAAGCTTTGGGATTTCAGCGGAAGGTGGCTTGTTGAG
CAAAAAGCCACTGGTATTAAGCGCCTTGAGAAGTTAGTGTCAGCCGCAAACCAGCCCCCTTTCTTTACGTTTTTCTTCTCCCCTGCGCAGCCGCCAGCCCCGCTGCCACT
CGAGCTCGCGCCACCGCCTCCCCCTGCTGCCGCCGCTCGAGTTCCAGCCGGTCAGTCACCGTCGCAGCCGCCTTGTCGCGCCGTCGCCGCCAGTCTCCCTTCGCACGACT
CTATCTCTTTGTCTCGCCGTCTCCGCGTCGTCTCCTCTCTGTCCCTCGGAGTCTCCTTTCCTCGCGTTTTCGCCTCTGTCCAGCAGCGTCGTTGGGCGTTTTCGGCGTGG
GTTAGTGTTTCCGCGCCGTCTAGGTGTTCAATTAAGTTCGAAACACTTCAACTTGGGTACCCACTGCTCGAAGAGCGTTCTAGCTCACTGGTTGTGGTTGGTATAACCCG
TCTAGCGCCAAAAGCGGGTCCGTTTGGAATTAGGTGTATTCGGTTGCTGTTCAGCGAGCGTTTGGTCTCAAATATCGTGTTAGCGAATACCCACAACTCGAAAGACGTTG
ATTTTGGTTACCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGAGCCACCGAAAAGGAAGGATCCCTCTCATTCGGATGTGATCTCGGTTCATTCATGTACCCCTCCTAACCTCGGGCGCCACATGATTGTAACGCCCCGCGTTTC
GAAATCGAGCCGTTTTGGTATTTACTGGGCCCGGCCAGCCCCGCTGCCACTCGAGCTCGCGCCACCGCCTCCCCCTACTGCCGCCGCTCGAGTTCCAGCCGGTCAGTCAC
CGTCGCAGCCGCCTTGTCGCGCCGTCGCCGCCAGTCTCCCTTCCGCACGACTCTATCTCTTTTGTCTCGCCGTCTCCGCGTCGTCTCCTCTCTGTCCGTGCGTTTTTGGT
CGAGTTAAGTATTGGCTCGGAGTCTCCTTTCCTCGCGTTTTCGCCTCTGTCCAGCAGCGTCGTTGGGCGTTTTCGGCGTGGGTTAGTGTTTCCGCGCCGTCTAGGTGTTC
GATTAAGTTCGAAACACTTCAACTTGGGTACCCACTGCTCGAAGAGCGTTCTAGCTCACTGGTTGTGGTTGGTATAACCCGTCTAGCGCAAAAGCGGGCTCTTCCGGTTG
TTCCAATTAGTCCTTTAGGGTCGTTTAGGAGCGTTAGAGATTCCATGGGCATGGTTCATGAGTCTAGAAGCATGTTGCAGGGCATATGCATTCTAGTGGTTGAGCGATAT
AGGGTCAATACGCTGCCTAGTCATCGAGGCCTTGGGTGTAAATGGTCAAGGGTCGATGCACAGTTCGAGGCCTTGGGTATAAATGGTCAAGGGTCGAATGCCGAGCTCTG
TAGAGGAGTGTCGAGGCCCTGGGTAGGAGGGGCTACTTACCAGTACCTTAGTGTACTGACCCCCTCCCCTTTCTCTCCCCCCAACTACCAGATTTTGCAGGTTGTTAAAA
AATTTTGGGTTAGTCCAATGTACGCCGTTATGCTGCCGAAATTTTCGGTGCTCACAGTTTGTTTTGGTCTAGGAATTAGTAATGTCGTTGGGTTAGCTTTTAAAATCCTG
GGGCGTTACAGTTGGTATCAGAGCAGAGTTGTTCCTGTAGACTGGCCTAGGAAATCTAGGTTGTTTGGTTGTTTAGGGTTATGGTCTTCCTCGTTCTCCTCTCCATCACC
AGTACCACATTCTCAGGCAATGTCTCGCGGTCATGACCCTGAAGTTCCAATTGTCGATCAAGATGATCAAGTAGAGGAAGTTACTACTCAGCAGGGGGTCGATCCTCTGG
CTCCCCCTATGCAGGAGGCTAATCCTCTGATTCCTCCCGGTCAGCGCAGGGTTGATCCTCCTCCTCCCCCGCCTCCTCCGGCCCCTCCTACGGCTCCTATGCTGATCACT
CCGGAAGCCCTCCAGACCATGTTCGATAACATGGCCCAGAGAAATGCTAGGCCACCGCGAAACCCTAATTGGGTACCTGAGAACGCGGAGGAATCCCAGTTCATTAGGGA
CTTCAAGCGCTACGGGCCTCCCTCTTTTGATGGGCAATCCGAAAATCCTTTAGCAGCAGAGCGATGGATCACTGATTTGGAGGCACTGTTTGACCTCATGAACTGTAATG
ATTCCTTGAAAATCAGAGGGGCAGTTTTCATGCTCAAGGATGACGCTCGCACGTGGTGGCAATCGGTGGCAGCAGCCGAAGACCATGCTAACCGACCGATCTCGTGGGAA
AGGTTCAAGGATCTATTGTATGATTATTACTTCCCGGAGACAGTCAAGGACGACAAAGAAGCGGAATTTCTTCATTTGGCCCAGGGGAGTATGTCTGTAGTGCAGTATGA
GAGGAAGTTCACTGCACTATCACGCTTTGCTCCTGACCTGGTCAGCACGCCAGAGCGGAAGATTAAGAGGTTCATTAAAGGTCTTCGTGAGGAAATTCGAGGCTCTGTGG
CCCTAAGCAGGCCCGCGACCTTTGCTGAAGCACTCACGGGTGCATTGATCATGGATAAGAATGTTTCTAAGAAGCCACAACCTCATCTCGAGAAGGGATCAACCTCTGGA
GTTAAAAGAAAGTTGTCTCCCCTGAGGACCCACCTATTGAGCCTACTCAGCATCAGCCCAGACGCCAAGTGCCCAAGGAGGTTAGCCAAGCAAACATCAATGGAGTCCTT
AAAGGTGGGAAGTCGGGTTCACCTCTCATTTCCAGCGTTTTGCAACTATTATGGAATCCATGAACCAAGTCTCAAGGTCTTTTTGGACTTGCTCATGATAATGCTCTTGG
TGTGCAAGAACCCAGTTCGGATTGGTGTTTTTCAGACCTTTGGTGTTCTTAATGCACTTGGTCAAGTTGCCGTTAAGCTTTGGGATTTCAGCGGAAGGTGGCTTGTTGAG
CAAAAAGCCACTGGTATTAAGCGCCTTGAGAAGTTAGTGTCAGCCGCAAACCAGCCCCCTTTCTTTACGTTTTTCTTCTCCCCTGCGCAGCCGCCAGCCCCGCTGCCACT
CGAGCTCGCGCCACCGCCTCCCCCTGCTGCCGCCGCTCGAGTTCCAGCCGGTCAGTCACCGTCGCAGCCGCCTTGTCGCGCCGTCGCCGCCAGTCTCCCTTCGCACGACT
CTATCTCTTTGTCTCGCCGTCTCCGCGTCGTCTCCTCTCTGTCCCTCGGAGTCTCCTTTCCTCGCGTTTTCGCCTCTGTCCAGCAGCGTCGTTGGGCGTTTTCGGCGTGG
GTTAGTGTTTCCGCGCCGTCTAGGTGTTCAATTAAGTTCGAAACACTTCAACTTGGGTACCCACTGCTCGAAGAGCGTTCTAGCTCACTGGTTGTGGTTGGTATAACCCG
TCTAGCGCCAAAAGCGGGTCCGTTTGGAATTAGGTGTATTCGGTTGCTGTTCAGCGAGCGTTTGGTCTCAAATATCGTGTTAGCGAATACCCACAACTCGAAAGACGTTG
ATTTTGGTTACCCATAA
Protein sequenceShow/hide protein sequence
MIEPPKRKDPSHSDVISVHSCTPPNLGRHMIVTPRVSKSSRFGIYWARPAPLPLELAPPPPPTAAARVPAGQSPSQPPCRAVAASLPSARLYLFCLAVSASSPLCPCVFG
RVKYWLGVSFPRVFASVQQRRWAFSAWVSVSAPSRCSIKFETLQLGYPLLEERSSSLVVVGITRLAQKRALPVVPISPLGSFRSVRDSMGMVHESRSMLQGICILVVERY
RVNTLPSHRGLGCKWSRVDAQFEALGINGQGSNAELCRGVSRPWVGGATYQYLSVLTPSPFSPPNYQILQVVKKFWVSPMYAVMLPKFSVLTVCFGLGISNVVGLAFKIL
GRYSWYQSRVVPVDWPRKSRLFGCLGLWSSSFSSPSPVPHSQAMSRGHDPEVPIVDQDDQVEEVTTQQGVDPLAPPMQEANPLIPPGQRRVDPPPPPPPPAPPTAPMLIT
PEALQTMFDNMAQRNARPPRNPNWVPENAEESQFIRDFKRYGPPSFDGQSENPLAAERWITDLEALFDLMNCNDSLKIRGAVFMLKDDARTWWQSVAAAEDHANRPISWE
RFKDLLYDYYFPETVKDDKEAEFLHLAQGSMSVVQYERKFTALSRFAPDLVSTPERKIKRFIKGLREEIRGSVALSRPATFAEALTGALIMDKNVSKKPQPHLEKGSTSG
VKRKLSPLRTHLLSLLSISPDAKCPRRLAKQTSMESLKVGSRVHLSFPAFCNYYGIHEPSLKVFLDLLMIMLLVCKNPVRIGVFQTFGVLNALGQVAVKLWDFSGRWLVE
QKATGIKRLEKLVSAANQPPFFTFFFSPAQPPAPLPLELAPPPPPAAAARVPAGQSPSQPPCRAVAASLPSHDSISLSRRLRVVSSLSLGVSFPRVFASVQQRRWAFSAW
VSVSAPSRCSIKFETLQLGYPLLEERSSSLVVVGITRLAPKAGPFGIRCIRLLFSERLVSNIVLANTHNSKDVDFGYP